|
View:
New views
5 Messages
—
Rating Filter:
Alert me
|
|
|
Re: UCC Batch Mode Problem under Windows - Job-Filegetsresubmitted over and over againThanks for you quick answer.
It is a Windows XP SP2 running in vmware and e: is an additional drive(disk-file) that I mounted into Windows(via vmware) since c: got too small. That could possibly be the cause? So it might/should work on a plain windows xp without vmware? I tried your suggestion and now its different; I now get stderr and stdout files. But it stills submitts more and more of the date job, but now after some .job-outpus it list the stderr/stdout files. I observed the RUNNING-JOBS directory and more and more jobs-files are created. And again, sometimes the date.u is deleted and sometimes its not. E:\ucc-1.1.1\bin>ucc.bat batch -t 1 -f -i jobs\ E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\a76c5cd0-fca5-4644-b915-cc9be65d42e3.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\b7b12728-ae22-440c-9092-735dc578ccdb.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\fdee18b5-fcff-4e45-a00f-4a82ed4b7f4e.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\f85e857e-5ba2-4c93-a086-b8957bb8a7b7.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\039b2639-9ab0-4690-b0bf-ccb1097ddff9.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\506f236b-97ab-412a-84bd-84d7b71e4e02.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\1b27ba3a-d1b0-4cbf-8db9-610af4ec595b.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\9d990a95-7521-4dd6-9879-f01830ca84a5.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\b490d3f2-2bad-47f1-8669-371d0230e326.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\883fccdf-9a3c-4910-bb6b-7a8119c446f7.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\35b6a777-5ff1-4602-9573-0c8fc2eff557.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\0bda6348-5865-4da7-870d-9c97029b9107.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\e01d639a-7b5c-41e4-bb6f-a0b34fc88dc7.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\c45e928d-d32d-4c5f-b510-8f202b7b4cdb.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\fe70815f-a5ce-4732-b5c4-a3b9f094e862.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\a8333d86-21ac-47dc-a1ac-4181c9bdf75c.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\02519f69-7fc9-4737-b4c1-b3f4d14a08b6.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\d509a5b3-3355-410b-826e-59dbf3339fe6.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\931324c1-5f5a-49fa-8a64-a391960ba755.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\5c8387e2-cef7-4423-b105-905c4f6a694a.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\668ad279-5553-4d4a-ac27-7784f13bdf47.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\51921d0e-6308-4d77-ad2a-c956cf478a32.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\d8c7ff37-d726-4129-8a53-da141c524a09.job E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\daecb272-0750-4cf5-b5f2-32e9ea985960.job E:\ucc-1.1.1\bin\.\a76c5cd0-fca5-4644-b915-cc9be65d42e3.stdout E:\ucc-1.1.1\bin\.\a76c5cd0-fca5-4644-b915-cc9be65d42e3.stderr E:\ucc-1.1.1\bin\.\b7b12728-ae22-440c-9092-735dc578ccdb.stdout E:\ucc-1.1.1\bin\.\b7b12728-ae22-440c-9092-735dc578ccdb.stderr E:\ucc-1.1.1\bin\.\fdee18b5-fcff-4e45-a00f-4a82ed4b7f4e.stdout E:\ucc-1.1.1\bin\.\fdee18b5-fcff-4e45-a00f-4a82ed4b7f4e.stderr E:\ucc-1.1.1\bin\.\f85e857e-5ba2-4c93-a086-b8957bb8a7b7.stdout E:\ucc-1.1.1\bin\.\f85e857e-5ba2-4c93-a086-b8957bb8a7b7.stderr E:\ucc-1.1.1\bin\.\039b2639-9ab0-4690-b0bf-ccb1097ddff9.stdout E:\ucc-1.1.1\bin\.\039b2639-9ab0-4690-b0bf-ccb1097ddff9.stderr E:\ucc-1.1.1\bin\.\506f236b-97ab-412a-84bd-84d7b71e4e02.stdout E:\ucc-1.1.1\bin\.\506f236b-97ab-412a-84bd-84d7b71e4e02.stderr E:\ucc-1.1.1\bin\.\1b27ba3a-d1b0-4cbf-8db9-610af4ec595b.stdout E:\ucc-1.1.1\bin\.\1b27ba3a-d1b0-4cbf-8db9-610af4ec595b.stderr E:\ucc-1.1.1\bin\.\9d990a95-7521-4dd6-9879-f01830ca84a5.stdout E:\ucc-1.1.1\bin\.\9d990a95-7521-4dd6-9879-f01830ca84a5.stderr E:\ucc-1.1.1\bin\.\b490d3f2-2bad-47f1-8669-371d0230e326.stdout E:\ucc-1.1.1\bin\.\b490d3f2-2bad-47f1-8669-371d0230e326.stderr E:\ucc-1.1.1\bin\.\883fccdf-9a3c-4910-bb6b-7a8119c446f7.stdout E:\ucc-1.1.1\bin\.\883fccdf-9a3c-4910-bb6b-7a8119c446f7.stderr E:\ucc-1.1.1\bin\.\35b6a777-5ff1-4602-9573-0c8fc2eff557.stdout E:\ucc-1.1.1\bin\.\35b6a777-5ff1-4602-9573-0c8fc2eff557.stderr E:\ucc-1.1.1\bin\.\0bda6348-5865-4da7-870d-9c97029b9107.stdout E:\ucc-1.1.1\bin\.\0bda6348-5865-4da7-870d-9c97029b9107.stderr E:\ucc-1.1.1\bin\.\e01d639a-7b5c-41e4-bb6f-a0b34fc88dc7.stdout E:\ucc-1.1.1\bin\.\e01d639a-7b5c-41e4-bb6f-a0b34fc88dc7.stderr E:\ucc-1.1.1\bin\.\c45e928d-d32d-4c5f-b510-8f202b7b4cdb.stdout E:\ucc-1.1.1\bin\.\c45e928d-d32d-4c5f-b510-8f202b7b4cdb.stderr E:\ucc-1.1.1\bin\.\fe70815f-a5ce-4732-b5c4-a3b9f094e862.stdout E:\ucc-1.1.1\bin\.\fe70815f-a5ce-4732-b5c4-a3b9f094e862.stderr E:\ucc-1.1.1\bin\.\a8333d86-21ac-47dc-a1ac-4181c9bdf75c.stdout E:\ucc-1.1.1\bin\.\a8333d86-21ac-47dc-a1ac-4181c9bdf75c.stderr E:\ucc-1.1.1\bin\.\02519f69-7fc9-4737-b4c1-b3f4d14a08b6.stdout E:\ucc-1.1.1\bin\.\02519f69-7fc9-4737-b4c1-b3f4d14a08b6.stderr E:\ucc-1.1.1\bin\.\d509a5b3-3355-410b-826e-59dbf3339fe6.stdout E:\ucc-1.1.1\bin\.\d509a5b3-3355-410b-826e-59dbf3339fe6.stderr E:\ucc-1.1.1\bin\.\931324c1-5f5a-49fa-8a64-a391960ba755.stdout E:\ucc-1.1.1\bin\.\931324c1-5f5a-49fa-8a64-a391960ba755.stderr E:\ucc-1.1.1\bin\.\5c8387e2-cef7-4423-b105-905c4f6a694a.stdout E:\ucc-1.1.1\bin\.\5c8387e2-cef7-4423-b105-905c4f6a694a.stderr E:\ucc-1.1.1\bin\.\668ad279-5553-4d4a-ac27-7784f13bdf47.stdout E:\ucc-1.1.1\bin\.\668ad279-5553-4d4a-ac27-7784f13bdf47.stderr E:\ucc-1.1.1\bin\.\51921d0e-6308-4d77-ad2a-c956cf478a32.stdout E:\ucc-1.1.1\bin\.\51921d0e-6308-4d77-ad2a-c956cf478a32.stderr E:\ucc-1.1.1\bin\.\d8c7ff37-d726-4129-8a53-da141c524a09.stdout E:\ucc-1.1.1\bin\.\d8c7ff37-d726-4129-8a53-da141c524a09.stderr E:\ucc-1.1.1\bin\.\daecb272-0750-4cf5-b5f2-32e9ea985960.stdout E:\ucc-1.1.1\bin\.\daecb272-0750-4cf5-b5f2-32e9ea985960.stderr etc And with the verbose option he gives me that over and over again: [ucc batch] Processing request: E:\ucc-1.1.1\bin\jobs\date.u And then stops and job-processing-output begins. Best Greetings Richard Zitat von Bernd Schuller <b.schuller@...>: > hi, > > Richard Grunzke wrote: >> Hello, >> >> I have a problem with the batch mode under windows and hopefully somebody >> might be able to help. Job gets resubmitted over and over again. >> >> I use the batch mode with ucc: >> E:\ucc-1.1.1\bin>ucc.bat batch -f -i jobs\ > > ucc is not very well tested on Windows (beyond basic usage), so I assume > this is simply a bug. Is "E" any special drive, such as a network mount, > USB stick or something? > > Also, can you try with a single thread? I.e. use > E:\ucc-1.1.1\bin>ucc.bat batch -t 1 -f -i jobs > > We had a similar problem related to NFS mounted directories, > but this problem was fixed in the 1.1 version. > > Best regards, > Bernd. > >> >> Copy date.u into the jobs-directory: >> E:\ucc-1.1.1\bin>copy date.u jobs\ >> Date.u is the standard file from the samples directory. >> >> And now the following happens, the date-jobs gets resubmitted over and >> over again and it doesn't stop: >> E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\03d5cea3-9c11-4a99-8d24-27b2fef3faa4.job > [...] > >> Can't find a target system. >> java.lang.NullPointerException >> at de.fzj.unicore.ucc.helpers.Runner.matches(Runner.java:239) >> at de.fzj.unicore.ucc.helpers.Runner.findTSS(Runner.java:207) >> at de.fzj.unicore.ucc.helpers.Runner.doSubmit(Runner.java:154) >> at de.fzj.unicore.ucc.helpers.Runner.run(Runner.java:101) >> at de.fzj.unicore.ucc.actions.Batch.processRequest(Batch.java:349) >> at de.fzj.unicore.ucc.actions.Batch$1.run(Batch.java:289) >> at >> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) >> at >> java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) >> at java.lang.Thread.run(Unknown Source) > [...] >> >> Under Linux this is no problem at all: >> ucc batch -f -i jobs/ & >> cp date.u jobs/ >> /home/richard/ucc/jobs/RUNNING_JOBS/0fe71db3-e17b-4666-bc98-5e64f2bad810.job >> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stdout >> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stderr >> >> >> ps: >> E:\ucc-1.1.1\bin>java -version >> java version "1.5.0_14" >> Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_14-b03) >> Java HotSpot(TM) Client VM (build 1.5.0_14-b03, mixed mode, sharing) >> [...] > > -- > Dr. Bernd Schuller | mail: b.schuller@... > | phone: +49 2461 61-8736 > (fax: -6656) > Distributed Systems and Grid Computing | personal blog: > Juelich Supercomputing Centre | > http://www.jroller.com/page/gridhaus > http://www.fz-juelich.de/jsc | > > > ------------------------------------------------------------------- > ------------------------------------------------------------------- > Forschungszentrum Jülich GmbH > 52425 Jülich > > Sitz der Gesellschaft: Jülich > Eingetragen im Handelsregister des Amtsgerichts Düren Nr. HR B 3498 > Vorsitzende des Aufsichtsrats: MinDir'in Bärbel Brumme-Bothe > Geschäftsführung: Prof. Dr. Achim Bachem (Vorsitzender), > Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr. Harald Bolt, > Dr. Sebastian M. Schmidt > ------------------------------------------------------------------- > ------------------------------------------------------------------- > > ------------------------------------------------------------------------- > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services for > just about anything Open Source. > http://sourceforge.net/services/buy/index.php > _______________________________________________ > Unicore-support mailing list > Unicore-support@... > https://lists.sourceforge.net/lists/listinfo/unicore-support > > ------------------------------------------------------------------------- Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php _______________________________________________ Unicore-support mailing list Unicore-support@... https://lists.sourceforge.net/lists/listinfo/unicore-support |
|
|
Re: UCC Batch Mode Problem under Windows- Job-Filegetsresubmittedover and over againHi,
Richard Grunzke wrote: > Thanks for you quick answer. > > It is a Windows XP SP2 running in vmware and e: is an additional > drive(disk-file) that I mounted into Windows(via vmware) since c: got too > small. > That could possibly be the cause? So it might/should work on a plain > windows xp without vmware? I think so, yes. But it is a ucc bug anyway, I'll have a look into this. I'm testing on Windows / VMWare as well :-) Best Regards, Bernd. > I tried your suggestion and now its different; > I now get stderr and stdout files. But it stills submitts more and > more of the date job, but now after some .job-outpus it list the > stderr/stdout files. > I observed the RUNNING-JOBS directory and more and more jobs-files are > created. > And again, sometimes the date.u is deleted and sometimes its not. >[...] > > Zitat von Bernd Schuller <b.schuller@...>: > >> hi, >> >> Richard Grunzke wrote: >>> Hello, >>> >>> I have a problem with the batch mode under windows and hopefully somebody >>> might be able to help. Job gets resubmitted over and over again. >>> >>> I use the batch mode with ucc: >>> E:\ucc-1.1.1\bin>ucc.bat batch -f -i jobs\ >> ucc is not very well tested on Windows (beyond basic usage), so I assume >> this is simply a bug. Is "E" any special drive, such as a network mount, >> USB stick or something? >> >> Also, can you try with a single thread? I.e. use >> E:\ucc-1.1.1\bin>ucc.bat batch -t 1 -f -i jobs >> >> We had a similar problem related to NFS mounted directories, >> but this problem was fixed in the 1.1 version. >> >> Best regards, >> Bernd. >> >>> Copy date.u into the jobs-directory: >>> E:\ucc-1.1.1\bin>copy date.u jobs\ >>> Date.u is the standard file from the samples directory. >>> >>> And now the following happens, the date-jobs gets resubmitted over and >>> over again and it doesn't stop: >>> E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\03d5cea3-9c11-4a99-8d24-27b2fef3faa4.job >> [...] >> >>> Can't find a target system. >>> java.lang.NullPointerException >>> at de.fzj.unicore.ucc.helpers.Runner.matches(Runner.java:239) >>> at de.fzj.unicore.ucc.helpers.Runner.findTSS(Runner.java:207) >>> at de.fzj.unicore.ucc.helpers.Runner.doSubmit(Runner.java:154) >>> at de.fzj.unicore.ucc.helpers.Runner.run(Runner.java:101) >>> at de.fzj.unicore.ucc.actions.Batch.processRequest(Batch.java:349) >>> at de.fzj.unicore.ucc.actions.Batch$1.run(Batch.java:289) >>> at >>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) >>> at >>> java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) >>> at java.lang.Thread.run(Unknown Source) >> [...] >>> Under Linux this is no problem at all: >>> ucc batch -f -i jobs/ & >>> cp date.u jobs/ >>> /home/richard/ucc/jobs/RUNNING_JOBS/0fe71db3-e17b-4666-bc98-5e64f2bad810.job >>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stdout >>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stderr >>> >>> >>> ps: >>> E:\ucc-1.1.1\bin>java -version >>> java version "1.5.0_14" >>> Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_14-b03) >>> Java HotSpot(TM) Client VM (build 1.5.0_14-b03, mixed mode, sharing) >>> [...] >> -- -- Dr. Bernd Schuller | mail: b.schuller@... | phone: +49 2461 61-8736 (fax: -6656) Distributed Systems and Grid Computing | personal blog: Juelich Supercomputing Centre | http://www.jroller.com/page/gridhaus http://www.fz-juelich.de/jsc | ------------------------------------------------------------------- ------------------------------------------------------------------- Forschungszentrum Jülich GmbH 52425 Jülich Sitz der Gesellschaft: Jülich Eingetragen im Handelsregister des Amtsgerichts Düren Nr. HR B 3498 Vorsitzende des Aufsichtsrats: MinDir'in Bärbel Brumme-Bothe Geschäftsführung: Prof. Dr. Achim Bachem (Vorsitzender), Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr. Harald Bolt, Dr. Sebastian M. Schmidt ------------------------------------------------------------------- ------------------------------------------------------------------- ------------------------------------------------------------------------- Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php _______________________________________________ Unicore-support mailing list Unicore-support@... https://lists.sourceforge.net/lists/listinfo/unicore-support |
|
|
Re: UCC Batch Mode Problem under Windows- Job-Filegetsresubmittedover and over againHello,
I now tried the same test(ucc in batch mode) under a Windows without VMWare, and its the same error, the date-job gets resubmitted over and over again. Do you have any other suggestions I could try? Did you maybe have the time to look into it? I would be very much interested. :) If you need someone to test please say so. Thanks Best Greetings, Richard Grunzke Quoting Bernd Schuller <b.schuller@...>: > Hi, > > Richard Grunzke wrote: >> Thanks for you quick answer. >> >> It is a Windows XP SP2 running in vmware and e: is an additional >> drive(disk-file) that I mounted into Windows(via vmware) since c: got too >> small. >> That could possibly be the cause? So it might/should work on a plain >> windows xp without vmware? > > I think so, yes. But it is a ucc bug anyway, I'll have a look into this. > I'm testing on Windows / VMWare as well :-) > > Best Regards, > Bernd. > > >> I tried your suggestion and now its different; >> I now get stderr and stdout files. But it stills submitts more and >> more of the date job, but now after some .job-outpus it list the >> stderr/stdout files. >> I observed the RUNNING-JOBS directory and more and more jobs-files are >> created. >> And again, sometimes the date.u is deleted and sometimes its not. >> [...] >> >> Zitat von Bernd Schuller <b.schuller@...>: >> >>> hi, >>> >>> Richard Grunzke wrote: >>>> Hello, >>>> >>>> I have a problem with the batch mode under windows and hopefully somebody >>>> might be able to help. Job gets resubmitted over and over again. >>>> >>>> I use the batch mode with ucc: >>>> E:\ucc-1.1.1\bin>ucc.bat batch -f -i jobs\ >>> ucc is not very well tested on Windows (beyond basic usage), so I assume >>> this is simply a bug. Is "E" any special drive, such as a network mount, >>> USB stick or something? >>> >>> Also, can you try with a single thread? I.e. use >>> E:\ucc-1.1.1\bin>ucc.bat batch -t 1 -f -i jobs >>> >>> We had a similar problem related to NFS mounted directories, >>> but this problem was fixed in the 1.1 version. >>> >>> Best regards, >>> Bernd. >>> >>>> Copy date.u into the jobs-directory: >>>> E:\ucc-1.1.1\bin>copy date.u jobs\ >>>> Date.u is the standard file from the samples directory. >>>> >>>> And now the following happens, the date-jobs gets resubmitted over and >>>> over again and it doesn't stop: >>>> E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\03d5cea3-9c11-4a99-8d24-27b2fef3faa4.job >>> [...] >>> >>>> Can't find a target system. >>>> java.lang.NullPointerException >>>> at de.fzj.unicore.ucc.helpers.Runner.matches(Runner.java:239) >>>> at de.fzj.unicore.ucc.helpers.Runner.findTSS(Runner.java:207) >>>> at de.fzj.unicore.ucc.helpers.Runner.doSubmit(Runner.java:154) >>>> at de.fzj.unicore.ucc.helpers.Runner.run(Runner.java:101) >>>> at >>>> de.fzj.unicore.ucc.actions.Batch.processRequest(Batch.java:349) >>>> at de.fzj.unicore.ucc.actions.Batch$1.run(Batch.java:289) >>>> at >>>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) >>>> at >>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) >>>> at java.lang.Thread.run(Unknown Source) >>> [...] >>>> Under Linux this is no problem at all: >>>> ucc batch -f -i jobs/ & >>>> cp date.u jobs/ >>>> /home/richard/ucc/jobs/RUNNING_JOBS/0fe71db3-e17b-4666-bc98-5e64f2bad810.job >>>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stdout >>>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stderr >>>> >>>> >>>> ps: >>>> E:\ucc-1.1.1\bin>java -version >>>> java version "1.5.0_14" >>>> Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_14-b03) >>>> Java HotSpot(TM) Client VM (build 1.5.0_14-b03, mixed mode, sharing) >>>> [...] >>> -- > > -- > Dr. Bernd Schuller | mail: b.schuller@... > | phone: +49 2461 61-8736 > (fax: -6656) > Distributed Systems and Grid Computing | personal blog: > Juelich Supercomputing Centre | > http://www.jroller.com/page/gridhaus > http://www.fz-juelich.de/jsc | > > > ------------------------------------------------------------------- > ------------------------------------------------------------------- > Forschungszentrum Jülich GmbH > 52425 Jülich > > Sitz der Gesellschaft: Jülich > Eingetragen im Handelsregister des Amtsgerichts Düren Nr. HR B 3498 > Vorsitzende des Aufsichtsrats: MinDir'in Bärbel Brumme-Bothe > Geschäftsführung: Prof. Dr. Achim Bachem (Vorsitzender), > Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr. Harald Bolt, > Dr. Sebastian M. Schmidt > ------------------------------------------------------------------- > ------------------------------------------------------------------- > > ------------------------------------------------------------------------- > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services for > just about anything Open Source. > http://sourceforge.net/services/buy/index.php > _______________________________________________ > Unicore-support mailing list > Unicore-support@... > https://lists.sourceforge.net/lists/listinfo/unicore-support > > ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Unicore-support mailing list Unicore-support@... https://lists.sourceforge.net/lists/listinfo/unicore-support |
|
|
Re: UCC Batch Mode Problem under Windows- Job-Filegets resubmitted over and over againHi Richard,
thanks for the reminder :-) this got lost somehow. Anyway I think I know what is happening here and I have a fix for this. If you can't wait for the next ucc release I can send you a patched ucc-1.1.1.jar file... Best regards, Bernd. Richard Grunzke wrote: > Hello, > > I now tried the same test(ucc in batch mode) under a Windows without > VMWare, and its the same error, the date-job gets resubmitted over and > over again. > > Do you have any other suggestions I could try? > Did you maybe have the time to look into it? > I would be very much interested. :) > If you need someone to test please say so. > > > Thanks > > Best Greetings, > Richard Grunzke > > > Quoting Bernd Schuller <b.schuller@...>: > >> Hi, >> >> Richard Grunzke wrote: >>> Thanks for you quick answer. >>> >>> It is a Windows XP SP2 running in vmware and e: is an additional >>> drive(disk-file) that I mounted into Windows(via vmware) since c: got too >>> small. >>> That could possibly be the cause? So it might/should work on a plain >>> windows xp without vmware? >> I think so, yes. But it is a ucc bug anyway, I'll have a look into this. >> I'm testing on Windows / VMWare as well :-) >> >> Best Regards, >> Bernd. >> >> >>> I tried your suggestion and now its different; >>> I now get stderr and stdout files. But it stills submitts more and >>> more of the date job, but now after some .job-outpus it list the >>> stderr/stdout files. >>> I observed the RUNNING-JOBS directory and more and more jobs-files are >>> created. >>> And again, sometimes the date.u is deleted and sometimes its not. >>> [...] >>> >>> Zitat von Bernd Schuller <b.schuller@...>: >>> >>>> hi, >>>> >>>> Richard Grunzke wrote: >>>>> Hello, >>>>> >>>>> I have a problem with the batch mode under windows and hopefully somebody >>>>> might be able to help. Job gets resubmitted over and over again. >>>>> >>>>> I use the batch mode with ucc: >>>>> E:\ucc-1.1.1\bin>ucc.bat batch -f -i jobs\ >>>> ucc is not very well tested on Windows (beyond basic usage), so I assume >>>> this is simply a bug. Is "E" any special drive, such as a network mount, >>>> USB stick or something? >>>> >>>> Also, can you try with a single thread? I.e. use >>>> E:\ucc-1.1.1\bin>ucc.bat batch -t 1 -f -i jobs >>>> >>>> We had a similar problem related to NFS mounted directories, >>>> but this problem was fixed in the 1.1 version. >>>> >>>> Best regards, >>>> Bernd. >>>> >>>>> Copy date.u into the jobs-directory: >>>>> E:\ucc-1.1.1\bin>copy date.u jobs\ >>>>> Date.u is the standard file from the samples directory. >>>>> >>>>> And now the following happens, the date-jobs gets resubmitted over and >>>>> over again and it doesn't stop: >>>>> E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\03d5cea3-9c11-4a99-8d24-27b2fef3faa4.job >>>> [...] >>>> >>>>> Can't find a target system. >>>>> java.lang.NullPointerException >>>>> at de.fzj.unicore.ucc.helpers.Runner.matches(Runner.java:239) >>>>> at de.fzj.unicore.ucc.helpers.Runner.findTSS(Runner.java:207) >>>>> at de.fzj.unicore.ucc.helpers.Runner.doSubmit(Runner.java:154) >>>>> at de.fzj.unicore.ucc.helpers.Runner.run(Runner.java:101) >>>>> at >>>>> de.fzj.unicore.ucc.actions.Batch.processRequest(Batch.java:349) >>>>> at de.fzj.unicore.ucc.actions.Batch$1.run(Batch.java:289) >>>>> at >>>>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) >>>>> at >>>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) >>>>> at java.lang.Thread.run(Unknown Source) >>>> [...] >>>>> Under Linux this is no problem at all: >>>>> ucc batch -f -i jobs/ & >>>>> cp date.u jobs/ >>>>> /home/richard/ucc/jobs/RUNNING_JOBS/0fe71db3-e17b-4666-bc98-5e64f2bad810.job >>>>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stdout >>>>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stderr >>>>> >>>>> >>>>> ps: >>>>> E:\ucc-1.1.1\bin>java -version >>>>> java version "1.5.0_14" >>>>> Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_14-b03) >>>>> Java HotSpot(TM) Client VM (build 1.5.0_14-b03, mixed mode, sharing) >>>>> [...] >>>> -- >> -- >> Dr. Bernd Schuller | mail: b.schuller@... >> | phone: +49 2461 61-8736 >> (fax: -6656) >> Distributed Systems and Grid Computing | personal blog: >> Juelich Supercomputing Centre | >> http://www.jroller.com/page/gridhaus >> http://www.fz-juelich.de/jsc | >> >> >> ------------------------------------------------------------------- >> ------------------------------------------------------------------- >> Forschungszentrum Jülich GmbH >> 52425 Jülich >> >> Sitz der Gesellschaft: Jülich >> Eingetragen im Handelsregister des Amtsgerichts Düren Nr. HR B 3498 >> Vorsitzende des Aufsichtsrats: MinDir'in Bärbel Brumme-Bothe >> Geschäftsführung: Prof. Dr. Achim Bachem (Vorsitzender), >> Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr. Harald Bolt, >> Dr. Sebastian M. Schmidt >> ------------------------------------------------------------------- >> ------------------------------------------------------------------- >> >> ------------------------------------------------------------------------- >> Check out the new SourceForge.net Marketplace. >> It's the best place to buy or sell services for >> just about anything Open Source. >> http://sourceforge.net/services/buy/index.php >> _______________________________________________ >> Unicore-support mailing list >> Unicore-support@... >> https://lists.sourceforge.net/lists/listinfo/unicore-support >> >> > > > > ------------------------------------------------------------------------- > This SF.Net email is sponsored by the Moblin Your Move Developer's challenge > Build the coolest Linux based applications with Moblin SDK & win great prizes > Grand prize is a trip for two to an Open Source event anywhere in the world > http://moblin-contest.org/redirect.php?banner_id=100&url=/ > _______________________________________________ > Unicore-support mailing list > Unicore-support@... > https://lists.sourceforge.net/lists/listinfo/unicore-support > -- Dr. Bernd Schuller | mail: b.schuller@... | phone: +49 2461 61-8736 (fax: -6656) Distributed Systems and Grid Computing | personal blog: Juelich Supercomputing Centre | http://www.jroller.com/page/gridhaus http://www.fz-juelich.de/jsc | ------------------------------------------------------------------- ------------------------------------------------------------------- Forschungszentrum Jülich GmbH 52425 Jülich Sitz der Gesellschaft: Jülich Eingetragen im Handelsregister des Amtsgerichts Düren Nr. HR B 3498 Vorsitzende des Aufsichtsrats: MinDir'in Bärbel Brumme-Bothe Geschäftsführung: Prof. Dr. Achim Bachem (Vorsitzender), Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr. Harald Bolt, Dr. Sebastian M. Schmidt ------------------------------------------------------------------- ------------------------------------------------------------------- ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Unicore-support mailing list Unicore-support@... https://lists.sourceforge.net/lists/listinfo/unicore-support |
|
|
Re: UCC Batch Mode Problem under Windows- Job-Filegets resubmitted over and over againHello Bernd,
> Hi Richard, > > thanks for the reminder :-) this got lost somehow. > Anyway I think I know what is happening here and I have a > fix for this. If you can't wait for the next ucc release I can > send you a patched ucc-1.1.1.jar file... Yes please do so, it would be really great and most appreciated! Thanks! Best Greetings, Richard > > Best regards, > Bernd. > > > Richard Grunzke wrote: >> Hello, >> >> I now tried the same test(ucc in batch mode) under a Windows without >> VMWare, and its the same error, the date-job gets resubmitted over and >> over again. >> >> Do you have any other suggestions I could try? >> Did you maybe have the time to look into it? >> I would be very much interested. :) >> If you need someone to test please say so. >> >> >> Thanks >> >> Best Greetings, >> Richard Grunzke >> >> >> Quoting Bernd Schuller <b.schuller@...>: >> >>> Hi, >>> >>> Richard Grunzke wrote: >>>> Thanks for you quick answer. >>>> >>>> It is a Windows XP SP2 running in vmware and e: is an additional >>>> drive(disk-file) that I mounted into Windows(via vmware) since c: got too >>>> small. >>>> That could possibly be the cause? So it might/should work on a plain >>>> windows xp without vmware? >>> I think so, yes. But it is a ucc bug anyway, I'll have a look into this. >>> I'm testing on Windows / VMWare as well :-) >>> >>> Best Regards, >>> Bernd. >>> >>> >>>> I tried your suggestion and now its different; >>>> I now get stderr and stdout files. But it stills submitts more and >>>> more of the date job, but now after some .job-outpus it list the >>>> stderr/stdout files. >>>> I observed the RUNNING-JOBS directory and more and more jobs-files are >>>> created. >>>> And again, sometimes the date.u is deleted and sometimes its not. >>>> [...] >>>> >>>> Zitat von Bernd Schuller <b.schuller@...>: >>>> >>>>> hi, >>>>> >>>>> Richard Grunzke wrote: >>>>>> Hello, >>>>>> >>>>>> I have a problem with the batch mode under windows and >>>>>> hopefully somebody >>>>>> might be able to help. Job gets resubmitted over and over again. >>>>>> >>>>>> I use the batch mode with ucc: >>>>>> E:\ucc-1.1.1\bin>ucc.bat batch -f -i jobs\ >>>>> ucc is not very well tested on Windows (beyond basic usage), so I assume >>>>> this is simply a bug. Is "E" any special drive, such as a network mount, >>>>> USB stick or something? >>>>> >>>>> Also, can you try with a single thread? I.e. use >>>>> E:\ucc-1.1.1\bin>ucc.bat batch -t 1 -f -i jobs >>>>> >>>>> We had a similar problem related to NFS mounted directories, >>>>> but this problem was fixed in the 1.1 version. >>>>> >>>>> Best regards, >>>>> Bernd. >>>>> >>>>>> Copy date.u into the jobs-directory: >>>>>> E:\ucc-1.1.1\bin>copy date.u jobs\ >>>>>> Date.u is the standard file from the samples directory. >>>>>> >>>>>> And now the following happens, the date-jobs gets resubmitted over and >>>>>> over again and it doesn't stop: >>>>>> E:\ucc-1.1.1\bin\jobs\RUNNING_JOBS\03d5cea3-9c11-4a99-8d24-27b2fef3faa4.job >>>>> [...] >>>>> >>>>>> Can't find a target system. >>>>>> java.lang.NullPointerException >>>>>> at de.fzj.unicore.ucc.helpers.Runner.matches(Runner.java:239) >>>>>> at de.fzj.unicore.ucc.helpers.Runner.findTSS(Runner.java:207) >>>>>> at de.fzj.unicore.ucc.helpers.Runner.doSubmit(Runner.java:154) >>>>>> at de.fzj.unicore.ucc.helpers.Runner.run(Runner.java:101) >>>>>> at >>>>>> de.fzj.unicore.ucc.actions.Batch.processRequest(Batch.java:349) >>>>>> at de.fzj.unicore.ucc.actions.Batch$1.run(Batch.java:289) >>>>>> at >>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) >>>>>> at >>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) >>>>>> at java.lang.Thread.run(Unknown Source) >>>>> [...] >>>>>> Under Linux this is no problem at all: >>>>>> ucc batch -f -i jobs/ & >>>>>> cp date.u jobs/ >>>>>> /home/richard/ucc/jobs/RUNNING_JOBS/0fe71db3-e17b-4666-bc98-5e64f2bad810.job >>>>>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stdout >>>>>> /home/richard/ucc/./0fe71db3-e17b-4666-bc98-5e64f2bad810.stderr >>>>>> >>>>>> >>>>>> ps: >>>>>> E:\ucc-1.1.1\bin>java -version >>>>>> java version "1.5.0_14" >>>>>> Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_14-b03) >>>>>> Java HotSpot(TM) Client VM (build 1.5.0_14-b03, mixed mode, sharing) >>>>>> [...] >>>>> -- >>> -- >>> Dr. Bernd Schuller | mail: b.schuller@... >>> | phone: +49 2461 61-8736 >>> (fax: -6656) >>> Distributed Systems and Grid Computing | personal blog: >>> Juelich Supercomputing Centre | >>> http://www.jroller.com/page/gridhaus >>> http://www.fz-juelich.de/jsc | >>> >>> >>> ------------------------------------------------------------------- >>> ------------------------------------------------------------------- >>> Forschungszentrum Jülich GmbH >>> 52425 Jülich >>> >>> Sitz der Gesellschaft: Jülich >>> Eingetragen im Handelsregister des Amtsgerichts Düren Nr. HR B 3498 >>> Vorsitzende des Aufsichtsrats: MinDir'in Bärbel Brumme-Bothe >>> Geschäftsführung: Prof. Dr. Achim Bachem (Vorsitzender), >>> Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr. Harald Bolt, >>> Dr. Sebastian M. Schmidt >>> ------------------------------------------------------------------- >>> ------------------------------------------------------------------- >>> >>> ------------------------------------------------------------------------- >>> Check out the new SourceForge.net Marketplace. >>> It's the best place to buy or sell services for >>> just about anything Open Source. >>> http://sourceforge.net/services/buy/index.php >>> _______________________________________________ >>> Unicore-support mailing list >>> Unicore-support@... >>> https://lists.sourceforge.net/lists/listinfo/unicore-support >>> >>> >> >> >> >> ------------------------------------------------------------------------- >> This SF.Net email is sponsored by the Moblin Your Move Developer's challenge >> Build the coolest Linux based applications with Moblin SDK & win >> great prizes >> Grand prize is a trip for two to an Open Source event anywhere in the world >> http://moblin-contest.org/redirect.php?banner_id=100&url=/ >> _______________________________________________ >> Unicore-support mailing list >> Unicore-support@... >> https://lists.sourceforge.net/lists/listinfo/unicore-support >> > > -- > Dr. Bernd Schuller | mail: b.schuller@... > | phone: +49 2461 61-8736 > (fax: -6656) > Distributed Systems and Grid Computing | personal blog: > Juelich Supercomputing Centre | > http://www.jroller.com/page/gridhaus > http://www.fz-juelich.de/jsc | > > > ------------------------------------------------------------------- > ------------------------------------------------------------------- > Forschungszentrum Jülich GmbH > 52425 Jülich > > Sitz der Gesellschaft: Jülich > Eingetragen im Handelsregister des Amtsgerichts Düren Nr. HR B 3498 > Vorsitzende des Aufsichtsrats: MinDir'in Bärbel Brumme-Bothe > Geschäftsführung: Prof. Dr. Achim Bachem (Vorsitzender), > Dr. Ulrich Krafft (stellv. Vorsitzender), Prof. Dr. Harald Bolt, > Dr. Sebastian M. Schmidt > ------------------------------------------------------------------- > ------------------------------------------------------------------- > > ------------------------------------------------------------------------- > This SF.Net email is sponsored by the Moblin Your Move Developer's challenge > Build the coolest Linux based applications with Moblin SDK & win great prizes > Grand prize is a trip for two to an Open Source event anywhere in the world > http://moblin-contest.org/redirect.php?banner_id=100&url=/ > _______________________________________________ > Unicore-support mailing list > Unicore-support@... > https://lists.sourceforge.net/lists/listinfo/unicore-support > > ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Unicore-support mailing list Unicore-support@... https://lists.sourceforge.net/lists/listinfo/unicore-support |
| Free Forum Powered by Nabble | Forum Help |