Boinc defunct processes minutes after startup.

Message boards : Number crunching : Boinc defunct processes minutes after startup.

To post messages, you must log in.

AuthorMessage
William King

Send message
Joined: 30 May 10
Posts: 1
Credit: 618,596
RAC: 0
Message 70760 - Posted: 20 Jul 2011, 21:12:53 UTC

I have two servers with exactly the same hardware and software configurations. One of them processing WU's just fine, the other shows no CPU usage under htop and all of the boinc processes show up as defunct or zombies.

Such as:

boinc 28765 6.6 1.7 342320 285388 ? SNl 14:04 0:19 ../../projects/boinc.bakerlab.org_rosetta/minirosetta_3.14_x86_64-pc-linux-gnu @monomer_all_boinc_1lu9B_127.nonlocal.pctid_0.09.tmscore_0.23359._nonlocal_tex.boinc.flags -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip monomer_all_boinc_1lu9B_127.nonlocal.pctid_0.09.tmscore_0.23359._nonlocal_tex.boinc.zip -nstruct 10000 -cpu_run_time 10800 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 3286016
boinc 28794 6.7 0.0 0 0 ? ZN 14:04 0:20 [minirosetta_3.1] <defunct>
ID: 70760 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1895
Credit: 9,135,082
RAC: 4,703
Message 70767 - Posted: 22 Jul 2011, 11:05:15 UTC - in response to Message 70760.  

I have two servers with exactly the same hardware and software configurations. One of them processing WU's just fine, the other shows no CPU usage under htop and all of the boinc processes show up as defunct or zombies.

Such as:

boinc 28765 6.6 1.7 342320 285388 ? SNl 14:04 0:19 ../../projects/boinc.bakerlab.org_rosetta/minirosetta_3.14_x86_64-pc-linux-gnu @monomer_all_boinc_1lu9B_127.nonlocal.pctid_0.09.tmscore_0.23359._nonlocal_tex.boinc.flags -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip monomer_all_boinc_1lu9B_127.nonlocal.pctid_0.09.tmscore_0.23359._nonlocal_tex.boinc.zip -nstruct 10000 -cpu_run_time 10800 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 3286016
boinc 28794 6.7 0.0 0 0 ? ZN 14:04 0:20 [minirosetta_3.1] <defunct>


No clue but since it is a Linux Server you might want to ask in the Linux section.
ID: 70767 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Boinc defunct processes minutes after startup.



©2024 University of Washington
https://www.bakerlab.org