Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 290 · 291 · 292 · 293

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5690
Credit: 5,859,226
RAC: 64
Message 109692 - Posted: 29 Aug 2024, 22:08:12 UTC

Its back, but with 12000 active users against 19000 tasks, they were gone in a heart beat.
That's 1.6 tasks per system on average
ID: 109692 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher

Send message
Joined: 10 Jun 13
Posts: 25
Credit: 30,583,968
RAC: 16,757
Message 109693 - Posted: 30 Aug 2024, 4:30:59 UTC - in response to Message 109692.  

Its back, but with 12000 active users against 19000 tasks, they were gone in a heart beat.
That's 1.6 tasks per system on average


Only if you're running an Intel GPU according to the Event Log. I've no Intel computers (to speak of). So I have no Rosetta work, Denis is still on summer break, leaving only WCG. Yet again.
ID: 109693 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1615
Credit: 16,543,924
RAC: 3,224
Message 109694 - Posted: 30 Aug 2024, 6:34:32 UTC - in response to Message 109693.  

Only if you're running an Intel GPU according to the Event Log.
???
There is no GPU application here at Rosetta, CPU only.
Grant
Darwin NT
ID: 109694 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1615
Credit: 16,543,924
RAC: 3,224
Message 109695 - Posted: 30 Aug 2024, 6:36:13 UTC - in response to Message 109692.  
Last modified: 30 Aug 2024, 6:39:42 UTC

Its back, but with 12000 active users
According to the Server Status page, that's 1,200.
And the last batch of work that was sent out- not even 5,000 Tasks.

So it's still a matter of 20 minutes or less before they are all gone.
Grant
Darwin NT
ID: 109695 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bill Swisher

Send message
Joined: 10 Jun 13
Posts: 25
Credit: 30,583,968
RAC: 16,757
Message 109697 - Posted: 30 Aug 2024, 14:25:39 UTC - in response to Message 109694.  

Yer right! My eyes went wonky for a bit and I misread the task name. It was an entry for WCG.
ID: 109697 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1972
Credit: 9,122,051
RAC: 2,089
Message 109704 - Posted: 4 Sep 2024, 14:56:39 UTC - in response to Message 109684.  

boinc-process is down again, so there's a Validation backlog once again that continues to grow.


Also today, like an old friend....
ID: 109704 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1972
Credit: 9,122,051
RAC: 2,089
Message 109717 - Posted: 9 Sep 2024, 19:53:56 UTC

I said that i hoped a lot of work when summer finish.
' Cause, at September, Universities, research centers, etc will re-open

And....no work :-(
ID: 109717 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mrchips

Send message
Joined: 11 Nov 09
Posts: 10
Credit: 13,708,271
RAC: 4,518
Message 109723 - Posted: 11 Sep 2024, 11:30:10 UTC

system is down again!!!
ID: 109723 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2062
Credit: 40,473,890
RAC: 1,510
Message 109733 - Posted: 17 Sep 2024, 0:54:47 UTC

I'm only dipping in and out recently, but all servers are running atm and my system grabbed tasks 30mins ago.
Server status page updated 7 minutes ago and more Rosetta Beta tasks seem to be available.
No idea how many until the front page updates.
Fingers crossed.
ID: 109733 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2062
Credit: 40,473,890
RAC: 1,510
Message 109734 - Posted: 17 Sep 2024, 4:12:23 UTC - in response to Message 109733.  

Looks like half a million
ID: 109734 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1615
Credit: 16,543,924
RAC: 3,224
Message 109735 - Posted: 17 Sep 2024, 5:05:18 UTC
Last modified: 17 Sep 2024, 5:06:14 UTC

Anyone getting errors with these Tasks, within a minute or so, with this in the Stderr output

<core_client_version>8.0.2</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.06_windows_x86_64.exe @srmpnn12_10_hallucinated_127_36_dldesign_0_cycle0.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3384584
Extracting in slot directory: minirosetta_database.zip
Using database: minirosetta_database
Cannot find database: minirosetta_database

</stderr_txt>
]]>


Try resetting the Project.
Once again, there is an issue with where things are, and where your existing installation actually has them (or not).
One of my systems started processing with no problems, the other producing just errors until resetting the project sorted it out.
Grant
Darwin NT
ID: 109735 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2062
Credit: 40,473,890
RAC: 1,510
Message 109736 - Posted: 17 Sep 2024, 16:26:37 UTC - in response to Message 109735.  

Anyone getting errors with these Tasks, within a minute or so, with this in the Stderr output

<core_client_version>8.0.2</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.06_windows_x86_64.exe @srmpnn12_10_hallucinated_127_36_dldesign_0_cycle0.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3384584
Extracting in slot directory: minirosetta_database.zip
Using database: minirosetta_database
Cannot find database: minirosetta_database

</stderr_txt>
]]>


Try resetting the Project.
Once again, there is an issue with where things are, and where your existing installation actually has them (or not).
One of my systems started processing with no problems, the other producing just errors until resetting the project sorted it out.

No. And I think the fact that one of your systems works fine and the other doesn't backs that up.
Why it should be happening with one and not the other, I have no idea.
ID: 109736 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1615
Credit: 16,543,924
RAC: 3,224
Message 109739 - Posted: 17 Sep 2024, 22:33:36 UTC - in response to Message 109736.  

Why it should be happening with one and not the other, I have no idea.
Neither do i, but it has been a recurring problem over at Ralph (when it's working, which it isn't again) and when it has work.
Several times it's been necessary to reset the project to stop errors occurring because the updated application doesn't have all the files it needs, or it's looking for them in the wrong place.

Both systems have the same hardware (CPU, motherboard) similar GPU (RTX 2060 & RTX 2060 super), same video driver, same AV software, same OS & updates, same version of BOINC, same projects, some configuration settings.
They are, the same. Yet weirdness continues to occur.
Grant
Darwin NT
ID: 109739 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 256
Credit: 460,340
RAC: 269
Message 109740 - Posted: 17 Sep 2024, 22:38:00 UTC - in response to Message 109739.  

Compare project directories then.
Copy both to usb hdd and then compare with winmerge.
ID: 109740 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1615
Credit: 16,543,924
RAC: 3,224
Message 109741 - Posted: 18 Sep 2024, 0:23:30 UTC - in response to Message 109740.  

Compare project directories then.
Copy both to usb hdd and then compare with winmerge.
Too late now, but something to think about if it occurs again on one system and not the other.
Grant
Darwin NT
ID: 109741 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 290 · 291 · 292 · 293

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org