Validator down... :-(

Message boards : Number crunching : Validator down... :-(

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1831
Credit: 119,526,853
RAC: 9,592
Message 71495 - Posted: 25 Oct 2011, 22:42:34 UTC - in response to Message 71494.  

Everything normal here as of now,
Uploads, downloads, validation, no pending`s,
Though from what you are saying here something else is not.
There's certainly something not right, not only with Rosetta@Home but with RALPH@Home as well.
On R@H, WU's are uploaded and reported but then just sit as "pending". This was working at some point yesterday.
And on RALPH@Home, you can not upload any finished WUs sue t a "can not attach to shared memory" error on the server(s).

Don't know how much resources Rosetta@Home and RALPH@Home are sharing, but it looks to me as if whatever they fixed yesterday isn't in fact working properly...

Ralf


The validator is probably working through a backlog of results after their downtime, hence the pending status of tasks.
ID: 71495 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TPCBF

Send message
Joined: 29 Nov 10
Posts: 111
Credit: 5,070,625
RAC: 2,159
Message 71497 - Posted: 26 Oct 2011, 1:50:49 UTC - in response to Message 71495.  

The validator is probably working through a backlog of results after their downtime, hence the pending status of tasks.
That backlog would have existed earlier today as well and all the previously pending WUs had cleared out but now they just keep piling up again. If it is "just" a backlog, at least one or two of those should be validated once in a while. But none of them has in more than 12h...

The silence of the sysadmins is really deafening... :-(

Ralf
ID: 71497 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TPCBF

Send message
Joined: 29 Nov 10
Posts: 111
Credit: 5,070,625
RAC: 2,159
Message 71505 - Posted: 27 Oct 2011, 4:34:40 UTC

Well, WU's are being validated, but at a snail's pace right now. Usually, they would not sit more than 5-10 minutes as pending, now I always have about a dozen that will sit for about a day before being validated...

Ralf
ID: 71505 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1831
Credit: 119,526,853
RAC: 9,592
Message 71508 - Posted: 27 Oct 2011, 9:26:27 UTC - in response to Message 71461.  

Well, never a dull moment...

Does anyone know what the issue is here or is this (just) another "it's weekend and no sysadmin is around" kind of typical R@H thing again? :-(

Ralf


It's slow but it isn't a problem.
ID: 71508 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TPCBF

Send message
Joined: 29 Nov 10
Posts: 111
Credit: 5,070,625
RAC: 2,159
Message 71511 - Posted: 27 Oct 2011, 15:44:03 UTC - in response to Message 71508.  

Well, never a dull moment...

Does anyone know what the issue is here or is this (just) another "it's weekend and no sysadmin is around" kind of typical R@H thing again? :-(

Ralf


It's slow but it isn't a problem.
Now, it is only slow, but that was not the case when I wrote my original message days ago, that you are (mis)quoting here.

Ralf
ID: 71511 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cutchet Salvador

Send message
Joined: 1 Feb 10
Posts: 17
Credit: 10,690,439
RAC: 0
Message 71512 - Posted: 27 Oct 2011, 18:44:46 UTC - in response to Message 71511.  
Last modified: 27 Oct 2011, 18:46:05 UTC

I do not understand this scorn towards the collaborators of R@H that we are working of free form and with the expenses to our account.
I do not understand as anybody with university education he can despise this way to whom they make possible an investigation that would take many years without our collaboration.
I believe that they do not know the public relations and the benefits that reach port to the community.
They have not deigned, as so many other times, to opening the mouth to give the most minimal explanation or to apologize.
Nulla ethica sine aestethica
ID: 71512 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1831
Credit: 119,526,853
RAC: 9,592
Message 71515 - Posted: 27 Oct 2011, 21:29:05 UTC - in response to Message 71511.  

Well, never a dull moment...

Does anyone know what the issue is here or is this (just) another "it's weekend and no sysadmin is around" kind of typical R@H thing again? :-(

Ralf


It's slow but it isn't a problem.
Now, it is only slow, but that was not the case when I wrote my original message days ago, that you are (mis)quoting here.

Ralf

Sorry, I meant to quote:


Well, WU's are being validated, but at a snail's pace right now. Usually, they would not sit more than 5-10 minutes as pending, now I always have about a dozen that will sit for about a day before being validated...

Ralf


I understand that you were adding an update to your previous comment - I was pointing out to any outsiders who might be reading and might not understand the setup that the validator being slow isn't a technical problem.
ID: 71515 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 71527 - Posted: 29 Oct 2011, 2:42:03 UTC

At the moment it seems the the validators are swamped with failed work units and this is slowing the whole process down to snails pace.
In the compute errors thread it is stated that the bad work units have been removed from the server though a lot of them are already out here with us and crashing and doing other `interesting` things,
It will take a little time to clear them and get back to normal.
ID: 71527 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile granno21
Avatar

Send message
Joined: 19 Jan 06
Posts: 6
Credit: 314,667
RAC: 0
Message 71535 - Posted: 30 Oct 2011, 9:14:16 UTC - in response to Message 71527.  

At the moment it seems the the validators are swamped with failed work units and this is slowing the whole process down to snails pace.
In the compute errors thread it is stated that the bad work units have been removed from the server though a lot of them are already out here with us and crashing and doing other `interesting` things,
It will take a little time to clear them and get back to normal.


Is there a time frame for when the validator will be back to its normal processing time?


ID: 71535 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [PUGLIA] kidkidkid3
Avatar

Send message
Joined: 14 Sep 10
Posts: 11
Credit: 2,348,063
RAC: 0
Message 71721 - Posted: 2 Dec 2011, 20:44:10 UTC

Hi all,
all status of Rosetta are "green", but i have a lot of WU in pending.
Anyone has the same problem ?
Thanks in advance for your help.

I'm a old italian programmer (do you know cards ?). Now, i recycle/repair old pc of my friends, and they revive for research.
A long trip begin with a little step ...
ID: 71721 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 71722 - Posted: 2 Dec 2011, 20:51:13 UTC - in response to Message 71721.  
Last modified: 2 Dec 2011, 20:55:48 UTC

Hi all,
all status of Rosetta are "green", but i have a lot of WU in pending.
Anyone has the same problem ?
Thanks in advance for your help.


none here.......... a small handful of valuator errors
ID: 71722 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
yoner
Avatar

Send message
Joined: 17 Sep 05
Posts: 10
Credit: 2,581,874
RAC: 0
Message 71726 - Posted: 3 Dec 2011, 2:16:26 UTC

I also have a bunch of work units that are waiting to be validated... about a days worth or so...
ID: 71726 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 71727 - Posted: 3 Dec 2011, 4:39:49 UTC

My newly returned tasks are getting validated within minutes, but i have seven tasks from when this started the other day still stuck!

ID: 71727 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [PUGLIA] kidkidkid3
Avatar

Send message
Joined: 14 Sep 10
Posts: 11
Credit: 2,348,063
RAC: 0
Message 71734 - Posted: 3 Dec 2011, 23:39:11 UTC

Good morning ... Vietnam !
I've 27 WU pending with 667.77 credit claimed.
A lot of WU with claimed credit equal to granted credit.
What's appened ?
Thanks in advance for your help.
I'm a old italian programmer (do you know cards ?). Now, i recycle/repair old pc of my friends, and they revive for research.
A long trip begin with a little step ...
ID: 71734 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
yoner
Avatar

Send message
Joined: 17 Sep 05
Posts: 10
Credit: 2,581,874
RAC: 0
Message 71737 - Posted: 4 Dec 2011, 4:24:37 UTC

I'm now up to about 2 days worth of pending work units - about 6500 credits worth...

This is out put coming from multiple computers...

What's wrong?
ID: 71737 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Leonator

Send message
Joined: 27 Aug 06
Posts: 2
Credit: 45,712,177
RAC: 0
Message 71742 - Posted: 4 Dec 2011, 8:05:35 UTC

I have 11,714.14 credits, who are waiting for approval from the November 30 and their number is growing. (((((((((((
ID: 71742 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Leonator

Send message
Joined: 27 Aug 06
Posts: 2
Credit: 45,712,177
RAC: 0
Message 71761 - Posted: 7 Dec 2011, 15:11:26 UTC - in response to Message 71742.  

I have 11,714.14 credits, (((((((((((

24k+ credits... WTF?!

ID: 71761 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
luisr

Send message
Joined: 11 Sep 06
Posts: 3
Credit: 1,039,929
RAC: 0
Message 71762 - Posted: 7 Dec 2011, 16:29:03 UTC

4,507 credits pending since 2 december
ID: 71762 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
luisr

Send message
Joined: 11 Sep 06
Posts: 3
Credit: 1,039,929
RAC: 0
Message 71770 - Posted: 9 Dec 2011, 4:48:18 UTC - in response to Message 71762.  

Finally, after 8 day, credits validating...

4,507 credits pending since 2 december


ID: 71770 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Validator down... :-(



©2024 University of Washington
https://www.bakerlab.org