Statistics consultation

Message boards : Number crunching : Statistics consultation

To post messages, you must log in.

AuthorMessage
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 2
Message 65552 - Posted: 14 Mar 2010, 18:53:02 UTC

Hi,

I've noted that Rosetta's statistics pages do not allow selecting validated, in progress, errored or pending validation tasks as in other BOINC projects. Is it on purpose? Could it be added? I think it would be a very useful tool to review and check how everything goes.

Sorry if the answer is already available somewhere but I did not manage to find it.

regards,
ID: 65552 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2117
Credit: 41,159,202
RAC: 15,498
Message 65554 - Posted: 15 Mar 2010, 2:56:38 UTC

Click on your name in this thread, click again on the word "View" alongside your computers, then click "Results" against each computer you have.

I notice you have quite a few Client Errors. If you click on the "Task ID" you'll see the output file and can usually work out what caused the error. Why the error was caused is another matter.

You'll rarely find tasks pending validations as they're usually dealt with just a few minutes after upload, but if there are any they'll be listed in your results screen too.
ID: 65554 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 2
Message 65557 - Posted: 15 Mar 2010, 6:16:20 UTC

Yes the reason of my post is to investigate those errors but i still can't get to show just tasks with errors in one page and i have to browse through all pages to find the ones that errored which is rather unconfortable you know.
ID: 65557 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 65559 - Posted: 15 Mar 2010, 12:30:49 UTC

The views you describe are incorporated in to BOINC server code at a version that is newer then is currently installed at Rosetta@home. So, it's not that anything was disabled here. It is just that when the server was last upgraded, this feature was not yet available.

...and before you say "then upgrade the server", please keep in mind that it isn't simply a matter of click a button and your server is upgraded. All of the customizations for Rosetta@home have to be manually incorporated into any new release. And every time you do that, you risk error. So upgrades tend to be minimized as much as possible.
Rosetta Moderator: Mod.Sense
ID: 65559 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2117
Credit: 41,159,202
RAC: 15,498
Message 65563 - Posted: 16 Mar 2010, 1:19:12 UTC - in response to Message 65557.  

Yes the reason of my post is to investigate those errors but I still can't get to show just tasks with errors in one page and I have to browse through all pages to find the ones that errored which is rather uncomfortable you know.

I misunderstood your question first time round, sorry. Still, it only took me two minutes to review all your tasks and you only need to check out the newer tasks each day. The results page only seems to show the last 7 days (or so) reported so it'll never become that long a list.
ID: 65563 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 2
Message 65585 - Posted: 19 Mar 2010, 9:54:37 UTC
Last modified: 19 Mar 2010, 9:56:02 UTC

Yes, as simple as that..., well I've done it and the summary is below.

Two type of errors, one that made fail all replicas, not only mine, and seem related with messing during the compression of the file while the second one on the other hand seems well related to one of my machines and read as "access violation" to certain addresses

Any advise on what look in my hardware/software configuration will be much appreciated.

It is a Q9400@3,2 GHz, XP 32 bits SP3, first idea is try to give the cPU/memory more voltage it it could be hardware related and or look for memory modules failures.

Task ID 323751279
Name t300__boinc_filtered_loopbuild_threading_cst_lb_tex_IGNORE_THE_REST_16900_8712_0
Reason: Access Violation (0xc0000005) at address 0x008D8D30 write attempt to address 0x00000000

Task ID 323511118
Name ina2inaN_to_NOE__18638_4331_0

Task ID 323295986
Name abinitio_withrelax_nodisulf_nohomfrag_cst0.01_129_B_1louA_SAVE_ALL_OUT_17407_1802_0
Reason: Access Violation (0xc0000005) at address 0x006E8DA1 write attempt to address 0x9503C59E

Task ID 324064377
Name quota_6_1tif__BOINC_abrelax.v1_SAVE_ALL_OUT_18319_981_0
BOINC:: Error reading and gzipping output datafile: default.out

Task ID 324064378
Name abinitio_withrelax_nodisulf_nohomfrag_nonative_cst0.1_129_B_1srrA_SAVE_ALL_OUT_17951_329_0
Reason: Access Violation (0xc0000005) at address 0x44F58BC9 read attempt to address 0x44F58BC9

Task ID 324064382
Name quota_6_rama_1tul__BOINC_abrelax.v1_SAVE_ALL_OUT_18320_986_0
Reason: Access Violation (0xc0000005) at address 0xE046DBD0 read attempt to address 0xE046DBD0

Task ID 324064394
Name Rossmann3x3_abinitio_SAVE_ALL_OUT_design_k019_003_18632_1842_0
Reason: Access Violation (0xc0000005) at address 0x0067E0E8 read attempt to address 0x00000008

Task ID 324245070 -****
Name gunn_fragments_SAVE_ALL_OUT_-1wtyA__18642_1703_0
BOINC:: Error reading and gzipping output datafile: default.out

Task ID 324254752
Name gunn_fragments_SAVE_ALL_OUT_-1npuA__18642_441_0
Reason: Access Violation (0xc0000005) at address 0x004C8C76 read attempt to address 0x00000010

Task ID 324254851
Name tyrsim_3gbn_1tyj_Protein_interface_design_25Feb2010_18415_595_0
Reason: Access Violation (0xc0000005) at address 0x00A93166 read attempt to address 0x7B806820

Task ID 324254855
Name t382__boinc_filtered_loopbuild_threading_cst_all_tex_IGNORE_THE_REST_16902_9038_0
Reason: Access Violation (0xc0000005) at address 0xE046A510 read attempt to address 0xE046A510

Task ID 324276886-****
Name gunn_fragments_SAVE_ALL_OUT_-1wtyA__18642_1755_0
BOINC:: Error reading and gzipping output datafile: default.out

Task ID 324497156
Name lr5_dun08_slow_corrections_it03_A_rlbd_1bm8_SAVE_ALL_OUT_IGNORE_THE_REST_DECOY_18090_1175_0
Reason: Access Violation (0xc0000005) at address 0xE044EDF0 read attempt to address 0xE044EDF0

Task ID 324500366
Name quota_6_rama_1bk2__BOINC_abrelax.v1_SAVE_ALL_OUT_18320_1143_0
Reason: Access Violation (0xc0000005) at address 0x008D8B30 read attempt to address 0x0EDC5040

Task ID 324903954
Name lr5_combined_torsion_it01_run01_A_rlbn_1a32_SAVE_ALL_OUT_IGNORE_THE_REST_NATIVE_NOCON_18670_31_0
Reason: Access Violation (0xc0000005) at address 0xE059E0B0 read attempt to address 0xE059E0B0

Task ID 324905512
Name Rossmann3x3_abinitio_SAVE_ALL_OUT_design_k023_004_18666_837_0
Reason: Access Violation (0xc0000005) at address 0x006E8DA1 write attempt to address 0x8BFC718F

Task ID 325023482
Name lr5_combined_torsion_it01_run01_A_rlbd_2vik_SAVE_ALL_OUT_IGNORE_THE_REST_DECOY_18669_634_0
Reason: Access Violation (0xc0000005) at address 0x008D8D30 write attempt to address 0x00000000

Task ID 325163896
Name lr5_combined_torsion_it01_run01_A_rlbd_2i9c_SAVE_ALL_OUT_IGNORE_THE_REST_DECOY_18669_1118_0
Reason: Access Violation (0xc0000005) at address 0xE044EDF0 read attempt to address 0xE044EDF0

Task ID 325248616
Name t362__boinc_filtered_loopbuild_threading_cst_lb_tex_IGNORE_THE_REST_16900_9623_0
Reason: Access Violation (0xc0000005) at address 0xE046DBD0 read attempt to address 0xE046DBD0

Task ID 325288480
Name gunn_fragments_SAVE_ALL_OUT_-1vls___18642_1284_0

Task ID 325288481
Name gunn_fragments_SAVE_ALL_OUT_-2hnfA__18642_1172_0
Reason: Access Violation (0xc0000005) at address 0x006E8DA1 write attempt to address 0xA14A60EC
ID: 65585 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 2
Message 65586 - Posted: 19 Mar 2010, 10:24:03 UTC

I'm starting with a HD defragmentation

regards
ID: 65586 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1831
Credit: 119,526,853
RAC: 9,592
Message 65587 - Posted: 19 Mar 2010, 10:51:39 UTC

0xc0000005 might be due to a faulty driver but I'd try testing your RAM and PSU first (testing a PSU isn't easy though).

If you haven't already, try memtest x86+: http://www.memtest.org/#downiso
and
Prime95 (I usually try both the 2nd (max heat, power etc) and 3rd (blend) options: http://www.mersenne.org/freesoft/ (the download link is under step 3 - ignore the rest!)
ID: 65587 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 2
Message 65588 - Posted: 19 Mar 2010, 12:03:11 UTC - in response to Message 65587.  

Yes prime95 and linx were used to test stability under heavy CPU/memory load.

I've finished defragmentation, let's how it behaves in the next hours.

regards,

ID: 65588 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Trotador

Send message
Joined: 30 May 09
Posts: 108
Credit: 291,214,977
RAC: 2
Message 65594 - Posted: 19 Mar 2010, 18:09:59 UTC - in response to Message 65588.  
Last modified: 19 Mar 2010, 18:10:13 UTC

Yes prime95 and linx were used to test stability under heavy CPU/memory load.

I've finished defragmentation, let's how it behaves in the next hours.

regards,



Desfragmentation did not did it. Another one errored.

Upping voltages.

regards
ID: 65594 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2117
Credit: 41,159,202
RAC: 15,498
Message 65595 - Posted: 20 Mar 2010, 3:54:06 UTC - in response to Message 65585.  

Any advise on what look in my hardware/software configuration will be much appreciated.

Unfortunately nothing that's sure to solve the problem. But when I see problems on machines that run perfectly on other machines I update my software to the latest stable version and look to uninstall any program that I don't use, in case there are unexpected conflicts.

I notice you use Boinc 6.2.28 when the latest is 6.10.18. Update this, then use Windows update, then something like Filehippo Update checker and perhaps run Secunia Software Inspector. A full virus and malware scan may also be useful and finally chkdsk.

If you're feeling adventurous you can review your startup programs and active services as well.

This may not solve the problem, but it will ensure you have a secure baseline machine.
ID: 65595 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Statistics consultation



©2024 University of Washington
https://www.bakerlab.org