Client errors

Message boards : Number crunching : Client errors

To post messages, you must log in.

AuthorMessage
premier

Send message
Joined: 30 Dec 05
Posts: 14
Credit: 23,872,868
RAC: 0
Message 76864 - Posted: 23 Jun 2014, 10:44:53 UTC
Last modified: 23 Jun 2014, 10:49:53 UTC

Hi guys,

i am wondering, what's wrong with my PC. Recently i'm having lot of client errors, and I try to figure out what's wrong. Seems to me like it's somthing to hard disk related problem, but i am not sure. I have marked interesting parts:


<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
[2014- 6-23 1:29:22:] :: BOINC:: Initializing ... ok.
[2014- 6-23 1:29:22:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
command: projects/boinc.bakerlab.org_rosetta/minirosetta_3.52_windows_x86_64.exe -abinitio::fastrelax 1 -ex2aro 1 -frag3 00001.200.3mers -in:file:native 00001.pdb -silent_gz 1 -frag9 00001.200.9mers -out:file:silent default.out -ex1 1 -abinitio::rsd_wt_loop 0.5 -relax::default_repeats 15 -abinitio::use_filters false -abinitio::increase_cycles 10 -abinitio::rsd_wt_helix 0.5 -abinitio::rg_reweight 0.5 -in:file:boinc_wu_zip benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_0aeeba3d61a044ac8d3d2107721e4aee_data.zip -out:file:silent default.out -silent_gz -mute all -nstruct 10000 -cpu_run_time 10800 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2970556
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_3d2618f.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_0aeeba3d61a044ac8d3d2107721e4aee_data.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
Setting up folding (abrelax) ...
Beginning folding (abrelax) ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Starting work on structure: _00001
Starting work on structure: _00002
Starting work on structure: _00003
Starting work on structure: _00004
Starting work on structure: _00005
Starting work on structure: _00006
Starting work on structure: _00007
Starting work on structure: _00008
Starting work on structure: _00009
Starting work on structure: _00010
Starting work on structure: _00011
Starting work on structure: _00012
Starting work on structure: _00013
Starting work on structure: _00014
WARNING! attempt to create gzipped file ../../projects/boinc.bakerlab.org_rosetta/benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_0aeeba3d61a044ac8d3d2107721e4aee_fold_SAVE_ALL_OUT_169893_2693_0_0 failed.
======================================================
DONE :: 1 starting structures 10368.2 cpu seconds
This process generated 14 decoys from 14 attempts
======================================================
BOINC :: WS_max 5.27295e+008

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down cleanly ...
called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_0aeeba3d61a044ac8d3d2107721e4aee_fold_SAVE_ALL_OUT_169893_2693_0_0</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>


and another one:


<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -1 (0xffffffff)
</message>
<stderr_txt>
[2014- 6-23 5:37:34:] :: BOINC:: Initializing ... ok.
[2014- 6-23 5:37:34:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
command: projects/boinc.bakerlab.org_rosetta/minirosetta_3.52_windows_x86_64.exe -out:file:silent default.out -in:file:s 00001.pdb -frag3 00001.200.3mers -in:file:native 00001.pdb -frag9 00001.200.9mers -silent_gz 1 -ex2aro 1 -relax::default_repeats 15 -in:file:fullatom 1 -run:protocol relax -ex1 1 -in:file:boinc_wu_zip benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_binchen_picked_2_12_0128_contact_opt_iteration_3_8409e15a086f4e0e85ea4ce1db5d01c9_data.zip -out:file:silent default.out -silent_gz -mute all -in:file:native 00001.pdb -in:file:fullatom -in:file:s 00001.pdb -nstruct 10000 -cpu_run_time 10800 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2319732
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_3d2618f.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_binchen_picked_2_12_0128_contact_opt_iteration_3_8409e15a086f4e0e85ea4ce1db5d01c9_data.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
[2014- 6-23 7: 6:28:] :: BOINC:: Initializing ... ok.
[2014- 6-23 7: 6:28:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
command: projects/boinc.bakerlab.org_rosetta/minirosetta_3.52_windows_x86_64.exe -out:file:silent default.out -in:file:s 00001.pdb -frag3 00001.200.3mers -in:file:native 00001.pdb -frag9 00001.200.9mers -silent_gz 1 -ex2aro 1 -relax::default_repeats 15 -in:file:fullatom 1 -run:protocol relax -ex1 1 -in:file:boinc_wu_zip benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_binchen_picked_2_12_0128_contact_opt_iteration_3_8409e15a086f4e0e85ea4ce1db5d01c9_data.zip -out:file:silent default.out -silent_gz -mute all -in:file:native 00001.pdb -in:file:fullatom -in:file:s 00001.pdb -nstruct 10000 -cpu_run_time 10800 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2319732
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_3d2618f.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_binchen_picked_2_12_0128_contact_opt_iteration_3_8409e15a086f4e0e85ea4ce1db5d01c9_data.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk1_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk2_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk3_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk4_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk5_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk6_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk7_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk8_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk9_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk10_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk11_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk12_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk13_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk14_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk15_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk16_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk17_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk18_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk19_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk20_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk21_fa ... success!
Continuing computation from checkpoint: chk_00001_00034_FastRelax__chk22_fa ... success!
[2014- 6-23 7:57:30:] :: BOINC:: Initializing ... ok.
[2014- 6-23 7:57:30:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
command: projects/boinc.bakerlab.org_rosetta/minirosetta_3.52_windows_x86_64.exe -out:file:silent default.out -in:file:s 00001.pdb -frag3 00001.200.3mers -in:file:native 00001.pdb -frag9 00001.200.9mers -silent_gz 1 -ex2aro 1 -relax::default_repeats 15 -in:file:fullatom 1 -run:protocol relax -ex1 1 -in:file:boinc_wu_zip benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_binchen_picked_2_12_0128_contact_opt_iteration_3_8409e15a086f4e0e85ea4ce1db5d01c9_data.zip -out:file:silent default.out -silent_gz -mute all -in:file:native 00001.pdb -in:file:fullatom -in:file:s 00001.pdb -nstruct 10000 -cpu_run_time 10800 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2319732
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_3d2618f.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_binchen_picked_2_12_0128_contact_opt_iteration_3_8409e15a086f4e0e85ea4ce1db5d01c9_data.zip

CreateFile error 32 when trying set file time
error: cannot create ./00001.200.9mers
error: cannot create ./00001.pdb

Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...

ERROR: Cannot open PDB file "00001.pdb"
ERROR:: Exit from: ......srccoreimport_poseimport_pose.cc line: 216


</stderr_txt>
]]>


and another


<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
[2014- 6-22 4:52:23:] :: BOINC:: Initializing ... ok.
[2014- 6-22 4:52:23:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
command: projects/boinc.bakerlab.org_rosetta/minirosetta_3.52_windows_x86_64.exe -abinitio::fastrelax 1 -ex2aro 1 -frag3 00001.200.3mers -in:file:native 00001.pdb -silent_gz 1 -frag9 00001.200.9mers -out:file:silent default.out -ex1 1 -abinitio::rsd_wt_loop 0.5 -relax::default_repeats 15 -abinitio::use_filters false -abinitio::increase_cycles 10 -abinitio::rsd_wt_helix 0.5 -abinitio::rg_reweight 0.5 -in:file:boinc_wu_zip benchmark_0014_alex_metric_58c8eb0a169e5c49e1cef5331416f2432f6d8260_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_c9a5c0bc028747f194305301d0dfc23b_data.zip -out:file:silent default.out -silent_gz -mute all -nstruct 10000 -cpu_run_time 10800 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 2550967
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_3d2618f.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/benchmark_0014_alex_metric_58c8eb0a169e5c49e1cef5331416f2432f6d8260_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_c9a5c0bc028747f194305301d0dfc23b_data.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
Setting up folding (abrelax) ...
Beginning folding (abrelax) ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Starting work on structure: _00001
Starting work on structure: _00002
Starting work on structure: _00003
Starting work on structure: _00004
Starting work on structure: _00005
Starting work on structure: _00006
Starting work on structure: _00007
Starting work on structure: _00008
Starting work on structure: _00009
Starting work on structure: _00010
Starting work on structure: _00011
Starting work on structure: _00012
Starting work on structure: _00013
Starting work on structure: _00014
Starting work on structure: _00015
WARNING! attempt to create gzipped file ../../projects/boinc.bakerlab.org_rosetta/benchmark_0014_alex_metric_58c8eb0a169e5c49e1cef5331416f2432f6d8260_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_c9a5c0bc028747f194305301d0dfc23b_fold_SAVE_ALL_OUT_170254_1122_0_0 failed.
======================================================
DONE :: 1 starting structures 10314.1 cpu seconds
This process generated 15 decoys from 15 attempts
======================================================
BOINC :: WS_max 5.32865e+008

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down cleanly ...
called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>benchmark_0014_alex_metric_58c8eb0a169e5c49e1cef5331416f2432f6d8260_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_c9a5c0bc028747f194305301d0dfc23b_fold_SAVE_ALL_OUT_170254_1122_0_0</file_name>
<error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>


I wouldn't bother that if only they were validate errors, but they are marked as client errors.
I have also noted, that I've been granted with credits for those tasks. So my question is.

Is it really something wrong with my PC or just something wrong with R@H?
ID: 76864 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Murasaki
Avatar

Send message
Joined: 20 Apr 06
Posts: 303
Credit: 511,418
RAC: 0
Message 76875 - Posted: 24 Jun 2014, 23:30:01 UTC

Several other users have reported recent client and validate errors in the Minirosetta 3.52 thread so it probably isn't your machine (unless you are seeing other problems as well).
ID: 76875 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
krypton
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 16 Nov 11
Posts: 108
Credit: 2,164,309
RAC: 0
Message 76890 - Posted: 27 Jun 2014, 0:57:57 UTC

My guess is that the filesystem you are using has a length limit for file names. I know some Windows based machines have this.

I'll let the group know, not to submit jobs with such large names.

Thank you for reporting the error!
ID: 76890 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
premier

Send message
Joined: 30 Dec 05
Posts: 14
Credit: 23,872,868
RAC: 0
Message 76929 - Posted: 30 Jun 2014, 6:32:28 UTC
Last modified: 30 Jun 2014, 6:49:49 UTC

I'm operating at Windows 7 64 bit version, and yeah it has 260 byte limit and this could be it.

relative path is
../../projects/boinc.bakerlab.org_rosetta/benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_0aeeba3d61a044ac8d3d2107721e4aee_fold_SAVE_ALL_OUT_169893_2693_0_0 = 224 chars + 1 <NUL>, but when we replace realative to absolute:

C:Program DataBOINCprojects/boinc.bakerlab.org_rosetta/benchmark_0008_master_babd28351e57425d68b32333be5a837fb7cd5818_ploops_42_input_0002_no_lig_fragments_contact_opt_iteration_1_0aeeba3d61a044ac8d3d2107721e4aee_fold_SAVE_ALL_OUT_169893_2693_0_0 it's 249+1 <NUL>, total 250 chars.

Here is what i found in network:


So it should work, but maybe there is different true name of the file (maybe there is tar.gz at the end? Then it would be 256 chars, still 4 bytes left :) Maybe tehre is something more.

Oh and one more thing, according to M$ site:

The Windows API has many functions that also have Unicode versions to permit an extended-length path for a maximum total path length of 32,767 characters. This type of path is composed of components separated by backslashes, each up to the value returned in the lpMaximumComponentLength parameter of the GetVolumeInformation function (this value is commonly 255 characters). To specify an extended-length path, use the "\?" prefix. For example, "\?D:very long path".

The "\?" prefix can also be used with paths constructed according to the universal naming convention (UNC). To specify such a path using UNC, use the "\?UNC" prefix. For example, "\?UNCservershare", where "server" is the name of the computer and "share" is the name of the shared folder. These prefixes are not used as part of the path itself. They indicate that the path should be passed to the system with minimal modification, which means that you cannot use forward slashes to represent path separators, or a period to represent the current directory, or double dots to represent the parent directory. Because you cannot use the "\?" prefix with a relative path, relative paths are always limited to a total of MAX_PATH characters.



I guess we need a client update :)
ID: 76929 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
krypton
Volunteer moderator
Project developer
Project scientist

Send message
Joined: 16 Nov 11
Posts: 108
Credit: 2,164,309
RAC: 0
Message 76941 - Posted: 1 Jul 2014, 5:56:07 UTC

Thank you premier! This is very helpful.
ID: 76941 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Monty

Send message
Joined: 14 Oct 07
Posts: 4
Credit: 146,497
RAC: 0
Message 76961 - Posted: 5 Jul 2014, 16:53:13 UTC
Last modified: 5 Jul 2014, 17:24:19 UTC

Client errors on tasks 672296503 and 672297344 .
Exit status -1073741819 (0xffffffffc0000005)
- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00947DEB write attempt to address 0x00000000

------------------------
Greetings from Germany
ID: 76961 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Killersocke@rosetta

Send message
Joined: 13 Nov 06
Posts: 29
Credit: 2,579,125
RAC: 0
Message 76962 - Posted: 5 Jul 2014, 22:40:28 UTC

Same in this Tread

Minirosetta 3.52
ID: 76962 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Client errors



©2024 University of Washington
https://www.bakerlab.org