Message boards : Number crunching : Rosetta 4.0+
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 19 · Next
Author | Message |
---|---|
Conan Send message Joined: 11 Oct 05 Posts: 150 Credit: 4,209,348 RAC: 1,803 |
Could be this is your problem (and mine...) but don't hold your breath for a fix. This issue is still occuring since the 1st of Feburary 2018 (see begining of this thread). Still getting Rosetta@home | [error] Process creation failed: (unknown error) - error code 193 (0xc1) Also posted on the 1/2/18 after the first report of this issue was a possible reason for the error that has not been taken up it appears, " Looks like Rosetta 4.06 (4.07) has been compiled using Visual Studio 2015 which does not create XP compatiable programme files" Could this be the issue? Rosetta Mini runs without an issue and I have no problems with my Linux machines with either Rosetta Mini or Rosetta work units. Both Ralph and Rosetta are affected, this fault should of been picked up over at Ralph first before then being released here at Rosetta. Thanks Conan " |
Viking69 Send message Joined: 3 Oct 05 Posts: 20 Credit: 6,815,776 RAC: 2,618 |
I am getting a lot of failures on this current version too. But mine state a memory error, and My PC has never ran out of available RAM. I have 16 GB and I never see it above 12GB used. One of my errors https://boinc.bakerlab.org/result.php?resultid=1004968726 Hi all you enthusiastic crunchers..... |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
I am getting a lot of failures on this current version too. But mine state a memory error, and My PC has never ran out of available RAM. I have 16 GB and I never see it above 12GB used. These are awfully big values: Peak working set size 1,492.68 MB I normally see about half that (or less); there must be something wrong with the work units. |
biodoc Send message Joined: 19 Feb 06 Posts: 14 Credit: 30,717,792 RAC: 0 |
I'm seeing quite a number of identical computation errors on 2 of my machines. They seem to occur in "bunches". For example, the ryzen had no errors for a couple of days and then I got about 180 of them within a few minutes. My other 2 machines are error free but they are running older kernels (mint 7.3 and 18.3) computer 1 https://boinc.bakerlab.org/rosetta/results.php?hostid=3420461&offset=0&show_names=0&state=6&appid= CPU type: 2 x Intel(R) Xeon(R) CPU E5-2690 v2 @ 3.00GHz [Family 6 Model 62 Stepping 4] Number of processors: 40 Coprocessors: NVIDIA Quadro FX 4800 (1535MB) driver: 340.10 OpenCL: 1.0 Operating System: Linux Mint 19 Tara [4.15.0-23-generic|libc 2.27 (Ubuntu GLIBC 2.27-3ubuntu1)] BOINC version: 7.9.3 Memory: 96602.15 MB Error: <core_client_version>7.9.3</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63)</message> <stderr_txt> command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.07_x86_64-pc-linux-gnu -run:protocol jd2_scripting @flags_rb_06_28_403_572__t000__0_C1_robetta -silent_gz -mute all -out:file:silent default.out -in:file:boinc_wu_zip input_rb_06_28_403_572__t000__0_C1_robetta.zip -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 3482852 rosetta_4.07_x86_64-pc-linux-gnu: loadlocale.c:129: _nl_intern_locale_data: Assertion `cnt < (sizeof (_nl_value_type_LC_TIME) / sizeof (_nl_value_type_LC_TIME[0]))' failed. SIGABRT: abort called Stack trace (17 frames): [0x5efead0] [0x5ffe380] [0x607e517] [0x60083a8] [0x6002794] [0x60027ee] [0x6000f73] [0x6001996] [0x60007df] [0x600020e] [0x5f1d10e] [0x5f1d73e] [0x5f1707a] [0x5f17202] [0x412631] [0x5fff8cc] [0x610b97] Exiting... </stderr_txt> ]]> Computer 2 CPU type: AMD Ryzen 7 2700X Eight-Core Processor [Family 23 Model 8 Stepping 2] Number of processors: 16 Coprocessors: NVIDIA GeForce GTX 780 Ti (3015MB) driver: 390.67 OpenCL: 1.2 Operating System: Antergos Linux [4.17.2-1-ARCH] BOINC version: 7.8.6 Memory: 16037.01 MB <core_client_version>7.8.6</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63)</message> <stderr_txt> command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.07_x86_64-pc-linux-gnu -relax::minimize_bond_lengths 1 -out:file:silent_struct_type binary -ignore_unrecognized_res 1 -abinitio::rsd_wt_loop 0.5 -abinitio::use_filters false -abinitio::rg_reweight 0.5 -relax::default_repeats 2 -beta 1 -abinitio::increase_cycles 10 -ex2aro 1 -frag9 00001.200.9mers -abinitio::fastrelax 1 -abinitio::detect_disulfide_before_relax 1 -ex1 1 -relax::minimize_bond_angles 1 -beta_cart 1 -in:file:native 00001.pdb -relax::dualspace 1 -optimization::default_max_cycles 200 -abinitio::rsd_wt_helix 0.5 -frag3 00001.200.3mers -in:file:boinc_wu_zip DRH_curve_X_h21_l4_h29_l2_09658_4_loop_11_0001_one_capped_0001_fragments_data.zip -out:file:silent default.out -silent_gz 1 -mute all -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 1195778 rosetta_4.07_x86_64-pc-linux-gnu: loadlocale.c:129: _nl_intern_locale_data: Assertion `cnt < (sizeof (_nl_value_type_LC_TIME) / sizeof (_nl_value_type_LC_TIME[0]))' failed. SIGABRT: abort called Stack trace (17 frames): [0x5efead0] [0x5ffe380] [0x607e517] [0x60083a8] [0x6002794] [0x60027ee] [0x6000f73] [0x6001996] [0x60007df] [0x600020e] [0x5f1d10e] [0x5f1d73e] [0x5f1707a] [0x5f17202] [0x412631] [0x5fff8cc] [0x610b97] Exiting... </stderr_txt> ]]> |
mmonnin Send message Joined: 2 Jun 16 Posts: 59 Credit: 24,317,585 RAC: 70,732 |
Yeah...just abort Rosetta app tasks. Mini works fine. Project admins could fix it if they allow an app selection in user preferences. For some reason my 2700x works fine but not my 1950x. Both on ubuntu 18.04 |
Trotador Send message Joined: 30 May 09 Posts: 108 Credit: 291,214,977 RAC: 1 |
/etc/locale.conf I installed BOINC with a .sh file so it is completely within the BOINC folder, what would I have to modify to circumvent the locale error? <core_client_version>7.6.31</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.07_x86_64-pc-linux-gnu @rb_07_16_509_735_ab_t000__h002_robetta_FLAGS -in::file::fasta t000__h002.fasta -psipred_ss2 t000__h002.spider3_ss2 -kill_hairpins t000__h002.nobuformat.spider3_ss2 -abinitio::use_filters true -in:file:boinc_wu_zip rb_07_16_509_735_ab_t000__h002_robetta.zip -frag3 rb_07_16_509_735_ab_t000__h002_robetta.200.3mers.index.gz -fragA rb_07_16_509_735_ab_t000__h002_robetta.200.14mers.index.gz -fragB rb_07_16_509_735_ab_t000__h002_robetta.200.11mers.index.gz -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 3102768 rosetta_4.07_x86_64-pc-linux-gnu: loadlocale.c:129: _nl_intern_locale_data: Assertion `cnt < (sizeof (_nl_value_type_LC_TIME) / sizeof (_nl_value_type_LC_TIME[0]))' failed. SIGABRT: abort called Stack trace (17 frames): [0x5efead0] [0x5ffe380] [0x607e517] [0x60083a8] [0x6002794] [0x60027ee] [0x6000f73] [0x6001996] [0x60007df] [0x600020e] [0x5f1d10e] [0x5f1d73e] [0x5f1707a] [0x5f17202] [0x412631] [0x5fff8cc] [0x610b97] Exiting... </stderr_txt> ]]> |
henfredemars Send message Joined: 26 Aug 18 Posts: 1 Credit: 2,917,734 RAC: 1,015 |
It took me hours to find a fix for this! I am so glad that others have found this problem and found a solution. Setting the locale to C using systemd's service file worked perfectly. Please don't statically link to glibc. That's just a bad idea. Hint: Ubuntu users, you can use systemctl show boinc-client.service | grep Path...to find the service file. |
[AF>Libristes]Maeda Send message Joined: 18 Aug 11 Posts: 4 Credit: 4,526,367 RAC: 0 |
Thanks for tips :) I stopped this project since May on one of my computer (that have a 'on the edge' distro) and just tried your solution. Let's see if all's right now ! |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,249,734 RAC: 9,368 |
While reporting that weird PF task in another thread, I was aware I had many others finished with a Compute Error and should actually be in this thread. Problems of various types with PF tasks have been going on all year, still unsolved. Next post will show my current Compute error tasks. Another week goes by, another 7 PF* tasks coming up with the same "nan" error after running to apparent completion |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,249,734 RAC: 9,368 |
Further PF tasks resulting in a Compute Error. All ran several hours before failing. None called for an excessive amount of memory. All seem to be Rosetta v4.07 windows_x86_64 jobs (needs verifying) - none are Mini Rosetta 3.78 tasks Note: not all PF tasks are erroring out - many go through fine. But many don't, all showing error "chi angle must be between -180 and 180: nan" PF08894.10_nojmps_aivan_SAVE_ALL_OUT_03_09_686468_2970_0 PF04317.11_jmps_aivan_SAVE_ALL_OUT_03_09_686470_3937_1 PF09820.8_nojumps_aivan_SAVE_ALL_OUT_03_09_686286_6530_1 PF06775.13_nojmps_aivan_SAVE_ALL_OUT_03_09_686691_257_0 PF14460.5_nojmps_aivan_SAVE_ALL_OUT_03_09_686691_1533_0 PF13731.5_nojmps_aivan_SAVE_ALL_OUT_03_09_686649_2900_0 PF14092.5_nojmps_aivan_SAVE_ALL_OUT_03_09_686649_6572_0 PF07774.12_nojmps_aivan_SAVE_ALL_OUT_03_09_686649_4239_0 PF04317.11_jmps_aivan_SAVE_ALL_OUT_03_09_686470_12934_1
One very specific weird PF error reported here. Related, it seems, but different PF13731.5_jmps_aivan_SAVE_ALL_OUT_03_09_686650_5044_0 |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,249,734 RAC: 9,368 |
One new thing to report, confirmed here this morning: ALL "cis_paper_simulation" tasks seem to fail within seconds. Less of a problem to me as no computing time is wasted. Errors of 2 types, as shown below. cis_paper_simulation_3_4_5_nmet_SAVE_ALL_OUT_686721_56_0 cis_paper_simulation_3_nmet_SAVE_ALL_OUT_686724_38_0 cis_paper_simulation_2_3_4_5_nmet_SAVE_ALL_OUT_686713_301_0 cis_paper_simulation_4_nmet_SAVE_ALL_OUT_686726_458_0 cis_paper_simulation_3_4_5_nmet_SAVE_ALL_OUT_686721_560_0 cis_paper_simulation_2_3_nmet_SAVE_ALL_OUT_686716_579_1 cis_paper_simulation_1_2_3_5_nmet_SAVE_ALL_OUT_686699_548_1 cis_paper_simulation_1_2_nmet_SAVE_ALL_OUT_686704_165_1 cis_paper_simulation_1_4_nmet_SAVE_ALL_OUT_686710_164_1 cis_paper_simulation_1_2_4_nmet_SAVE_ALL_OUT_686702_165_1 cis_paper_simulation_3_4_nmet_SAVE_ALL_OUT_686722_164_1 cis_paper_simulation_4_nmet_SAVE_ALL_OUT_686726_163_1 cis_paper_simulation_2_3_5_nmet_SAVE_ALL_OUT_686715_739_1 cis_paper_simulation_1_2_3_4_nmet_SAVE_ALL_OUT_686698_619_1 Typical error report Outcome Computation error cis_paper_simulation_1_3_4_5_nmet_SAVE_ALL_OUT_686705_304_0 cis_paper_simulation_1_3_4_nmet_SAVE_ALL_OUT_686706_458_0 Outcome Computation error |
neil Send message Joined: 22 Dec 06 Posts: 3 Credit: 18,162,630 RAC: 170 |
Greetings: I have four identical machines running Ubuntu 18.04.1 LTS (GNU/Linux 4.15.0-33-generic x86_64), each with a dual-core Intel E8400 cpu. For the last few weeks I've also been unable to run any 4.07 units, they all throw a Computation Error after a couple of seconds... I've tried the suggestion for editing the /etc/default/locale file, rebooting, etc... No joy. Unless someone has an actual working suggestion, I'm just going to flat stop running Rosetta and switch to World Community Grid, at least until Rosetta gets their act together. Sorry! |
aad Send message Joined: 5 Jan 06 Posts: 9 Credit: 194,209,187 RAC: 186 |
I have several Linux machines running Rosetta. The ones I upgraded from Linux Mint 18.3 to Mint 19 works fine with all wu' s. The ' clean install' machines on Mint 19 all fail to run the 4.07 wu' s. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,249,734 RAC: 9,368 |
A few more PF task errors, but this time a different error and on my 2 Laptops rather than my Desktop. The Desktop is running fine now as it's not running (m)any PF tasks. One an Intel Core 2 Duo T6600 running W7, the other an AMD FX9830 running W10, but both showing the exact same error message: fp0910_db_37_43_msd3_Y_A_fragments_abinitio_SAVE_ALL_OUT_686991_726_0 fp0910_db_131_43_msd3_Y_fragments_abinitio_SAVE_ALL_OUT_686994_273_0 fp0910_db_37_43_msd4_X_A_fragments_abinitio_SAVE_ALL_OUT_686990_1178_1 ERROR: ERROR: FragmentIO: could not open file 00001.200.9mers |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,633,151 RAC: 7,242 |
A lot of errors after few seconds 1029496426 1029496442 etc File: ......srccoreposeutil.cc:703 File: ......srccoreposeutil.cc:703 And others "atom......not found" |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,633,151 RAC: 7,242 |
And others "atom......not found" Up to now: Atom 38 Atom 246 Atom 84 Atom 66 Atom 186 |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,633,151 RAC: 7,242 |
Today 5 wus finished and 50 failed. Please admins, if you want to test your app, use Ralph (i'm happy to help you in this beta project) But don't waste my time and my internet connection with Rosetta, that is stable production. |
Admin Project administrator Send message Joined: 1 Jul 05 Posts: 4805 Credit: 0 RAC: 0 |
Sorry for the recent batches of bad workunits. I've talked to the researcher who was responsible and we canceled them as soon as we noticed the issue. |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
Application version Rosetta v4.07 windows_intelx86 Device: 1759960, Task: 1035040309, and WU 932446587. Name: BDW_mal_pep_CSPFab43peptide21_0001_renumbered_N-10_C-2_0003_abinitio_1_abinitio_SAVE_ALL_OUT_699471_159_0 Status: 1 (0x00000001) Unknown error code. <core_client_version>7.10.2</core_client_version> ERROR: ERROR: FragmentIO: could not open file 00001.500.6mers Run time was only 27 seconds prior to error, with CPU time 18 sec. |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
Same here, but Rosetta 4.08 (Ubuntu 16.04). <core_client_version>7.12.0</core_client_version> <![CDATA[ <message> process exited with code 1 (0x1, -255)</message> <stderr_txt> command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu -fragA 00001.500.6mers -fragB 00001.500.4mers -in:file:fasta 00001.fasta -abinitio::increase_cycles 10 -mute all -abinitio::fastrelax -relax::default_repeats 15 -abinitio::rsd_wt_helix 0.5 -abinitio::rsd_wt_loop 0.5 -abinitio::use_filters false -ex1 -ex2aro -in:file:boinc_wu_zip cp_CSPFab43peptide21_0001_renumbered_N-10_C-2_0003_abinitio_1_fold_data.zip -out:file:silent default.out -silent_gz -in:file:native 00001.pdb -out:file:silent_struct_type binary -detect_disulf true -fix_disulf disulf -constraints::cst_file CB_cst -constraints:cst_weight 1 -number_9mer_frags 150 -number_3mer_frags 150 -beta -nstruct 10000 -cpu_run_time 28800 -watchdog -boinc:max_nstruct 600 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -run::rng mt19937 -constant_seed -jran 1887503 ERROR: ERROR: FragmentIO: could not open file 00001.500.6mers ERROR:: Exit from: src/core/fragment/FragmentIO.cc line: 233 BACKTRACE: [0x5a57db6] [0x398a65a] [0x12cfd45] [0x12dbc88] [0x12e4ef4] [0x41281d] [0x5ff3ccc] [0x6108e7] BOINC:: Error reading and gzipping output datafile: default.out 15:51:52 (21457): called boinc_finish(1) </stderr_txt> https://boinc.bakerlab.org/result.php?resultid=1035160110 |
Message boards :
Number crunching :
Rosetta 4.0+
©2024 University of Washington
https://www.bakerlab.org