Message boards : Number crunching : Minirosetta 1.97
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
![]() Send message Joined: 16 Jun 08 Posts: 1235 Credit: 14,372,156 RAC: 313 |
*** WARNING *** - the Microsoft updates today for my second 64-bit Vista computer disabled its ability to reach the internet; fortunately, it's a laptop I don't consider ready to run BOINC projects yet, especially those that don't run well with less than 100% CPU. Recovery started, but not finished. Note - I finally found which update; one needed for Vista SP1 just before it offers to let you install the Vista SP2 update. So if you're already at Vista SP2, ignore that part of the message. Recovery of that machine finished, except for that one failed update. |
![]() ![]() Send message Joined: 5 Jun 06 Posts: 154 Credit: 279,018 RAC: 0 |
I had to kill this work unit because after 40 minutes it had zero % progress and I thought that was ridiculous. The graphic part showed it still initializing. Why waste processing time something that is going nowhere. Even restarted the client to no avail. 243l_A_58_I_ddg_predictions_82409_010_WT.243l_A_58_I_.out_14659_1 |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Just watching one of these "ddg_predictions" tasks running on my own machine. At the risk of overstepping my duties, I'm going to recommend anyone with no more then 512MB of memory, cancel tasks with "ddg_predictions" in the name. I've EMailed the Project Team asking about these. They have the complete picture (beyond all of the posts in this thread) of how these tasks are running to assess if they are producing useful results, and so I will expect further details will follow soon. At this point, I feel confident these tasks are consistently using more memory then is going to be feasible for a 512MB machine. So this is why I'm making the suggestion above. Rosetta Moderator: Mod.Sense |
TimL Send message Joined: 16 Sep 06 Posts: 17 Credit: 15,509,973 RAC: 0 |
|
Sid Celery Send message Joined: 11 Feb 08 Posts: 2220 Credit: 42,304,766 RAC: 24,305 ![]() |
Tempting fate, I know, but I thought I'd check for any errors to report in the last week and went back through every 1.97 WU I've ever received. No errors at all. Surely that can't be right... ;) For all those people who went off in a huff over perceived problems earlier in the month, can someone tell them the coast is currently very clear. Even credits have edged back up per WU too. Whisper it quietly... ![]() ![]() |
![]() ![]() Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
whatever they did sure cleaned up the errors. been clean as far back as the rosie will let me and its all perfect. did have one group of tasks that came back as no reply, kind of odd. no effect on credit though. |
![]() Send message Joined: 16 Jun 08 Posts: 1235 Credit: 14,372,156 RAC: 313 |
whatever they did sure cleaned up the errors. I've had one batch of tasks that came back as no reply on one of my computers, but I thought this was the result of a recent power failure, partly since all BOINC projects this computer participates in were affected, not just Rosetta@home. |
frederick corse Send message Joined: 7 Oct 05 Posts: 10 Credit: 1,545,999 RAC: 0 |
Hello I am now running 8tim_Q_178_A_ddg_predictions_82409_1252 and it calls out that it is using 1.5G of memory . It has been running for over 40 minutes and it is still initalizing. the first time it was sent out came back as no reply. |
frederick corse Send message Joined: 7 Oct 05 Posts: 10 Credit: 1,545,999 RAC: 0 |
helo boinc I sampled the program core::scoring::etable::count_pair::CPCrossoverBehavior) 1 operator new(unsigned long) 1 malloc 1 std::basic_string<char, std::char_traits<char>, std::allocator<char> >::basic_string(char const*, std::allocator<char> const&) 1 char* std::string::_S_construct<char const*>(char const*, char const*, std::allocator<char> const&, std::forward_iterator_tag) ore::graph::PointGraphEdgeData> >, double, utility::vector1<bool, std::allocator<bool> >) 1 operator new(unsigned long) 1 malloc 1 malloc_zone_malloc 1 szone_malloc_should_clear 1 small_malloc_from_free_list 1 0xffffffff 1 _sigtramp 1 __i686.get_pc_thunk.bx 1 core::graph::residue_point_graph_from_pose(core::pose::Pose const&, core::graph::UpperEdgeGraph<core::graph::PointGraphVertexData, core::gr 1 std::basic_string<char, std::char_traits<char>, std::allocator<char> >::~basic_string() 1 __i686.get_pc_thunk.bx 4 core::scoring::etable::count_pair::CountPairAll::count(int, int, double&) const 3 core::scoring::etable::count_pair::CountPairIntraRes<core::scoring::etable::count_pair::CountPairCrossover3>::count(int, int, double&) const 2 core::graph::find_neighbors_restricted(utility::pointer::owning_ptr<core::graph::UpperEdgeGraph<core::graph::PointGraphVertexData, core::graph::PointGraphEdgeData> >, d |
Astropoint Send message Joined: 13 Oct 05 Posts: 7 Credit: 3,530,505 RAC: 2,803 ![]() |
I had a WU stuck for about 4 hours on 0% and using 1.2GB of memory before I aborted it. https://boinc.bakerlab.org/rosetta/result.php?resultid=278489672 This is the 2nd one that I remember from the past couple of weeks |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
frederick, see my post on ddg_predictions tasks previously. I haven't seen any for a while, so I'm guessing they cancelled any new ones. Sounds like the one you got was reissued due to the original copy missing the deadline. They seems to consume a lot of memory, not interact with BOINC Manager to report their progress, they seem to have rather long running models, and not to display anything more then the basic framework of the graphic... but they do seem to eventually complete. It looks like your preferred runtime is about 4 hours. Please let that one run for at least 8hrs before considering aborting it. I believe it will be completed before that time anyway. your other ddg_predictions task finished in about 5 hrs, and it ran that long because it was only able to complete a single model (the minimum amount of useful results a task can produce). Rosetta Moderator: Mod.Sense |
![]() Send message Joined: 5 Aug 09 Posts: 5 Credit: 1,356,008 RAC: 0 |
I have now run out of tasks to work on. However my computer is still trying to get new work units because when I checked the messages section it was listing, “Requesting new tasks Scheduler request completed: got 0 new tasks. “ Also under the server status the scheduler is running. Is anyone else getting this problem? |
LizzieBarry Send message Joined: 25 Feb 08 Posts: 76 Credit: 201,862 RAC: 0 |
I have now run out of tasks to work on. Sort of, yes. Eventually something came through, but it looks like the work generator is struggling to keep up again. |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
This one was stuck or not making much progress. After 4hrs, 14min it was on Model: 1 ,Step: 3. I aborted it sorry. lr13_seq_score12_ss5.0_rlbd_1tig_IGNORE_THE_REST_DECOY_14612_3390_0 https://boinc.bakerlab.org/rosetta/workunit.php?wuid=254147223 ![]() |
AMD_is_logical Send message Joined: 20 Dec 05 Posts: 299 Credit: 31,460,681 RAC: 0 |
I currently have over 100 results in my "pending" list. Also, I notice that I have quite a few results taking around 2 hours or less. (My runtime setting is 12 hours.) |
Quercus Petraea Send message Joined: 12 Oct 07 Posts: 1 Credit: 6,279,104 RAC: 0 |
Many "pending" granted credit in my list to! |
[AF>france>pas-de-calais]symaski62 Send message Joined: 19 Sep 05 Posts: 47 Credit: 33,871 RAC: 0 |
https://boinc.bakerlab.org/rosetta/result.php?resultid=279674844 <core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> [2009- 9-10 7: 5:29:] :: BOINC:: Initializing ... ok. [2009- 9-10 7: 5:29:] :: BOINC :: boinc_init() BOINC:: Setting up shared resources ... ok. BOINC:: Setting up semaphores ... ok. BOINC:: Updating status ... ok. BOINC:: Registering timer callback... ok. BOINC:: Worker initialized successfully. Registering options.. Registered extra options. Initializing broker options ... Registered extra options. Initializing core... Initializing options.... ok Options::initialize() Options::adding_options() Options::initialize() Check specs. Options::initialize() End reached Loaded options.... ok Processed options.... ok Initializing random generators... ok Initialization complete. Setting WU description ... Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev32257.zip Setting database description ... Setting up checkpointing ... Setting up graphics native ... BOINC:: Worker startup. Starting watchdog... Watchdog active. # cpu_run_time_pref: 14400 [2009- 9-10 9:59:56:] :: BOINC:: Initializing ... ok. [2009- 9-10 9:59:56:] :: BOINC :: boinc_init() BOINC:: Setting up shared resources ... ok. BOINC:: Setting up semaphores ... ok. BOINC:: Updating status ... ok. BOINC:: Registering timer callback... ok. BOINC:: Worker initialized successfully. Registering options.. Registered extra options. Initializing broker options ... Registered extra options. Initializing core... Initializing options.... ok Options::initialize() Options::adding_options() Options::initialize() Check specs. Options::initialize() End reached Loaded options.... ok Processed options.... ok Initializing random generators... ok Initialization complete. Setting WU description ... Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev32257.zip Setting database description ... Setting up checkpointing ... Setting up graphics native ... BOINC:: Worker startup. Starting watchdog... Watchdog active. Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk1_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk2_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk3_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk4_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk5_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk6_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk7_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk8_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk9_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk10_fa ... success! Continuing computation from checkpoint: chk_3BDC-ALA132GLU_0001_00020_FastRelax__chk11_fa ... success! # cpu_run_time_pref: 14400 ====================================================== DONE :: 21 starting structures 10500.8 cpu seconds This process generated 21 decoys from 21 attempts ====================================================== BOINC :: Watchdog shutting down... BOINC :: BOINC support services shutting down cleanly ... called boinc_finish </stderr_txt> ]]> |
![]() ![]() Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
there is nothing wrong with this task other than it is in pending credit queue. please do not post so much information unless you truly have a bug to report. https://boinc.bakerlab.org/rosetta/result.php?resultid=279674844 |
![]() ![]() Send message Joined: 30 Aug 08 Posts: 3 Credit: 478,187 RAC: 0 |
Got about 16 WU,s waiting to upload and no new work yet severs say all OK yet when you ping the srv4.bakerlab.org you get timed out C:Program FilesSupport Tools>ping srv4.bakerlab.org Pinging srv4.bakerlab.org [140.142.20.112] with 32 bytes of data: Request timed out. Request timed out. Request timed out. Request timed out. Ping statistics for 140.142.20.112: Packets: Sent = 4, Received = 0, Lost = 4 (100% loss), |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
symm_lr8_seq_score12_A_rlbd_1t2i_IGNORE_THE_REST_DECOY_14880_289 https://boinc.bakerlab.org/rosetta/workunit.php?wuid=257005065 Are we getting these again, i seem to remember this type from weeks ago. Some of them caused problems back then to, other user had a problem with it to. When this one restarted it had done over 3hrs and it then went back to Model:1 / Step:0 doesn't look like it check pointed at all so i ABORTED IT. ![]() |
Message boards :
Number crunching :
Minirosetta 1.97
©2025 University of Washington
https://www.bakerlab.org