Duke-UNC Brain Imaging and Analysis Center
BIAC Forums | Profile | Register | Active Topics | Members | Search | FAQ
Username:
Password:
Save Password   Forgot your Password?
 All Forums
 Support Forums
 Cluster Support
 Potential Scheduled Downtime ( 12/09 )
 New Topic  Reply to Topic
 Printer Friendly
Author Previous Topic Topic Next Topic  

petty
BIAC Staff

USA
453 Posts

Posted - Dec 05 2011 :  2:23:32 PM  Show Profile  Reply with Quote

The cluster will be potentially be down this Friday afternoon ( 12/9 ) due to the expansion of Munin.

We will find out this week if service engineers from BluArc will be expanding/updating Munin. If they are able to make it this Friday, then systems will need to be shut-down for a few hours.

I will reply when we have a finalized plan. Currently we are waiting to hear back, but i just wanted to give notice that the cluster nodes may be off Friday afternoon.

Thanks,
-Chris

petty
BIAC Staff

USA
453 Posts

Posted - Dec 06 2011 :  1:57:32 PM  Show Profile  Reply with Quote
this will be happening, please plan accordingly:

cluster nodes will be turned off around 1230pm ... the head node will remain on.

http://www.biac.duke.edu/forums/topic.asp?TOPIC_ID=1554
Go to Top of Page

dvsmith
Advanced Member

USA
218 Posts

Posted - Dec 10 2011 :  3:06:04 PM  Show Profile  Visit dvsmith's Homepage  Reply with Quote
Hey Chris,

Are some of the nodes still down? I was getting ready to restart a set of jobs, but it looks like something is weird with some of the nodes. My test jobs just sit in the queue indefinitely.

If these nodes are hung, will it affect my ability to work on other nodes without having to deal with random problems later? Put differently, if I start submitting a bunch of jobs, will I just wind up with a bunch of jobs that get stuck on these nodes? I'm happy waiting, if it lessens later headaches...

Thanks,
David

[smith@hugin neglect_mvpa]$ qstat
job-ID prior name user state submit/start at queue slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
3049142 0.25091 nodetest.s smith qw 12/10/2011 13:17:52 1
3049150 0.25043 nodetest.s smith qw 12/10/2011 13:18:41 1
3049155 0.25013 nodetest.s smith qw 12/10/2011 13:19:11 1
3049156 0.25007 nodetest.s smith qw 12/10/2011 13:19:17 1
3049157 0.25000 nodetest.s smith qw 12/10/2011 13:19:24 1
Go to Top of Page

petty
BIAC Staff

USA
453 Posts

Posted - Dec 10 2011 :  8:13:55 PM  Show Profile  Reply with Quote
Two nodes are still down and three others are interactive only.

Jobs can't go to the down nodes

Edited by - petty on Dec 11 2011 1:33:07 PM
Go to Top of Page
  Previous Topic Topic Next Topic  
 New Topic  Reply to Topic
 Printer Friendly
Jump To:
BIAC Forums © 2000-2010 Brain Imaging and Analysis Center Go To Top Of Page
This page was generated in 0.47 seconds. Snitz Forums 2000