Duke-UNC Brain Imaging and Analysis Center
BIAC Forums | Profile | Register | Active Topics | Members | Search | FAQ
 All Forums
 Support Forums
 Cluster Support
 Potential Scheduled Downtime ( 12/09 )

Note: You must be registered in order to post a reply.
To register, click here. Registration is FREE!

Screensize:
UserName:
Password:
Format Mode:
Format: BoldItalicizedUnderlineStrikethrough Align LeftCenteredAlign Right Horizontal Rule Insert HyperlinkInsert EmailInsert Image Insert CodeInsert QuoteInsert List
   
Message:

* HTML is OFF
* Forum Code is ON
Smilies
Smile [:)] Big Smile [:D] Cool [8D] Blush [:I]
Tongue [:P] Evil [):] Wink [;)] Clown [:o)]
Black Eye [B)] Eight Ball [8] Frown [:(] Shy [8)]
Shocked [:0] Angry [:(!] Dead [xx(] Sleepy [|)]
Kisses [:X] Approve [^] Disapprove [V] Question [?]

 
Check here to subscribe to this topic.
   

T O P I C    R E V I E W
petty Posted - Dec 05 2011 : 2:23:32 PM

The cluster will be potentially be down this Friday afternoon ( 12/9 ) due to the expansion of Munin.

We will find out this week if service engineers from BluArc will be expanding/updating Munin. If they are able to make it this Friday, then systems will need to be shut-down for a few hours.

I will reply when we have a finalized plan. Currently we are waiting to hear back, but i just wanted to give notice that the cluster nodes may be off Friday afternoon.

Thanks,
-Chris
3   L A T E S T    R E P L I E S    (Newest First)
petty Posted - Dec 10 2011 : 8:13:55 PM
Two nodes are still down and three others are interactive only.

Jobs can't go to the down nodes
dvsmith Posted - Dec 10 2011 : 3:06:04 PM
Hey Chris,

Are some of the nodes still down? I was getting ready to restart a set of jobs, but it looks like something is weird with some of the nodes. My test jobs just sit in the queue indefinitely.

If these nodes are hung, will it affect my ability to work on other nodes without having to deal with random problems later? Put differently, if I start submitting a bunch of jobs, will I just wind up with a bunch of jobs that get stuck on these nodes? I'm happy waiting, if it lessens later headaches...

Thanks,
David

[smith@hugin neglect_mvpa]$ qstat
job-ID prior name user state submit/start at queue slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
3049142 0.25091 nodetest.s smith qw 12/10/2011 13:17:52 1
3049150 0.25043 nodetest.s smith qw 12/10/2011 13:18:41 1
3049155 0.25013 nodetest.s smith qw 12/10/2011 13:19:11 1
3049156 0.25007 nodetest.s smith qw 12/10/2011 13:19:17 1
3049157 0.25000 nodetest.s smith qw 12/10/2011 13:19:24 1
petty Posted - Dec 06 2011 : 1:57:32 PM
this will be happening, please plan accordingly:

cluster nodes will be turned off around 1230pm ... the head node will remain on.

http://www.biac.duke.edu/forums/topic.asp?TOPIC_ID=1554

BIAC Forums © 2000-2010 Brain Imaging and Analysis Center Go To Top Of Page
This page was generated in 0.34 seconds. Snitz Forums 2000