Duke-UNC Brain Imaging and Analysis Center
BIAC Forums | Profile | Register | Active Topics | Members | Search | FAQ
Username:
Password:
Save Password   Forgot your Password?
 All Forums
 Support Forums
 Cluster Support
 Job status email notification
 New Topic  Reply to Topic
 Printer Friendly
Author Previous Topic Topic Next Topic  

Adria
New Member

19 Posts

Posted - Nov 10 2011 :  4:03:19 PM  Show Profile  Reply with Quote
I am running a job that is taking a while to finish. Is there a way I can have the cluster notify me via email when the job is complete?

petty
BIAC Staff

USA
453 Posts

Posted - Nov 10 2011 :  4:08:27 PM  Show Profile  Reply with Quote
you can add a notification if you didn't do it at submission time:

qalter -m e -M email@address.com JOBID

-m is the email notication flag 'e' means ending
Go to Top of Page

Adria
New Member

19 Posts

Posted - Nov 10 2011 :  5:09:46 PM  Show Profile  Reply with Quote
Thank you! Now what if I have a job running, but it doesn't finish before I leave work. If I log out of my computer, and return tomorrow, will it start off where it ended the day before, or will it rerun the whole job?
Go to Top of Page

petty
BIAC Staff

USA
453 Posts

Posted - Nov 10 2011 :  8:14:36 PM  Show Profile  Reply with Quote
If its a batch job it will keep running, those are not tied to your computer at all.

It it's interactive it will die when your shell closes.
Go to Top of Page

Adria
New Member

19 Posts

Posted - Nov 17 2011 :  3:07:52 PM  Show Profile  Reply with Quote
Hi, thanks for the help. I have another question. I have set up a qsub script for FSLVBM, but I don't know when and where to enter the command to send a notification to my email address. I have been referencing the conversation between you and user dit11: http://www.biac.duke.edu/forums/topic.asp?TOPIC_ID=1491&SearchTerms=qsub
Go to Top of Page

syam.gadde
BIAC Staff

USA
421 Posts

Posted - Nov 17 2011 :  3:13:16 PM  Show Profile  Reply with Quote
Most of the job submission info is listed here:
http://wiki.biac.duke.edu/biac:cluster:submit

If you are running a command/script through qsub, it should automatically set your email correctly, but you can explicitly set it in your script with a line like:
#$ -M user@somewhere.edu
Go to Top of Page

Adria
New Member

19 Posts

Posted - Nov 17 2011 :  3:35:49 PM  Show Profile  Reply with Quote
So does that mean that qsub will automatically send me an email without me having to send out an option, or are there command line options I have to use?
Go to Top of Page

petty
BIAC Staff

USA
453 Posts

Posted - Nov 17 2011 :  4:00:25 PM  Show Profile  Reply with Quote
Put these two things in your qsub script

#$ -m ea
#$ -M user@school.edu


"-m ea" means to email you if the job ends or is altered
"-M " is the email address to send the notification too

you'll need both arguments
Go to Top of Page

Adria
New Member

19 Posts

Posted - Nov 17 2011 :  4:14:52 PM  Show Profile  Reply with Quote
I typed in the -M command and the job went through, but I got an error saying "[akv7@node51 data]$ qsub -v EXPERIMENTS=fse.01 fslvbm.sh
Invalid EMAIL detected!
searching users for : akv7
changing email to adria.vidovic@duke.edu
Your job 2834398 ("fslvbm.sh") has been submitted
[akv7@node51 data]$ more fslvbm.sh
#!/bin/sh"

When I went back to check to see if I put in the right email address, it turns out that I typed it correctly. What do you think is happening?
Go to Top of Page

petty
BIAC Staff

USA
453 Posts

Posted - Nov 17 2011 :  4:35:36 PM  Show Profile  Reply with Quote
2 things: You need the "#" symbol infront of all of your SGE flags inside the script:

# -- END USER DIRECTIVE --
$ -M adria.vidovic@duke.edu
# -- BEGIN USER SCRIPT --

needs to be

# -- END USER DIRECTIVE --
#$ -M adria.vidovic@duke.edu
# -- BEGIN USER SCRIPT --

and you need to delete:
#$ -M someone@somewhere.edu



Go to Top of Page

Adria
New Member

19 Posts

Posted - Nov 17 2011 :  4:46:20 PM  Show Profile  Reply with Quote
Ok great. Thank you!
Go to Top of Page

Adria
New Member

19 Posts

Posted - Nov 17 2011 :  5:10:24 PM  Show Profile  Reply with Quote
Hi, Just for future reference, what does SGE stand for?
Go to Top of Page

petty
BIAC Staff

USA
453 Posts

Posted - Nov 17 2011 :  6:21:56 PM  Show Profile  Reply with Quote
Sun grid engine
Go to Top of Page

Adria
New Member

19 Posts

Posted - Dec 05 2011 :  09:24:36 AM  Show Profile  Reply with Quote
I received this error in the email I had BIAC send me.
Job 2834410 (fslvbm.sh) Aborted
Exit Status = -1
Signal = unknown signal
User = akv7
Queue = users.q@node56.local
Host = node56.local
Start Time = <unknown>
End Time = <unknown>
CPU = NA
Max vmem = NA
failed before writing exit_status because:
shepherd exited with exit status 19: before writing exit_status

What does this mean and is there anything I need to change in my script to make it work? Thank you!

Go to Top of Page

petty
BIAC Staff

USA
453 Posts

Posted - Dec 05 2011 :  09:34:39 AM  Show Profile  Reply with Quote
looks like if failed before it could even make it to a node, which is why all those values are blank.

you can try "dos2unix" on the script to remove any invalid characters, otherwise its likely a scripting error somewhere early in the script

Go to Top of Page
  Previous Topic Topic Next Topic  
 New Topic  Reply to Topic
 Printer Friendly
Jump To:
BIAC Forums © 2000-2010 Brain Imaging and Analysis Center Go To Top Of Page
This page was generated in 0.42 seconds. Snitz Forums 2000