Notify you that one or more of your jobs was running on a compute node that crashed due to a system software problem.
Failure of job(s) 919137 due to a system software problem at OSC
OSC Help <OSCHelp@osc.edu>
Your job failed and was not at fault. You should resubmit the job. Usually the problems are caused by another job running on the node.
These emails are sent by a systems administrator as part of the node cleanup process.
We don’t have a mechanism to turn off these emails. If they really bother you, contact OSC Help and we’ll try to accommodate you.
If you request a whole node (nodes=1:ppn=12 on Oakley or nodes=1:ppn=8 on Glenn) your jobs will be less susceptible to problems caused by other jobs. Other than that, be assured that we work hard to keep jobs from interfering with each other.