The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

 OSC is preparing to update Slurm on its production systems to version 23.11.4 on March, 27. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort ascending Updated
Scheduling temporarily suspended on Oakley Batch Resolved

We are migrating the batch scheduler on Oakley to a new virtual machine. In order to accomplish this, the scheduler will be temporarily offline for about four hours on December 16th. Running jobs... Read more

8 years 3 months ago 8 years 3 months ago
pdsh -j broken on Oakley Batch, system software Resolved

pdsh -j is broken on Oakley.  It was broken by updates during the September downtime.  We are currently working on resolving the issue.

Users who require... Read more

8 years 3 months ago 5 years 9 months ago
Estimated charging for serial jobs on Oakley is incorrect Batch Resolved

Currently, the estimated RU charge reported at the end of a job shows an incorrect value for serial jobs on Oakley of the entire node. Jobs are being charged the correct amount in the official... Read more

8 years 5 months ago 5 years 9 months ago
Downtime Update: All Major Services Online Resolved

Friday, Sept 25th 12PM Noon:

  • Oakley is back online and has resumed running jobs.  
  • Ruby... Read more
8 years 6 months ago 8 years 6 months ago
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.


As of early afternoon, Sept. 8,... Read more

8 years 6 months ago 8 years 6 months ago
Lustre bug causing Oakley login node crashes filesystem, login Resolved

Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.  The bug (or issue otherwise) seems to be activated when a user does operations on a... Read more

8 years 7 months ago 8 years 5 months ago
Unscheduled GPFS Outage filesystem Resolved

As of 11:30PM on June 16th, we have removed the GPFS filesystem from service due to a number of hardware failures. At this point, further hardware failures would put a large portion of the entire... Read more

8 years 9 months ago 8 years 9 months ago
warning: libhwloc.so.1 may conflict with libhwloc.so.5 Resolved

Sometimes when building MPI programs the following warning appears.  It is harmless and can be safely ignored.

ld: warning: libhwloc.so.1, needed by /usr/local/mvapich2/1.7-intel/lib/... Read more          
8 years 11 months ago 8 years 5 months ago
Matlab PCT broken due to pbsrsh modification Matlab Resolved

A change was made to the system wide pbsrsh script which Matlab relies on.  It has been discovered that this change has broken the parallel computing toolbox (... Read more

8 years 11 months ago 8 years 5 months ago
module spider/avail/show not showing MPI dependent modules Ruby Resolved

On Ruby, the commands:

  • module spider
  • module avail
  • module show... Read more
8 years 11 months ago 8 years 5 months ago

Pages