The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

 OSC is preparing to update Slurm on its production systems to version 23.11.4 on March, 27. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Titlesort ascending Category Resolution Description Posted Updated
Oakley login node down login Resolved

One of the Oakley login nodes is down. We are currently working on bringing it back online. SSH connections to oakley.osc.edu may time out. A workaround is to connect directly to oakley01.osc.edu... Read more

10 years 2 weeks ago 10 years 2 weeks ago
Oakley login node down Resolved

One of the Oakley login... Read more

9 years 2 months ago 9 years 1 month ago
Oakley and Owens queue issue Batch Resolved

We are experiencing a problem with the queuing system on oakley and owens that is delaying or preventing new jobs from running. Our systems staff is investigating.

 

6 years 3 months ago 6 years 3 months ago
Nvidia drivers on Oakley GPU Resolved

We upgraded the drivers for the Nvidia GPUs on all of our clusters during the downtime this week. Unfortunately, we are noticing some subtle problems with the GPUs on Oakley. We will be rolling... Read more

7 years 5 months ago 5 years 9 months ago
Nsight GPU profiler not working due to DCGM conflict GPU, Infrastructure Resolved

UPDATE (Mar 15, 2023)

After the downtime on Mar. 14, 2023, OSC enabled a new Slurm option --gres=nsight. DCGM will be disabled on the nodes for the job with the Slurm option,... Read more

1 year 3 weeks ago 1 year 1 week ago
NFS service disruption 6/29/16 filesystem Resolved

OSC experienced errors with NFS services the morning of June 29 between 08:37 and 09:12 that may have caused some jobs to fail, or other unexpected behavior.  The... Read more

7 years 9 months ago 7 years 9 months ago
NFS outage on Thursday Jan 17 from 7am to 8am filesystem Resolved

Update:

This work has been canceled and will be done during downtime on Feb. 5. 

Original Post:

On Thursday, January 17th from 7 am to 8 am OSC will have a GPFS... Read more

5 years 2 months ago 5 years 2 months ago
Network card re-seat Network Resolved

At 8AM on Tuesday, July 9th 2013, we will be re-seating a network card in a switch at our operations center. It is possible that a brief (~10 minute) outage may occur. Jobs will pause for the... Read more

10 years 8 months ago 10 years 8 months ago
Negative Balance Emails client portal Resolved

Negative balance emails continue to be sent once an application is submitted.

To confirm whether or not you have truly submitted an application for additional resources and that you can... Read more

4 years 10 months ago 4 years 6 months ago
NAMD 2.11 precompiled binaries do not work Software Resolved

NAMD 2.11 precompiled binaries do not work.  Please use NAMD 2.11 installed from the source and available via module namd/2.11.

The NAMD 2.11 issue involves changes to the command charmrun... Read more

8 years 1 month ago 5 years 2 months ago

Pages