The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

 OSC is preparing to update Slurm on its production systems to version 23.11.4 on March, 27. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort descending Updated
Oakley login nodes and ruby02 will not be accessible between 9:00-9:30am on 10/18/2016 login Resolved

We upgraded to RHEL 6.8 for both Oakley and Ruby clusters during the October 12th's downtime. Unfortunately, we are noticing some NFS problem that has been causing rsh, or ssh sessions to hang on... Read more

7 years 5 months ago 7 years 5 months ago
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

7 years 4 months ago 7 years 4 months ago
LAMMPS 14May16 velocity command problem on Owens Software Resolved
(workaround)

LAMMPS 14May16 on Owens can hang when using the velocity command.  Inputs that hang on Owens work on Oakley and Ruby.  LAMMPS 31Mar17 on Owens also works.  Here is an example failing input snippet... Read more

7 years 3 months ago 1 year 11 months ago
Performance Regression of GPU Nodes on Ruby GPU, Ruby Resolved

We currently have performance regression of Ruby's GPU nodes. Some of the GPU nodes on Ruby will remain in a power-saving state even after an application starts using them, resulting in... Read more

7 years 3 months ago 5 years 9 months ago
Dec 27, 2016: Issues with /fs/project filesystem Resolved

Dec 27, 2016 3:46PM Update: Both project and scratch file systems (/fs/project and /fs/scratch ) are back to normal now.  Some users' jobs may be... Read more

7 years 3 months ago 7 years 3 months ago
Issues with VDI through OnDemand Batch Resolved

Update: Jan 23, 2017 3PM: this issue has been fixed. 

There is an issue with OSC OnDemand -> Desktops -> Virtual Desktop Interface (VDI) such that you get "qsub: submit error ..."... Read more

7 years 2 months ago 7 years 2 months ago
Abaqus license contention Batch, Licensing Resolved

We have noticed some abaqus jobs end up in BatchHold. Once the job is in BatchHold, it will never start. This is because of sharing the abaqus licenses between Oakley and Owens. We have opened a... Read more

7 years 1 month ago 5 years 9 months ago
Critical change about using $PFSDIR directory at OSC Batch Resolved

Starting from Thursday, Feb 2nd, the $PFSDIR directory on scratch (/fs/scratch) won’t be created by job prologue. For example, if you simply use the command cd $PFSDIR,... Read more

7 years 1 month ago 7 years 1 month ago
Update on 02/24/2017: All services available Outage Resolved

02/24/17 3:50PM Update: All Services have been restored including:

  • Oakley cluster with full capacity for general access
  • Ruby cluster with full capacity for... Read more
7 years 1 month ago 7 years 3 weeks ago
Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017 login, Owens, Ruby Resolved

4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed. 

All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more

7 years 2 weeks ago 7 years 1 day ago

Pages