Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Posted Updatedsort ascending
Issue with GPFS on Owens since April 14, 2017 Batch, filesystem, Owens Resolved

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address this GPFS issue. 

We have had issues with GPFS mounts on Owens Cluster since Friday afternoon,... Read more

5 years 5 months ago 5 years 5 months ago
Rolling reboot of all clusters, starting from Wednesday morning, April 19, 2017 Batch, Maintenance, Owens, Ruby Resolved

1:40PM 4/27/2017 Update: Rolling reboots are completed. 

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address GPFS errors occured... Read more

5 years 5 months ago 5 years 5 months ago
Scratch and Project are hung; schedulings have been paused Batch, filesystem Resolved

1:00PM 4/6/2017 Update:  The Scratch and Project file systems are back to normal service. Scheduling on systems are resumed. We are still investigating the causes to this problem... Read more

5 years 6 months ago 5 years 6 months ago
Owens is in Partial Service Owens Resolved

3:45PM April 3, 2017 Update: GPU nodes on Owens are available. 

206 Owens nodes are not accessible to users due to GPU testing and a bad Ethernet switch. It is expected... Read more

5 years 6 months ago 5 years 6 months ago
Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017 login, Owens, Ruby Resolved

4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed. 

All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more

5 years 6 months ago 5 years 6 months ago
Update on 02/24/2017: All services available Outage Resolved

02/24/17 3:50PM Update: All Services have been restored including:

  • Oakley cluster with full capacity for general access
  • Ruby cluster with full capacity for... Read more
5 years 7 months ago 5 years 7 months ago
Critical change about using $PFSDIR directory at OSC Batch Resolved

Starting from Thursday, Feb 2nd, the $PFSDIR directory on scratch (/fs/scratch) won’t be created by job prologue. For example, if you simply use the command cd $PFSDIR,... Read more

5 years 8 months ago 5 years 7 months ago
Issues with VDI through OnDemand Batch Resolved

Update: Jan 23, 2017 3PM: this issue has been fixed. 

There is an issue with OSC OnDemand -> Desktops -> Virtual Desktop Interface (VDI) such that you get "qsub: submit error ..."... Read more

5 years 8 months ago 5 years 8 months ago
Dec 27, 2016: Issues with /fs/project filesystem Resolved

Dec 27, 2016 3:46PM Update: Both project and scratch file systems (/fs/project and /fs/scratch ) are back to normal now.  Some users' jobs may be... Read more

5 years 9 months ago 5 years 9 months ago
Problems with LAMMPS 14May16 Software Resolved

LAMMPS 14May16 was built with the USER-OMP package on Oakley, Ruby, and Owens. Its default behavior is to spawn too many OpenMP threads. lammps/14May16 batch scripts that do not use the USER-OMP... Read more

6 years 3 weeks ago 5 years 10 months ago

Pages