Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
February 11 2014 Scheduled Downtime Outage Resolved

HPC systems are offline today for scheduled quarterly maintenance activity. For details, please visit osc.edu/n

12 years 4 days ago 12 years 3 days ago
(informational) GPFS maintenance work duplicate known issue filesystem Resolved

Maintenance work on the GPFS servers is scheduled to be performed today, 28 Feb 2020 at 2:00p.m.

Although there is no direct impact expected to services at OSC, there may be short... Read more

5 years 11 months ago 5 years 11 months ago
Job failures on some rolling-rebooted nodes on Owens since April 16, 2018 Owens Resolved

3:35 PM 4/30/2018 Update:

The cause is that NFSv4.1 is not configured correctly after OS on Owens was updated from RHEL 7.3 to 7.4. We re-rebooted the Owens compute nodes... Read more

7 years 10 months ago 7 years 9 months ago
Login Issues with my.osc.edu Outage Resolved

Update: this is fixed. 

Original post:

We are aware that some users may be experiencing issues logging into ... Read more

1 year 4 months ago 1 year 4 months ago
Downtime Update: All Major Services Online Resolved

Friday, Sept 25th 12PM Noon:

  • Oakley is back online and has resumed running jobs.  
  • Ruby... Read more
10 years 5 months ago 10 years 4 months ago
Very little free space for metadata on the scratch storage /fs/scratch filesystem Resolved

Updated 15:30 October 19:

The issue of little space for metadata on scratch storage is resolved. If you have any questions, please contact... Read more

4 years 3 months ago 4 years 3 months ago
BWA Software Security Vulnerability Software Resolved

... Read more

6 years 6 months ago 6 years 6 months ago
cp2k/2023.2 can produce huge output containing MKL messages Ascend, Cardinal, Pitzer, Software Resolved
(workaround)

On all clusters the cp2k executables from module cp2k/2023.2 can produce huge output files due to many many repeating errors from MKL, e.g.:

... Read more          
6 months 1 week ago 4 months 1 week ago
Issue with GPFS on Owens since April 14, 2017 Batch, filesystem, Owens Resolved

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address this GPFS issue. 

We have had issues with GPFS mounts on Owens Cluster since Friday afternoon,... Read more

8 years 10 months ago 8 years 9 months ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

2 years 7 months ago 2 years 1 month ago

Pages