Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Ondemand error when terminating interactive app OnDemand Resolved
(workaround)

When trying to delete an interactive session through OnDemand, you may receive an error page about 'No such file'. This can be disregarded. Simply navigate back to the interactive sessions page... Read more

5 years 2 months ago 3 years 10 months ago
qsub filter rejects valid jobs Resolved

Job scripts submitted on Glenn, Oakley, or Ruby all go a submit filter before reaching the resource manager, Torque.  A bug has been discovered in our submit filter which prevents jobs with the... Read more

10 years 10 months ago 10 years 4 months ago
OpenMPI-HPCX 4.1.x hangs on writing files on a shared file system Software Resolved
(workaround)

Your job utilizing openmpi/4.1.x-hpcx (or 4.1.x on Ascend) might hang while writing files on a shared file system. This issue is caused by a ... Read more

9 months 1 week ago 9 months 1 week ago
Rolling reboot of all clusters, starting from 9:30 AM June 05, 2019 Batch, login, Owens, Pitzer, Ruby Resolved

Update #2 Posted on 14 June 2019 12:33 PM

The rolling reboots of all clusters are completed. Please contact oschelp@osc.edu if you... Read more

6 years 8 months ago 6 years 8 months ago
16 core nodes on Glenn temporarily unavailable Operations Resolved

This issue has been resolved. The 16-core nodes are online.

---------------------------------------------------------------------------------------

16 core nodes on Glenn are currently... Read more

12 years 11 months ago 12 years 10 months ago
GPFS problems with /fs/project and possibly /fs/scratch filesystem Resolved

There was an issue with GPFS clients that affected /fs/project and possibly /fs/scratch between around 3:30AM and 8:30AM on Sunday September 4th. Some jobs from clients were also impacted. 

... Read more
3 years 5 months ago 3 years 5 months ago
Systemic Problem on Cluster Computing service Operations Resolved

4:20PM 6/23/2017 Update: All HPC systems are back in production. This outage may cause failures of users' jobs. We'll update the community as more is known. 

... Read more
8 years 8 months ago 8 years 7 months ago
AlphaFold 3 GPU Out-of-Memory Error During Inference Software Resolved
(workaround)

When you run AlphaFold 3, you may encounter a GPU out-of-memory (OOM) failures during model execution. The job terminated with errors similar to:

Can't... Read more          
2 months 2 weeks ago 2 months 2 weeks ago
Storage Problems - GPFS services (Project / Scratch) filesystem Resolved

Updates at 15:52 March 11, 2020:

The issue with the Project file system that causes deletes of file system snapshots to fail has now been resolved. OSC Project file system... Read more

5 years 11 months ago 5 years 11 months ago
issue with OnDemand 6:09 - 8:39 pm Resolved

OnDemand, epi accounting queries, the Viper DB, the Medline DB, the Eweld DB,... Read more

11 years 6 months ago 11 years 6 months ago

Pages