Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
GPFS problems with /fs/project and possibly /fs/scratch filesystem Resolved

There was an issue with GPFS clients that affected /fs/project and possibly /fs/scratch between around 3:30AM and 8:30AM on Sunday September 4th. Some jobs from clients were also impacted. 

... Read more
2 years 8 months ago 2 years 8 months ago
Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017 login, Owens, Ruby Resolved

4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed. 

All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more

8 years 2 months ago 8 years 1 month ago
Storage Problems - GPFS services (Project / Scratch) filesystem Resolved

Updates at 15:52 March 11, 2020:

The issue with the Project file system that causes deletes of file system snapshots to fail has now been resolved. OSC Project file system... Read more

5 years 2 months ago 5 years 2 months ago
Account changes temporarily suspended Account Management Resolved

We are still experiencing some account problems related to Thursday's issue. As a result, we have taken my.osc.edu offline and cannot process email changes or password resets, either via self-... Read more

10 years 11 months ago 10 years 11 months ago
OnDemand unresponsive login Resolved

Some of the login nodes on Owens and Pitzer are in bad states. User can't log into OnDemand. And scratch is unresponsive sometimes. We are working on this issue. We will update when we have more... Read more

4 years 11 months ago 4 years 11 months ago
Lustre Updates filesystem Resolved

9/10/14 - We have not seen any additional crashes of the Lustre servers since making this change.

8/26/14 
- Lustre jobs are being accepted as of 10AM this... Read more

10 years 8 months ago 10 years 8 months ago
- --gpus-per-task is not working Batch Resolved

Updated: This is fixed. 

Original Post:

After the recent Slurm upgrade, the option --gpus-per-task is currently not functioning as... Read more

4 months 6 days ago 4 months 6 days ago
Stale File Handles on GPFS clients filesystem Resolved

OSC is experiencing some problems with the Project and Scratch filesystems that are resulting in some jobs seeing "stale file handles". We are investigating the problem and will provide updates as... Read more

6 years 4 months ago 6 years 4 months ago
Problems with home directory servers filesystem Resolved

We had several auto-reboots with our home directory servers, starting from around 11pm August 30. Jobs might be impacted. The systems are working properly now.  

... Read more
3 years 8 months ago 3 years 8 months ago
Problems with MVAPICH2 Owens, Ruby, Software Resolved

Some MVAPICH2 MPI installations on Oakley, Ruby, and Owens, such as the default module mvapich2/2.2 as well as mvapich2/2.1, appear to have a bug that is triggered by certain programs.  The... Read more

9 years 3 months ago 3 years 1 week ago

Pages