We are currently experiencing temporary instability on the Ascend login nodes.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
OpenMPI job stopped at 'There are not enough slots available in the system to satisfy the slots' Owens, Pitzer, Software Resolved

Users would encounter a MPI job failed with openmpi/3.1.0-hpcx on Owens and Pitzer. The job would stop with the error  like "There are not enough slots available in the system to... Read more

5 years 10 months ago 5 years 9 months ago
Usage overview graph for storage client portal Resolved

The project storage section for usage overview displays a graph with a percentage for the amount of storage used. Although the values above the graph are correct, the graph itself shows 100%.... Read more

5 years 10 months ago 5 years 6 months ago
cuda-gdb segmentation fault on startup Owens, Pitzer, Software Resolved

The CUDA debugger, cuda-gdb, can raise a segmentation fault immediately upon execution.  A workaround before executing cuda-gdb is to unload the xalt module, e.g.: 

module unload... Read more          
6 years 1 month ago 4 years 2 months ago
Maintenance outage on the cluster export services Maintenance, OnDemand, Ruby Resolved

Update on 14 April 2020, 0903:

Work is completed.

Original message:

There will be maintenance on cluster export services on Tuesday, April... Read more

6 years 2 months ago 6 years 1 month ago
Error 'libim_client.so: undefined reference to uuid@' with MVAPICH2 in Conda environment Owens, Pitzer, Software Resolved

Users may encoutner an error like 'libim_client.so: undefined reference to `uuid_unparse@UUID_1.0' while compiling MPI applications with mvapich2 in some Conda enivronments. We found pre-installed... Read more

6 years 2 months ago 4 years 2 months ago
GPFS problems on Owens filesystem Resolved

Owens is experiencing a disruption of GPFS availability. At about 4:17PM today (January 6th), OSC monitoring noticed a problem with mounts of Project on the Owens supercomputer. Jobs may have been... Read more

6 years 5 months ago 6 years 5 months ago
my.osc.edu outbound email outage Account Management, client portal, login, Login Problems Resolved

There is an issue with my.osc.edu sending outbound emails, including the 'Forgot Password' emails.

There is ongoing work to resolve this issue as soon as possible. 

Contact OSC... Read more

6 years 4 months ago 6 years 4 months ago
Rolling reboot of Owens and Pitzer starting from February 3, 2020 login, Owens, Pitzer Resolved

2:11 PM 2/17/2020 Update:

The rolling reboot of Owens has been completed.

12:41 PM 2/10/2020 Update:

The rolling reboot of Pitzer has been... Read more

6 years 4 months ago 6 years 3 months ago
Storage Problems - GPFS services (Project / Scratch) filesystem Resolved

Updates at 15:52 March 11, 2020:

The issue with the Project file system that causes deletes of file system snapshots to fail has now been resolved. OSC Project file system... Read more

6 years 3 months ago 6 years 3 months ago
(informational) GPFS maintenance work duplicate known issue filesystem Resolved

Maintenance work on the GPFS servers is scheduled to be performed today, 28 Feb 2020 at 2:00p.m.

Although there is no direct impact expected to services at OSC, there may be short... Read more

6 years 3 months ago 6 years 3 months ago

Pages