We are currently experiencing temporary instability on the Ascend login nodes.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
NCCL hang on Ascend dual-GPU nodes Ascend, GPU, Software Resolved
(workaround)

Users may encounter the following message and experience NCCL hangs if the first operation is a barrier when running multi-GPU training. We have identified... Read more

1 year 1 week ago 1 year 6 days ago
Backups of /nfs/gpfs Backups Resolved

Changes to files on /nfs/gpfs may not be backed up during the following evening's backup, as would normally be expected. The backup software is attempting to recreate a full backup of the... Read more

13 years 1 month ago 13 years 1 month ago
openmpi/4.1.1 is deprecated Resolved

openmpi/4.1.1-hpcx will be removed on November 29th, 2022 due to InfiniBand drivers (MOFED) update. Please use compatible and bug-fixed version 'openmpi/4.1.2-hpcx' to run ORCA or your MPI... Read more

3 years 6 months ago 3 years 6 months ago
Proj13 file system difficulties filesystem Resolved

We are currently experiencing difficulties with the servers for the filesystem mounted at /nfs/proj13.

12 years 8 months ago 12 years 8 months ago
Outbound Emails from MyOSC are Blocked at MS 365 Servers Account Management, client portal Resolved

Outbound emails, including account verification emails, from MyOSC to institutional email addresses utilizing MS 365 are being blocked as phishing attempts at the institution's server.

... Read more

2 years 9 months ago 2 years 9 months ago
Apptainer apt GPG Error Caused by Private /tmp Software Resolved
(workaround)

Duplicate of:

https... Read more

4 months 6 days ago 1 month 2 weeks ago
abaqus with UMAT Software Resolved

On Owens, usage of user-defined material (UMAT) script for abaqus is limited as following:

abaqus 2017: correctly running on single and multi-nodes

abaqus 6.14 and 2016: correctly... Read more

8 years 3 months ago 8 years 4 days ago
/fs/ess Project Details Storage Reporting client portal Resolved

If you have a storage quota on /fs/ess, the project storage is showing as zero in project details. You can still view storage information in the "Usage Overview" tab.

5 years 11 months ago 5 years 10 months ago
Oakley login node down Resolved

One of the Oakley login... Read more

11 years 4 months ago 11 years 4 months ago
Core and Node labels on Classroom app are incorrect Resolved

The core and node labels on the Classroom app (class.osc.edu) incorrectly displays as '0', regardless of the requested number of cores for a job. While this label is incorrect, the job is still... Read more

1 year 5 months ago 1 year 4 months ago

Pages