Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more

11 years 12 months ago 11 years 12 months ago
A partial-node MPI job failed to start using Intel MPI mpiexec Owens, Pitzer, Software Resolved
(workaround)

A partial-node MPI job may fail to start using mpiexec from intelmpi/2019.3 and intelmpi/2019.7 with error messages like

[mpiexec@o0439.ten.osc.... Read more          
4 years 8 months ago 2 months 1 week ago
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017 Batch, Owens Resolved

Updated on Nov 21, 2017 at 3:33PM:

It has been completed. 

Updated on October 20, 2017 at 4:19PM:

We will have a rolling reboot of Owens... Read more

7 years 9 months ago 7 years 8 months ago
OpenMPI 4 and NVHPC MPI Compatibility Issues with SLURM HWLOC Ascend, Cardinal, Software Resolved
(workaround)

A pure MPI application using mpirun or mpiexec with more ranks than the number of NUMA nodes may encounter an error similar to the following:... Read more

4 months 3 days ago 1 month 2 days ago
Job failures on some rolling-rebooted nodes on Owens since April 16, 2018 Owens Resolved

3:35 PM 4/30/2018 Update:

The cause is that NFSv4.1 is not configured correctly after OS on Owens was updated from RHEL 7.3 to 7.4. We re-rebooted the Owens compute nodes... Read more

7 years 3 months ago 7 years 2 months ago
Singularity: failed to run a container directly or pull an image from Singularity or Docker hub Software Resolved
(workaround)

You might encounter an error while run a container directly from a hub:

[pitzer-login01]$ apptainer run shub://vsoch/hello-world
Progress |===================================| 100.0%... Read more          
2 months 1 week ago 2 months 1 week ago
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... Read more

10 years 5 months ago 10 years 5 months ago
Possible performance degradation after August 9th's downtime filesystem Resolved

Updates on May 20 2023:

verbsRDMA is enabled on Pitzer. 

Updates on Dec 14 2022:

verbsRDMA is enabled on Owens during December 13 downtime... Read more

2 years 11 months ago 2 years 1 month ago
Password Expiration Emails client portal Resolved

Password expiration notices are still being sent after you change your password.

To ensure your password change date has been updated and the account will not expire, please look at... Read more

6 years 2 months ago 6 years 2 months ago
GPFS problems on Owens filesystem Resolved

Owens is experiencing a disruption of GPFS availability. At about 4:17PM today (January 6th), OSC monitoring noticed a problem with mounts of Project on the Owens supercomputer. Jobs may have been... Read more

5 years 6 months ago 5 years 6 months ago

Pages