Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Titlesort descending Category Resolution Description Posted Updated
Problems with MVAPICH2 Owens, Ruby, Software Resolved

Some MVAPICH2 MPI installations on Oakley, Ruby, and Owens, such as the default module mvapich2/2.2 as well as mvapich2/2.1, appear to have a bug that is triggered by certain programs.  The... Read more

8 years 11 months ago 2 years 8 months ago
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.


As of early afternoon, Sept. 8,... Read more

9 years 4 months ago 9 years 4 months ago
Problems with the home directories filesystem Resolved

 We are currently seeing problems with the home directories at OSC's HPC facility.... Read more

4 years 8 months ago 4 years 8 months ago
Problems with the nightly backup on ess filesystem Backups Resolved

We are having problems with the nightly backup on ess filesystem, causing missing backups of the /fs/ess file system since Monday, August 9th. Backups of home directory and /fs/project are normal... Read more

3 years 5 months ago 3 years 5 months ago
Proj13 file system difficulties filesystem Resolved

We are currently experiencing difficulties with the servers for the filesystem mounted at /nfs/proj13.

11 years 3 months ago 11 years 3 months ago
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

8 years 2 months ago 8 years 2 months ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

1 year 6 months ago 1 year 2 weeks ago
qsub filter rejects valid jobs Resolved

Job scripts submitted on Glenn, Oakley, or Ruby all go a submit filter before reaching the resource manager, Torque.  A bug has been discovered in our submit filter which prevents jobs with the... Read more

9 years 10 months ago 9 years 3 months ago
quota exceeded error when using chgrp in /fs/ess directories filesystem Resolved

Users may receive an error when using the chgrp command on data in /fs/ess/ locations.

$ chgrp -v PEX1234 my-file.txt
chgrp: changing group of 'my-file.txt': Disk quota exceeded
failed... Read more          
1 year 11 months ago 1 year 11 months ago
Replacement of Owens Ethernet switches from Dec 14, 2018 Network, Owens Resolved

Updated on Jan 16, 2019, at 09:20 AM:

The replacement is done except for the three switches including the login nodes of Owens. We posted another notice for more... Read more

6 years 4 months ago 6 years 1 week ago

Pages