Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
PyTorch jobs timeout and hanging GPU Unresolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

4 months 4 weeks ago 1 month 1 week ago
Pitzer Xfce Desktop through OnDemand is not working OnDemand Resolved

Pitzer Xfce Desktop through OnDemand is not working. Please choose 'Mate' desktop environment instead. We are working to fix this issue now and apologize for any inconvenience. Please contact ... Read more

4 years 10 months ago 4 years 10 months ago
Problems with home directory servers filesystem Resolved

We had several auto-reboots with our home directory servers, starting from around 11pm August 30. Jobs might be impacted. The systems are working properly now.  

... Read more
2 years 3 months ago 2 years 3 months ago
Problems with LAMMPS 14May16 Software Resolved

LAMMPS 14May16 was built with the USER-OMP package on Oakley, Ruby, and Owens. Its default behavior is to spawn too many OpenMP threads. lammps/14May16 batch scripts that do not use the USER-OMP... Read more

7 years 2 months ago 7 years 1 week ago
Ruby is offline Operations Resolved

The Ruby Transitional Cluster (only open to select research groups) is currently offline due to network problems. We expect it will return to service some time after the downtime.

10 years 2 months ago 9 years 10 months ago
Intermittent home directory performance issues filesystem Resolved

Users may experience performance issues in home directory. It is recommended to use temporary directory ($TMPDIR, or scratch) or project storage to minimize the impact on... Read more

5 months 1 week ago 4 months 3 weeks ago
Oakley and Owens queue issue Batch Resolved

We are experiencing a problem with the queuing system on oakley and owens that is delaying or preventing new jobs from running. Our systems staff is investigating.

 

5 years 11 months ago 5 years 11 months ago
OnDemand unresponsive login Resolved

Some of the login nodes on Owens and Pitzer are in bad states. User can't log into OnDemand. And scratch is unresponsive sometimes. We are working on this issue. We will update when we have more... Read more

3 years 6 months ago 3 years 6 months ago
Oakley login node problems Resolved

One of the Oakley login nodes (oakley01) has experienced some hardware failures and is temporarily out of service while repairs are ongoing.

Please limit your interactive use of the... Read more

8 years 11 months ago 8 years 11 months ago
module spider/avail/show not showing MPI dependent modules Ruby Resolved

On Ruby, the commands:

  • module spider
  • module avail
  • module show... Read more
8 years 7 months ago 8 years 2 months ago

Pages