Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

1 year 2 weeks ago 6 months 3 weeks ago
Oakley login node down login Resolved

One of the Oakley login nodes is down. We are currently working on bringing it back online. SSH connections to oakley.osc.edu may time out. A workaround is to connect directly to oakley01.osc.edu... Read more

10 years 4 months ago 10 years 4 months ago
Spurious warnings about balance being exhausted client portal Resolved

Due to the price changes and some specifics about MyOSC, you may get warnings... Read more

4 years 3 weeks ago 3 years 11 months ago
Intermittent issue with connecting to batch server Batch, Owens Resolved

Updated on June 18, 2018, at 3:15 PM:

This issue has been fixed. 

Posted on June 18, 2018, at 12:30 PM:

We've been having intermittent... Read more

6 years 1 month ago 6 years 1 month ago
LAMMPS ppn=40 GPU problem on Pitzer Pitzer, Software Resolved

On Pitzer both LAMMPS 22Aug18 and 5Jun19 gpu jobs with ppn=40 hang immediately after lammps begins execution.  A workaround is to use ppn=39.

5 years 9 months ago 4 years 6 months ago
Scheduling temporarily suspended on Oakley Batch Resolved

We are migrating the batch scheduler on Oakley to a new virtual machine. In order to accomplish this, the scheduler will be temporarily offline for about four hours on December 16th. Running jobs... Read more

8 years 7 months ago 8 years 7 months ago
ondemand outage OnDemand Resolved

Resolution notes

The problems with ondemand.osc.edu are now resolved.

Users will encounter errors using... Read more

2 years 3 months ago 2 years 3 months ago
Sign Up reCAPTCHA Error Resolved

If you fail to hit the reCAPTCHA and try to submit the form, you will receive an error regarding the reCAPTCHA.

If you hit the reCAPTCHA and re-submit, the error will remain.

... Read more

4 years 12 months ago 4 years 8 months ago
2/26 Downtime Difficulties Operations Resolved

All systems should be functioning normally. Please report any remaining issues to OSC Help.

----
A number of systems are still experiencing problems after yesterday's... Read more

11 years 5 months ago 11 years 5 months ago
Large MPI job startup hang with mvapich2/2.3 and mvapich2/2.3.1 Owens, Pitzer, Software Resolved
(workaround)

We have found that large MPI jobs may hang at startup with mvapich2/2.3 and mvapich/2.3.1 (on any compiler dependency) due to a known bug that has been fixed in release 2... Read more

4 years 8 months ago 2 years 3 months ago

Pages