Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Very little free space for metadata on the scratch storage /fs/scratch filesystem Resolved

Updated 15:30 October 19:

The issue of little space for metadata on scratch storage is resolved. If you have any questions, please contact... Read more

2 years 10 months ago 2 years 10 months ago
GPFS filesystem Errors on June 4 2019 filesystem Resolved

Update Posted on 04 June 2019 12:27 PM

We fixed the problem with both project and scratch filesystem and the service has been restored. Please contact ... Read more

5 years 3 months ago 5 years 3 months ago
Issues with VDI through OnDemand Batch Resolved

Update: Jan 23, 2017 3PM: this issue has been fixed. 

There is an issue with OSC OnDemand -> Desktops -> Virtual Desktop Interface (VDI) such that you get "qsub: submit error ..."... Read more

7 years 7 months ago 7 years 7 months ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

1 year 1 month ago 7 months 3 weeks ago
Oakley login node down login Resolved

One of the Oakley login nodes is down. We are currently working on bringing it back online. SSH connections to oakley.osc.edu may time out. A workaround is to connect directly to oakley01.osc.edu... Read more

10 years 5 months ago 10 years 5 months ago
Spurious warnings about balance being exhausted client portal Resolved

Due to the price changes and some specifics about MyOSC, you may get warnings... Read more

4 years 1 month ago 4 years 4 weeks ago
my.osc.edu logins failing Account Management Resolved

Logins to my.osc.edu are failing. This is unrelated to our InfiniBand issue; a router change at OARnet is the believed cause. They are working on re-establishing the necessary routing.

10 years 1 month ago 10 years 1 month ago
Rolling reboot of Pitzer cluster, starting from Feb 03, 2021 Batch, login, Pitzer Resolved

Updates at 10AM Feb 11, 2021:

The rolling reboot is completed. 

Original Post:

We will have rolling reboots of Pitzer cluster including... Read more

3 years 7 months ago 3 years 6 months ago
LAMMPS ppn=40 GPU problem on Pitzer Pitzer, Software Resolved

On Pitzer both LAMMPS 22Aug18 and 5Jun19 gpu jobs with ppn=40 hang immediately after lammps begins execution.  A workaround is to use ppn=39.

5 years 10 months ago 4 years 7 months ago
Scheduling temporarily suspended on Oakley Batch Resolved

We are migrating the batch scheduler on Oakley to a new virtual machine. In order to accomplish this, the scheduler will be temporarily offline for about four hours on December 16th. Running jobs... Read more

8 years 8 months ago 8 years 8 months ago

Pages