vLLM versions prior to 0.14.1 are deprecated due to security issue CVE-2026-22778 which can allow remo

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Matlab parallel pool cannot connect Matlab, Pitzer Unresolved

In some cases, matlab will hang with connecting to parallel pool on Pitzer. This was discovered in the 2026 January downtime. We are still investigating

1 month 1 week ago 1 month 1 week ago
GPU Memory Not Released Causing OOM in Subsequent Jobs GPU Unresolved

We have noticed that GPU memory is not being properly released in some jobs, causing subsequent jobs on the same nodes to run out of memory (OOM). We are currently working on a resolution... Read more

4 months 3 weeks ago 4 months 3 weeks ago
cp2k/2023.2 can produce huge output containing MKL messages Ascend, Cardinal, Pitzer, Software Resolved
(workaround)

On all clusters the cp2k executables from module cp2k/2023.2 can produce huge output files due to many many repeating errors from MKL, e.g.:

... Read more          
6 months 1 week ago 4 months 1 week ago
Issues with VDI through OnDemand Batch Resolved

Update: Jan 23, 2017 3PM: this issue has been fixed. 

There is an issue with OSC OnDemand -> Desktops -> Virtual Desktop Interface (VDI) such that you get "qsub: submit error ..."... Read more

9 years 3 weeks ago 9 years 3 weeks ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

2 years 7 months ago 2 years 1 month ago
Oakley login node down login Resolved

One of the Oakley login nodes is down. We are currently working on bringing it back online. SSH connections to oakley.osc.edu may time out. A workaround is to connect directly to oakley01.osc.edu... Read more

11 years 11 months ago 11 years 11 months ago
Intermittent issue with connecting to batch server Batch, Owens Resolved

Updated on June 18, 2018, at 3:15 PM:

This issue has been fixed. 

Posted on June 18, 2018, at 12:30 PM:

We've been having intermittent... Read more

7 years 8 months ago 7 years 8 months ago
Spurious warnings about balance being exhausted client portal Resolved

Due to the price changes and some specifics about MyOSC, you may get warnings... Read more

5 years 7 months ago 5 years 6 months ago
Core label on OnDemand app is incorrect OnDemand Resolved

The core label on the OnDemand app incorrectly displays as '1', regardless of the requested number of cores for a job. While this label is incorrect, the job is still allocated the correct number... Read more

1 year 1 month ago 1 year 2 weeks ago
Armstrong inaccessible Resolved

Update: 2PM March 12th: Armstrong is back up and running.  Please notify oschelp@osc.edu of any lingering issues.


As of 10AM Thursday March 12th... Read more

10 years 11 months ago 10 years 11 months ago

Pages