vLLM versions prior to 0.14.1 are deprecated due to security issue CVE-2026-22778 which can allow remo

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Podman storage error due to outdated configuration file Software Resolved
(workaround)

If you experience a storage error such as:

 write /var/tmp/storage388772891/1: no space left on device.

you may have an outdated ~/.... Read more

6 months 2 weeks ago 6 months 2 weeks ago
Ls-dyna license outage since Oct 15, 2019 Licensing Resolved

Updated on 1:47 PM Oct 16, 2019

The ls-dyna license server is operational again from 1:40 pm on Oct 16, 2019

Original Post:

The ls-dyna... Read more

6 years 4 months ago 6 years 4 months ago
OnDemand service is not available OnDemand Resolved

Update:

OnDemand service is available again.

Original post:

OnDemand service is not available and won't let users log in. We are working to fix it as soon as we can.... Read more

7 years 1 month ago 7 years 1 month ago
Fail to connect with VS Code 1.86 Resolved

VS Code 1.86 (aka the ‘January 2024’ update) requires ≥glibc 2.28 which are not supported on Pitzer and Owens clusters. Please downgrade to VS Code 1.85. 

See this link for more... Read more

2 years 1 week ago 1 year 9 months ago
Submit filter bug after downtime Batch Resolved

A change was made to a part of our batch software during the downtime that should have only affected users who are a part of multiple projects. We have found that there is a bug in the changes... Read more

10 years 6 days ago 10 years 6 days ago
GPFS errors on compute nodes filesystem Resolved

We've seen an increase in transient problems that result in compute nodes losing access to the GPFS file systems for ~5 minutes.

Any jobs running on these nodes accessing files on GPFS may... Read more

5 years 2 months ago 4 years 2 months ago
6/4/13 Scheduled Downtime Outage Resolved

HPC systems are currently offline for scheduled maintenance. See osc.edu/n for more information.

12 years 8 months ago 12 years 8 months ago
PyTorch hangs on dual-gpu node on Ascend Ascend, GPU Resolved
(workaround)

PyTorch can hang on Ascend on dual-GPU nodes

Through internal testing, we have confirmed that the hang issue only occurs on Ascend dual-GPU (nextgen) nodes. We’re still unsure why... Read more

9 months 2 weeks ago 9 months 2 weeks ago
GPU/VIS nodes for various OOD apps are broken Cardinal, OnDemand, Outage, Software Resolved

After the most recent downtime we discovered that various OOD apps relying on the "virtualgl" module on Cardinal were broken. We have since updated and pinned the latest virtualgl version... Read more

2 weeks 10 hours ago 1 week 1 day ago
Rolling reboot of Owens cluster, starting from 9AM September 11, 2017 Batch, Owens Resolved

Updates on 12:20PM September 25, 2017: 

The rolling reboot of Owens is completed. 

... Read more
8 years 5 months ago 8 years 4 months ago

Pages