Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
GPU Memory Not Released Causing OOM in Subsequent Jobs GPU Unresolved

We have noticed that GPU memory is not being properly released in some jobs, causing subsequent jobs on the same nodes to run out of memory (OOM). We are currently working on a resolution... Read more

2 months 2 weeks ago 2 months 2 weeks ago
MVAPICH 3.0 hang due to PMI mismatch with Slurm Software Unresolved

Applications such as Quantum ESPRESSO, LAMMPS, and NWChem experienced hangs with MVAPICH 3.0 due to a PMI mismatch. MVAPICH 3.0 was built with PMI-1, while newer Slurm versions on RHEL 9... Read more

2 months 2 days ago 2 months 1 day ago
Rolling reboot of Owens cluster, starting from 9AM September 11, 2017 Batch, Owens Resolved

Updates on 12:20PM September 25, 2017: 

The rolling reboot of Owens is completed. 

... Read more
8 years 3 months ago 8 years 2 months ago
Vulnerability in R Programming language Resolved

Updates on 09/03:

The unpatched older R versions will be removed from the Owens cluster by October 9, 2024. If you are using... Read more

1 year 6 months ago 1 year 3 months ago
Statewide Intel compiler license checkout failures Licensing Resolved

This morning (9/10/14) we updated our Intel compiler licenses. We are seeing some unexpected license checkout failures in the logs (please click through to see details):

10:44:... Read more          
11 years 3 months ago 11 years 2 months ago
Intermittent home directory performance issues filesystem Resolved

Users may experience performance issues in home directory. It is recommended to use temporary directory ($TMPDIR, or scratch) or project storage to minimize the impact on... Read more

4 years 8 months ago 4 years 8 months ago
OnDemand service is not available OnDemand Resolved

Update:

OnDemand service is available again.

Original post:

OnDemand service is not available and won't let users log in. We are working to fix it as soon as we can.... Read more

6 years 10 months ago 6 years 10 months ago
Singularity: failed to pull a large Docker image Software Resolved
(workaround)

You might encounter an error while pulling a large Docker image:

[pitzer-login01]$ apptainer pull docker://qimme2/core
FATAL: Unable to pull docker://qiime2/core While... Read more          
6 months 4 weeks ago 2 weeks 1 day ago
Submit filter bug after downtime Batch Resolved

A change was made to a part of our batch software during the downtime that should have only affected users who are a part of multiple projects. We have found that there is a bug in the changes... Read more

9 years 10 months ago 9 years 10 months ago
weld predictor - slurm account error OnDemand, Software Resolved

Updated on 09/08/2022:

Users can choose the project code from a dropdown list to use. 

Original Post:

Users of weld predictor software in... Read more

3 years 4 months ago 3 years 3 months ago

Pages