Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
MOE license server down Licensing Resolved

The MOE license server is experiencing an unknown issue and potentially down.  We are working to resolve the issue.

1 year 2 weeks ago 1 year 2 weeks ago
8AM 9/11/13 - Brief network disruption to reboot a switch Network Resolved

At 8AM on September 11, 2013, we will be rebooting a network switch to replace a failed card in the switch. Network will be disrupted for 10 to 15 minutes while the work is done. Filesystem mounts... Read more

11 years 3 weeks ago 11 years 1 week ago
OpenMPI job stopped at 'There are not enough slots available in the system to satisfy the slots' Owens, Pitzer, Software Resolved

Users would encounter a MPI job failed with openmpi/3.1.0-hpcx on Owens and Pitzer. The job would stop with the error  like "There are not enough slots available in the system to... Read more

4 years 2 months ago 4 years 1 month ago
Owens batch is down Owens Resolved

Updated at 9:07PM on Dec 20, 2017 :

Owens batch was restored by updating Torque resource manager at 6:37pm Dec 19, 2017. 

Original Post at 4:45PM on Dec 19... Read more

6 years 9 months ago 6 years 9 months ago
Can not change GPU compute mode on Oakley GPU Resolved

Update: The driver version has been updated and the issue has been fixed.

 

In updating the driver version for Oakley's NVIDIA GPUs the NVML libraries that are used in conjunction... Read more

9 years 10 months ago 9 years 8 months ago
qsub filter rejects valid jobs Resolved

Job scripts submitted on Glenn, Oakley, or Ruby all go a submit filter before reaching the resource manager, Torque.  A bug has been discovered in our submit filter which prevents jobs with the... Read more

9 years 6 months ago 8 years 12 months ago
Security vulnerabilities on ARM Forge versions prior to 22.0.x Software Resolved
(workaround)

ARM identified security vulnerabilities on ARM Forge versions prior to 22.0.x as follow:

  • Security update #1: A locally exploitable code-injection vulnerability was identified in... Read more
2 years 3 months ago 2 years 3 months ago
Rolling reboot of all clusters, starting from 9:30 AM June 05, 2019 Batch, login, Owens, Pitzer, Ruby Resolved

Update #2 Posted on 14 June 2019 12:33 PM

The rolling reboots of all clusters are completed. Please contact oschelp@osc.edu if you... Read more

5 years 4 months ago 5 years 3 months ago
Balance could be non-existent client portal Resolved

Balances may be none existent in my.osc.edu and OSCusage command. Balances are being properly accounted for in the background.

The bug has been identified and a patch will be released ASAP... Read more

4 years 10 months ago 4 years 9 months ago
Abaqus license contention Batch, Licensing Resolved

We have noticed some abaqus jobs end up in BatchHold. Once the job is in BatchHold, it will never start. This is because of sharing the abaqus licenses between Oakley and Owens. We have opened a... Read more

7 years 8 months ago 6 years 4 months ago

Pages