We are currently experiencing temporary instability on the Ascend login nodes.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
Owens batch is down Owens Resolved

Updated at 9:07PM on Dec 20, 2017 :

Owens batch was restored by updating Torque resource manager at 6:37pm Dec 19, 2017. 

Original Post at 4:45PM on Dec 19... Read more

8 years 5 months ago 8 years 5 months ago
Problems with the home directories filesystem Resolved

 We are currently seeing problems with the home directories at OSC's HPC facility.... Read more

6 years 1 week ago 6 years 1 week ago
Can not change GPU compute mode on Oakley GPU Resolved

Update: The driver version has been updated and the issue has been fixed.

 

In updating the driver version for Oakley's NVIDIA GPUs the NVML libraries that are used in conjunction... Read more

11 years 6 months ago 11 years 4 months ago
Abaqus Parallel Job Failure with PMPI Due to Out-of-Memory (OOM) Error Cardinal Resolved
(workaround)

You may encounter the following error while running an Abaqus parallel job with PMPI:

Traceback (most recent call last):
 File "SMAPylModules/SMAPylDriverPy.m/src/driverAnalysis.py",... Read more          
1 year 5 months ago 1 year 1 month ago
Scratch filesystem is down filesystem, OnDemand Resolved

Updated on 2:30pm Feb 1st:

Scratch filesystem is back. OnDemand is also available now. 

Original Post:

Scratch filesystem is down now.... Read more

7 years 4 months ago 7 years 4 months ago
Rolling reboots on owens and pitzer starting 18 Aug 2021 Batch, Connectivity, Maintenance Resolved

We will have rolling reboots of Owens and Pitzer cluster, including login and compute nodes, starting from 9am on August 18, 2021. The rolling reboot is for urgent security updates.

The... Read more

4 years 9 months ago 4 years 9 months ago
GPFS hang Issue on 09/08/2016 filesystem Resolved

On Thursday, Sept 8 starting at 19:37, we had some bad interaction that appears to have been caused by the backup client, and the GPFS servers. This resulted in a GPFS hang that propagated I/O... Read more

9 years 9 months ago 9 years 9 months ago
Docker container runtime error on desktop due to DBUS session Software Resolved
(workaround)

When running a container using the podman or docker command on a desktop system, you may encounter an error like the following:

Error: OCI... Read more          
10 months 1 week ago 10 months 1 week ago
8AM 9/11/13 - Brief network disruption to reboot a switch Network Resolved

At 8AM on September 11, 2013, we will be rebooting a network switch to replace a failed card in the switch. Network will be disrupted for 10 to 15 minutes while the work is done. Filesystem mounts... Read more

12 years 9 months ago 12 years 8 months ago
GPFS filesystem Problem on Oct 24 2019 filesystem Resolved

Updated on 4:45 PM Oct 24, 2019

The issue is fixed. GPFS filesystems and OnDemand are back. 

Original Post

We are having issues with GPFS filesystem... Read more

6 years 7 months ago 6 years 7 months ago

Pages