The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

 OSC is preparing to update Slurm on its production systems to version 23.11.4 on March, 27. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
my.osc.edu logins failing Account Management Resolved

Logins to my.osc.edu are failing. This is unrelated to our InfiniBand issue; a router change at OARnet is the believed cause. They are working on re-establishing the necessary routing.

9 years 8 months ago 9 years 8 months ago
Rolling reboot of Pitzer cluster, starting from Feb 03, 2021 Batch, login, Pitzer Resolved

Updates at 10AM Feb 11, 2021:

The rolling reboot is completed. 

Original Post:

We will have rolling reboots of Pitzer cluster including... Read more

3 years 1 month ago 3 years 1 month ago
Can not change GPU compute mode on Oakley GPU Resolved

Update: The driver version has been updated and the issue has been fixed.

 

In updating the driver version for Oakley's NVIDIA GPUs the NVML libraries that are used in conjunction... Read more

9 years 4 months ago 9 years 2 months ago
CP2K 6.1 would fail on Pitzer Cascade Lakes (48-core) node: Pitzer Resolved
(workaround)

CP2K 6.1 would fail with the following error when running on Pitzer Cascade Lakes (48-core) node:

Program received signal SIGFPE: Floating-point exception - erroneous arithmetic... Read more          
2 years 9 months ago 1 year 11 months ago
Scratch filesystem is down filesystem, OnDemand Resolved

Updated on 2:30pm Feb 1st:

Scratch filesystem is back. OnDemand is also available now. 

Original Post:

Scratch filesystem is down now.... Read more

5 years 1 month ago 5 years 1 month ago
GPFS hang Issue on 09/08/2016 filesystem Resolved

On Thursday, Sept 8 starting at 19:37, we had some bad interaction that appears to have been caused by the backup client, and the GPFS servers. This resulted in a GPFS hang that propagated I/O... Read more

7 years 6 months ago 7 years 6 months ago
openmpi/4.1.1 is deprecated Resolved

openmpi/4.1.1-hpcx will be removed on November 29th, 2022 due to InfiniBand drivers (MOFED) update. Please use compatible and bug-fixed version 'openmpi/4.1.2-hpcx' to run ORCA or your MPI... Read more

1 year 4 months ago 1 year 4 months ago
8AM 9/11/13 - Brief network disruption to reboot a switch Network Resolved

At 8AM on September 11, 2013, we will be rebooting a network switch to replace a failed card in the switch. Network will be disrupted for 10 to 15 minutes while the work is done. Filesystem mounts... Read more

10 years 6 months ago 10 years 6 months ago
OSC OnDemand is not responsive OnDemand Resolved

OSC OnDemand is not responsive now. We are investigating the problem now. Please use other ways like ssh to connect to OSC HPC systems. 

We apologize for any inconvenience this may cause... Read more

4 years 3 weeks ago 4 years 3 weeks ago
Owens batch is down Owens Resolved

Updated at 9:07PM on Dec 20, 2017 :

Owens batch was restored by updating Torque resource manager at 6:37pm Dec 19, 2017. 

Original Post at 4:45PM on Dec 19... Read more

6 years 3 months ago 6 years 3 months ago

Pages