The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

 OSC is preparing to update Slurm on its production systems to version 23.11.4 on March, 27. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
February 11 2014 Scheduled Downtime Outage Resolved

HPC systems are offline today for scheduled quarterly maintenance activity. For details, please visit osc.edu/n

10 years 1 month ago 10 years 1 month ago
Singularity: reached your pull rate limit Owens, Pitzer, Software Resolved
(workaround)

You might encounter an error while pulling a large Docker image:

ERROR: toomanyrequests: Too Many Requests.

or

You have reached your pull rate limit. You may... Read more          
2 years 9 months ago 1 year 11 months ago
Job failures on some rolling-rebooted nodes on Owens since April 16, 2018 Owens Resolved

3:35 PM 4/30/2018 Update:

The cause is that NFSv4.1 is not configured correctly after OS on Owens was updated from RHEL 7.3 to 7.4. We re-rebooted the Owens compute nodes... Read more

5 years 11 months ago 5 years 11 months ago
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... Read more

9 years 1 month ago 9 years 1 month ago
Symlinks to /fs/project directories were missed filesystem Resolved

Symlinks to /fs/project directories were missed for a short period of time both on Tuesday afternoon October 11(right after downtime) and Wedenday morning October 12. It might result in job... Read more

1 year 5 months ago 1 year 5 months ago
Password Expiration Emails client portal Resolved

Password expiration notices are still being sent after you change your password.

To ensure your password change date has been updated and the account will not expire, please look at... Read more

4 years 11 months ago 4 years 10 months ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

8 months 2 weeks ago 2 months 3 weeks ago
BWA Software Security Vulnerability Software Resolved

... Read more

4 years 8 months ago 4 years 8 months ago
Spurious warnings about balance being exhausted client portal Resolved

Due to the price changes and some specifics about MyOSC, you may get warnings... Read more

3 years 8 months ago 3 years 7 months ago
Issue with GPFS on Owens since April 14, 2017 Batch, filesystem, Owens Resolved

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address this GPFS issue. 

We have had issues with GPFS mounts on Owens Cluster since Friday afternoon,... Read more

6 years 11 months ago 6 years 11 months ago

Pages