The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Rolling reboots of all clusters starting from Monday Feb 5, 2018 Batch, Owens, Ruby Resolved

Posted on Feb 22 at 1:25PM:

The rolling reboots have been completed. 

Posted on Jan 30, 2018 at 4:00PM:

We will have rolling reboots of... Read more

6 years 2 months ago 6 years 1 month ago
Rolling reboot of Owens and Ruby, starting from 8 AM Monday, August 6, 2018 Owens, Ruby Resolved

10:16 AM 8/13/2018 Update:

The rolling reboot of Owens has been completed.

9:00 AM 8/10/2018 Update:

The rolling reboot of Ruby has been... Read more

5 years 8 months ago 5 years 8 months ago
(informational) GPFS maintenance work duplicate known issue filesystem Resolved

Maintenance work on the GPFS servers is scheduled to be performed today, 28 Feb 2020 at 2:00p.m.

Although there is no direct impact expected to services at OSC, there may be short... Read more

4 years 1 month ago 4 years 1 month ago
Matlab PCT broken due to pbsrsh modification Matlab Resolved

A change was made to the system wide pbsrsh script which Matlab relies on.  It has been discovered that this change has broken the parallel computing toolbox (... Read more

8 years 11 months ago 8 years 6 months ago
Slow Processing of Password Changes Account Management, client portal Resolved

Password changes are taking longer than usual to process through the system. In some test cases up to 17 minutes.

We are working on resolving this issue.

4 years 9 months ago 2 years 4 months ago
Singularity: reached your pull rate limit Owens, Pitzer, Software Resolved
(workaround)

You might encounter an error while pulling a large Docker image:

ERROR: toomanyrequests: Too Many Requests.

or

You have reached your pull rate limit. You may... Read more          
2 years 10 months ago 2 years 2 weeks ago
Update on 02/24/2017: All services available Outage Resolved

02/24/17 3:50PM Update: All Services have been restored including:

  • Oakley cluster with full capacity for general access
  • Ruby cluster with full capacity for... Read more
7 years 1 month ago 7 years 1 month ago
Symlinks to /fs/project directories were missed filesystem Resolved

Symlinks to /fs/project directories were missed for a short period of time both on Tuesday afternoon October 11(right after downtime) and Wedenday morning October 12. It might result in job... Read more

1 year 6 months ago 1 year 6 months ago
Cannot login to clusters Resolved

As of around 3PM today (Thursday 6/12), we have reports of users being unable to login in to the clusters.  The error message given will make it sound like your password is incorrect, although it... Read more

9 years 10 months ago 9 years 10 months ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

9 months 1 week ago 3 months 1 week ago

Pages