The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Symlinks to /fs/project directories were missed filesystem Resolved

Symlinks to /fs/project directories were missed for a short period of time both on Tuesday afternoon October 11(right after downtime) and Wedenday morning October 12. It might result in job... Read more

1 year 6 months ago 1 year 6 months ago
Segmentation fault from openmpi/1.10-hpcx and 2.0-hpcx on Owens Owens, Software Resolved

We have found that recent MPI jobs using openmpi/1.10-hpcx and openmpi/2.0-hpcx on Owens may complete or hang until the job is killed, but receive segmentation fault. Some applications might be ... Read more

4 years 9 months ago 4 years 8 months ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

10 months 1 day ago 4 months 1 day ago
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more

10 years 9 months ago 10 years 9 months ago
Spurious warnings about balance being exhausted client portal Resolved

Due to the price changes and some specifics about MyOSC, you may get warnings... Read more

3 years 10 months ago 3 years 9 months ago
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017 Batch, Owens Resolved

Updated on Nov 21, 2017 at 3:33PM:

It has been completed. 

Updated on October 20, 2017 at 4:19PM:

We will have a rolling reboot of Owens... Read more

6 years 6 months ago 6 years 5 months ago
Scheduling suspended Batch Resolved

We have temporarily suspended scheduling due to some problems with the parallel scratch file system.

9 years 7 months ago 9 years 7 months ago
Very little free space for metadata on the scratch storage /fs/scratch filesystem Resolved

Updated 15:30 October 19:

The issue of little space for metadata on scratch storage is resolved. If you have any questions, please contact... Read more

2 years 6 months ago 2 years 6 months ago
NFS outage on Thursday Jan 17 from 7am to 8am filesystem Resolved

Update:

This work has been canceled and will be done during downtime on Feb. 5. 

Original Post:

On Thursday, January 17th from 7 am to 8 am OSC will have a GPFS... Read more

5 years 3 months ago 5 years 3 months ago
June 7th downtime to finish at 6:30PM Connectivity, filesystem, Infrastructure, login, Login Problems, Maintenance, Operations, Outage Resolved

Update: Downtime completed at 6:30PM, June 7th.

 

The June 7th downtime is now slated to be completed at 6:30PM.  Previous estimate was 5PM.

All systems and services will... Read more

7 years 11 months ago 7 years 11 months ago

Pages