The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

 OSC is preparing to update Slurm on its production systems to version 23.11.4 on March, 27. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Titlesort descending Category Resolution Description Posted Updated
Critical change about using $PFSDIR directory at OSC Batch Resolved

Starting from Thursday, Feb 2nd, the $PFSDIR directory on scratch (/fs/scratch) won’t be created by job prologue. For example, if you simply use the command cd $PFSDIR,... Read more

7 years 1 month ago 7 years 1 month ago
cuda-gdb segmentation fault on startup Owens, Pitzer, Software Resolved

The CUDA debugger, cuda-gdb, can raise a segmentation fault immediately upon execution.  A workaround before executing cuda-gdb is to unload the xalt module, e.g.: 

module unload... Read more          
3 years 11 months ago 1 year 11 months ago
Data on /fs/scratch is not accessible filesystem Resolved

Updated on 10:30 AM July 3rd, 2019:

Data on /fs/scratch is accessible now. We are working with the vendor to find the root cause and apologize for any inconvenience.  ... Read more

4 years 9 months ago 4 years 9 months ago
Dec 27, 2016: Issues with /fs/project filesystem Resolved

Dec 27, 2016 3:46PM Update: Both project and scratch file systems (/fs/project and /fs/scratch ) are back to normal now.  Some users' jobs may be... Read more

7 years 3 months ago 7 years 3 months ago
DOWNTIME EXTENDED UNTIL MORNING OF 12/13/17 Resolved

We have extended the 12/12/2017 downtime until 7AM on 12/13/17 to complete filesystem maintenance that has taken longer than expected.

6 years 3 months ago 6 years 3 months ago
Downtime Update: All Major Services Online Resolved

Friday, Sept 25th 12PM Noon:

  • Oakley is back online and has resumed running jobs.  
  • Ruby... Read more
8 years 6 months ago 8 years 6 months ago
Email Issues client portal Resolved

OSU is having ongoing periodic problems with Microsoft (their mail hosting provider) severely delaying outbound email. There is no solution being offered and no timeline for getting it resolved.... Read more

4 years 5 months ago 4 years 2 months ago
Emergency InfiniBand Shutdown (All systems) Network Resolved

We have returned to service. It appears that we have resolved the networking issues enough to allow jobs to run safely. We will continue working with our vendors to fix any remaining hardware... Read more

9 years 8 months ago 9 years 8 months ago
Emergency maintenance in OSC’s data center Feb 10 2022 Outage, Owens, Pitzer Resolved

OSC will shut down significant portions of the Owens and Pitzer clusters for several hours this afternoon (Thursday, Feb.... Read more

2 years 1 month ago 2 years 1 month ago
Emergency UPS Maintenance Maintenance Resolved

A UPS in the data center requires some emergency maintenance to be undertaken at 2PM on Oct 11 2023. There is a very small chance that parts of Owens and of the C18 Pitzer nodes may lose power as... Read more

5 months 2 weeks ago 5 months 2 weeks ago

Pages