We are currently experiencing outages affecting multiple services, including OnDemand (ondemand.osc.edu) and login nodes of HPC systems.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort ascending Updated
Partial Unavailability of Owens Nodes: 07/29 - 08/06 Outage Resolved

Partial Owens nodes will be unavailable from July 29, 2024, to August 6, 2024, due to power changes in preparation for the Cardinal installation. This outage won't affect any running batch jobs,... Read more

10 months 2 weeks ago 10 months 1 week ago
Schrodinger license check in issue Licensing Resolved

Schrödinger is an application that uses a FlexNet license server. To run a job, the application needs to check out licenses from the server and check it back in once the job is completed. However... Read more

11 months 2 weeks ago 6 months 1 week ago
Vulnerability in R Programming language Resolved

Updates on 09/03:

The unpatched older R versions will be removed from the Owens cluster by October 9, 2024. If you are using... Read more

1 year 2 weeks ago 9 months 1 week ago
Login problem Resolved

Around 10am EDT, OSC began to experience problems with our infrastructure.  Login sessions began to be affected around 11:30 EDT. The problem was resolved around 1pm EDT, and all systems have... Read more

1 year 1 month ago 1 year 1 month ago
Conda activate will be enabled for python and miniconda3 modules Resolved

If you've previously utilized conda init to enable the conda activate command, your shell configuration file such as .bashrc would have been altered with... Read more

1 year 1 month ago 1 year 1 month ago
Parallel job with IntelMPI hangs Software Resolved

... Read more

1 year 2 months ago 10 months 1 week ago
Multi-node job hang with ORCA 5 Owens, Pitzer, Software Resolved
(workaround)

You may experience a multi-node job hang if the job runs into a module that requires heavy I/O, e.g., MP2 or CCSD. Additionally, it potentially leads to our GPFS performance issue. We have... Read more

1 year 2 months ago 1 month 1 week ago
Backup Issues Backups Resolved

Updates on 04/17:

OSC has conducted thorough validations to ensure the integrity of our backup data for user home directories and the /fs/ess filesystem. 

... Read more

1 year 2 months ago 1 year 1 month ago
Slurm to be Upgraded to Version 23.11.4 Owens, Pitzer Resolved

Updates on 04/08/2024:

The rolling reboots are completed. 

Updates:

We will perform rolling reboots on this... Read more

1 year 3 months ago 1 year 2 months ago
Slurm on Pitzer is offline Resolved

The Slurm scheduler for Pitzer is currently offline. We are working with the vendor for the fix. Sorry for the inconvenience.

1 year 3 months ago 1 year 3 months ago

Pages