Due to a critical security vulnerability we need to reboot public-facing systems to deploy a mitigation against the vulnerabilit

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Unavailability of intel channel among the conda channel list: Software Resolved

If you are getting an error: 

UnavailableInvalidChannel: HTTP 403 FORBIDDEN for channel intel <... Read more

1 year 8 months ago 1 year 8 months ago
16 core nodes on Glenn temporarily unavailable Operations Resolved

This issue has been resolved. The 16-core nodes are online.

---------------------------------------------------------------------------------------

16 core nodes on Glenn are currently... Read more

13 years 2 months ago 13 years 1 month ago
Problems with GPFS filesystem, and OnDemand is not working filesystem Resolved

Updates on 12:00pm June 10:

The issue is fixed. All impacted services return to production.

We apologize for the inconvience. If you have any questions, please... Read more

4 years 11 months ago 4 years 11 months ago
Systemic Problem on Cluster Computing service Operations Resolved

4:20PM 6/23/2017 Update: All HPC systems are back in production. This outage may cause failures of users' jobs. We'll update the community as more is known. 

... Read more
8 years 10 months ago 8 years 10 months ago
Instability on Clusters after May 13 Downtime Resolved

We've been experiencing some instability on the clusters (particularly Cardinal and Ascend) following the recent May 13 downtime, especially with parallel job processing. If you notice any unusual... Read more

12 months 1 day ago 11 months 1 week ago
issue with OnDemand 6:09 - 8:39 pm Resolved

OnDemand, epi accounting queries, the Viper DB, the Medline DB, the Eweld DB,... Read more

11 years 9 months ago 11 years 9 months ago
GPFS problems with /fs/project and possibly /fs/scratch filesystem Resolved

There was an issue with GPFS clients that affected /fs/project and possibly /fs/scratch between around 3:30AM and 8:30AM on Sunday September 4th. Some jobs from clients were also impacted. 

... Read more
3 years 8 months ago 3 years 8 months ago
Major network switch outage Network Resolved

01:20 PM 11/14/2018 Update:

... Read more

7 years 6 months ago 7 years 6 months ago
AlphaFold 3 GPU Out-of-Memory Error During Inference Software Resolved
(workaround)

When you run AlphaFold 3, you may encounter a GPU out-of-memory (OOM) failures during model execution. The job terminated with errors similar to:

Can't... Read more          
5 months 2 weeks ago 5 months 2 weeks ago
Storage Problems - GPFS services (Project / Scratch) filesystem Resolved

Updates at 15:52 March 11, 2020:

The issue with the Project file system that causes deletes of file system snapshots to fail has now been resolved. OSC Project file system... Read more

6 years 2 months ago 6 years 2 months ago

Pages