Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
Rolling reboot of all clusters, starting from Wednesday morning, April 19, 2017 Batch, Maintenance, Owens, Ruby Resolved

1:40PM 4/27/2017 Update: Rolling reboots are completed. 

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address GPFS errors occured... Read more

7 years 2 months ago 7 years 1 month ago
Parallel job with IntelMPI hangs Software Resolved

... Read more

2 months 1 week ago 2 months 1 week ago
Lustre is still offline. HPC systems back up Maintenance Resolved

Day One of the scheduled downtime has been completed, and HPC operations have resumed. As planned, Lustre work will extend into Day Two. Jobs using /fs/lustre or $PFSDIR cannot run until this work... Read more

9 years 11 months ago 9 years 11 months ago
starccm outage Feb 21, 2021 Licensing, Outage, Owens, Software Resolved

Updated on Feb 25: 

StarCCM license outage is restored.

Original post:

OSC's starccm software license will expire at 12 a.m., Sunday, Feb... Read more

3 years 4 months ago 3 years 3 months ago
Replacement of Owens Ethernet switches from Dec 14, 2018 Network, Owens Resolved

Updated on Jan 16, 2019, at 09:20 AM:

The replacement is done except for the three switches including the login nodes of Owens. We posted another notice for more... Read more

5 years 9 months ago 5 years 5 months ago
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.


As of early afternoon, Sept. 8,... Read more

8 years 9 months ago 8 years 9 months ago
Security vulnerabilities on ARM Forge versions prior to 22.0.x Software Resolved
(workaround)

ARM identified security vulnerabilities on ARM Forge versions prior to 22.0.x as follow:

  • Security update #1: A locally exploitable code-injection vulnerability was identified in... Read more
1 year 11 months ago 1 year 11 months ago
Segmentation fault from openmpi/1.10-hpcx and 2.0-hpcx on Owens Owens, Software Resolved

We have found that recent MPI jobs using openmpi/1.10-hpcx and openmpi/2.0-hpcx on Owens may complete or hang until the job is killed, but receive segmentation fault. Some applications might be ... Read more

4 years 11 months ago 4 years 10 months ago
quota exceeded error when using chgrp in /fs/ess directories filesystem Resolved

Users may receive an error when using the chgrp command on data in /fs/ess/ locations.

$ chgrp -v PEX1234 my-file.txt
chgrp: changing group of 'my-file.txt': Disk quota exceeded
failed... Read more          
1 year 4 months ago 1 year 3 months ago
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more

10 years 11 months ago 10 years 11 months ago

Pages