Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort ascending Updated
Issue with submitting job array Batch, Owens Resolved

3:30 PM 5/10/2018 Original Post:

User may have been getting the following error message when trying to submit a PBS job using job arrays:

qsub: submit error (Maximum number of... Read more          
6 years 5 months ago 2 years 10 months ago
Job failures on some rolling-rebooted nodes on Owens since April 16, 2018 Owens Resolved

3:35 PM 4/30/2018 Update:

The cause is that NFSv4.1 is not configured correctly after OS on Owens was updated from RHEL 7.3 to 7.4. We re-rebooted the Owens compute nodes... Read more

6 years 5 months ago 6 years 5 months ago
Rolling reboot of Owens cluster, starting from Monday, April 16, 2018 Owens Resolved

12:00 PM 5/7/2018 Update:

The rolling reboot of Owens has been completed. 

Posted on April 11, 2018, at 3:45... Read more

6 years 6 months ago 6 years 5 months ago
abaqus: partial node jobs Software Resolved

If you run a parallel (or even serial!) job, but not using all the cpus... Read more

6 years 6 months ago 6 years 4 months ago
Occasional failures in file permissions filesystem Resolved

Users may experience occasional failures in file permissions with our filesystem. We've opened a case with the vendor for further investigations. If you get 'permission denied' message when you... Read more

6 years 6 months ago 2 years 10 months ago
abaqus with UMAT Software Resolved

On Owens, usage of user-defined material (UMAT) script for abaqus is limited as following:

abaqus 2017: correctly running on single and multi-nodes

abaqus 6.14 and 2016: correctly... Read more

6 years 7 months ago 6 years 4 months ago
Rolling reboots of all clusters starting from Monday Feb 5, 2018 Batch, Owens, Ruby Resolved

Posted on Feb 22 at 1:25PM:

The rolling reboots have been completed. 

Posted on Jan 30, 2018 at 4:00PM:

We will have rolling reboots of... Read more

6 years 8 months ago 6 years 7 months ago
Oakley and Owens queue issue Batch Resolved

We are experiencing a problem with the queuing system on oakley and owens that is delaying or preventing new jobs from running. Our systems staff is investigating.

 

6 years 9 months ago 6 years 9 months ago
Owens batch is down Owens Resolved

Updated at 9:07PM on Dec 20, 2017 :

Owens batch was restored by updating Torque resource manager at 6:37pm Dec 19, 2017. 

Original Post at 4:45PM on Dec 19... Read more

6 years 9 months ago 6 years 9 months ago
Rolling reboot of login nodes of clusters at 7:00AM Dec 19, 2017 login Resolved

We will have rolling reboot of login nodes of clusters at 7:00AM Dec 19, 2017 for GPFS version upgrade. It is supposed to be completed in a short period of time. f you encounter any login issues,... Read more

6 years 9 months ago 6 years 9 months ago

Pages