Known Issues

Known Issues

Title Category Resolution Description Postedsort ascending Updated
Job failures on some rolling-rebooted nodes on Owens since April 16, 2018 Owens Unresolved

9:30 AM 4/18/2018 Original Post:

Users may have been experiencing job failures on Owens cluster since April 16, 2018. Some Owens nodes after being rebooted fail to pick up... Read more

1 week 1 hour ago 1 week 1 hour ago
Rolling reboot of Owens cluster, starting from Monday, April 16, 2018 Owens Unresolved

We will have a rolling reboot of login and compute nodes of Owens cluster starting from Monday, April 16, 2018, to update its OS from RHEL 7.3 to 7.4. This rolling reboot won't affect any running... Read more

1 week 6 days ago 1 week 1 hour ago
abaqus: partial node jobs Software Unresolved

If you run a parallel (or even serial!) job, but not using all the cpus... Read more

1 month 2 days ago 1 month 2 days ago
Occasional failures in file permissions filesystem Unresolved

Users may experience occasional failures in file permissions with our filesystem. We've opened a case with the vendor for further investigations. If you get 'permission denied' message when you... Read more

1 month 5 days ago 2 weeks 6 days ago
abaqus with UMAT Software Unresolved

On Owens, usage of user-defined material (UMAT) script for abaqus is limited as following:

abaqus 2017: correctly running on single and multi-nodes

abaqus 6.14 and 2016: correctly... Read more

1 month 2 weeks ago 1 month 2 weeks ago
VASP job with Out-of-Memory crashes compute node(s) Batch, Owens, Software Unresolved

There is a bug with VASP 5.4.1 built with mvapich2/2.2 on Owens... Read more

11 months 3 weeks ago 11 months 3 weeks ago
Abaqus license contention Batch, Licensing Unresolved

We have noticed some abaqus jobs end up in BatchHold. Once the job is in BatchHold, it will never start. This is because of sharing the abaqus licenses between Oakley and Owens. We have opened a... Read more

1 year 2 months ago 1 year 2 months ago
Performance Regression of GPU Nodes on Ruby GPU, Ruby Unresolved

We currently have performance regression of Ruby's GPU nodes. Some of the GPU nodes on Ruby will remain in a power-saving state even after an application starts using them, resulting in... Read more

1 year 4 months ago 1 year 4 months ago
LAMMPS 14May16 velocity command problem on Owens Software Unresolved

LAMMPS 14May16 on Owens can hang when using the velocity command.  Inputs that hang on Owens work on Oakley and Ruby.  LAMMPS 31Mar17 on Owens also works.  Here is an example failing input snippet... Read more

1 year 4 months ago 6 months 1 week ago
Nvidia drivers on Oakley GPU Unresolved

We upgraded the drivers for the Nvidia GPUs on all of our clusters during the downtime this week. Unfortunately, we are noticing some subtle problems with the GPUs on Oakley. We will be rolling... Read more

1 year 6 months ago 1 year 6 months ago

Pages