Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort ascending Updated
pdsh -j broken on Oakley Batch, system software Resolved

pdsh -j is broken on Oakley.  It was broken by updates during the September downtime.  We are currently working on resolving the issue.

Users who require... Read more

9 years 1 week ago 6 years 6 months ago
Estimated charging for serial jobs on Oakley is incorrect Batch Resolved

Currently, the estimated RU charge reported at the end of a job shows an incorrect value for serial jobs on Oakley of the entire node. Jobs are being charged the correct amount in the official... Read more

9 years 2 months ago 6 years 6 months ago
Downtime Update: All Major Services Online Resolved

Friday, Sept 25th 12PM Noon:

  • Oakley is back online and has resumed running jobs.  
  • Ruby... Read more
9 years 3 months ago 9 years 2 months ago
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.


As of early afternoon, Sept. 8,... Read more

9 years 3 months ago 9 years 3 months ago
Lustre bug causing Oakley login node crashes filesystem, login Resolved

Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.  The bug (or issue otherwise) seems to be activated when a user does operations on a... Read more

9 years 3 months ago 9 years 2 months ago
Unscheduled GPFS Outage filesystem Resolved

As of 11:30PM on June 16th, we have removed the GPFS filesystem from service due to a number of hardware failures. At this point, further hardware failures would put a large portion of the entire... Read more

9 years 6 months ago 9 years 5 months ago
warning: libhwloc.so.1 may conflict with libhwloc.so.5 Resolved

Sometimes when building MPI programs the following warning appears.  It is harmless and can be safely ignored.

ld: warning: libhwloc.so.1, needed by /usr/local/mvapich2/1.7-intel/lib/... Read more          
9 years 7 months ago 9 years 2 months ago
Matlab PCT broken due to pbsrsh modification Matlab Resolved

A change was made to the system wide pbsrsh script which Matlab relies on.  It has been discovered that this change has broken the parallel computing toolbox (... Read more

9 years 7 months ago 9 years 2 months ago
module spider/avail/show not showing MPI dependent modules Ruby Resolved

On Ruby, the commands:

  • module spider
  • module avail
  • module show... Read more
9 years 7 months ago 9 years 2 months ago
qsub filter rejects valid jobs Resolved

Job scripts submitted on Glenn, Oakley, or Ruby all go a submit filter before reaching the resource manager, Torque.  A bug has been discovered in our submit filter which prevents jobs with the... Read more

9 years 8 months ago 9 years 2 months ago

Pages