Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
2/13/2014 0730 - Reboot of login nodes Outage Resolved

We need to reboot all of the login nodes on our production clusters to fix a minor issue from the downtime. We will be conducting this reboot at 7:30AM on Thursday, February 13th 2014. We expect... Read more

9 years 3 months ago 9 years 3 months ago
LS-DYNA License problems on all clusters Software Resolved

We are experiencing problems with the LS-DYNA license server. We will resume working on restoring access to this software on October 24th, 2018.

==

It has been fixed and working... Read more

4 years 7 months ago 4 years 7 months ago
Problems with the nightly backup on ess filesystem Backups Resolved

We are having problems with the nightly backup on ess filesystem, causing missing backups of the /fs/ess file system since Monday, August 9th. Backups of home directory and /fs/project are normal... Read more

1 year 9 months ago 1 year 9 months ago
Estimated charging for serial jobs on Oakley is incorrect Batch Resolved

Currently, the estimated RU charge reported at the end of a job shows an incorrect value for serial jobs on Oakley of the entire node. Jobs are being charged the correct amount in the official... Read more

7 years 7 months ago 4 years 11 months ago
Error when downloading SRA data on computing nodes Owens, Pitzer, Software Resolved
(workaround)

NCBI blocks any connection from computing nodes because they are behind firewalls. Thus OSC users cannot use SRA tools to download data "on-the-fly" at runtime on computing nodes, e.g. 'fastq-dump... Read more

3 years 10 months ago 1 year 1 month ago
OnDemand 2.1 - Issue downloading large directories OnDemand Resolved

There is an issue with OnDemand or Awesim after the new release when downloading directories that are large in size.

Users may encouter issues downloading large directories. Staff are... Read more

2 months 3 weeks ago 2 months 3 weeks ago
VASP job with Out-of-Memory crashes compute node(s) Batch, Owens, Software Resolved
(workaround)

... Read more

6 years 2 weeks ago 1 year 1 month ago
cuda-gdb segmentation fault on startup Owens, Pitzer, Software Resolved

The CUDA debugger, cuda-gdb, can raise a segmentation fault immediately upon execution.  A workaround before executing cuda-gdb is to unload the xalt module, e.g.: 

module unload... Read more          
3 years 1 month ago 1 year 1 month ago
Emergency InfiniBand Shutdown (All systems) Network Resolved

We have returned to service. It appears that we have resolved the networking issues enough to allow jobs to run safely. We will continue working with our vendors to fix any remaining hardware... Read more

8 years 10 months ago 8 years 10 months ago
A partial-node MPI job failed to start using Intel MPI mpiexec Owens, Pitzer, Software Resolved
(workaround)

A partial-node MPI job may fail to start using mpiexec from intelmpi/2019.3 and intelmpi/2019.7 with error messages like

[mpiexec@o0439.ten.osc.... Read more          
2 years 7 months ago 1 year 1 month ago

Pages