Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
Oakley login nodes and ruby02 will not be accessible between 9:00-9:30am on 10/18/2016 login Resolved

We upgraded to RHEL 6.8 for both Oakley and Ruby clusters during the October 12th's downtime. Unfortunately, we are noticing some NFS problem that has been causing rsh, or ssh sessions to hang on... Read more

7 years 10 months ago 7 years 10 months ago
GPFS problems on Owens filesystem Resolved

Owens is experiencing a disruption of GPFS availability. At about 4:17PM today (January 6th), OSC monitoring noticed a problem with mounts of Project on the Owens supercomputer. Jobs may have been... Read more

4 years 8 months ago 4 years 8 months ago
Password changes may be delayed Infrastructure Resolved

Due to an infrastructure problem, password changes via ARMSTRONG may be delayed until further notice.

10 years 9 months ago 10 years 9 months ago
abaqus: partial node jobs Software Resolved

If you run a parallel (or even serial!) job, but not using all the cpus... Read more

6 years 5 months ago 6 years 3 months ago
myosc outage - may 17, 2021 client portal Resolved

Resolution: Access to myosc was restored.

myosc is currently unavailable.

... Read more
3 years 3 months ago 3 years 3 months ago
OnDemand, Awesim, and DB Services down morning of Feb 12 Resolved

Update: Reboot was succesful.  OnDemand, Awesim, and Database services are back online.  Report any issues to oschelp@osc.edu.


A short reboot... Read more

9 years 7 months ago 9 years 7 months ago
Lustre bug causing Oakley login node crashes filesystem, login Resolved

Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.  The bug (or issue otherwise) seems to be activated when a user does operations on a... Read more

9 years 2 weeks ago 8 years 11 months ago
"Forgot your password?" Unavailable Account Management, client portal Resolved

Password changes cannot be completed via the "Forgot your password?" tool at the login page of the Client Portal (my.osc.edu).

Passwords can be changed once you log into the Client Portal... Read more

5 years 2 months ago 5 years 2 months ago
Nsight GPU profiler not working due to DCGM conflict GPU, Infrastructure Resolved

UPDATE (Mar 15, 2023)

After the downtime on Mar. 14, 2023, OSC enabled a new Slurm option --gres=nsight. DCGM will be disabled on the nodes for the job with the Slurm option,... Read more

1 year 6 months ago 1 year 5 months ago
Scratch and Project are hung; schedulings have been paused Batch, filesystem Resolved

1:00PM 4/6/2017 Update:  The Scratch and Project file systems are back to normal service. Scheduling on systems are resumed. We are still investigating the causes to this problem... Read more

7 years 5 months ago 7 years 5 months ago

Pages