Known Issues

Known Issues

Title Category Resolution Description Posted Updated
PBS commands on Owens are not working Batch, Owens Resolved

Update posted on July 12, 2017 at 1:50PM:

We have fixed the problem with the batch management system on Owens and queues on Owens have been opened again for jobs.

... Read more

6 months 1 week ago 6 months 1 week ago
Rolling reboot of Owens cluster, starting from 9AM June 28, 2017 Owens Resolved

Update posted on July 7, 2017 at 2:00PM:

Rolling reboot of login and compute nodes of Owens cluster is completed. 

... Read more
6 months 3 weeks ago 6 months 2 weeks ago
Systemic Problem on Cluster Computing service Operations Resolved

4:20PM 6/23/2017 Update: All HPC systems are back in production. This outage may cause failures of users' jobs. We'll update the community as more is known. 

... Read more
7 months 3 days ago 6 months 3 weeks ago
my.osc.edu is NOT available Account Management Resolved

my.osc.edu has not been fully restored after yesterday's downtime. You can change your password, but you will not be able to use the new password on my.osc.edu. The updated password will work to... Read more

7 months 4 weeks ago 7 months 4 weeks ago
"pbsdcp" is not working on Oakley Oakley Resolved

12:35PM 5/24/2017 Update: pbsdcp   has been fixed on Oakley.

pbsdcp   is not working on Oakley and returns a missing library error as below:... Read more

7 months 4 weeks ago 7 months 4 weeks ago
Issue with GPFS on Owens since April 14, 2017 Batch, filesystem, Owens Resolved

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address this GPFS issue. 

We have had issues with GPFS mounts on Owens Cluster since Friday afternoon,... Read more

9 months 6 days ago 8 months 3 weeks ago
Rolling reboot of all clusters, starting from Wednesday morning, April 19, 2017 Batch, Maintenance, Oakley, Owens, Ruby Resolved

1:40PM 4/27/2017 Update: Rolling reboots are completed. 

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address GPFS errors occured... Read more

9 months 6 days ago 8 months 3 weeks ago
Scratch and Project are hung; schedulings have been paused Batch, filesystem Resolved

1:00PM 4/6/2017 Update:  The Scratch and Project file systems are back to normal service. Scheduling on systems are resumed. We are still investigating the causes to this problem... Read more

9 months 2 weeks ago 9 months 2 weeks ago
Owens is in Partial Service Owens Resolved

3:45PM April 3, 2017 Update: GPU nodes on Owens are available. 

206 Owens nodes are not accessible to users due to GPU testing and a bad Ethernet switch. It is expected... Read more

9 months 3 weeks ago 9 months 2 weeks ago
Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017 login, Oakley, Owens, Ruby Resolved

4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed. 

All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more

10 months 1 week ago 9 months 3 weeks ago

Pages