Interruption of both Project and Scratch filesystems on 10/13/2016 |
filesystem |
Resolved |
We experienced a brief interruption of both Project and Scratch filesystems at about 5:15PM October 13, 2016. User jobs may have been effected.
We'll update more details later.
|
7 years 7 months ago |
7 years 7 months ago |
libibumad.so.2 missing on Oakley |
Software |
Resolved |
Update: We think this is fixed. Please submit a ticket if you encounter further problems.
As a result of updates made during yesterday's downtime, software built with mvapich2/... Read more |
7 years 7 months ago |
7 years 7 months ago |
Problems with LAMMPS 14May16 |
Software |
Resolved |
LAMMPS 14May16 was built with the USER-OMP package on Oakley, Ruby, and Owens. Its default behavior is to spawn too many OpenMP threads. lammps/14May16 batch scripts that do not use the USER-OMP... Read more |
7 years 8 months ago |
7 years 5 months ago |
GPFS hang Issue on 09/08/2016 |
filesystem |
Resolved |
On Thursday, Sept 8 starting at 19:37, we had some bad interaction that appears to have been caused by the backup client, and the GPFS servers. This resulted in a GPFS hang that propagated I/O... Read more |
7 years 8 months ago |
7 years 8 months ago |
All HPC systems are available |
Login Problems, Operations, Outage |
Resolved |
8/24/16 3:57PM: All HPC systems are availalbe including:
- Oakley cluster for general access
- Ruby cluster for restricted access
- Owens cluster for... Read more
|
7 years 9 months ago |
7 years 8 months ago |
Some issues remain after downtime |
Login Problems, Operations, Outage |
Resolved |
15 July 2016, 5:00PM update: some additional issues we are facing
- We are experiencing periodic hangs of the GPFS client file system software used with the new storage environment. We... Read more
|
7 years 10 months ago |
6 years 7 months ago |
NFS service disruption 6/29/16 |
filesystem |
Resolved |
OSC experienced errors with NFS services the morning of June 29 between 08:37 and 09:12 that may have caused some jobs to fail, or other unexpected behavior. The... Read more |
7 years 10 months ago |
7 years 10 months ago |
June 7th downtime to finish at 6:30PM |
Connectivity, filesystem, Infrastructure, login, Login Problems, Maintenance, Operations, Outage |
Resolved |
Update: Downtime completed at 6:30PM, June 7th.
The June 7th downtime is now slated to be completed at 6:30PM. Previous estimate was 5PM.
All systems and services will... Read more |
7 years 11 months ago |
7 years 11 months ago |
Globus Online Transfers Failing |
Connectivity, filesystem, Web Services |
Resolved |
We are currently investigating multiple reports of Globus Online transfers to/from OSC to other sites are failing. Transfers to/from Globus Personal Endpoints do not seem to be affected.
... Read more |
8 years 1 month ago |
5 years 11 months ago |
Submit filter bug after downtime |
Batch |
Resolved |
A change was made to a part of our batch software during the downtime that should have only affected users who are a part of multiple projects. We have found that there is a bug in the changes... Read more |
8 years 3 months ago |
8 years 3 months ago |