Globus Online Transfers Failing |
Connectivity, filesystem, Web Services |
Resolved |
We are currently investigating multiple reports of Globus Online transfers to/from OSC to other sites are failing. Transfers to/from Globus Personal Endpoints do not seem to be affected.
... Read more |
8 years 8 months ago |
6 years 6 months ago |
June 7th downtime to finish at 6:30PM |
Connectivity, filesystem, Infrastructure, login, Login Problems, Maintenance, Operations, Outage |
Resolved |
Update: Downtime completed at 6:30PM, June 7th.
The June 7th downtime is now slated to be completed at 6:30PM. Previous estimate was 5PM.
All systems and services will... Read more |
8 years 6 months ago |
8 years 6 months ago |
NFS service disruption 6/29/16 |
filesystem |
Resolved |
OSC experienced errors with NFS services the morning of June 29 between 08:37 and 09:12 that may have caused some jobs to fail, or other unexpected behavior. The... Read more |
8 years 5 months ago |
8 years 5 months ago |
Some issues remain after downtime |
Login Problems, Operations, Outage |
Resolved |
15 July 2016, 5:00PM update: some additional issues we are facing
- We are experiencing periodic hangs of the GPFS client file system software used with the new storage environment. We... Read more
|
8 years 5 months ago |
7 years 2 months ago |
All HPC systems are available |
Login Problems, Operations, Outage |
Resolved |
8/24/16 3:57PM: All HPC systems are availalbe including:
- Oakley cluster for general access
- Ruby cluster for restricted access
- Owens cluster for... Read more
|
8 years 3 months ago |
8 years 3 months ago |
GPFS hang Issue on 09/08/2016 |
filesystem |
Resolved |
On Thursday, Sept 8 starting at 19:37, we had some bad interaction that appears to have been caused by the backup client, and the GPFS servers. This resulted in a GPFS hang that propagated I/O... Read more |
8 years 3 months ago |
8 years 3 months ago |
Problems with LAMMPS 14May16 |
Software |
Resolved |
LAMMPS 14May16 was built with the USER-OMP package on Oakley, Ruby, and Owens. Its default behavior is to spawn too many OpenMP threads. lammps/14May16 batch scripts that do not use the USER-OMP... Read more |
8 years 3 months ago |
8 years 2 weeks ago |
libibumad.so.2 missing on Oakley |
Software |
Resolved |
Update: We think this is fixed. Please submit a ticket if you encounter further problems.
As a result of updates made during yesterday's downtime, software built with mvapich2/... Read more |
8 years 2 months ago |
8 years 2 months ago |
Interruption of both Project and Scratch filesystems on 10/13/2016 |
filesystem |
Resolved |
We experienced a brief interruption of both Project and Scratch filesystems at about 5:15PM October 13, 2016. User jobs may have been effected.
We'll update more details later.
|
8 years 2 months ago |
8 years 2 months ago |
Nvidia drivers on Oakley |
GPU |
Resolved |
We upgraded the drivers for the Nvidia GPUs on all of our clusters during the downtime this week. Unfortunately, we are noticing some subtle problems with the GPUs on Oakley. We will be rolling... Read more |
8 years 2 months ago |
6 years 6 months ago |