ondemand gpu request error Nov 2021 |
Batch, OnDemand, Pitzer |
Resolved |
When requesting an interactive session in ondemand and requesting gpu resources, users may see an error similar similar to "sbatch: error: Invalid generic resource (gres) specification"
... Read more |
1 year 9 months ago |
1 year 9 months ago |
Rolling reboot of Owens cluster, starting from Monday, April 16, 2018 |
Owens |
Resolved |
12:00 PM 5/7/2018 Update:
The rolling reboot of Owens has been completed.
Posted on April 11, 2018, at 3:45... Read more |
5 years 5 months ago |
5 years 4 months ago |
Replacement of Owens Ethernet switches from Dec 14, 2018 |
Network, Owens |
Resolved |
Updated on Jan 16, 2019, at 09:20 AM:
The replacement is done except for the three switches including the login nodes of Owens. We posted another notice for more... Read more |
5 years 1 week ago |
4 years 8 months ago |
Balance could be non-existent |
client portal |
Resolved |
Balances may be none existent in my.osc.edu and OSCusage command. Balances are being properly accounted for in the background.
The bug has been identified and a patch will be released ASAP... Read more |
3 years 10 months ago |
3 years 8 months ago |
Problems with Project Space (/nfs/gpfs) |
filesystem |
Resolved |
(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.
As of early afternoon, Sept. 8,... Read more |
8 years 2 weeks ago |
8 years 2 weeks ago |
Segmentation fault from openmpi/1.10-hpcx and 2.0-hpcx on Owens |
Owens, Software |
Resolved |
We have found that recent MPI jobs using openmpi/1.10-hpcx and openmpi/2.0-hpcx on Owens may complete or hang until the job is killed, but receive segmentation fault. Some applications might be ... Read more |
4 years 2 months ago |
4 years 1 month ago |
starccm outage Feb 21, 2021 |
Licensing, Outage, Owens, Software |
Resolved |
Updated on Feb 25:
StarCCM license outage is restored.
Original post:
OSC's starccm software license will expire at 12 a.m., Sunday, Feb... Read more |
2 years 7 months ago |
2 years 7 months ago |
Rolling reboot of all clusters, starting from Wednesday morning, April 19, 2017 |
Batch, Maintenance, Owens, Ruby |
Resolved |
1:40PM 4/27/2017 Update: Rolling reboots are completed.
3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address GPFS errors occured... Read more |
6 years 5 months ago |
6 years 4 months ago |
Lustre is still offline. HPC systems back up |
Maintenance |
Resolved |
Day One of the scheduled downtime has been completed, and HPC operations have resumed. As planned, Lustre work will extend into Day Two. Jobs using /fs/lustre or $PFSDIR cannot run until this work... Read more |
9 years 2 months ago |
9 years 2 months ago |
Security vulnerabilities on ARM Forge versions prior to 22.0.x |
Software |
Resolved (workaround) |
ARM identified security vulnerabilities on ARM Forge versions prior to 22.0.x as follow:
- Security update #1: A locally exploitable code-injection vulnerability was identified in... Read more
|
1 year 2 months ago |
1 year 2 months ago |