Pitzer

Pitzer Downtime September 22, 2020

An approximately 3-hour downtime for the Pitzer system is scheduled starting from 9 a.m. Tuesday, September 22, 2020, to finalize the Slurm transition on the Pitzer system. This will affect both the Pitzer legacy compute and login nodes, as well as the new Pitzer hardware. During this downtime time, users will not be able to access Pitzer and submit jobs. Other OSC services, including Owens and Ruby Clusters, web portals, and HPC file servers will be available.

Pitzer Expansion Early User Program

In preparation for the deployment of the new hardware as well as the Slurm migration on Pitzer in fall 2020, OSC would like to invite all members of the client community to participate in the Pitzer Expansion Early User Program in order to test the functionality of new hardware, including Dual GPU and Quad GPU features, and test the new Slurm scheduler and its TORQUE/Moab compatibility. Jobs that are eligible for the early user program will not be charged and there is no registration required, all OSC users are eligible.

Slurm Migration Issues

This page documents the known issues for migrating jobs from Torque to Slurm.

$PBS_NODEFILE and $SLURM_JOB_NODELIST

Please be aware that $PBS_NODEFILE is a file while $SLURM_JOB_NODELIST is a string variable. 

The analog on Slurm to cat $PBS_NODEFILE is srun hostname | sort -n 

Environment variables are not evaluated in job script directives

Environment variables do not work in a slurm directive inside a job script.

 
1 Start 2 Complete

Please report the problem here when you use Slurm

CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.

Pitzer compute unavailable between 7am Aug 18 and noon Aug 20, 2020

A downtime for all OSC HPC systems is scheduled from 7 a.m. to 9 p.m., Tuesday, August 18, 2020. Pitzer login nodes will be available at the end of the normal downtime window. However, all compute nodes of Pitzer cluster will be unavailable through noon on August 20, 2020 to allow for cooling changes for the Pitzer expansion. To stay up to date on system notices, follow @HPCNotices on Twitter. As always, you can contact us at OSC Help.

Slurm Migration

Overview

Slurm, which stands for Simple Linux Utility for Resource Management, is a widely used open-source HPC resource management and scheduling system that originated at Lawrence Livermore National Laboratory.

It is decided that OSC will be implementing Slurm for job scheduling and resource management, to replace the Torque resource manager and Moab scheduling system that it currently uses, over the course of 2020.

Pages