Pitzer

Some MKL environment variables have incorrect paths

MKL module files define some helper environment variables with incorrect paths.  This can yield link time errors.  All three clusters are affected.  We are working to correct the module files.   A workaround for users is to redefine the environment variable with the correct path; this requires some computational maturity.  We recommend users contact oschelp@osc.edu for assistance.  An example error from Cardinal with module intel-oneapi-mkl/2023.2.0 that defined environment variable MKL_LIBS_INT64 follows:

MPI-IO issues on home directories with Intel MPI 2019.3

Certain MPI-IO operations with intelmpi/2019.3 may crash, fail or proceed with errors on the home directory. We do not expect the same issue on our GPFS file system, such as the project space and the scratch space. The problem might be related to the known issue reported by HDF5 group. Please read the section "Problem Reading A Collectively Written Dataset in Parallel" from HDF5 Known Issues for more detail.

Affected versions

2019.3

Rolling reboot of Ascend, Owens and Pitzer starting from Oct 25 2023

Update on Nov 8 2023:

Rolling reboots of all clusters are completed. 

Update on Nov 3 2023:

Rolling reboots of Ascend and Pitzer clusters are completed. 

Original Post:

We will have rolling reboots of Ascend, Owens and Pitzer clusters including login and compute nodes, starting from 9AM Wednesday October 25, to perform NVIDIA driver and Slurm upgrades.

Pages