You may encounter the following error while running an Abaqus parallel job with PMPI:
Traceback (most recent call last):
File "SMAPylModules/SMAPylDriverPy.m/src/driverAnalysis.py", line 263, in run
File "SMAPylModules/SMAPylDriverPy.m/src/driverExplicit.py", line 214, in analyze
File "SMAPylModules/SMAPylDriverPy.m/src/driverExplicitMPI.py", line 36, in runXpl
File "SMAPylModules/SMAPylDriverPy.m/src/driverPhase.py", line 575, in run
File "SMAPylModules/SMAPylDriverPy.m/src/driverPhase.py", line 567, in _run
driverExceptions.AbaqusExecutionError: ('Abaqus/Explicit Analysis', 255, 'knee_bolster_nsm')
slurmstepd: error: Detected 1 oom_kill event in StepId=2822.batch. Some of the step tasks have been OOM Killed.
Cause of the Error
This error occurs because the job is terminated due to the MPI process abnormally running out of memory. This triggers an Out-of-Memory (OOM) event, leading to Slurm job termination.
Affected versions
2022 and 2024
Workaround
Switch to the default MPI implementation (IntelMPI) to run the job. This avoids the memory issue associated with PMPI.