FAQ

Calculation

My job is PENDING with Reason=Resources

Check at your submit script, some mistypes can append.

When I submit a job, I got : AssocGrpJobsLimit

You reach your job quota on the requested partition.

When I submit an other job, I got : AssocGrpCPUsLimit

You reach your cpu quota on the requested partition.

Storage

My $HOME directory is full

Default quota is 100GB, do some cleanup, or ask to increase your quota if possible.

How many space I use in my $HOME ?

You can check this with :

quota -s

CPUs available on a node

How much cpus are allowed on a node ?

Use this command :

$ scontrol show node TESTNODE
NodeName=compute-0-4 Arch=x86_64 CoresPerSocket=12
   CPUAlloc=20 CPUErr=0 CPUTot=48 CPULoad=10.00 Features=rack-0,48CPUs
   Gres=ioperso:1
   NodeAddr=10.1.255.245 NodeHostName=compute-0-4 Version=14.11
   OS=Linux RealMemory=129180 AllocMem=115000 Sockets=4 Boards=1
   State=MIXED ThreadsPerCore=1 TmpDisk=260810 Weight=20528104
   BootTime=2015-09-02T11:30:42 SlurmdStartTime=2015-10-06T09:53:03
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

We can see 20 cpus allocated on a 48 cpus node.

Hyperthreading

I can’t launch an hyperthreaded job

By default, hyperthread is deactivated, to activate it :

$SBATCH --ntasks-per-core=2

Compilation

Which option to use to get a program usable on the entire cluster ?

Don’t use -x option like -xAVX, or use -axCORE-AVX2,AVX,SSE4.2

Which options match which CPUs

  • xeonv1 : -xAVX
  • xeonv2 : -xAVX
  • xeonv3 : -xCORE-AVX2
  • xeonv4 : -xCORE-AVX2
  • moonshot : -xSSE4.2

To get more informations about compiler flags : https://software.intel.com/en-us/articles/performance-tools-for-software-developers-intel-compiler-options-for-sse-generation-and-processor-specific-optimizations

MPI

I have to use mpirun or srun

To use mpi with slurm, you have to use srun