Job id 35226904 stuck

Hi there,

Actually, it is considered by slurmctld regularly, but still it has lower priority (cf. Job priority explanation ):

[root@login2 ~]# scontrol show Job=35226904 | \
 grep -e 'Reason' \
      -e 'LastSchedEval'
   JobState=PENDING Reason=Priority Dependency=(null)
   SuspendTime=None SecsPreSuspend=0 LastSchedEval=2020-06-18T14:02:26
[root@login2 ~]# sprio -u cucci
          JOBID PARTITION     USER   PRIORITY       SITE        AGE  FAIRSHARE    JOBSIZE  PARTITION        QOS
       35226904 mono-shar    cucci       4647          0         36        859          2       3750          0
[root@login2 ~]# 

However, your job should not be accepted on the mono-shared-EL7 partition given that you asked for more per-CPU memory than permitted (10GB, cf. https://baobab.unige.ch/enduser/src/enduser/enduser.html#partitions-and-limits ):

[root@login2 ~]# scontrol show Job=35226904 | \
 grep -e Partition \
      -e TRES
   Partition=mono-shared-EL7 AllocNode:Sid=login2:138783
   TRES=cpu=1,mem=16000M,node=1,billing=1
[root@login2 ~]# 

Should I change to another partition, shared-EL7 for example? Or do you prefer to reduce the requested memory?

Thx, bye,
Luca