Information about Jobs

TaskCommand
List all jobs for a usersqueue --me
List all running jobs for a usersqueue --me -t RUNNING
List all pending jobs for a user
squeue --me -t PENDING
List all current jobs in the normal partition for a user
squeue --me -p normal
List detailed information for a job (useful for troubleshooting):
scontrol show job -dd <jobid>
Only available after a job has completed
To get statistics on completed jobs by jobID
sacct -j <jobid> --format=JobID,JobName,MaxRSS,Elapsed
To view the same information for all jobs of a user
sacct -u <username> --format=JobID,JobName,MaxRSS,Elapsed
Show priorities for waiting jobs
sprio -l

Controlling Jobs

TaskCommand
To cancel one job
scancel <jobid>
To cancel all the jobs for a user
scancel -u <username>
To cancel all the pending jobs for a user
scancel -t PENDING -u <username>
To cancel one or more jobs by name
scancel --name myJobName
To pause a particular job
scontrol hold <jobid>
To resume a particular job
scontrol resume <jobid>
To requeue (cancel and rerun) a particular job
scontrol requeue <jobid>
  • Keine Stichwörter