问题
I have submitted a job to a SLURM queue, the job has run and completed. I then check the completed jobs using the sacct
command. But looking at the results of the sacct command I notice additional results that I did not expect:
JobID JobName State NCPUS Timelimit
5297048 test COMPLETED 1 00:10:00
5297048.bat+ batch COMPLETED 1
5297048.ext+ extern COMPLETED 1
Can anyone explain what the 'batch' and 'extern' jobs are and what their purpose is. Why does the extern job always complete even when the primary job fails.
I have attempted to search the documentation but have not found a satisfactory and complete answer.
EDIT: Here's the script I am submitting to produce the above sacct
output:
#!/bin/bash
echo test_script > done.txt
With the following sbatch
command:
sbatch -A BRIDGE-CORE-SL2-CPU --nodes=1 --ntasks=1 -p skylake --cpus-per-task 1 -J jobname -t 00:10:00 --output=./output.out --error=./error.err < test.sh
回答1:
A job is comprised of several job steps. Each job step is shown independently. In your output you have $JOBID, which stands for the overall reservation, $JOBID.batch, which represents the main script you submitted.
Regarding external
... I'm not sure, but I guess that you started one job step and you named it as "external". In that case, that is the information of that job step.
If you show us the script you submitted we can clarify some soubts.
来源:https://stackoverflow.com/questions/52447602/slurm-sacct-shows-batch-and-extern-job-names