I have a SLURM job I submit with sbatch
, such as
sbatch --gres gpu:Tesla-V100:1 job.sh
job.sh
trains a model on a V100 GPU. The code itself does not log GPU memory usage.
Is there a SLURM command to query peak GPU memory usage once the job is finished?