Questions tagged [pbs]

29 questions
5
votes
0 answers

torque pbs 4.0.1 job stays queued ('Q') state; the scheduler seems not receiving any notification

I am using torque 4.0.1 on openSUSE 12.1 in a cluster environment. When I qsub a job (simple as "echo hello"), it remains in 'Q' state, and never gets scheduled. I can force the job to run with qrun, and it is executed on the first node without…
liding
  • 51
  • 1
  • 4
3
votes
2 answers

How to find out the PBS version installed in a cluster?

Whenever I submit a job array, I get one job ID in this form: 463880-1.hostname with 463880 being the job ID and 1 the array ID, one job for each "array position". Today I'm using a different machine, and it looks like a different PBS version,…
RSFalcon7
  • 163
  • 1
  • 6
3
votes
0 answers

Email notifications per JOB ARRAY not per job in PBS torque

Is there a way to configure torque to send email notifications on start and end of job array, and not per job, Im managing job arrays of thousands of jobs, and I dont want to get flooded by mails. But indeed I want to know when the entire job array…
3
votes
2 answers

Can read/write jobs be put in a queue?

I share a server with HAL. The server has 32 GB of memory. I rarely use more than 1 GB of memory, and when I do, it is for a few minutes at a time, and I don't mind sending such jobs to the back of the line. HAL read/writes large files (e.g. using…
3
votes
2 answers

Torque jobs does not enter "E" state (unless "qrun")

Jobs I add to the queue stays there in "Queued" state without attempts to be executed (unless I manually qrun them) /var/spool/torque/server_logs say just 04/11/2011 12:43:27;0100;PBS_Server;Job;16.localhost;enqueuing into batch, state 1 hop…
Vi.
  • 821
  • 11
  • 19
2
votes
1 answer

Job submitted to Torque does not generate error/log file

As stated, I have just installed Torque on a Ubuntu 16.04 machine. The submitted jobs complete just fine but the -e and -o flags seem to not be working. No error and log files are created even though I have given the flag an absolute path to the…
user121392
  • 13
  • 1
  • 6
2
votes
0 answers

Torque pbs queue system runs queue in reverse

I have a small compute cluster set up on Redhat 7.1. It runs the PBS torque queue system with version 5.1.1. When I queue several jobs it starts to run the jobs in "backwards" priority. It starts with the job which was submitted last. Is there any…
Pe2
  • 21
  • 2
2
votes
1 answer

PBS Jobs - serial vs parallel job which one is better for my scenario

I have access to a cluster that uses 'torque' (I think) and we use PBS scripts to submit jobs. I need to run more then 200 instances of an app that I've developed in java. The app acts as a peer forming a P2P network, which means that those…
BraCa
  • 133
  • 4
2
votes
1 answer

pbs/Torque interactive jobs: bash trick to run command after launch

I'm using pbs interactive jobs in bash, launched using qsub -I, and I'd like to run some command after launch. After this command is run, I'd like to continue to use the interactive reservation as usual. I'm unsure how to accomplish this, because…
BigChief
  • 398
  • 1
  • 2
  • 12
2
votes
1 answer

How can I set up interactive-job-only or batch-job-only partition on a SLURM cluster?

I'm managing a PBS/torque HPC cluster, and now I'm setting up another cluster with SLURM. On the PBS cluster, I can set a queue to accept only interactive jobs by qmgr -c "set queue interactive_q disallowed_types = batch" and to accept only batch…
wdg
  • 143
  • 1
  • 5
1
vote
0 answers

How to setup bash environment inside Torque PBS? Why source ~/.bashrc doesn't work?

I have successfully installed Torque PBS on my ubuntu server. Job submission is fine. However there is an annoying thing. I found bash environment is not right inside PBS for example echo 'echo $PATH > ~/res.txt' | qsub and the content of res.txt…
user15964
  • 121
  • 5
1
vote
0 answers

PBS Torque Limit Resource by Time of Day

I am using Torque to manage software that I have a limited number of licenses of (4 to be exact). During the day I need to keep 2 licenses free for use of setting up cases and at night and over the weekend I can use all 4 licenses to solve cases. Is…
LWhitson2
  • 111
  • 3
1
vote
0 answers

Torque queue issue

I am having troubles with Torque + Maui. The problem is the following: I have 2 queues, each queue has 10 associated nodes. If i submit 10k jobs to the first queue and i submit 1 job to the second one, the job in the second one remains in Q…
Andrea
  • 11
  • 1
1
vote
1 answer

pbs check how much cpu a job is using / job % cpu usage

On a cluster a job is submitted to run on N CPUs. qstat -f doesn't show how much CPU resources the job is using other than the time. How does one check whether 100% of each CPU power is used? It is essential to determining the resources required for…
user102906
1
vote
1 answer

Why does qdel comman return 'Unknown Job Id'

OS Version: CentOS release 4.6 (Final) Kernel \r on an \m 2.6.9-100.ELsmp Problem When I run qdel i get the following error: qdel: Unknown Job Id 20432.scyld.localdomain Information Output of qstat -n: head0.localdomain: …
1
2