I have a complex fortran MPI application running under a Torque/Maui system. When I run my application it produces a huge unique output (~20 GB). To avoid that, I produced a RunJob script that breaks up the running in 5 pieces, each producing smaller outputs much easier to handle.
For the moment my RunJob script stops correctly at the end of the first piece and also produces the correct output. However, when it tries to restart I get the following error message:
qsub: Bad UID for job execution MSG=ruserok failed validating username/username from compute-0-0.local
I know that this problem comes from the fact the Torque/Maui system by default does not allow a node to submit a job.
In fact, when I type this:
qmgr -c 'l s' | grep allow_node_submit
I got:
allow_node_submit = False
I do not have an administrator account just a user one
My questions are:
- Is it possible to set allow_node_submit = true on the gmgr being a user ? How ? (- I guess not)
- If question 1 = false, is there another way to work around this ? How ?
All the best.