Hello all.
Receiving the error in the title above. I have verified that the file exists.
OoD is on one server and AWS Parallel Cluster is on a second server.
I have searched the logs and couldn’t find anything that would tell me what this error is.
I’m using the RStudio interactive app example as a test.
Thanks.
On which server? It needs to exist on the server that OOD is installed on. Or you can add a submit_host
to the cluster.d/slurm.yml
and the system will ssh into that host and issue commands (like /opt/slurm/bin/sbatch
).
By default the system will use the localhost’s filesystem. localhost being the host that OOD is installed & running on.
Thanks for the clarification, very helpful.
I do have submit_host configured in cluster.d/slurm.yml.
sh-4.4$ sudo cat example2.yml
v2:
metadata:
title: “test-al2”
login:
host: “10.121.123.123”
submit_host: “10.121.123.123”
job:
adapter: “slurm”
cluster: “test-al2”
bin: “/opt/slurm/bin”
conf: “/opt/slurm/etc/slurm.conf”
batch_connect:
basic:
script_wrapper: |
module purge
%s
vnc:
script_wrapper: |
module purge
export PATH=“/usr/local/turbovnc/bin:$PATH”
export WEBSOCKIFY_CMD=“/usr/local/websockify/run”
%s
It should be within the job
section, not login
.
v2:
job:
submit_host: '10.10.123.213'
1 Like