Failed to submit session with the following error: No such file or directory - /opt/slurm/bin/sbatch

Hello all.
Receiving the error in the title above. I have verified that the file exists.
OoD is on one server and AWS Parallel Cluster is on a second server.

I have searched the logs and couldn’t find anything that would tell me what this error is.
I’m using the RStudio interactive app example as a test.

Thanks.

On which server? It needs to exist on the server that OOD is installed on. Or you can add a submit_host to the cluster.d/slurm.yml and the system will ssh into that host and issue commands (like /opt/slurm/bin/sbatch).

By default the system will use the localhost’s filesystem. localhost being the host that OOD is installed & running on.

Thanks for the clarification, very helpful.
I do have submit_host configured in cluster.d/slurm.yml.
sh-4.4$ sudo cat example2.yml
v2:
metadata:
title: “test-al2”
login:
host: “10.121.123.123”
submit_host: “10.121.123.123”
job:
adapter: “slurm”
cluster: “test-al2”
bin: “/opt/slurm/bin”
conf: “/opt/slurm/etc/slurm.conf”
batch_connect:
basic:
script_wrapper: |
module purge
%s
vnc:
script_wrapper: |
module purge
export PATH=“/usr/local/turbovnc/bin:$PATH”
export WEBSOCKIFY_CMD=“/usr/local/websockify/run”
%s

It should be within the job section, not login.

v2:
  job:
    submit_host: '10.10.123.213'
1 Like

That was it. Thank you.

This topic was automatically closed 180 days after the last reply. New replies are no longer allowed.