SSH Slurm’s job submission and control clients

Greetings!

I am trying to send a Slurm job via an SSH connection on the node on which OOD is installed and I get the following error:
"
sbatch: error: Batch job submission failed: Zero Bytes were transmitted or received
"

The OOD server is a separate server from the cluster.
I have already configured my .yml file in the following way:

v2:
  job:
    adapter: slurm
    submit_host: my_cluster.scheduler
    bin: /usr/bin
    conf: /etc/slurm/slurm.conf
    host: my_cluster.scheduler
    strict_host_checking: false
  login:
    host: my_cluster.scheduler
  metadata:
    title: Slurm Cluster

And I can correctly submit a job from the Job-Composer.

Thank you in advance for any help

Hi and welcome!

A quick google search of the issue seems to indicate there’s some issue in your Slurm setup. Like there’s a connectivity issue or similar. I’d check the slurm logs first as something somewhere (the controller or the deamons) may have indicated why it rejected your job.

Hello, thank you for your reply.
Indeed the logs showed an authentication error due to the munge.key
My mistake for not looking earlier.

Thank you very much for your support,
Vittorio

1 Like

No problem at all! Just open another topic if you have more issues.