s.park
(s.park)
July 28, 2025, 2:37pm
1
Hello,
I got error when I submit an interactive desktop.
my slurm.conf is readable for everyone.
# ls -l /etc/slurm/slurm.conf
-rw-r--r--. 1 root root 7922 Jul 28 14:06 /etc/slurm/slurm.conf
Indise of slurm.conf, ClusterName
is set.
ondemand-web
, slurm controller
and cluster host
are different but the communication between them are okay.
/etc/ood/config/clusters.d/my_cluster.yml
is below.
---
v2:
metadata:
title: "My Cluster"
login:
host: "hpc-lg01.my_cluster.com"
job:
adapter: "slurm"
bin: "/usr/bin"
conf: "/etc/slurm/slurm.conf"
batch_connect:
basic:
script_wrapper: |
module purge
%s
set_host: "host=$(hostname -A | awk '{print $1}')"
vnc:
script_wrapper: |
module purge
export PATH="/opt/TurboVNC/bin:$PATH"
export WEBSOCKIFY_CMD="/usr/bin/websockify"
%s
set_host: "host=$(hostname -A | awk '{print $1}')"
and /etc/ood/config/apps/bc_desktop/my_cluster.yml
---
title: "Cluster Desktop"
cluster: "my_cluster"
attributes:
desktop: "xfce"
I might miss something but I couldn’t find it.
I already test interactive desktop with dev-cluster and it worked well. But, with my_cluster, it does not work.
Thank you in advance for any help.
PARK
I see this as 644 - but I wonder about the directory above it. Can you ls -l
as a regular unprivileged user? Unable to read/permission denied is pretty straight forward.
Remember to ls -l
on the OnDemand machine itself, this is where the command is being issued, so this is where you have to test to validate it.
s.park
(s.park)
July 28, 2025, 2:44pm
3
Thank you for your answer.
Here it is for the permission.
s.park@ondemand-web01 ~ $ ls -l /etc/slurm/
total 16
-rw-r--r--. 1 root root 7922 Jul 28 14:06 slurm.conf
PARK
Maybe it’s an SELinux thing?
Can you just issue sbatch
with and without CONF environment variable set to that /etc file and see if it works from a command line?
s.park
(s.park)
July 28, 2025, 3:13pm
5
With slurm.conf
,
This is script file for sbatch
#!/bin/bash
echo "Hello world!"
I submitted this job with sbatch.
s.park@ondemand-web01 ~ $ sbatch tx.sh
Submitted batch job 948700
and output is..
s.park@ondemand-web01 ~ $ cat slurm-948700.out
Hello world!
Without slurm.conf
I got failed.
s.park@ondemand-web01 ~ $ sbatch tx.sh
sbatch: error: resolve_ctls_from_dns_srv: res_nsearch error: Unknown host
sbatch: error: fetch_config: DNS SRV lookup failed
sbatch: error: _establish_config_source: failed to fetch config
sbatch: fatal: Could not establish a configuration source
Also, as expected, without slurm.conf
, tired to submit desktop on the web and failed with this error.
PARK
Not without the file itself, with and without the CONF=/etc/slurm/slurm.conf
environment variable set.
s.park
(s.park)
July 28, 2025, 4:34pm
7
I’ve comment the conf line in cluster .yml file as below.
job:
adapter: "slurm"
bin: "/usr/bin"
# conf: "/etc/slurm/slurm.conf"
However, with or without conf environment variable, same error is occured.
PARK
It’s likely an SELinux issue. Do you have SELinux enabled?
1 Like
And if so, check out /var/log/audit.log and see if there are errors there.
s.park
(s.park)
July 28, 2025, 4:50pm
10
Yes, SELinux was enforced.
I disabled it and then it works well.
Thank you for your guide.
PARK
Pull requests to this component are welcome! It’s always slightly different on different systems, so any patches we get on it from the community are greatly appreciated!
1 Like