Hello!
I’m using OpenOnDemand 4.0.8 and Slurm 25.11.3
When running jobs via job composer, the job is queued and assigned a pending status with the reason “user env retrieval failed required held.”
This is because the sssd service configuration file on the compute node contains the following setting:
access_provider = ldap
ldap_access_filter =(memberOf=cn=admins,ou=groups,dc=hpc,dc=frccsc,dc=ru)
If I comment out these lines, the job starts and runs.
However, there’s something confusing. When running a job from the Slurm Login host, it always runs, and there’s no need to comment out these lines.
How should I configure OpenOnDemand to allow the job to run with the specified sssd service settings?
I’m attaching the OpenOnDemand settings:
cluster.yml
---
v2:
metadata:
title: "FRCCSC HPC"
login:
host: "login.hpc.frccsc.ru"
job:
adapter: "slurm"
bin: "/usr/bin/"
conf: "/etc/slurm/slurm.conf"
copy_environment: true
defaul.sh (job file created via job composer)
#!/bin/bash
#SBATCH --partition=main
#SBATCH -w g5500-1
#SBATCH --ntasks=1
#SBATCH --nodes=1
#SBATCH --ntasks-per-node=1
#SBATCH --cpus-per-task=2
#SBATCH -o slurm.out
######## It is NOT recommended to change settings in this section ########
export SLURM_EXPORT_ENV=ALL
##########################################################################
## Run the program
date && hostname
I am ready to provide additional information if necessary.
Thank you!
Best regards!