We have seen sporadic OOD login timeouts after one authenticates, which correspond to a log message like:
[Tue Mar 12 12:41:43.274829 2024] [proxy_http:error] [pid 185398:tid 139664266614528] (70007)The timeout specified has expired: [client 155.101.16.32:45342] AH01102: error reading status line from remote server httpd-UDS:0, referer: https://ondemand-class.chpc.utah.edu/pun/sys/dashboard/batch_connect/sessions
These messages show much more often than just the login timeouts which are quite rare, and we have tracked that to a timed out squeue
call which happens roughly every 5-10 seconds for each logged in user. I guess the PUN is running squeue
periodically to query the state of user’s jobs and update their status in OOD.
What is the default interval of these queries and can it be changed? I would like to experiment with making it longer to see if we continue seeing these timeout log messages.
Thanks,
Martin