Feasibility of Open OnDemand

hpcuser · May 20, 2025, 7:53pm

Hey everyone!
I am new here and just have one quick question to see if Open OnDemand might be a good fit for something I am working on:

I would like to host a large language model (LLM) on our HPC infrastructure and make it accessible to multiple users through a front-end web app. The idea is to have “one” shared model running (ideally on a GPU), instead of each user spinning up their own instance and burning extra compute.

Is it possible to build something like this using Open OnDemand where users interact with the same backend model via a shared app?

Thanks in advance!

alanc · May 20, 2025, 10:44pm

Conceptually yes it’s possible, but it’s really more dependent on your underlying system and configuration than anything Open OnDemand specific.

Whenever somebody asks whether Open OnDemand can do X, my default answer is, can a knowledgeable client on your system do X with some combination of existing software / workflows / configuration? If so, then yes, Open OnDemand can do X.

In this particular situation, the solution boils down to how would you naturally allow multiple users to access a single running job / process? There are a variety of ways I could see this happening, depending on what specific resource manager you utilize and the system configuration. For example, maybe that job is run under a community account, that everyone has sudo access to? Or maybe you primarily interact with the process via files, in which case group permissions could be set appropriate on those files. Etc. etc. etc.

hpcuser · May 21, 2025, 7:10pm

thank you Alan for a detailed reply.

Topic		Replies	Views
New to HPC and Open OnDemand General Discussion	9	1062	February 25, 2025
Install on single server Get Help	5	1440	May 26, 2022
Installing open Ondemand General Discussion ondemand2 , question	1	390	April 12, 2023
A question about Open Ondemand features Get Help	2	170	April 27, 2024
Open OnDemand instances now available in AWS Marketplace Announcements	0	446	May 6, 2022

Feasibility of Open OnDemand

Related topics