PrivateGPT Reverse

smohr · June 14, 2024, 4:30pm

Hello I am working on setting up PrivateGPT in OOD. This launches as a web app using uvicorn and gradio as a UI. I am able to launch the service. It does come up on a port and OOD connects to it. When I hit the button to connect I do see the UI. However parts of the page don’t load properly. There should be some java script and css that is loaded. I see the following in the applications log when I make the connection.

11:59:11.747 [INFO ] uvicorn.access - 172.23.100.12:38880 - “GET / HTTP/1.1” 200

If I then launch a remote desktop and open firefox I can reach the service and it loads just fine. I then see the following in the application log.

12:00:04.644 [INFO ] uvicorn.access - 172.23.7.1:49042 - “GET / HTTP/1.1” 200
12:00:04.804 [INFO ] uvicorn.access - 172.23.7.1:49042 - “GET /assets/Index-52a9d5ff.css HTTP/1.1” 200
12:00:04.849 [INFO ] uvicorn.access - 172.23.7.1:49056 - “GET /info HTTP/1.1” 200
12:00:04.879 [INFO ] uvicorn.access - 172.23.7.1:49056 - “GET /theme.css HTTP/1.1” 200
12:00:05.236 [INFO ] uvicorn.access - 172.23.7.1:49108 - “GET /assets/Index-e45a2b11.css HTTP/1.1” 200
12:00:05.239 [INFO ] uvicorn.access - 172.23.7.1:49108 - “GET /assets/Index-64f7cc27.js HTTP/1.1” 200
12:00:05.244 [INFO ] uvicorn.access - 172.23.7.1:49108 - “GET /assets/Index-a90cda25.css HTTP/1.1” 200
12:00:05.245 [INFO ] uvicorn.access - 172.23.7.1:49124 - “GET /assets/Index-b2efa79d.js HTTP/1.1” 200
12:00:05.466 [INFO ] uvicorn.access - 172.23.7.1:49150 - “POST /run/predict HTTP/1.1” 200

I did find a github post with a similar sounding issue behind an nginix proxy which was able to be resolved. I am unsure if this is applicable but may be a step in the right direction. The solution was to use the following settings.

        proxy_buffering off;
        proxy_redirect off;
        proxy_http_version 1.1;
        proxy_set_header Upgrade $http_upgrade;
        proxy_set_header Connection "upgrade";
        proxy_set_header Host $host;

github.com/zylon-ai/private-gpt

Issue running behind nginx

opened 08:49AM - 01 Mar 24 UTC

closed 04:19AM - 04 Mar 24 UTC

ramashishb-yubi

Hi, I am trying to run privateGTP behind an nginx, with nginx acting as a rev…erse proxy. The privateGPT server starts fine and is accessible on localhost:8001. However, when running behind nginx, I see following requests being made to locahost:8001- http://localhost:8001/info http://localhost:8001/theme.css These fail because localhost:8001 is not accessible anymore. Due to the above the application fails to load correctly. My nginx config is bare minimum- location / { proxy_pass http://localhost:8001; } Is there any change needed to make it work behind a reverse proxy? Thanks, Ramashish

Thank You,
Steven Mohr

jeff.ohrstrom · June 14, 2024, 4:48pm

Yea gradio can be kinda wonky. When I deployed stable diffusion web ui (built on gradio app) we’re able to supply a --subpath argument. Looking at the source code for this app - in the settings.yml you can only set the port, so that doesn’t seem like it’s an option for you.

github.com

OSC/bc_classroom_stable_diffusion/blob/9c5bfbe6cae3316923454f6f2d298e50a4d19d33/template/script.sh.erb#L66-L77


      
          python $OSC_STABLE_DIFFUSION_PATH/launch.py \
            --data-dir "$CLASS_DIR" \
            --ui-settings-file "$CONFIG" \
            --gradio-auth <%= CurrentUser.name %>:$password \
            --api-auth <%= CurrentUser.name %>:$password \
            --skip-install \
            --port $port \
            --listen \
            --skip-prepare-environment \
            --subpath "rnode/$HOST_CFG/$PORT_CFG" \
            --server-name $(hostname) \
            --no-half

It may just work for you if you use /rnode in your view.html.erb. That may proxy correctly (not sure if you started with /node or not).

That said - you may have to use a nginx proxy within the job to set the URLs right. It’s not often that you should need to, but some apps just don’t work right or can recognize that they’re behind a proxy.

mcuma · June 14, 2024, 6:35pm

Hi Jeff,

is the nginx proxy documented somewhere? Or if not can you please document it? I think we’ll have to explore this option with quite a few of the LLM frontends that are out there (like the OpenWebUI we talked about on Tuesday).

Thanks.

jeff.ohrstrom · June 14, 2024, 6:48pm

This was before my time, so I’ve never done it personally, but I am aware of this repository (that at a glance uses an nginx container).

But again, I didn’t develop that personally and indeed have never even used the app - it was decommissioned at some poitn.

mjbludwig · June 14, 2024, 6:53pm

@jeff.ohrstrom I wonder if this would be a potential solution for things like Cryosparc who also can’t handle uri’s being changed?

jeff.ohrstrom · June 14, 2024, 6:57pm

Looks like it could, but again I can’t offer a lot of guidance for the same. Looks like nginx has substitute type directives.

http://nginx.org/en/docs/http/ngx_http_sub_module.html

smohr · June 14, 2024, 8:24pm

Thanks for the quick responses on this. I had no luck with node set in the view.html.erb. I can at least reach the UI in some capacity with rnode. In the Privatgpt settings there is an option to set a path. So far any changes to this setting haven’t made much of a difference.

ui:
enabled: true
path: /

jeff.ohrstrom · June 14, 2024, 8:31pm

I don’t see path in any of the settings ymls, but I think this is what you’d set it to if it is indeed a configuration option.

path: '/rnode/$host/$port'

or this below - sometimes trailing /s matter.

path: '/rnode/$host/$port/'

jeff.ohrstrom · June 14, 2024, 8:33pm

Maybe with no quotes. IDK how the templating system interacts with environment variables, but quoting and not quoting are other variations you may need to try.

smohr · June 20, 2024, 2:38pm

Is it possible for the reverse proxy Open Ondemand uses to connect to a Unix domain socket? This app uses uvicorn and can be started with any options it supports.

jeff.ohrstrom · June 20, 2024, 3:58pm

Yes and no.

Yes in the sense that all passenger apps boot on unix sockets. So when access /pun/sys/dashboard for example apache connects to Nginx (which has booted the application) through a unix socket.

No in the sense that what this topic is about is a batch connect application. batch connect (or interactive) applications boot on a compute node (i.e., a different machine than the machine that’s running OnDemand). So connecting to that machine through a unix socket is impossible because it requires traveling over the network from the OnDemand machine to the compute node.

smohr · June 20, 2024, 4:00pm

Makes perfect sense.

smohr · June 26, 2024, 2:31pm

I am still stuck on this. I took the suggestion and built an nginx container. I configured it as a reverse proxy. I used the forum post about this specific app in my initial post and the shiny app that was posted with nginx config as inspiration.

server {
listen 8085;

location / {
    proxy_pass http://unix:/tmp/gp.sock;
    proxy_buffering off;
    proxy_redirect off;
    proxy_http_version 1.1;
    proxy_set_header Upgrade $http_upgrade;
    proxy_set_header Connection "upgrade";
    proxy_set_header Host $host;
}

}

I tried it starting uvicorn on an IP and I also tried it using a socket.

#uvicorn.run(app, host=“0.0.0.0”, port=settings().server.port)

uvicorn.run(app, uds=“/tmp/gp.sock”, port=settings().server.port)

If I start a remote desktop and start the app and proxy in that session I can then open a web browser and access the app on the port for the reverse proxy just fine. The page loads and everything looks fine.

When I go to reach the app through Open Ondemand I can reach it. I still have the same issues . I can reach the app, I can see it make the initial GET. It doesn’t load the CSS and JS.

I figured using nginx might give a bit more flexibility and help but there is still something in Open Ondemand that isn’t working right. Do you have any suggestions?

smohr · July 8, 2024, 7:39pm

Feel free to close this. I got pretty close but could not get the page to render properly with proxy or reverse proxy. Ultimately I had it launch in a VNC session and start firefox in full screen kiosk mode. It’s not ideal but is getting the job done.

system · January 4, 2025, 7:40pm

This topic was automatically closed 180 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Avoid launching a web browser Get Help	24	572	June 6, 2025
External URL app w/ Reverse Proxy? Get Help	2	72	April 5, 2025
OOD app to display interface to a python app Get Help question	13	405	July 28, 2024
Problems getting apps to work under 1.5 Get Help	19	3313	May 26, 2022
Reverse Proxy on Separate Host? Get Help	19	1967	December 6, 2021

PrivateGPT Reverse

Related topics