Draft: Ollama API with Podman Compose #93

jroddev · 2025-01-29T04:54:44Z

WIP for using podman-compose instead of docker compose with the Ollama API docs.

Needs someone else to verify before merging these updates. I have only tested on my Aurora install with Nvidia GPU.

Nvidia GPU passthrough seems to need sudo until this bug is fixed containers/podman#19338

Make some changes required to use podman instead of docker

jroddev · 2025-01-29T04:56:22Z

docs/ai.md

+    ports:
+      - 11434:11434
+    volumes:
+      - ./ollama_v:/root/.ollama:z


Permission denied in podman without :z

jroddev · 2025-01-29T04:59:02Z

docs/ai.md

+    volumes:
+      - ./ollama_v:/root/.ollama:z
+    devices:
+      - nvidia.com/gpu=all


This might work with docker-compose as well. If so we can probably unify the 2 blocks (:z worked with docker-compose)

Does not work for docker

Error response from daemon: could not select device driver "cdi" with capabilities: []

castrojo · 2025-01-29T05:09:27Z

We could also just put a quadlet in there and tell people to paste it into the right file? I'm thinking, if we're going podman, we should go full podman/quadlet which is what they prefer.

We should also leave the docker example there too, if someone needs to get something done and they need ollama they shouldn't have to learn podman that same time, so offering both feels great. What do you think?

jroddev · 2025-01-30T07:07:30Z

@castrojo I think that makes sense. I suspect that it will still need to run as root, but I'll give it a try and report back.
Also LMStudio AppImage worked out of the box

split podman from the docker section, also add a quadlet version

jroddev · 2025-01-30T08:34:28Z

docs/ai.md

+ContainerName=ollama
+AutoUpdate=yes
+PublishPort=11434:11434
+Volume=./ollama_v:/root/.ollama:z


Not sure what the volume path should be in the quadlet

jroddev · 2025-01-30T09:02:28Z

OK so the quadlet kind of works.

Starts fine under --user with ~/.local/share/systemd/ollama.container
nvidia gpu is working (without sudo, --system)
I still needed to brew install ollama on the host to get the cli frontend (or you could exec into the container)
ollama is picking up existing models from my system (I think from ~/.ollama). I'm not sure how since it's not mounted into the container. Maybe the frontend is doing it?

jroddev added 2 commits January 29, 2025 15:52

Update ai.md

396513d

Make some changes required to use podman instead of docker

Update ai.md

b70bca6

jroddev commented Jan 29, 2025

View reviewed changes

docs/ai.md Outdated

ports:

- 11434:11434

volumes:

- ./ollama_v:/root/.ollama:z

Copy link

Author

jroddev Jan 29, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Permission denied in podman without :z

jroddev commented Jan 29, 2025

View reviewed changes

Update ai.md

2ff32a7

split podman from the docker section, also add a quadlet version

jroddev commented Jan 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: Ollama API with Podman Compose #93

Draft: Ollama API with Podman Compose #93

jroddev commented Jan 29, 2025 •

edited

Loading

jroddev Jan 29, 2025

jroddev Jan 29, 2025

jroddev Jan 30, 2025

castrojo commented Jan 29, 2025

jroddev commented Jan 30, 2025

jroddev Jan 30, 2025

jroddev commented Jan 30, 2025

Draft: Ollama API with Podman Compose #93

Are you sure you want to change the base?

Draft: Ollama API with Podman Compose #93

Conversation

jroddev commented Jan 29, 2025 • edited Loading

jroddev Jan 29, 2025

Choose a reason for hiding this comment

jroddev Jan 29, 2025

Choose a reason for hiding this comment

jroddev Jan 30, 2025

Choose a reason for hiding this comment

castrojo commented Jan 29, 2025

jroddev commented Jan 30, 2025

jroddev Jan 30, 2025

Choose a reason for hiding this comment

jroddev commented Jan 30, 2025

jroddev commented Jan 29, 2025 •

edited

Loading