Replies: 2 comments
-
@minixxie you found any solution for this ? |
Beta Was this translation helpful? Give feedback.
-
In default config Qdrant is setup to run in local mode using local_data/private_gpt/qdrant which is ephemeral storage not shared across pods. What is worse, this is temporary storage and it would be lost if Kubernetes restarts the pod. This the limitation of running in local ephemeral storage. In order to share data across pods, you can use Postgres with pgvector extension as vector store and nodestore. You also need to deploy Postgres - in another container or on AWS RDS. ==settings.yaml== |
Beta Was this translation helpful? Give feedback.
-
Hello,
First thank you so much for providing this awesome project!
I'm able to run this in kubernetes, but when I try to scale out to 2 replicas (2 pods), I found that the documents ingested are not shared among 2 pods.
First, I found the data being persisted in "local_data/" folder, so I found the doc and spin up qdrant, and change the settings.yaml as follow:
I saw the log of the pod showing the check on qdrant was successful:
After I ingested the doc inside the 1st pod:
I'm able to get the doc from the list endpoint:
However, if I check the list endpoint in the 2nd pod, it's empty:
This means they are not sharing the data from the vector database? Is there any way to run it in HA mode, so all replicas share the same set of documents ingested?
docker image I'm using: 3x3cut0r/privategpt:0.2.0
OS: mac OS mac book pro (Apple M2)
runtime: colima:
Beta Was this translation helpful? Give feedback.
All reactions