Optimize documents embedding in the add method from the MilvusVectortore #1140

solenyk · 2024-08-01T08:45:30Z

Today I was using Spring AI and ran into a problem with org.springframework.ai.vectorstore.MilvusVectorStore. In method add(List documents), when going through each document separately, a request embeddingModel.embed(document); is sent for each of them. I have 4,000 documents in my case, so the method takes a very long time (50 minutes).

To speed up the process, I extended from MilvusVectorStore and already after executing the loop over the documents I used the List<List> embed(List texts) method from org.springframework.ai.embedding.EmbeddingModel with contentArray, which significantly accelerated execution (now performance lasts 1 minute).

sobychacko · 2024-08-02T21:53:19Z

...ring-ai-milvus-store/src/main/java/org/springframework/ai/vectorstore/MilvusVectorStore.java

@@ -69,6 +69,8 @@ public class MilvusVectorStore implements VectorStore, InitializingBean {

 	public static final int INVALID_EMBEDDING_DIMENSION = -1;

+	private static final int MAX_EMBEDDING_ARRAY_DIMENSIONS = 2048;


Is there a reason for this limit? Milvus documentation suggests much higher limits for dimensionality - https://milvus.io/docs/limitations.md#Dimensions-of-a-vector.

The reason I added this dimensions limit is that when using OpenAiEmbeddingModel implementation of the EmbeddingModel is an OpenAi API limit:
https://platform.openai.com/docs/api-reference/embeddings/create

I attached the link for the first version of the API, but there is the same situation for the second one!

I renamed MAX_EMBEDDING_ARRAY_DIMENSIONS to MAX_OPENAI_EMBEDDING_ARRAY_DIMENSIONS for greater clarity. Also, since in this class there is an OPENAI_EMBEDDING_DIMENSION_SIZE variable, this change helps maintain consistency in variable naming style.

sobychacko · 2024-08-02T21:53:59Z

...ring-ai-milvus-store/src/main/java/org/springframework/ai/vectorstore/MilvusVectorStore.java

@@ -69,6 +69,8 @@ public class MilvusVectorStore implements VectorStore, InitializingBean {



Please add your name as an author to the class.

…tore

sobychacko · 2024-08-27T21:33:55Z

@solenyk Based on the ideas in this PR, we decided to make some API changes that are overarching to how we embed documents and then use those embeddings in other vector stores. At the core of those changes is a new BatchingStrategy API. Take a look at this PR: #1215

These changes were included in the recently released M2. Currently, the batching call is only applied to the Milvus vector store. We are going to evaluate other vector stores for similar usage.

Thank you for the PR; it certainly geared us toward providing it as a general pattern that other vector stores can use. I am closing the PR now. More PR contributions are welcomed!

sobychacko requested changes Aug 2, 2024

View reviewed changes

solenyk force-pushed the optimize-milvus-vector-store-documents-embedding branch from a5c9f07 to 9afd59f Compare August 5, 2024 07:28

solenyk requested a review from sobychacko August 5, 2024 07:28

solenyk force-pushed the optimize-milvus-vector-store-documents-embedding branch 3 times, most recently from f49e257 to 4a9be1f Compare August 5, 2024 08:32

Optimize documents embedding in the add method from the MilvusVectorS…

15359aa

…tore

solenyk force-pushed the optimize-milvus-vector-store-documents-embedding branch from 4a9be1f to 15359aa Compare August 5, 2024 08:57

sobychacko mentioned this pull request Aug 13, 2024

Introduce batching support for embedding documents #1214

Closed

sobychacko closed this Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize documents embedding in the add method from the MilvusVectortore #1140

Optimize documents embedding in the add method from the MilvusVectortore #1140

solenyk commented Aug 1, 2024 •

edited

Loading

sobychacko Aug 2, 2024

solenyk Aug 5, 2024

solenyk Aug 5, 2024 •

edited

Loading

sobychacko Aug 2, 2024

solenyk Aug 5, 2024

sobychacko commented Aug 27, 2024 •

edited

Loading

		@@ -69,6 +69,8 @@ public class MilvusVectorStore implements VectorStore, InitializingBean {

		public static final int INVALID_EMBEDDING_DIMENSION = -1;

		private static final int MAX_EMBEDDING_ARRAY_DIMENSIONS = 2048;

Optimize documents embedding in the add method from the MilvusVectortore #1140

Optimize documents embedding in the add method from the MilvusVectortore #1140

Conversation

solenyk commented Aug 1, 2024 • edited Loading

sobychacko Aug 2, 2024

Choose a reason for hiding this comment

solenyk Aug 5, 2024

Choose a reason for hiding this comment

solenyk Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

sobychacko Aug 2, 2024

Choose a reason for hiding this comment

solenyk Aug 5, 2024

Choose a reason for hiding this comment

sobychacko commented Aug 27, 2024 • edited Loading

solenyk commented Aug 1, 2024 •

edited

Loading

solenyk Aug 5, 2024 •

edited

Loading

sobychacko commented Aug 27, 2024 •

edited

Loading