Introduce batching support for embedding documents #1214

sobychacko · 2024-08-13T15:59:47Z

Introduce a way for EmbeddingModel implementations to embed Document objects using a batching strategy.

The text was updated successfully, but these errors were encountered:

sobychacko · 2024-08-13T16:00:54Z

Using this #1140 as a starting point, introduce a new API for batching for all vector stores and embedding models to use.

- When embedding documents, allow batching the documents using some criteria. - `BatchingStrategy` interface with a `TokenCountBatchingStrategy` implementation that uses the openai max input token size of 8191 as the default. - Add a default method in EmbeddingModel to embed document using this new batching strategy. - Change `MilvusVectorStore` to make use of this new batching API. - Adding unit tests for `TokenCountBatchingStrategy`. - Adding openai integration test to call the embed API that uses batching. Resolves spring-projects#1214

impactCn · 2024-08-15T01:42:11Z

@sobychacko hi, I also encountered the same problem in PG. The segmentation process I use , #1200 .
Can you guide me on how to use this batch API?

- When embedding documents, allow batching the documents using some criteria. - `BatchingStrategy` interface with a `TokenCountBatchingStrategy` implementation that uses the openai max input token size of 8191 as the default. - Add a default method in EmbeddingModel to embed document using this new batching strategy. - Change `MilvusVectorStore` to make use of this new batching API. - Adding unit tests for `TokenCountBatchingStrategy`. - Adding openai integration test to call the embed API that uses batching. Resolves spring-projects#1214 Other vector stores will be updated seperately

sobychacko · 2024-09-04T23:39:33Z

@impactCn Sorry for the delay in responding. We just merged similar batching changes for the PG vector store. Can you take a look and see if that satisfies your use case? If there is a gap, we can improve on that. Thanks!

sobychacko self-assigned this Aug 13, 2024

sobychacko mentioned this issue Aug 13, 2024

GH-1214: Batching strategy for embedding documents #1215

Closed

markpollack closed this as completed in 949f1ed Aug 20, 2024

sobychacko mentioned this issue Sep 3, 2024

Add batching strategy for all vector store implementations #1261

Open

15 tasks

sobychacko mentioned this issue Sep 11, 2024

When too much data is imported, timeouts may easily occur when executing the embedding model. #1199

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce batching support for embedding documents #1214

Introduce batching support for embedding documents #1214

sobychacko commented Aug 13, 2024

sobychacko commented Aug 13, 2024

impactCn commented Aug 15, 2024

sobychacko commented Sep 4, 2024

Introduce batching support for embedding documents #1214

Introduce batching support for embedding documents #1214

Comments

sobychacko commented Aug 13, 2024

sobychacko commented Aug 13, 2024

impactCn commented Aug 15, 2024

sobychacko commented Sep 4, 2024