performance: Optimize ColBERT index free search with torch.topk #219

Diegi97 · 2024-06-04T13:36:34Z

The following line is a bottleneck when using the index free search:

https://github.com/bclavie/RAGatouille/blob/796b49388cad0822564f30ef0b1464d021186637/ragatouille/models/colbert.py#L480

The change I introduce reduces the search time over 25k documents from 5.571s to 0.023s in my local setup with an intel i7 and a RTX 4090.

Script to reproduce:

from ragatouille import RAGPretrainedModel
from datasets import load_dataset

# Load the pretrained model
r = RAGPretrainedModel.from_pretrained('colbert-ir/colbertv2.0')

# Load the dataset
dataset = load_dataset('mteb/scidocs', 'corpus')
docs = dataset['corpus']['text']
print(f"Number of documents: {len(docs)}")

# Encode the documents
encodings = r.encode(docs, bsize=256)

# Perform search on encoded documents
import timeit

def search():
    return r.search_encoded_docs('Recurrent Neural Networks', k=5)

# Timing the searches
rnn_time = timeit.timeit(search, number=7)

print(f"Search 'Recurrent Neural Networks': {rnn_time / 7:.3f} s per loop")

bclavie · 2024-08-07T09:38:12Z

Thanks a lot! Let this be the first merge of the RAGatouille long overdue overhaul 😄

Diegi97 added 2 commits June 4, 2024 15:22

performance: Optimize ColBERT index free search with torch.topk

cb3201d

Remove typo

d3d6eac

bclavie approved these changes Aug 7, 2024

View reviewed changes

bclavie merged commit de3c620 into AnswerDotAI:main Aug 7, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance: Optimize ColBERT index free search with torch.topk #219

performance: Optimize ColBERT index free search with torch.topk #219

Diegi97 commented Jun 4, 2024 •

edited

Loading

bclavie commented Aug 7, 2024

performance: Optimize ColBERT index free search with torch.topk #219

performance: Optimize ColBERT index free search with torch.topk #219

Conversation

Diegi97 commented Jun 4, 2024 • edited Loading

bclavie commented Aug 7, 2024

Diegi97 commented Jun 4, 2024 •

edited

Loading