Python ORAGle: Using documentation as a knowledge base for programming questions

In this project we used retrieval augmented generation in combination with the Gemma-7b-it model to create a question answering LLM for python related problems. To achieve this, we relied on 3 core components:

Online Documentation and Tutorials in a PDF format as a knowledge base
ChromaDB as a vector database with gte-large (based on googles BERT-framework) as its embedding model
Gemma-7b-it as a LLM

We furthermore provide an automated benchmark to evaluate the retrieval quality for the specific database, again relying on Gemma-7b-it as a core component of this benchmark.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
chromadb		chromadb
gemma		gemma
plots		plots
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
embedding_finetune.ipynb		embedding_finetune.ipynb
python_rag.ipynb		python_rag.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Python ORAGle: Using documentation as a knowledge base for programming questions

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

cowolff/Python-ORAGle

Folders and files

Latest commit

History

Repository files navigation

Python ORAGle: Using documentation as a knowledge base for programming questions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages