This Project is an implementation of a RAG Chatbot that is using a Database of Research Papers to answer questions about Carbon Capture. This is a Team Project made possible by our Professor, Dr.Kritesh Kumar Gupta. It was an awesome experience working on this project, and me and my team look forward to do more fun projects in the future!
-
Install the required packages:
pip install -r requirements.txt -
Create a .env file in the root directory and add your API Keys:
HUGGINGFACEHUB_API_TOKEN="YOUR_HUGGING_FACE_API_TOKEN"If you still face issues, tryhuggingface-cli loginin the terminal
GOOGLE_API_KEY = "YOUR_GOOGLE_API_KEY"
LLAMA_CLOUD_API_KEY = "YOUR_LLAMA_CLOUD_API_KEY"
GROQ_API_KEY = "YOUR_GROQ_API" -
For the first run, to create a Vector Database, run
py create_database.py
Add your files in "data\pdfs" Add texts in "data\texts" (Optional)(Text files are not preprocessed) -
Run the GUI using the following command
streamlit run GUI.py
Gemini QA-Pair generation is not a part of this code, please visit Google AI Studio and generate QP pairs with a PDF and add response to the text folder!
You may face dependency issues with multiple modules used in this program So, it is recommended to use Python 3.10.6 for this project

