VidMentor🦙: Mentor for Online Learning Based on Large Language Model

Powered by llama3, Whisper, Paddleocr, bge-base-en-v1.5, KeyBert, xlm-roberta_punctuation_fullstop_truecase and paraphrase-multilingual-MiniLM-L12-v2, we construct an agent to implement online Q&A, video segmentation, Inter-class quizzes for multi educational videos understanding. We hope to expand the functionality and effectiveness of online education.

Pipeline

Demo

We use the videos from link as exmaple (you can download from link) and you can find demo of VidMentor here.

Project Structure

├── 📂 checkpoints                    #save model checkpoints
├── 📂 videos                         #save all origin videos 
├── 📂 asset                          #save necessary files 
├── 📂 backend                        
│   ├── 📄 backend_audio.py           #extract audio info into database
│   ├── 📄 backend_search.py          #support search and answer in website demo
│   ├── 📄 backend_visual.py          #extract visual info into database  
│   ├── 📄 backend_llm.py             #support building llm agents 
├── 📂 database                       #save all video's data
├── 📂 utils           
│   ├── 📄 tamplate.py                #provide different tamplates for different llm agents
│   ├── 📄 trees.py                   #provide tools to generate mind map
│   ├── 📄 utils.py                   #provide some useful common tools            
├── 📂 models                                
│   ├── 📄 bgemodel.py                #bgemodel method         
│   ├── 📄 llm_model.py               #llm model method
│   ├── 📄 whisper_model.py           #whisper model method
│   ├── 📄 keybert_model.py           #keybert method         
│   ├── 📄 punctuator_model.py        #punctuator model method
├── 📄 README.md                      #readme file
├── 📄 TUTORIAL.md                    #tutorial for vidmentor
├── 📄 requirements.txt               #packages requirement
├── 📄 st_demo.py                     #run streamlit website demo
├── 📄 download_ckpt.py               #download all model into local
├── 📄 build_database.py              #build database

Environment Preparing

1. Create Conda Environment

# Make sure you have git-lfs installed (https://git-lfs.com)
git lfs install
git clone https://github.com/Kailuo-Lai/VidMentor.git
conda create -n vidmentor python=3.9
conda activate vidmentor
cd VidMentor
pip install -r requirements.txt

2. Install Graphviz

Downlowd Graphviz from link.
Add Graphviz to your system path.

3. Download Model Weight

python download_ckpt.py

4. LLM Quantization

Build llama.cpp from link.
Quantize the llama3 weight in the checkpoints folder following the instructions from link
Change the argument --llm_version in st_demo.py and build_database.py to the output file name of the quantized llama3 weight.

Tutorial

You can find the tutorial of VidMentor🦙 here.

Acknowledge

We are grateful for the following awesome projects

llama3: An open-source large language model created by Meta
Whisper: Robust Speech Recognition via Large-Scale Weak Supervision
PaddleOCR: Awesome multilingual OCR toolkits based on PaddlePaddle
KeyBert: A minimal method for keyword extraction with BERT
bge-base-en-v1.5: A general embedding model created by BAAI
paraphrase-multilingual-MiniLM-L12-v2: A multilingual text embedding
xlm-roberta_punctuation_fullstop_truecase: An xlm-roberta model fine-tuned to restore punctuation

Contributors

Thanks to all the contributors who have helped to make this project better!

_{Yifan Wu} 💻	_Kailuo 💻	_chenminghao 💻
Add your contributions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VidMentor🦙: Mentor for Online Learning Based on Large Language Model

Pipeline

Demo

Project Structure

Environment Preparing

1. Create Conda Environment

2. Install Graphviz

3. Download Model Weight

4. LLM Quantization

Tutorial

Acknowledge

Contributors

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
__pycache__		__pycache__
asset		asset
backend		backend
checkpoints		checkpoints
database		database
llama.cpp @ ed67bcb		llama.cpp @ ed67bcb
models		models
utils		utils
videos		videos
.all-contributorsrc		.all-contributorsrc
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
TUTORIAL.md		TUTORIAL.md
build_database.py		build_database.py
download_ckpt.py		download_ckpt.py
requirements.txt		requirements.txt
st_demo.py		st_demo.py

Kailuo-Lai/VidMentor

Folders and files

Latest commit

History

Repository files navigation

VidMentor🦙: Mentor for Online Learning Based on Large Language Model

Pipeline

Demo

Project Structure

Environment Preparing

1. Create Conda Environment

2. Install Graphviz

3. Download Model Weight

4. LLM Quantization

Tutorial

Acknowledge

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages