GitHub - NJUxlj/Travel-Agent-based-on-LLM-and-SFT: A travel agent based on Qwen2.5, fine-tuned by LoRA and DPO, a mindmap can be output using the response.

基于Qwen2.5+LoRA微调+RLHF+RAG的旅游路径规划智能体

环境配置

GPU: RTX3090 x 2 Platform: AutoDL

NAME="Ubuntu"
VERSION="20.04.4 LTS (Focal Fossa)"
CUDA=12.4
Pytorch=2.5.0

pip install -r requirements.txt

如何运行

python main.py

python rag_naive.py

Experiment Setup

Model

我们使用了Qwen2.5作为LLM模型
- 目前仅测试了Qwen2.5的1.5B参数版本

Project Structure

核心代码都放在 src/ 目录下.
src/ 的目录结构：

src:
    data:
     - processed_data
     - data_augmentation.py
     - data_preprocessor.py
     - init.py
    training:
     - dpo_trainer.py
     - sft_trainer.py
     - multi_task_trainer.py
     - init.py
    models:
     - model.py
     - init.py
    ui:
     - app.py
     - mindmap.py
     - init.py

data:
     - 各种数据集
utils.py
configs:
     - config.py
     - init.py

Dataset

我们使用了一个旅游对话数据集：CrossWOZ

Dataset Citation:

@inproceedings{zhu2020crosswoz,  
    title={CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset},  
    author={Zhu, Qi and Zhang, Zheng and Fang, Yan and Li, Xiang and Takanobu, Ryuichi and Li, Jinchao and Peng, Baolin and Gao, Jianfeng and Zhu, Xiaoyan and Huang, Minlie},  
    booktitle={Transactions of the Association for Computational Linguistics},  
    year={2020},  
    url={https://arxiv.org/abs/2002.11893}  
}

Travel Agent运行结果

RAG运行结果

运行结果解释

我们给RAG的问题包含了：question+context， context是由数据集中前5个与question最接近的样本组成的。

Citation

we refer to many other projects when building this project.
knowledge-graph-from-GPT
ai-travel-agent
GPT2
RLHF_instructGPT

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
__pycache__		__pycache__
configs		configs
image		image
legacy		legacy
outputs/runs		outputs/runs
src		src
.gitignore		.gitignore
README.md		README.md
client.py		client.py
langchain-agent.py		langchain-agent.py
main.py		main.py
model_utils.py		model_utils.py
rag_naive.py		rag_naive.py
requirements.txt		requirements.txt
serve.py		serve.py
speech_to_text.py		speech_to_text.py
speech_to_text2.py		speech_to_text2.py
test_datasets.py		test_datasets.py
travel_dataset.py		travel_dataset.py
utils.py		utils.py
webui_demo.py		webui_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

基于Qwen2.5+LoRA微调+RLHF+RAG的旅游路径规划智能体

环境配置

如何运行

Experiment Setup

Model

Project Structure

Dataset

Travel Agent运行结果

RAG运行结果

运行结果解释

Citation

About

Packages

Languages

NJUxlj/Travel-Agent-based-on-LLM-and-SFT

Folders and files

Latest commit

History

Repository files navigation

基于Qwen2.5+LoRA微调+RLHF+RAG的旅游路径规划智能体

环境配置

如何运行

Experiment Setup

Model

Project Structure

Dataset

Travel Agent运行结果

RAG运行结果

运行结果解释

Citation

About

Topics

Resources

Stars

Watchers

Forks

Packages 0

Languages

Packages