Course project of "基础学科交叉实践课程" 2024 summer
cd combine
CUDA_VISIBLE_DEVICES=0 python run.py
CUDA_VISIBLE_DEVICES=0 python pred.py
CUDA_VISIBLE_DEVICES=0 python pred_long.py
CUDA_VISIBLE_DEVICES=0 python pred_streaming.py
CUDA_VISIBLE_DEVICES=0 python pred_longstreaming.py
python eval.py --model xxx
The tests of Original Model, StreamingLLM, LongStreamingLLM are done in the environment of transformers==4.33.0. However, testing LongLM needs the environment of transformers==4.38.2. So run the following command before testing it.
pip install transformers==4.38.2
The results are in report.pdf (Chinese version)
This project is based on the projects below: