Qwen-7B-Chat-Cantonese

基於Qwen-7B-Chat嘅微調版本，基於大量粵語數據進行訓練。

Download

Download Configs

git clone https://github.com/stvlynn/Qwen-7B-Chat-Cantonese

Download Release

cd Qwen-7B-Chat-Cantonese 
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00001-of-00008.safetensors
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00002-of-00008.safetensors
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00003-of-00008.safetensors
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00004-of-00008.safetensors
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00005-of-00008.safetensors
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00006-of-00008.safetensors
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00007-of-00008.safetensors
wget --no-check-certificate --content-disposition https://github.com/stvlynn/Qwen-7B-Chat-Cantonese/releases/download/v1/model-00008-of-00008.safetensors

Quickstart

Pls turn to QwenLM/Qwen - Quickstart

Training Parameters

Parameter	Description	Value
Learning Rate	AdamW optimizer learning rate	7e-5
Weight Decay	Regularization strength	0.8
Gamma	Learning rate decay factor	1.0
Batch Size	Number of samples per batch	1000
Precision	Floating point precision	fp16
Learning Policy	Learning rate adjustment policy	cosine
Warmup Steps	Initial steps without learning rate adjustment	0
Total Steps	Total training steps	1024
Gradient Accumulation Steps	Number of steps to accumulate gradients before updating	8

Demo

Special Note

This is my first fine-tuning LLM project. Pls forgive me if there's anything wrong.

If you have any questions or suggestions, feel free to contact me.

Twitter @stv_lynn

Telegram @stvlynn

email i@stv.pm

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
config.json		config.json
configuration_qwen.py		configuration_qwen.py
cpp_kernels.py		cpp_kernels.py
generation_config.json		generation_config.json
model.safetensors.index.json		model.safetensors.index.json
modeling_qwen.py		modeling_qwen.py
qwen.tiktoken		qwen.tiktoken
qwen_generation_utils.py		qwen_generation_utils.py
special_tokens_map.json		special_tokens_map.json
tokenization_qwen.py		tokenization_qwen.py
tokenizer_config.json		tokenizer_config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qwen-7B-Chat-Cantonese

Download

Quickstart

Training Parameters

Demo

Special Note

About

Releases 1

Packages

Languages

License

stvlynn/Qwen-7B-Chat-Cantonese

Folders and files

Latest commit

History

Repository files navigation

Qwen-7B-Chat-Cantonese

Download

Quickstart

Training Parameters

Demo

Special Note

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages