GitHub

Prepare

pip install -r requirements.txt

Run

Step 1: Pick data

You need to change the model path and the input file in pickdata.py and run it.

python run.py

Then you get all the .pt data in ./savedata.

Step 2: Change data format from .pt to .dat and blockwise it

Activation

You need to change the from_dir and to_dir in reshape.py and run it.

python reshape.py

Then you get all the .dat data in ./data_llama-3.1-8B-bf16-prefill-520-len-5.

Weight

You need to change the filedir and to_dir in reshape_weight.py and run it.

python reshape_weight.py

Then you get all the .dat data in ./data_llama-3.1-8B-bf16-prefill-520-len-5.

Config

Model and Dtype and Generate Length

This is the running config.

run.py:

# config
config.max_new_tokens = 5 # decoding len
max_length = 36 # prefill len
dtype = torch.bfloat16 # model dtype
model_path = "/data/coding/llama-3.1-8b" # model path
input_file = "./data_for_gem5_prompt.txt" # input file, inside is a list of strings

Pick Tensor

This is the data you want to pick out and the layers you want to pick.

pickdata.py:

save_dir = "./savedata" # dir you want to save .pt data

data_need = [
    "activation_attention_norm_input",
    "activation_attention_input",
    "activation_attention_output_after_outproj",
    "activation_ffn_norm_input",
    "activation_ffn_input",
    "activation_ffn_output",
    "constant_ROPE_cos_pos",
    "constant_ROPE_sin_pos",
    "activation_logit_output_after_outproj",
    "activation_logit_output_before_outproj",
    "activation_transformerblock_output",
    "activation_token_embedding_tokens",
    "kv_cache_k_after_update",
    "kv_cache_v_after_update",
    # "activation_attention_output_before_reshape",
    # "activation_attention_xq_after_ROPE",
    # "activation_attention_xk_after_ROPE",
    # "activation_attention_xq_before_ROPE",
    # "activation_attention_xk_before_ROPE",
    # "activation_attention_xv",
    # "activation_attention_scores_before_div_sqrt_dim",
    # "kv_cache_k_quant_after_update",
    # "kv_cache_v_quant_after_update",
    # "kv_cache_k_rest_after_update",
    # "kv_cache_v_rest_after_update",
]

layer_need = range(32) # pick out all layers

Change format and blockwise

Activation

reshape.py:

from_dir = r'./savedata' # dir of .pt data
to_dir = r'./data_llama-3.1-8B-bf16-prefill-520-len-5' # dir you want to save .dat

Weight

reshape_weight.py:

filedir = 'D:\data-tanyifan\modeldata\llama-3.1-8b' # dir of the model
to_dir = r'./data_llama-3.1-8B-bf16-prefill-520-len-5' # dir you want to save .dat

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
__pycache__		__pycache__
gemm_operator_ago		gemm_operator_ago
llama		llama
new_operator		new_operator
special_for_deepseek		special_for_deepseek
.gitattributes		.gitattributes
README.md		README.md
analysedata.py		analysedata.py
data_for_gem5_prompt.txt		data_for_gem5_prompt.txt
data_for_gem5_prompt_2.txt		data_for_gem5_prompt_2.txt
data_for_gem5_prompt_superlenth.txt		data_for_gem5_prompt_superlenth.txt
pickdata.py		pickdata.py
pile10k.py		pile10k.py
requirements.txt		requirements.txt
reshape.py		reshape.py
reshape_weight.py		reshape_weight.py
run.py		run.py
run_cmodel.py		run_cmodel.py
run_kivi.py		run_kivi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prepare

Run

Step 1: Pick data

Step 2: Change data format from .pt to .dat and blockwise it

Activation

Weight

Config

Model and Dtype and Generate Length

Pick Tensor

Change format and blockwise

Activation

Weight

About

Uh oh!

Releases

Packages

Languages

JiaqiGuoSunlune/data_scraper

Folders and files

Latest commit

History

Repository files navigation

Prepare

Run

Step 1: Pick data

Step 2: Change data format from .pt to .dat and blockwise it

Activation

Weight

Config

Model and Dtype and Generate Length

Pick Tensor

Change format and blockwise

Activation

Weight

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages