MimiQ: Low-Bit Data-Free Quantization of Vision Transformer with Encouraging Inter-Head Attention Similarity

This folder contains the official implementation of MimiQ: Low-Bit Data-Free Quantization of Vision Transformer with Encouraging Inter-Head Attention Similarity.

Requirements

Python 3.9.18
PyTorch 2.0.1
Refer to the requirements.txt for other requirements

Setup

We recommend using Python virtual environment to run this code.

You can install requirements with the command below.

pip install -r requirements.txt
mkdir -p /datasets
ln -s {YOUR IMAGENET FOLDER} /datasets/image

Folder Structure

mimiq_code
├── main.py                            
├── option.py 
├── trainer.py
├── imagenet_{NETWORK}.hocon                    # Setting files
├── train.sh                                    # Train script
├── generate_data.sh                            # Synthetic data generation script
├── merge_dataset.sh                            # Merge generated data into the dataset
├── trainer.py
├── ...                                         # Utils
├── LICENSE.md
├── README.md
└── requirements.txt

Data Generation

For synthetic dataset reconstruction, run the data generation script below:

./generate_dataset.sh MODEL_NAME NUM_IMGS SAVE_PREFIX SAVE_PATH

MODLE_NAME : Target network architecture. ViT architectures: vit_{tiny|small|base}_patch16_224 DeiT architectures: deit_{tiny|small|base}_patch16_224 Swin architectures: swin_{tiny|small|base}_patch4_window7_224
NUM_IMGS : The number of synthetic image per GPU
SAVE_PREFIX : This number will be added to the image number, use for multi-gpu generation e.g., if SAVE_PREFIX=1000 and NUM_IMGS=100, generated images will have ID from 1000 to 1100.
SAVE_PATH : Where to save generated images.

and run merge_dataset.sh SAVE_PATH MODEL_NAME

Training

For training, change the path of the validation set in .hocon file. To quantize the model described in the paper, run the training script below

train.sh CONF_PATH ID LR QW QA GAMMA DATA_PATH LR_POLICY LR_STEP AQ_MODE

CONF_PATH: path to .hocon file
ID : experiment ID, you can use proper unsigned integer, such as 1234, 5678, etc.
LR : Learning rate, default=0.001
QW, QA : Weight quantization bit and Activation quantization bit
GAMMA : Attention head distillation coefficient, default = 10.0
DATA_PATH : Synthetic dataset path
LR_POLICY : Learning rate policy, default=multi_step
LR_STEP : Learning rate decay policy, defualt=[50,100]
AQ_MODE : Activation quantization method, you can use minmax or lsq, default=lsq

License

This project is licensed under the terms of the GNU General Public License v3.0

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
quant_utils		quant_utils
utils		utils
LICENSE.md		LICENSE.md
README.md		README.md
__init__.py		__init__.py
dataloader.py		dataloader.py
gaussian_blur.py		gaussian_blur.py
generate_dataset.sh		generate_dataset.sh
hydra_image_gen_merge.py		hydra_image_gen_merge.py
hydra_image_gen_ssim_att_map.py		hydra_image_gen_ssim_att_map.py
image_gen_aug_utils.py		image_gen_aug_utils.py
imagenet_class_labels.py		imagenet_class_labels.py
imagenet_deit_b_16_224.hocon		imagenet_deit_b_16_224.hocon
imagenet_deit_s_16_224.hocon		imagenet_deit_s_16_224.hocon
imagenet_deit_t_16_224.hocon		imagenet_deit_t_16_224.hocon
imagenet_swin_b_p4_w7_224.hocon		imagenet_swin_b_p4_w7_224.hocon
imagenet_swin_s_p4_w7_224.hocon		imagenet_swin_s_p4_w7_224.hocon
imagenet_swin_t_p4_w7_224.hocon		imagenet_swin_t_p4_w7_224.hocon
imagenet_vit_b_16_224.hocon		imagenet_vit_b_16_224.hocon
imagenet_vit_s_16_224.hocon		imagenet_vit_s_16_224.hocon
imagenet_vit_t_16_224.hocon		imagenet_vit_t_16_224.hocon
main.py		main.py
merge_dataset.sh		merge_dataset.sh
options.py		options.py
requirements.txt		requirements.txt
train.sh		train.sh
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MimiQ: Low-Bit Data-Free Quantization of Vision Transformer with Encouraging Inter-Head Attention Similarity

Requirements

Setup

Folder Structure

Data Generation

Training

License

About

Uh oh!

Releases

Packages

Languages

License

iamkanghyunchoi/mimiq

Folders and files

Latest commit

History

Repository files navigation

MimiQ: Low-Bit Data-Free Quantization of Vision Transformer with Encouraging Inter-Head Attention Similarity

Requirements

Setup

Folder Structure

Data Generation

Training

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages