M-Attack-V2

Official implementation of our paper Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting.

M-Attack-V2 substantially improves M-Attack (v1) by reducing unstable local-gradient behavior and handling source-target asymmetry more explicitly.

Quick Start

Install dependencies

uv sync
uv run python -m spacy download en_core_web_sm

Add API keys in api_keys.yaml

gpt4o:
  - "your_openai_key"
claude:
  - "your_anthropic_key"
gemini:
  - "your_google_key"
# optional
gpt5:
  - "your_openai_key"

Run end-to-end pipeline

uv run bash run_parallel.sh

run_parallel.sh runs:

generate_ad_sample_parallel.py
blackbox_text_generation.py
gpt_evaluate.py
keyword_matching_gpt.py

Required Data

Expected folders:

resources/images/bigscale or resources/images/bigscale_100
resources/images/target_images or resources/images/target_images_100
resources/retrieved_embeddings

keyword_matching_gpt.py expects keywords.json under .../target_images/1/keywords.json. resources/embeddings is an optional retrieval cache and will be created automatically if you run retrieval.py.

Advanced Docs

Retrieval pipeline: docs/retrieval.md
Hyperparameter template: docs/hyperparameters.md

Notes

Configure wandb.entity in config/ensemble_3models.yaml if you use Weights & Biases.
Do not commit api_keys.yaml.
Hydra config entry point is config/ensemble_3models.yaml.

Results and Method Details

Main Result

Main Algorithm

Framework Reformulation (v1 vs Ours)

M-Attack (v1, GitHub):

Asymmetric matching (ours):

MCA (Multi-Crop Alignment) improves expectation estimation by averaging alignment over multiple local crops. ATA (Auxiliary Target Alignment) improves target semantic sampling by using auxiliary target cues for a stabler reference.

📝 Citation

If you find this project useful in your research or applications, please consider giving it a star ⭐ and citing our work:

@article{zhao2026pushingfrontierblackboxlvlm,
  title={Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting},
  author={Zhao, Xiaohan and Li, Zhaoyi and Luo, Yaxin and Cui, Jiacheng and Shen, Zhiqiang},
  journal={arXiv preprint arXiv:2602.17645},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
attack		attack
config		config
docs		docs
resources		resources
surrogates		surrogates
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
blackbox_text_generation.py		blackbox_text_generation.py
config_schema.py		config_schema.py
evaluation_metrics.py		evaluation_metrics.py
generate_ad_sample_parallel.py		generate_ad_sample_parallel.py
generate_ad_samples.py		generate_ad_samples.py
gpt_evaluate.py		gpt_evaluate.py
keyword_matching_gpt.py		keyword_matching_gpt.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
retrieval.py		retrieval.py
run_parallel.sh		run_parallel.sh
run_pipeline.sh		run_pipeline.sh
utils.py		utils.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

M-Attack-V2

Quick Start

Required Data

Advanced Docs

Notes

Results and Method Details

Main Result

Main Algorithm

Framework Reformulation (v1 vs Ours)

📝 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

VILA-Lab/M-Attack-V2

Folders and files

Latest commit

History

Repository files navigation

M-Attack-V2

Quick Start

Required Data

Advanced Docs

Notes

Results and Method Details

Main Result

Main Algorithm

Framework Reformulation (v1 vs Ours)

📝 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages