Skip to content

[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want

Notifications You must be signed in to change notification settings

showlab/Tune-An-Ellipse

Repository files navigation

Tune-An-Ellipse

This is the official PyTorch implementation of our paper:

Tune-An-Ellipse: CLIP Has Potential to Find What You Want
CVPR 2024 Highlight
Jinheng Xie, Songhe Deng, Bing Li, Haozhe Liu, Yawen Huang, Yefeng Zheng, Jurgen Schmidhuber, Bernard Ghanem, Linlin Shen, Mike Zheng Shou

[Paper] [Poster] [Demo]

Install

pip install requirements.txt

Demo

  • The code will automatically download the clip model checkpoints

GUI

python gradio_demo.py

GUI screenshot

CLI

python run.py --img_path source/cat.png --caption "jumping cat" 

Result image will be saved at workspace/test/hd_tune

Experiment on RefCOCO/RefCOCO+/RefCOCOg

  1. Please following the instructions in lichengunc/refer to download the refcoco series datasets
  2. After downloading all of them, organize the data as follows in /PATH/TO/RefCOCO,
├── images
│   └── mscoco
│       └── images
│           └── train2014
├── reclip_data
│   ├── refcoco+_dets_dict.json
│   ├── refcoco_dets_dict.json
│   └── refcocog_dets_dict.json
├── refcoco
│   ├── instances.json
│   ├── refs(google).p
│   └── refs(unc).p
├── refcoco+
│   ├── instances.json
│   └── refs(unc).p
└── refcocog
    ├── instances.json
    ├── refs(google).p
    └── refs(umd).p
  1. Using the prepared scripts,
bash scripts/refcoco.sh
bash scripts/refcoco+.sh
bash scripts/refcocog.sh

We used the code from CLIP-ES to generate the cam of clip models, thanks to their great work!

BibTex

@InProceedings{Xie_2024_CVPR,
    author    = {Xie, Jinheng and Deng, Songhe and Li, Bing and Liu, Haozhe and Huang, Yawen and Zheng, Yefeng and Schmidhuber, Jurgen and Ghanem, Bernard and Shen, Linlin and Shou, Mike Zheng},
    title     = {Tune-An-Ellipse: CLIP Has Potential to Find What You Want},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2024},
    pages     = {13723-13732}
}

About

[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published