[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
deep-learning similarity-score image-text-matching zero-shot-classification large-language-models visual-prompting vision-language-model visual-text-alignment textual-prompting
-
Updated
Sep 3, 2024 - Python