#

image-text

Here are 40 public repositories matching this topic...

salesforce / ALBEF

Code for ALBEF: a new vision-language pre-training method

representation-learning weakly-supervised-learning image-text vision-and-language contrastive-learning

Updated Sep 20, 2022
Python

Sense-GVT / DeCLIP

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

multi-model clip big-model zero-shot self-supervised image-text vision-language-pretraining

Updated Sep 19, 2022
Python

imageinwords

google / imageinwords

Data release for the ImageInWords (IIW) paper.

evaluation dataset image-captioning dataset-generation image-to-text image-descriptions image-text human-annotation t2i i2t detailed-descriptions detailed-annotations

Updated Nov 17, 2024
JavaScript

X-PLUG / mPLUG

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

pytorch transformer vqa image-captioning visual-language image-text multimodal pretraining image-text-retrieval

Updated May 8, 2023
Python

miccunifi / QualiCLIP

Quality-Aware Image-Text Alignment for Opinion-Unaware Image Quality Assessment

computer-vision deep-learning image-processing image-quality clip iqa image-text image-quality-assessment blind-image-quality-assessment low-level-vision image-degradation self-supervised-learning ranking-loss biqa vision-language nr-iqa no-reference-image-quality-assessment opinion-unaware opinion-unaware-nr-iqa

Updated Mar 10, 2025
Python

labyrinth7x / Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching

Deep Cross-Modal Projection Learning for Image-Text Matching

image-text

Updated Sep 2, 2020
Python

glami / glami-1m

The largest multilingual image-text classification dataset. It contains fashion products.

multilingual natural-language-processing computer-vision deep-learning fashion text-classification dataset classification image-classification image-to-text image-text multimodal text-to-image-generation multi-modal-deep-learning image-text-classification multilingual-image-text-classification

Updated Jun 8, 2023
Jupyter Notebook

TheoCoombes / crawlingathome

A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.

machine-learning dataset dataset-generation clip image-text dall-e

Updated Mar 21, 2023
Python

zhangming8 / ocr_algo_server

ocr文字识别算法服务

python ocr image-text text-recognize

Updated Feb 6, 2021
C++

antonlukin / poster-editor

Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.

php composer php-library image-processing php-gd intervention php-image image-text php-class poster-editor

Updated Feb 4, 2025
PHP

awsaf49 / flickr-dataset

Download flickr8k, flickr30k image caption datasets

image flickr dataset clip captioning-images image-text flickr8k flickr30k siglip

Updated Feb 6, 2024

zabir-nabil / imagebert-keras

Keras implementation of ImageBERT from Microsoft

keras image-text imagebert

Updated Jan 28, 2020

HuangRunHua / LiveTextWithImage

WWDC22: Enabling Live Text interactions with images in SwiftUI

swift image-processing wwdc image-text swiftui swiftui-example swiftui-demo wwdc22 live-text

Updated Jun 10, 2022
Swift

TheoCoombes / crawlingathome-server

A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.

machine-learning dataset dataset-generation clip image-text dall-e

Updated Mar 21, 2023
Python

Thisisus7 / ING-VP

An Interactive Game-based Vision Planning benchmark

game benchmark image-text multimodal lmm llm mllm

Updated Feb 24, 2025
Python

dvlab-research / TagCLIP

segmentation clip zero-shot image-text

Updated Sep 3, 2024
Python

fatemeh-mohseni-AI / most-repeated-vocabulary-IELTS

This project is a FastAPI-based web application designed to analyze C a m b r i d g e I E L T S P D F s ( B o o k s 1 − 18 ) for the most and least repeated words. It can handle both regular text-based PDFs and scanned image-based PDFs by converting them to images and extracting text using OCR (Optical Character Recognition).

ielts image-text fast-api

Updated Aug 16, 2024
Python

waittim / ConVIRT-Colab

Contrastive Learning Representations for Images and Text Pairs. Colab implementation of ConVIRT for transfer learning with insufficient data volume.

colab image-text contrastive-learning

Updated Jan 15, 2022
Jupyter Notebook

reshalfahsi / image-captioning-mobilenet-llama3

Image Captioning With MobileNet-LLaMA 3

nlp cnn pytorch transformer image-captioning image-text flickr8k-dataset mobilenetv3 pytorch-lightning kv-cache rotary-position-embedding grouped-query-attention rms-norm llama3

Updated Jun 23, 2024
Jupyter Notebook

leeyunjai / image2text

caption generator using lavis and argostranslate

captions caption image-analysis captioning-images img2txt image-text caption-generation caption-generator blip2

Updated Mar 21, 2023
Python

Improve this page

Add a description, image, and links to the image-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image-text topic, visit your repo's landing page and select "manage topics."