Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
-
Updated
Feb 3, 2023 - Jupyter Notebook
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Meshed-Memory Transformer for Image Captioning. CVPR 2020
A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.
awesome grounding: A curated list of research papers in visual grounding
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Automatic image captioning model based on Caffe, using features from bottom-up attention.
This repository contains my solutions to the assignments for Stanford's CS231n "Convolutional Neural Networks for Visual Recognition" (Spring 2020).
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow
📸 Generates hashtags for Instagram posts. Upload your photo and it will suggest the relevant #hashtags for you. 🏷️
Adds SPICE metric to coco-caption evaluation server codes
Semantic Propositional Image Caption Evaluation
Learning to ground explanations of affect for visual art.
A simple implementation of neural image caption generator
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.
Deep CNN-LSTM for Generating Image Descriptions 😈
Aid for blinds. This AI will describe the surrounding, it will tell who is in front of him (if that person is a known person to AI using Facial Recognition) and it will also help him to know what is written (Optical Character Recognition)
A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and auto-tag/caption models for your purposes. Custom datasets can be added!
Add a description, image, and links to the captioning-images topic page so that developers can more easily learn about it.
To associate your repository with the captioning-images topic, visit your repo's landing page and select "manage topics."