You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and prov…
Image Captioning is a task where each image must be understood properly and are able generate suitable caption with proper grammatical structure. Here it is a hybrid system which uses multilayer CNN (Convolutional Neural Network) for generating keywords.
Exercise on captioning images in the Neural Networks for Computer Vision course. Using the Flickr8K dataset, and simple encoder-decoder architecture. Evaluation based on Cross-Entropy loss and 4-gram Bleu score.