#

caption-task

Here is 1 public repository matching this topic...

microsoft / UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

video localization caption alignment segmentation coin multimodality joint multimodal-sentiment-analysis pretrain pretraining msrvtt video-text-retrieval video-text video-language youcookii retrieval-task caption-task

Updated Jul 25, 2024
Python

Improve this page

Add a description, image, and links to the caption-task topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the caption-task topic, visit your repo's landing page and select "manage topics."