Build software better, together

thaolmk54 / hcrn-videoqa

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

vqa question-answering tgif-qa videoqa

Updated Jul 25, 2024
Python

doc-doc / NExT-QA

Star

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

video-understanding videoqa vision-language video-question-answering multi-object-interaction causal-temporal-action-reasoning

Updated Jul 25, 2024
Python

doc-doc / NExT-GQA

Star

Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)

videoqa video-grounding video-question-answering video-language-understanding trustworthy-vqa visual-evidence-grounding

Updated Jul 1, 2024
Python

mmazab / LifeQA

Star

Data and PyTorch code for the LifeQA LREC 2020 paper.

nlp machine-learning natural-language-processing youtube research computer-vision deep-learning pytorch dataset videos question-answering real-life videoqa video-question-answering lrec2020 lrec lifeqa

Updated Jun 21, 2024
Python

doc-doc / CoVGT

Star

Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)

videoqa video-question-answering contrastive-learning dynamic-visual-graph video-language-understanding

Updated Mar 9, 2024
Python

engindeniz / vitis

Star

[ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts

video-understanding zero-shot-learning multimodal-learning visual-question-answering few-shot-learning videoqa vision-language prompt-learning large-language-models

Updated Oct 10, 2023
Python

antoyang / just-ask

Star

[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

vqa video-understanding weakly-supervised-learning multimodal-learning visual-question-answering question-generation vision-and-language videoqa pre-training video-question-answering

Updated Sep 29, 2023
Jupyter Notebook

antoyang / FrozenBiLM

Star

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

vqa video-understanding weakly-supervised-learning multimodal-learning visual-question-answering vision-and-language videoqa pre-training video-question-answering large-language-models

Updated Sep 24, 2023
Python

doc-doc / NExT-OE

Star

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

videoqa vision-language video-comprehension multi-object-interaction causal-temporal-action-reasoning

Updated Jul 18, 2023
Python

YangLiu9208 / CMCIR

Star

[IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering

traffic vqa causality causal-inference causal videoqa causal-discovery

Updated Jul 6, 2023
Python

sail-sg / VGT

Star

Video Graph Transformer for Video Question Answering (ECCV'22)

videoqa video-question-answering temporal-dynamics graph-transformer video-language-understanding

Updated Jun 8, 2023
Python

MichiganNLP / wildqa

Star

WildQA website code

nlp machine-learning youtube research computer-vision deep-learning pytorch dataset videos question-answering in-the-wild coling videoqa video-question-answering natual-language-processing coling2022 wildqa

Updated May 10, 2023
HTML

yl3800 / IGV

Star

This repo contains code for Invariant Grounding for Video Question Answering

video generalization interpretable videoqa video-question-answering invariant-learning cvpr-2022 cvpr-oral-2022

Updated Mar 2, 2023
Python

MichiganNLP / lifeqa

Star

LifeQA website code

nlp machine-learning natural-language-processing youtube research computer-vision deep-learning pytorch dataset videos question-answering real-life videoqa video-question-answering lrec2020 lrec lifeqa

Updated Feb 3, 2023
HTML

jayleicn / TVQA

Star

[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering

pytorch dataset videoqa tvqa

Updated Oct 25, 2022
Python

doc-doc / HQGA

Star

Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)

videoqa vision-language video-question-answering conditional-graph-hierarchy

Updated Sep 17, 2022
Python

MILVLG / activitynet-qa

Star

An VideoQA dataset based on the videos from ActivityNet

dataset vqa activitynet videoqa

Updated Nov 22, 2020
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

videoqa

Here are 17 public repositories matching this topic...

thaolmk54 / hcrn-videoqa

doc-doc / NExT-QA

doc-doc / NExT-GQA

mmazab / LifeQA

doc-doc / CoVGT

engindeniz / vitis

antoyang / just-ask

antoyang / FrozenBiLM

doc-doc / NExT-OE

YangLiu9208 / CMCIR

sail-sg / VGT

MichiganNLP / wildqa

yl3800 / IGV

MichiganNLP / lifeqa

jayleicn / TVQA

doc-doc / HQGA

MILVLG / activitynet-qa

Improve this page

Add this topic to your repo