GitHub - yl3800/IGV: This repo contains code for Invariant Grounding for Video Question Answering

Invariant Grounding for Video Question Answering 🔥

Overview

This repo contains source code for Invariant Grounding for Video Question Answering (CVPR 2022 Oral, Best Paper Finalists). In this work, propose a new learning framework, Invariant Grounding for VideoQA (IGV), to ground the question-critical scene, whose causal relations with answers are invariant across different interventions on the complement. With IGV, the VideoQA models are forced to shield the answering process from the negative influence of spurious correlations, which significantly improves the reasoning ability.

<

Installation

Main packages: PyTorch = 1.11
See requirements.txt for other packages.

Data Preparation

We use MSVD-QA as an example to help get farmiliar with the code. Please download the dataset in dataset.zip and the pre-computed features here

After downloading the data, please modify your data path and feature path in run.py.

Run IGV

Simply run train.sh to reproduce the results in the paper. We have saved our checkpoint here (acc 41.42% on MSVD-QA) for your references.

Reference

@InProceedings{Li_2022_CVPR,
    author    = {Li, Yicong and Wang, Xiang and Xiao, Junbin and Ji, Wei and Chua, Tat-Seng},
    title     = {Invariant Grounding for Video Question Answering},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2022},
    pages     = {2928-2937}
}

Acknowledgement

Our reproduction of the methods is based on the respective official repositories and NExT-QA, we thank the authors to release their code.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
dataloader		dataloader
figures		figures
log		log
networks		networks
utils		utils
.gitignore		.gitignore
README.md		README.md
dataset.zip		dataset.zip
requirements.txt		requirements.txt
run.py		run.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Invariant Grounding for Video Question Answering 🔥

Overview

Installation

Data Preparation

Run IGV

Reference

Acknowledgement

About

Releases

Packages

Languages

yl3800/IGV

Folders and files

Latest commit

History

Repository files navigation

Invariant Grounding for Video Question Answering 🔥

Overview

Installation

Data Preparation

Run IGV

Reference

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages