An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
benchmark machine-learning deep-neural-networks video navigation vqa question-answering visual-reasoning multimodal embodied cross-modality conditioning videonavqa
-
Updated
Jun 22, 2022 - Python