Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19
visual-question-answering
vision-and-language
multimodal-deep-learning
multimodal-datasets
multimodal-representation
memex-question-answering
memexqa-dataset
-
Updated
Jul 1, 2019 - Python