Skip to content

ZihaoW123/SOTA-Visdual-Dialog

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

SOTA-Visual-Dialog

Advances in Visual Dialog Last update on 2022/10/16.

Table of Contents

Image-based Visual Dialog

Visual Dialog

  1. Visual Dialog, CVPR 2017, [code]

  2. Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model, NIPS 2017, [code]

  3. Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning, CVPR 2018

  4. Image-Question-Answer Synergistic Network for Visual Dialog, CVPR 2019

  5. Reasoning Visual Dialogs with Structural and Partial Observations, CVPR, 2019, [code]

  6. Recursive Visual Attention in Visual Dialog, CVPR 2019, [code]

  7. Dual Visual Attention Network for Visual Dialog, IJCAI 2019

  8. Making History Matter: History-Advantage Sequence Training for Visual Dialog, ICCV 2019

  9. Granular Multimodal Attention Networks for Visual Dialog, ICCV Workshop 2019

  10. Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog, ACL 2019

  11. Dual Attention Networks for Visual Reference Resolution in Visual Dialog, EMNLP 20219, []code

  12. DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog, AAAI 2020, [code]

  13. Modality-Balanced Models for Visual Dialogue, AAAI 2020

  14. DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue, AAAI 2020, [code]

  15. Two Causal Principles for Improving Visual Dialog, CVPR 2020, [code]

  16. DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue, IJCAI 2020, [code]

  17. KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue, ACM MM 2020

  18. Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline, ECCV 2020, [code]

  19. Visual Dialog: Light-weight Transformer for Many Inputs, ECCV 2020, [code]

  20. Multi-View Attention Network for Visual Dialog, ACL 2020, [code]

  21. History for Visual Dialog: Do we really need it?, ACL 2020, [code]

  22. VD-BERT: A Unified Vision and Dialog Transformer with BERT, EMNLP 2020, [code]

  23. GoG: Graph-over-Graph Network for Visual Dialog, ACL Findings 2021

  24. Multimodal Incremental Transformer for Visual Dialogue Generation, ACL Findings 2021

  25. Learning to Ground Visual Objects for Visual Dialog, EMNLP Findings 2021

  26. VU-BERT: A Unified framework for Visual Dialog, ICASSP 2022

  27. Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning, ICASSP 2022

  28. UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog, CVPR 2022

  29. VD-PCR: Improving visual dialog with pronoun coreference resolution, Pattern Recognition 2022, [code]

  30. Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog, ACM MM 2022

  31. Unified Multimodal Model with Unlikelihood Training for Visual Dialog, ACM MM 2022, [code]

ImageChat

Engaging Image Chat: Modeling Personality in Grounded Dialogue, ACL 2020, [code]

PhotoChat

PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling, ACL 2021, [code]

Chinese Multi-Modal Chat

MMChat: Multi-Modal Chat Dataset on Social Media Yinhe, LREC 2022, [code]

Video-based Visual Dialog

Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog, AAAI 2020, [code]

Other Resources

Acknowledgement

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published