Free-HeadGAN: Neural Talking Head Synthesis With Explicit Gaze Control.
Michail Christos Doukas, Evangelos Ververas, Viktoriia Sharmanska, Stefanos Zafeiriou.
TPAMI 2023. [PDF]
SPACE: Speech-driven Portrait Animation with Controllable Expression.
Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu.
ICCV 2023. [PDF] [Project]
Instant Volumetric Head Avatars.
Wojciech Zielonka, Timo Bolkart, Justus Thies.
CVPR 2023. [PDF] [Project] [Video]
LiP-Flow: Learning Inference-time Priors for Codec Avatars Via Normalizing Flows in Latent Space.
Emre Aksan, Shugao Ma, Akin Caliskan, Stanislav Pidhorskyi, Alexander Richard, Shih-En Wei, Jason Saragih, Otmar Hilliges.
ECCV 2022. [PDF]
Talking Head from Speech Audio using a Pre-trained Image Generator.
Mohammed M. Alghamdi, He Wang, Andrew J. Bulpitt, David C. Hogg.
ACM MM 2022. [PDF] [Code]
Speech Driven Tongue Animation.
Salvador Medina, Denis Tome, Carsten Stoll, Mark Tiede, Kevin Munhall, Alexander G. Hauptmann, Iain Matthews.
CVPR 2022. [PDF]
Talking Face Generation With Multilingual TTS.
Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee, Seungmin Yang, Hyunjae Cho, Youseong Lee, Dongho Choi, Kang-wook Kim.
CVPR 2022. [PDF]
Thin-Plate Spline Motion Model for Image Animation.
Jian Zhao, Hui Zhang.
CVPR 2022. [PDF]
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.
Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou.
CVPR 2022. [PDF]
Depth-Aware Generative Adversarial Network for Talking Head Video Generation.
Fa-Ting Hong, Longhao Zhang, Li Shen, Dan Xu
CVPR 2022. [PDF] [Code]
FaceFormer: Speech-Driven 3D Facial Animation with Transformers.
Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura.
CVPR 2022. [PDF]
Everybody's Talkin': Let Me Talk as You Want.
Linsen Song, Wayne Wu, Chen Qian, Ran He, Chen Change Loy.
TIFS 2022. [PDF] [Project]
SAFA: Structure Aware Face Animation.
Qiulin Wang, Lu Zhang, Bo Li.
3DV 2021. [PDF] [Code]
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning.
Suzhen Wang, Lincheng Li, Yu Ding, Xin Yu.
AAAI 2022. [PDF]
Talking Head Generation with Audio and Speech Related Facial Action Units.
Sen Chen, Zhilei Liu, Jiaxing Liu, Zhengxiang Yan, Longbiao Wang.
BMVC 2021. [PDF]
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation.
Yuanxun Lu, Jinxiang Chai, Xun Cao.
SIGGRAPH Asia 2021. [PDF]
DECA: Learning an Animatable Detailed 3D Face Model from In-The-Wild Images.
Yao Feng, Haiwen Feng, Michael J. Black, Timo Bolkart.
SIGGRAPH 2021. [PDF] [Code]
Towards Realistic Visual Dubbing with Heterogeneous Sources.
Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma.
ACM MM 2021. [PDF]
Learned Spatial Representations for Few-Shot Talking-Head Synthesis.
Moustafa Meshry, Saksham Suri, Larry S. Davis, Abhinav Shrivastava.
ICCV 2021. [PDF]
The Right To Talk: An Audio-Visual Transformer Approach.
Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu.
ICCV 2021. [PDF]
Speech Drives Templates: Co-Speech Gesture Synthesis With Learned Templates.
Shenhan Qian, Zhi Tu, Yihao Zhi, Wen Liu, Shenghua Gao.
ICCV 2021. [PDF]
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis.
Yudong Guo, Keyu Chen, Sen Liang, Yongjin Liu, Hujun Bao, Juyong Zhang.
ICCV 2021. [PDF] [Project] [Video]
FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning.
Chenxu Zhang, Yifan Zhao, Yifei Huang, Ming Zeng, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo.
ICCV 2021. [PDF]
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement.
Alexander Richard, Michael Zollhoefer, Yandong Wen, Fernando de la Torre, Yaser Sheikh.
ICCV 2021. [PDF] [Video]
HeadGAN: Video-and-Audio-Driven Talking Head Synthesis.
Michail Christos Doukas, Stefanos Zafeiriou, Viktoriia Sharmanska.
ICCV 2021. [PDF]
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion.
Suzhen Wang, Lincheng Li, Yu Ding, Changjie Fan, Xin Yu.
IJCAI 2021. [PDF]
Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis.
Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng.
ACM MM 2021. [PDF] [Code]
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting Normalization.
Avisek Lahiri, Vivek Kwatra, Christian Frueh, John Lewis, Chris Bregler.
CVPR 2021. [PDF]
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation.
Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, Ziwei Liu.
CVPR 2021. [PDF] [Project] [Code]
Everything's Talkin': Pareidolia Face Reenactment.
Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He.
CVPR 2021. [PDF] [Project] [Code]
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing.
Ting-Chun Wang, Arun Mallya, Ming-Yu Liu.
CVPR 2021 (oral). [PDF] [Project]
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation.
Lincheng Li, Suzhen Wang, Zhimeng Zhang, Yu Ding, Yixing Zheng, Xin Yu, Changjie Fan.
AAAI 2021. [PDF]
FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis.
Kuangxiao Gu, Yuqian Zhou, Thomas Huang.
AAAI 2020. [PDF] [Code]
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking Face Generation.
Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wayne Wu, Chen Qian, Ran He, Yu Qiao, Chen Change Loy.
ECCV 2020. [PDF] [Project] [Code]
MakeItTalk: Speaker-Aware Talking-Head Animation.
Yang Zhou, Xintong Han, Eli Shechtman, Jose Echevarria, Evangelos Kalogerakis, Dingzeyu Li.
TOG 2020. [PDF] [Code]
Text-based Editing of Talking-head Video.
Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B Goldman Kyle Genova, Zeyu Jin, Christian Theobalt, Maneesh Agrawala.
TOG 2019. [PDF] [Project]
ATVGnet: Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss.
Lele Chen, Ross K. Maddox, Zhiyao Duan, Chenliang Xu.
CVPR 2019. [PDF] [Code]
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation.
Hang Zhou, Yu Liu, Ziwei Liu, Ping Luo, Xiaogang Wang.
AAAI 2019. [PDF] [Code] [Project]
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time.
Sicheng Xu, Guojun Chen, Yu-Xiao Guo, Jiaolong Yang, Chong Li, Zhenyu Zang, Yizhong Zhang, Xin Tong, Baining Guo.
arxiv 2024. [PDF] [Project]