Awesome Talking Head Generation

A curated list of papers and resources for Talking Head Generation, including face animation, audio-driven synthesis, portrait animation, and related tasks.

Contributions welcome! Open an issue, pull request, or contact fatinghong@gmail.com. Discord: Fa-Ting Hong#6563

🔍 I'm actively looking for remote full-time positions and full-time postdoc opportunities in talking head generation, generative models, or related areas. If you're interested, feel free to reach out at fatinghong@gmail.com.

🔥 New: ACTalker — portrait video generation driven by audio and expression simultaneously. ICCV 2025

Related Groups

Datasets

VoxCeleb1 [Download link].
VoxCeleb2 [Download link].
Faceforensics++ [Download link].
CelebV [Download link].
TalkingHead-1KH [Download link].
LRW (Lip Reading in the Wild) [Download link].
MEAD [Download link].
CelebV-HQ [Download link].
CHDTF [Download link].
MultiTalk [Download link].
VFHQ [Download link].
Hallo3 [Download link].
AVSpeech [Download link].

Survey

Year	Paper	Venue	Links
2026	Talking-Head Generation in Practice: A Longitudinal Analysis 2021–2025	OpenReview
2025	Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions	arXiv
2025	Advancements in Talking Head Generation: A Comprehensive Review of Techniques, Metrics, and Challenges	The Visual Computer
2024	A Survey of Talking Head Synthesis: Portrait Generation, Driving Mechanisms, and Editing	ACM CSUR
2024	A Comprehensive Taxonomy and Analysis of Talking Head Synthesis	arXiv
2024	Audio-Driven Facial Animation with Deep Learning: A Survey	MDPI Information
2024	A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos	arXiv	Code
2023	From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications	arXiv
2023	Talking Human Face Generation: A Survey	Expert Systems with Applications
2022	Human-Computer Interaction System: A Survey of Talking-Head Generation	MDPI Electronics
2020	What comprises a good talking-head video generation?: A Survey and Benchmark	arXiv

Image-driven

Year	Paper	Venue	Links
2026	PersonaLive! Expressive Portrait Image Animation for Live Streaming	CVPR 2026	Code
2026	PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment	arXiv 2026
2025	HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation	CVPR 2025	Code · Project
2025	Robust Deepfake Detection for Electronic Know Your Customer Systems Using Registered Images	arXiv 2025
2025	Towards Interactive Intelligence for Digital Humans	arXiv 2025
2025	FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction	arXiv 2025
2024	X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention	SIGGRAPH 2024	Code
2024	Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation	SIGGRAPH Asia 2024	Code
2024	LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control	arXiv 2024	Code · Project
2024	EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars	CVPR 2024	Code · Project
2024	Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation	CVPR 2024	Project
2023	Audio-Visual Face Reenactment	WACV 2023	Code · Project
2023	Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis	WACV 2023	Code
2023	Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head Video Generation	ICCV 2023	Project · Code
2023	StyleLipSync: Style-based Personalized Lip-sync Video Generation	ICCV 2023	Code
2022	Depth-Aware Generative Adversarial Network for Talking Head Video Generation	CVPR 2022	Code · Project
2022	Thin-Plate Spline Motion Model for Image Animation	CVPR 2022	Code
2022	StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pretrained StyleGAN	ECCV 2022	Code · Project
2022	MegaPortraits: One-shot Megapixel Neural Head Avatars	ACM MM 2022	Project
2022	Structure-Aware Motion Transfer with Deformable Anchor Model	CVPR 2022	Code
2022	StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment	FG, 2023	Code
2022	Controllable Radiance Fields for Dynamic Face Synthesis	Arxiv 2022
2022	Animatable 3D-Aware Face Image Generation for Video Avatars	NeurIPS 2022	Project
2022	Implicit Warping for Animation with Image Sets	NeurIPS 2022	Project
2022	HifiHead: One-Shot High Fidelity Neural Head Synthesis with 3D Control	IJCAI 2022
2022	Face Animation with Multiple Source Images	Arxiv 2022
2022	MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation	Arxiv 2022
2022	Compressing Video Calls using Synthetic Talking Heads	BMVC 2022	Project
2022	Finding Directions in GAN’s Latent Space for Neural Face Reenactment	BMVC 2022	Project · Code
2022	Latent Image Animator: Learning to Animate Images via Latent Space Navigation	ICLR 2022	Project · Code
2021	One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing	CVPR 2021 Oral	Project
2021	Sparse to Dense Motion Transfer for Face Image Animation	ICCV 2021
2021	SAFA: Structure Aware Face Animation	3DV 2021	Code
2021	Self-appearance-aided Differential Evolution for Motion Transfer	arXiv 2021
2021	PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering	ICCV 2021	Code
2021	FACEGAN: Facial Attribute Controllable rEenactment GAN	WACV 2021
2021	F3A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks	IEEE TIP 2021
2021	FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning	ICCV 2021
2021	Motion Representations for Articulated Animation	CVPR 2021	Code
2021	HeadGAN: One-shot Neural Head Synthesis and Editing	ICCV 2021	Project
2020	Mesh Guided One-shot Face Reenactment Using Graph Convolutional Networks	ACM Multimedia 2020	Code
2020	MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets	AAAI 2020	Project
2020	Learning Identity-Invariant Motion Representations for Cross-ID Face Reenactment	CVPR 2020
2019	First order motion model for image animation	NeurIPS 2019	Code
2019	Few-Shot Adversarial Learning of Realistic Neural Talking Head Models	ICCV 2019	Code
2019	Animating Arbitrary Objects via Deep Motion Transfer	CVPR 2019 Oral	Code · Project
2019	Few-shot Video-to-Video Synthesis	NeurIPS 2019	Code · Project
2018	ReenactGAN: Learning to Reenact Faces via Boundary Transfer	ECCV 2018	Code
2018	X2Face: A network for controlling face generation by using images, audio, and pose codes	ECCV 2018	Code · Project
2016	Face2Face: Real-time face capture and reenactment of RGB videos	CVPR 2016

Audio-driven

Year	Paper	Venue	Links
2026	FunCineForge: A Unified Dataset Toolkit and Model for Zero-Shot Movie Dubbing in Diverse Cinematic Scenes	arXiv 2026
2026	TurboTalk: Progressive Distillation for One-Step Audio-Driven Talking Avatar Generation	arXiv 2026
2026	SEDTalker: Emotion-Aware 3D Facial Animation Using Frame-Level Speech Emotion Diarization	arXiv 2026	Code
2026	AUHead: Realistic Emotional Talking Head Generation via Action Units Control	ICLR 2026
2026	UniTalking: A Unified Audio-Video Framework for Talking Portrait Generation	CVPR 2026
2026	DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization	CVPR 2026
2026	ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars	CVPR 2026
2026	Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video	CVPR 2026
2026	MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation	CVPR 2026
2025	OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models	arXiv 2025	Project
2025	Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modelling for Natural Talking Head Generation	ICCV 2025	Project
2025	OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation	arXiv 2025	Code · Project
2025	Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation	CVPR 2025
2025	EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion	CVPR 2025	Project
2025	INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations	CVPR 2025	Project
2025	Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation	ICLR 2025	Code
2025	Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency	ICLR 2025	Project
2025	DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation	ICLR 2025	Project · Code
2025	AnyTalk: Multi-modal Driven Multi-domain Talking Head Generation	AAAI 2025
2025	Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation	AAAI 2025
2025	FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation	ICCV 2025
2025	FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait	ICCV 2025	Project
2025	MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation	CVPR 2025
2025	Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length	arXiv 2025	Code
2025	DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models	arXiv 2025
2025	GAIA: Zero-shot Talking Avatar Generation	arXiv 2025
2024	Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis	ICLR 2024	Project · Code
2024	Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions	arXiv 2024	Project · Code
2024	Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style	AAAI 2024
2024	Say Anything with Any Style	AAAI 2024
2024	[MuseTalk] Real-Time High Quality Lip Synchorization with Latent Space Inpainting, [Code].		Code
2024	VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time	NeurIPS 2024	Project
2024	THQA: A Perceptual Quality Assessment Database for Talking Heads	arXiv 2024	Code
2024	Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior	arXiv 2024	Code · Project
2024	EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis	arXiv 2024	Code · Project
2024	AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animations	arXiv 2024	Code
2024	FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization	arXiv 2024
2024	FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio	arXiv 2024	Code
2024	Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation	arXiv 2024	Code
2024	EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions	arXiv 2024	Code · Project
2024	RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network	arXiv 2024
2024	Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation	arXiv 2024
2024	Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement	arXiv 2024
2024	FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model	arXiv 2024
2024	ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer	arXiv 2024
2024	Style-Preserving Lip Sync via Audio-Aware Style Reference	arXiv 2024
2024	EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation	arXiv 2024	Code · Project
2024	Latent Diffusion Transformer for Talking Video Synthesis	arXiv 2024	Code · Project
2024	IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation	arXiv 2024	Project
2024	Memory-Guided Diffusion for Expressive Talking Video Generation	arXiv 2024	Project · Code
2024	Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks	arXiv 2024
2024	VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization	arXiv 2024
2024	Towards Customizable One-Shot Audio-to-Talking Face Generation	arXiv 2024
2024	LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync	arXiv 2024	Code
2024	Media2Face: Co-speech Facial Animation Generation with Multi-Modality Guidance	SIGGRAPH 2024
2024	PersonaTalk: Bring Attention to Your Persona in Visual Dubbing	SIGGRAPH Asia 2024
2024	StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads	TPAMI 2024
2024	JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation	BMVC 2024
2024	Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis	arXiv 2024
2024	JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics	arXiv 2024	Code
2024	HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level Conditions in Diffusion Models	arXiv 2024	Code
2024	LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details	arXiv 2024
2023	Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation	Arxiv 2023	Project
2023	DiffTalk: Crafting Diffusion Models for Generalized Talking Head Synthesis	Arxiv 2023	Project · Code
2023	[READ Avatars: Realistic Emotion-controllable Audio Driven Avatars](READ Avatars: Realistic Emotion-controllable Audio Driven Avatars)	Arxiv 2023
2023	DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder	Arxiv 2023
2023	Emotionally Enhanced Talking Face Generation	Arxiv 2023	Code
2023	Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert	CVPR 2023	Code
2023	StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator	CVPR 2023	Project · Code
2023	GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation	arXiv 2023	Project · Code
2023	MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions	ICCV 2023
2023	VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior	Arxiv 2023	Project · Code
2023	IP_LAP: Identity-Preserving Talking Face Generation with Landmark and Appearance Priors	CVPR 2023	Code
2023	HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation	CVPR 2023	Code
2023	Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation	ICCV 2023	Project · Code
2023	SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Talking Head Animation	CVPR 2023	Project · Code
2023	DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution Video	AAAI 2023	Code
2023	EMMN: Emotional Motion Memory Network for Audio-driven Emotional Talking Face Generation	ICCV 2023
2023	ToonTalker: Cross-Domain Face Reenactment	ICCV 2023
2023	High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning	CVPR 2023
2023	DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial Expressions	ICASSP 2023	Code
2022	Expressive Talking Head Generation with Granular Audio-Visual Control	CVPR 2022
2022	Talking Face Generation with Multilingual TTS	CVPR 2022	Demo
2022	EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model	SIGGRAPH 2022
2022	SPACEx 🚀: Speech-driven Portrait Animation with Controllable Expression	arXiv 2022	Project
2022	Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers	SIGGRAPH Asia 2022
2022	Memories are One-to-Many Mapping Alleviators in Talking Face Generation	arXiv 2022
2021	Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation	CVPR 2021	Code · Project
2021	Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis	ACM Multimedia 2021
2021	Audio-Driven Emotional Video Portraits	CVPR 2021	Code
2021	Talking Head Generation with Audio and Speech Related Facial Action Units	arxiv 2021
2021	Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation	IJCAI 2021
2021	Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis	ACM MM 2021
2021	Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation	ACM TOG 2021	Code
2021	Audio2head: Audio-driven one-shot talking-head generation with natural head motion	ArXiv 2021
2020	A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild	ACM Multimedia 2020	Code · Project
2020	Talking-head Generation with Rhythmic Head Motion	ECCV 2020	Code
2020	MakeItTalk: Speaker-Aware Talking-Head Animation	SIGGRAPH Asia 2020	Code · Project
2020	Neural Voice Puppetry: Audio-driven Facial Reenactment	ECCV 2020	Code · Project
2020	MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation	ECCV 2020	Code · Project
2020	Realistic Speech-Driven Facial Animation with GANs	IJCV 2020
2019	Talking Face Generation by Adversarially Disentangled Audio-Visual Representation	AAAI 2019	Code
2019	Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss	CVPR 2019	Code
2018	Lip Movements Generation at a Glance	ECCV 2018	Code
2018	VisemeNet: Audio-Driven Animator-Centric Speech Animation	SIGGRAPH 2018
2017	Synthesizing Obama: Learning Lip Sync From Audio	SIGGRAPH 2017	Project
2017	You Said That?: Synthesising Talking Faces From Audio	IJCV 2019	Code
2017	Audio-Driven Facial Animation by Joint End-to-End Learning of Pose and Emotion	SIGGRAPH 2017
2017	A Deep Learning Approach for Generalized Speech Animation	SIGGRAPH 2017
2016	Lip Reading in the Wild	ACCV 2016

Nerf & 3D

Year	Paper	Venue	Links
2026	MMTalker: Multiresolution 3D Talking Head Synthesis with Multimodal Feature Fusion	arXiv 2026
2026	EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization	CVPR 2026
2026	GenFaceTalk: Generalizable One-Shot Talking-Head Generation for Diverse Styles	ICLR 2026
2026	FG-Portrait: 3D Flow Guided Editable Portrait Animation	CVPR 2026
2026	Giving Faces Their Feelings Back: Explicit Emotion Control for Feedforward Single-Image 3D Head Avatars	arXiv 2026
2026	3DRealHead: Few-Shot Detailed Head Avatar	arXiv 2026
2026	FHAvatar: Fast and High-Fidelity Reconstruction of Face-and-Hair Composable 3D Head Avatar from Few Casual Captures	arXiv 2026
2026	NBAvatar: Neural Billboards Avatars with Realistic Hand-Face Interaction	arXiv 2026
2026	OMG-Avatar: One-shot Multi-LOD Gaussian Head Avatar	arXiv 2026
2026	OFERA: Blendshape-driven 3D Gaussian Control for Occluded Facial Expression to Realistic Avatars in VR	arXiv 2026
2026	CAG-Avatar: Cross-Attention Guided Gaussian Avatars for High-Fidelity Head Reconstruction	arXiv 2026
2026	Uncertainty-Aware 3D Emotional Talking Face Synthesis with Emotion Prior Distillation	arXiv 2026
2026	3DXTalker: Unifying Identity, Lip Sync, Emotion, and Spatial Dynamics in Expressive 3D Talking Avatars	arXiv 2026
2025	IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos	CVPR 2025
2025	Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics	CVPR 2025
2025	GaussianSpeech: Audio-Driven Personalized 3D Gaussian Avatars	ICCV 2025	Code
2025	MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization	ICCV 2025	Code
2025	CAFE-TALK: Generating 3D Talking Face	ICLR 2025
2025	InsTaG: Learning Personalized 3D Talking Head from Few-Second Video	CVPR 2025	Code
2025	Monocular and Generalizable Gaussian Talking Head Animation	CVPR 2025	Project
2025	DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations	CVPR 2025	Code
2025	VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image	NeurIPS 2025
2025	PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis	AAAI 2025
2025	PTalker: Personalized Speech-Driven 3D Talking Head Animation via Style Disentanglement	arXiv 2025
2025	SynergyWarpNet: Attention-Guided Cooperative Warping for Neural Portrait Animation	arXiv 2025
2025	From Autoencoders to CycleGAN: Robust Unpaired Face Manipulation via Adversarial Learning	arXiv 2025
2024	CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer	WACV 2024	Code
2024	3D-Aware Talking-Head Video Motion Transfer	WACV 2024
2024	TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting	ECCV 2024	Code
2024	GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting	ACM MM 2024	Code
2024	Generalizable and Animatable Gaussian Head Avatar	NeurIPS 2024	Code
2024	MimicTalk: Mimicking a Personalized and Expressive 3D Talking Face in Minutes	NeurIPS 2024
2024	EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head	ECCV 2024
2024	Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis	CVPR 2024	Code
2024	GPAvatar: Generalizable and Precise Head Avatar from Image(s)	ICLR 2024	Code
2024	UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model	arXiv 2024	Code
2023	Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis	ICCV 2023	Code
2023	EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation	ICCV 2023	Code
2023	CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior	CVPR 2023	Code
2023	GANHead: Towards Generative Animatable Neural Head Avatars	CVPR 2023	Code
2023	OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering	CVPR 2023	Code
2023	GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis	ICLR 2023	Code
2023	SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis	CVPR 2024	Code
2022	Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation	arxiv, 2022
2022	HeadNeRF: A Real-time NeRF-based Parametric Head Model	CVPR 2022	Code · Project
2022	I M Avatar: Implicit Morphable Head Avatars from Videos	CVPR 2022	Code
2022	Realistic One-shot Mesh-based Head Avatars	ECCV 2022
2022	FNeVR: Neural Volume Rendering for Face Animation	Arxiv 2022	Code
2022	3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation	Arxiv 2022	Code · Project
2022	Generative Neural Texture Rasterization for 3D-Aware Head Avatars	Arxiv 2022	Project
2022	NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation	Arxiv 2022
2022	Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis	ECCV 2022	Code
2021	DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering	arxiv, 2021
2021	NerFACE: Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction	CVPR 2021 Oral	Code · Project
2021	AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis	ICCV 2021	Code · Code
2020	Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning	CVPR 2020 Oral	Code

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome Talking Head Generation

Table of Contents

Related Groups

Datasets

Survey

Image-driven

Audio-driven

Nerf & 3D

Star History

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Awesome Talking Head Generation

Table of Contents

Related Groups

Datasets

Survey

Image-driven

Audio-driven

Nerf & 3D

Star History

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages