_ | talking | _ |
AD-NeRF: Audio Driven Neural Radiance Fields for | talking | Head Synthesis |
AnyoneNet: Synchronized Speech and | talking | Head Generation for Arbitrary Persons |
Are You | talking | to Me? Reasoned Visual Dialog Generation Through Adversarial Learning |
Audio-driven | talking | face generation with diverse yet realistic facial animations |
Audio-Driven | talking | Face Video Generation With Dynamic Convolution Kernels |
Audio-Driven | talking | Video Frame Restoration |
Audio-visual selection process for the synthesis of photo-realistic | talking | -head animations |
Audio-Visual Unit Selection for the Synthesis of Photo-Realistic | talking | -Heads |
Audiovisual | talking | Head for Augmented Speech Generation: Models and Animations Based on a Real Speaker's Articulatory Data, An |
Bangla | talking | Calculator for Visually Impaired Students in Bangladesh |
Combining online and offline learning for tracking a | talking | face in video |
Compact Temporal Trajectory Representation for | talking | Face Video Compression |
Compression of MPEG-4 Facial Animation Parameters for Transmission of | talking | Heads |
Cptnet: Cascade Pose Transform Network for Single Image | talking | Head Animation |
Creating 3D speech-driven | talking | heads: a probabilistic network approach |
Czech Artificial Computerized | talking | Head George |
DAVD-Net: Deep Audio-Aided Video Decompression of | talking | Heads |
Defending Low-Bandwidth | talking | Head Videoconferencing Systems From Real-Time Puppeteering Attacks |
Depth-Aware Generative Adversarial Network for | talking | Head Video Generation |
Do-it-yourself photo realistic | talking | head creation system and method |
Dual-modality | talking | -metrics: 3D Visual-Audio Integrated Behaviometric Cues from Speakers |
Efficient Emotional Adaptation for Audio-Driven | talking | -Head Generation |
Efficient Region-Aware Neural Radiance Fields for High-Fidelity | talking | Portrait Synthesis |
EMMN: Emotional Motion Memory Network for Audio-driven Emotional | talking | Face Generation |
Expressive | talking | Head Generation with Granular Audio-Visual Control |
Expressive | talking | Head Video Encoding in StyleGAN2 Latent Space |
Face Analysis for the Synthesis of Photo-Realistic | talking | Heads |
FACIAL: Synthesizing Dynamic | talking | Face with Implicit Attribute Learning |
FakeTalkerDetect: Effective and Practical Realistic Neural | talking | Head Detection with a Highly Unbalanced Dataset |
Fast Viseme Recognition for | talking | Head Application |
Few-Shot Adversarial Learning of Realistic Neural | talking | Head Models |
Flow-guided One-shot | talking | Face Generation with a High-resolution Audio-visual Dataset |
Free-HeadGAN: Neural | talking | Head Synthesis With Explicit Gaze Control |
Hierarchical Cross-Modal | talking | Face Generation With Dynamic Pixel-Wise Loss |
High-Fidelity and Freely Controllable | talking | Head Video Generation |
High-Fidelity Generalized Emotional | talking | Face Generation with Multi-Modal Emotion Space Learning |
Hypermask: | talking | Head Projected Onto Moving Surface |
Identity-Preserving | talking | Face Generation with Landmark and Appearance Priors |
iface: A 3d Synthetic | talking | Face |
Implicit Identity Representation Conditioned Memory Compensation Network for | talking | Head Video Generation |
Implicit Memory-Based Variational Motion | talking | Face Generation |
Impostures of | talking | Face Systems Using Automatic Face Animation |
IV2 Multimodal Biometric Database (Including Iris, 2D, 3D, Stereoscopic, and | talking | Face Data), and the IV2-2007 Evaluation Campaign, The |
Learned Spatial Representations for Few-shot | talking | -Head Synthesis |
Learning Dynamic Facial Radiance Fields for Few-Shot | talking | Head Synthesis |
Learning Landmarks Motion from Speech for Speaker-agnostic 3d | talking | Heads Generation |
Learning to Recognise | talking | Faces |
Leveraging Real | talking | Faces via Self-Supervision for Robust Forgery Detection |
Lifelike | talking | faces for interactive services |
LipFormer: High-fidelity and Generalizable | talking | Face Generation with A Pre-learned Facial Codebook |
LipSync3D: Data-Efficient Learning of Personalized 3D | talking | Faces from Video using Pose and Lighting Normalization |
Look who is not | talking | : Assessing engagement levels in panel conversations |
Look who's | talking | : Speaker detection using video and audio correlation |
Mead: A Large-scale Audio-visual Dataset for Emotional | talking | -face Generation |
MetaPortrait: Identity-Preserving | talking | Head Generation with Fast Personalized Adaptation |
Modelling | talking | Head Behaviour |
Multimodal Inputs Driven | talking | Face Generation With Spatial-Temporal Dependency |
Multimodal Learning for Temporally Coherent | talking | Face Generation With Articulator Synergy |
new frame interpolation scheme for | talking | head sequences, A |
Non-Invasive Approach for Driving Virtual | talking | Heads from Real Facial Movements, A |
One-Shot Free-View Neural | talking | -Head Synthesis for Video Conferencing |
One-Shot High-Fidelity | talking | -Head Synthesis with Deformable Neural Radiance Field |
OTAvatar: One-Shot | talking | Face Avatar with Controllable Tri-Plane Rendering |
Partial linear regression for speech-driven | talking | head application |
Photo realistic | talking | head creation system and method |
Pose-Controllable | talking | Face Generation by Implicitly Modularized Audio-Visual Representation |
Progressive Disentangled Representation Learning for Fine-Grained Controllable | talking | Head Synthesis |
Reactive Memories: An Interactive | talking | -Head |
Realistic head motion synthesis for an image-based | talking | head |
Realistic | talking | Face Synthesis With Geometry-Aware Feature Transformation |
Recognizing | talking | faces from acoustic Doppler reflections |
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image | talking | Face Animation |
Sample-based Synthesis of | talking | Heads |
SD-NeRF: Towards Lifelike | talking | Head Animation via Spatially-Adaptive Dual-Driven NeRFs |
Seeing What You Said: | talking | Face Generation Guided by a Lip Reading Expert |
Shape and appearance models of | talking | faces for model-based tracking |
Simulated natural movement of a computer-generated synthesized | talking | head |
Speech Driven | talking | Face Generation From a Single Image and an Emotion Condition |
Speech-Driven Expressive | talking | Lips with Conditional Sequential Generative Adversarial Networks |
StyleHEAT: One-Shot High-Resolution Editable | talking | Face Generation via Pre-trained StyleGAN |
Synthesizing a | talking | mouth |
Synthesizing Photo-Realistic 3D | talking | Head: Learning Lip Synchronicity and Emotion from Audio and Video |
Synthesizing | talking | Faces from Text and Audio: An Autoencoder and Sequence-to-Sequence Convolutional Neural Network |
| talking | About 3D Scenes: Integration of Image and Speech Understanding in a Hybrid Distributed System |
| talking | Cars, Doubtful Users: A Population Study in Virtual Reality |
| talking | Detection in Collaborative Learning Environments |
| talking | Face |
| talking | Face Generation via Learning Semantic and Temporal Synchronous Landmarks |
| talking | Face Generation with Multilingual TTS |
| talking | Face: Using Facial Feature Detection and Image Transformations for Visual Speech |
| talking | Faces: Technologies and Applications |
| talking | Head Generation with Probabilistic Audio-to-Visual Diffusion Priors |
| talking | Heads, Speech Driven Face Animation |
| talking | Heads: Detecting Humans and Recognizing Their Interactions |
| talking | Heads: Introducing the tool of 3D motion fields in the study of action |
| talking | pictures: Temporal grouping and dialog-supervised person recognition |
| talking | profile to distinguish identical twins, A |
| talking | to Machines |
| talking | With Hands 16.2M: A Large-Scale Dataset of Synchronized Body-Finger Motion and Audio for Conversational Motion Analysis and Synthesis |
| talking | with signs A simple method to detect nouns and numbers in a non-annotated signs language corpus |
| talking | With Your Hands: Scaling Hand Gestures and Recognition With CNNs |
| talking | -head Generation with Rhythmic Head Motion |
Three-Dimensional Facial Adaptation for MPEG-4 | talking | Heads |
Toward Fine-Grained | talking | Face Generation |
Towards a low bandwidth | talking | face using appearance models |
Towards Generating Ultra-High Resolution | talking | -Face Videos with Lip synchronization |
Towards MOOCs for Lipreading: Using Synthetic | talking | Heads to Train Humans in Lipreading at Scale |
VAST: Vivify Your | talking | Avatar via Zero-Shot Expressive Facial Style Transfer |
Ventriloquist-Net: Leveraging Speech Cues for Emotive | talking | Head Generation |
Viseme Classification for | talking | Head Application |
Walking and | talking | : A bilinear approach to multi-label action recognition |
Watching and | talking | : media content as social nexus |
What Are You | talking | About? Text-to-Image Coreference |
You Said That?: Synthesising | talking | Faces from Audio |
114 for talking