_ | voice | _ |
Active Defense Against | voice | Conversion Through Generative Adversarial Network |
Adversarial Continual Learning to Transfer Self-Supervised Speech Representations for | voice | Pathology Detection |
Affective Learning: Empathetic Agents with Emotional Facial and Tone of | voice | Expressions |
AI-Synthesized | voice | Detection Using Neural Vocoder Artifacts |
Analysis of Features and Metrics for Alignment in Text-Dependent | voice | Conversion |
Analytical Approach for | voice | Capacity Estimation Over WiFi Network Using ITU-T E-Model, An |
Aspects of | voice | Interaction on a Mobile Augmented Reality Application |
Audiovisual | voice | activity detection using off-the-shelf cameras |
Bimodal Expression of Emotion by Face and | voice | |
BIOMET: A Multimodal Person Authentication Database Including Face, | voice | , Fingerprint, Hand and Signature Modalities |
Causal reasoning for algorithmic fairness in | voice | controlled cyber-physical systems |
Challenging | voice | Dataset for Robotic Applications in Noisy Environments, A |
Color-based lips extraction applied to | voice | activity detection |
Combining speech energy and edge information for fast and efficient | voice | activity detection in noisy environments |
Comparative Analysis between Wavelets for the Identification of Pathological | voice | s |
Comparative evaluation of feature normalization techniques for | voice | password based speaker verification |
Comparisons of Visual Activity Primitives for | voice | Activity Detection |
Computational Auditory Scene Analysis Based | voice | Activity Detection |
Continuous Authentication With Touch Behavioral Biometrics and | voice | on Wearable Glasses |
Controlled Autoencoders to Generate Faces from | voice | s |
Cross-Modal Perceptionist: Can Face Geometry be Gleaned from | voice | s? |
Crossmodal Matching of Speakers Using Lip and | voice | Features in Temporally Non-overlapping Audio and Video Streams |
Cyclonic Process of the | voice | of the Sea Microseism Generation and Its Remote Monitoring |
Deep Cross-Modal Image- | voice | Retrieval in Remote Sensing |
Deep Neural Networks for Detecting Real Emotions Using Biofeedback and | voice | |
Design and optimization of a long-stroke compliant micropositioning stage driven by | voice | coil motor |
Detecting Aggression in | voice | Using Inverse Filtered Speech Features |
Development of a | voice | Virtual Assistant for the Geospatial Data Visualization Application on the Web |
Distinct Synthesizer Convolutional Tasnet for Singing | voice | Separation, A |
Dragon | voice | |
Dual Microphone | voice | Activity Detection Exploiting Interchannel Time and Level Differences |
Emotion Intensity and its Control for Emotional | voice | Conversion |
Energy and Computation Efficient Audio-Visual | voice | Activity Detection Driven by Event-Cameras |
Evaluation and analysis of a face and | voice | outdoor multi-biometric system |
Evaluation of Time and Frequency Domain-Based Methods for the Estimation of Harmonics-to-Noise-Ratios in | voice | Signals |
Face, Body, | voice | : Video Person-Clustering with Multiple Modalities |
Face-to-face Communicative Avatar Driven by | voice | |
Face- | voice | Authentication Based on 3D Face Models |
Fast Inter-Harmonic Reconstruction for Spectral Envelope Estimation in High-Pitched | voice | s |
Forging | voice | s and faces |
Functional Feature Selection by Weighted Projections in Pathological | voice | Detection |
GEN-RES-NET: A Novel Generative Model for Singing | voice | Separation |
Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for | voice | Research and Affective Computing, The |
Geospatial User Feedback: How to Raise Users' | voice | s and Collectively Build Knowledge at the Same Time |
Gesture-based interaction with | voice | feedback for a tour-guide robot |
Glottal and Vocal Tract Characteristics of | voice | Impersonators |
Hetero-Associative Memories for | voice | Signal and Image Processing |
Homogeneity Measure for Forensic | voice | Comparison: A Step Forward Reliability |
Hybrid Biometric Person Authentication Using Face and | voice | Features |
ICA-FX features for classification of singing | voice | and instrumental sound |
Identification of Electronic Disguised | voice | s in the Noisy Environment |
improved method for | voice | pathology detection by means of a HMM-based feature space transformation, An |
improved noise-robust | voice | activity detector based on hidden semi-Markov models, An |
Improvement and Evaluation of Time-Spread Echo Hiding Technology for Long-Distance | voice | Evacuation Systems |
Improvements of | voice | Timbre Control Based on Perceived Age in Singing Voice Conversion |
Improvements of | voice | Timbre Control Based on Perceived Age in Singing Voice Conversion |
Improving Parkinson's disease recognition through | voice | analysis using deep learning |
Inferring Emotions From Large-Scale Internet | voice | Data |
Inferring Emphasis for Real | voice | Data: An Attentive Multimodal Neural Network Approach |
Informed Group-Sparse Representation for Singing | voice | Separation |
Inner | voice | s: Reflexive Augmented Listening |
Interdependencies among | voice | Source Parameters in Emotional Speech |
Interference Reduction in Reverberant Speech Separation With Visual | voice | Activity Detection |
Introduction to Signal Processing for Singing- | voice | Analysis: High Notes in the Effort to Automate the Understanding of Vocals in Music, An |
Investigating Trust Factors in Human-Robot Shared Control: Implicit Gender Bias Around Robot | voice | |
Investigation of Normalised Time of Increasing Vocal Fold Contact as a Discriminator of Emotional | voice | Type |
Is synthetic | voice | detection research going into the right direction? |
Isolated spoken word recognition using packed-MFCC on padded- | voice | signal for unscripted languages |
Joint learning for | voice | based disease detection |
Language Modelization and Categorization for | voice | -Activated QA |
LDV Remote | voice | Acquisition and Enhancement |
Learning to Infer Public Emotions from Large-Scale Networked | voice | Data |
Learning Visual | voice | Activity Detection with an Automatically Annotated Dataset |
Light-weight Frequency Information Aware Neural Network Architecture for | voice | Spoofing Detection |
Lightweight | voice | Spoofing Detection Using Improved One-Class Learning and Knowledge Distillation |
LM-VC: Zero-Shot | voice | Conversion via Speech Generation Based on Language Models |
Local-Global Contrast for Learning | voice | -Face Representations |
Low-Complexity | voice | Activity Detector Using Periodicity and Energy Ratio |
Low-Complexity | voice | Detector for Mobile Environments |
LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic | voice | s |
Mandarin Language Learning System for Nasal | voice | User |
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for | voice | Conversion |
MSU-AVIS dataset: Fusing Face and | voice | Modalities for Biometric Recognition in Indoor Surveillance Videos |
Multi-speaker | voice | activity detection using a camera-assisted microphone array |
Multi-Task WaveRNN With an Integrated Architecture for Cross-Lingual | voice | Conversion |
Multimodal Biometrics of Lip Movements and | voice | using Kernel Fisher Discriminant Analysis |
Multipath fading effects on integrated video, | voice | and data transmission in hybrid-code BPSK-DS/CDMA systems |
Neural network vowel-recognition jointly using | voice | features and mouth shape image |
Neural | voice | Puppetry: Audio-driven Facial Reenactment |
New Method of Image Encryption/Decryption via | voice | Features, A |
New Sampling Method of Auto Focus for | voice | Coil Motor in Camera Modules, A |
Noise robust | voice | detector for speaker recognition |
Noise-Robust | voice | Activity Detector Based on Hidden Semi-Markov Models |
Novel Application of Real-time Face Tracking and Microphones Array to Pick up Human | voice | Remotely and Clearly, A |
Novel Modified Mel-DCT Filter Bank Structure With Application to | voice | Activity Detection, A |
Novel Transducer: From Lip Motion to | voice | Message, A |
On Learning Associations of Faces and | voice | s |
One-Class Learning Towards Synthetic | voice | Spoofing Detection |
Open | voice | Command Interface Kit, An |
Places Speaking with Their Own | voice | s. A Case Study from the Gra.fo Archives |
portability evaluation of Brazilian Portuguese | voice | s produced with MARY TTS, A |
Presentation and Short Discussion of rVAD-fast, a Fast | voice | Activity Detector, A |
Prosodic, Spectral and | voice | Quality Feature Selection Using a Long-Term Stopping Criterion for Audio-Based Emotion Recognition |
Put-That-There: | voice | and gesture at the graphics interface |
Quick and efficient definition of hangbefore and hangover criteria for | voice | activity detection |
Random Cycle Loss and Its Application to | voice | Conversion |
real-time accompaniment system based on sung | voice | recognition, A |
RealVAD: A Real-World Dataset and A Method for | voice | Activity Detection by Body Motion Analysis |
Remote Sensing of Infrasound Signals of the | voice | of the Sea during the Evolution of Typhoons |
ResMax: Detecting | voice | Spoofing Attacks with Residual Network and Max Feature Map |
Restricted Boltzmann Machine-Based | voice | Conversion for Nonparallel Corpus |
Rhythm Modeling for | voice | Conversion |
Robust face- | voice | based speaker identity verification using multilevel fusion |
Robust Visual | voice | Activity Detection Using Long Short-Term Memory Recurrent Neural Network |
Robust | voice | Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation |
Robust | voice | activity detection directed by noise classification |
S-VVAD: Visual | voice | Activity Detection by Motion Segmentation |
Score-Level Fusion of Face and | voice | Using Particle Swarm Optimization and Belief Functions |
See the Silence: Improving Visual-Only | voice | Activity Detection by Optical Flow and RGB Fusion |
Seeing | voice | s and Hearing Faces: Cross-Modal Biometric Matching |
Seeking the Shape of Sound: An Adaptive Framework for Learning | voice | -Face Association |
Semantics-Consistent Representation Learning for Remote Sensing Image- | voice | Retrieval |
Separation and Classification of Harmonic Sounds for Singing | voice | Detection |
Simultaneous-Speaker | voice | Activity Detection and Localization Using Mid-Fusion of SVM and HMMs |
Skip Attention Mechanism for Monaural Singing | voice | Separation, A |
Slice-based architecture for biometrics: Prototype illustration on privacy preserving | voice | verification |
Spatial Bias in Vision-Based | voice | Activity Detection |
Spectro-Temporal Attention-Based | voice | Activity Detection |
Speech enhancement for in-vehicle | voice | control systems using wavelet analysis and blind source separation |
Speech-to-Singing | voice | Conversion: The Challenges and Strategies for Improving Vocal Conversion Processes |
Speech2Face: Learning the Face Behind a | voice | |
SSM: A Novel Method to Recognize the Fundamental Frequency in | voice | Signals |
Study of | voice | Source and Vocal Tract Filter Based Features in Cognitive Load Classification, A |
StyleVC: Non-Parallel | voice | Conversion with Adversarial Style Generalization |
Substitution of Vocal Folds for | voice | Generation by Means of Intra-Oral Pulse Generator |
SVSNet: An End-to-End Speaker | voice | Similarity Assessment Model |
Text-independent | voice | conversion using deep neural network based phonetic level features |
Toward Visual | voice | Activity Detection for Unconstrained Videos |
Towards the creation of reliable | voice | control system based on a fuzzy approach |
Usability and Functionality Assessment of an Oculus Rift in Immersive and Interactive Systems Using | voice | Commands |
Using HMD for Immersive Training of | voice | -Based Operation of Small Unmanned Ground Vehicles |
V2C: Visual | voice | Cloning |
V2S: | voice | to Sign Language Translation System for Malaysian Deaf People |
Visual | voice | activity detection based on spatiotemporal information and bag of words |
Visual | voice | Activity Detection in the Wild |
Visual | voice | Activity Detection Using Frontal versus Profile Views |
Visual | voice | activity detection with optical flow |
| voice | activity detection and speaker localization using audiovisual cues |
| voice | Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test |
| voice | Activity Detection by Upper Body Motion Analysis and Unsupervised Domain Adaptation |
| voice | Activity Detection Using an Adaptive Context Attention Model |
| voice | Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm |
| voice | Activity Detection Via Noise Reducing Using Non-Negative Sparse Coding |
| voice | Activity Detection: Merging Source and Filter-based Information |
| voice | biometrics using linear Gaussian model |
| voice | Communication-Augmented Simulation Framework for Aircraft Trajectory Simulation, A |
| voice | Conversion for Whispered Speech Synthesis |
| voice | Conversion Using Learnable Similarity-guided Masked Autoencoder |
| voice | disguise by mimicry: deriving statistical articulometric evidence to evaluate claimed impersonation |
| voice | Instruction Learning System Using PL-T-SOM, A |
| voice | Interaction for Augmented Reality Navigation Interfaces with Natural Language Understanding |
| voice | of Leadership: Models and Performances of Automatic Analysis in Online Speeches, The |
| voice | Pleasantness of Female Voices and the Assessment of Physical Characteristics |
| voice | Pleasantness of Female Voices and the Assessment of Physical Characteristics |
| voice | Transformation by Mapping the Features at Syllable Level |
| voice | -Assisted Image Labeling for Endoscopic Ultrasound Classification Using Neural Networks |
| voice | -Bandwidth Visual Communication Through Logmaps: The Telecortex |
VoViT: Low Latency Graph-Based Audio-Visual | voice | Separation Transformer |
168 for voice