_ | asr | _ |
Accuracy Analysis of Generalized Pronunciation Variant Selection in | asr | Systems |
ASQ: An Ultra-Low Bit Rate | asr | -Oriented Speech Quantization Method |
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV- | asr | |
CIF-Based Speech Segmentation Method for Streaming E2E | asr | , A |
Class-Based Parametric Approximation to Histogram Equalization for | asr | |
Combining Several | asr | Outputs in a Graph-Based SLU System |
Creating speaker independent | asr | system through prosody modification based data augmentation |
DNN Uncertainty Propagation Using GMM-Derived Uncertainty Features for Noise Robust | asr | |
E2E- | asr | -Based Iteratively-Trained Timestamp Estimator, An |
Effect of Prosody Modification on Children's | asr | |
Impact of the Approaches Involved on Word-Graph Derivation from the | asr | System |
Information Distance-Based Subvector Clustering for | asr | Parameter Quantization |
Integrating Phonological Knowledge in | asr | Systems for Spanish Language |
Speaker Dependent | asr | s for Huastec and Western-Huastec Nahuatl Languages |
Spelling-Aware Word-Based End-to-End | asr | |
SSCFormer: Push the Limit of Chunk-Wise Conformer for Streaming | asr | Using Sequentially Sampled Chunks and Chunked Causal Convolution |
UniEnc-CASSNAT: An Encoder-Only Non-Autoregressive | asr | for Speech SSL Models |
17 for asr