24.1.16 Signal Processing, Speech Recognition, Speech Analysis

Chapter Contents (Back)
These are mostly included since they are in the full ToC for journals that are taken completely. See also Emotion Recognition, from Other Than Faces.

Dragon Voice,
2005 Speech Recognition
WWW Link. Vendor, Speech Recognition. Developed from the original Dragon speech system.

Hanson, A.R., Riseman, E.M., Fisher, E.,
Context in word recognition,
PR(8), No. 1, January 1976, pp. 35-45.
Elsevier DOI 0309
BibRef

de Mori, R., Laface, P., Makhonine, V.A., Mezzalama, M.,
A syntactic procedure for the recognition of glottal pulses in continuous speech,
PR(9), No. 4, 1977, pp. 181-189.
Elsevier DOI 0309
BibRef

Maroy, J.P., Berthod, M.,
Natural language understanding by a robot: A pattern recognition problem,
PR(10), No. 2, 1978, pp. 63-71.
Elsevier DOI 0309
BibRef

Pal, S.K., Datta, A.K., Majumder, D.D.[D. Dutta],
A self-supervised vowel recognition system,
PR(12), No. 1, 1980, pp. 27-34.
Elsevier DOI 0309
BibRef

Pathak, A.[Amita], Pal, S.K.[Sankar K.],
On the convergence of 'A self-supervised vowel recognition system',
PR(20), No. 2, 1987, pp. 237-244.
Elsevier DOI 0309
BibRef

de Mori, R.[Renato], Giordano, G.[Giovanna],
Algorithms for syllabic hypothesization in continuous speech,
PR(14), No. 1-6, 1981, pp. 245-260.
Elsevier DOI 0309
BibRef

Tanaka, E.[Eiichi], Toyama, T.[Takanori], Kawai, S.[Sachiko],
High speed error correction of phoneme sequences,
PR(19), No. 5, 1986, pp. 407-412.
Elsevier DOI 0309
BibRef

Lee, L.S., Tseng, C.Y., Chen, K.J., Huang, J., Hwang, C.H., Ting, P.Y., Lin, L.J., Chen, C.C.,
A Mandarin dictation machine based upon a hierarchical recognition approach and Chinese natural language analysis,
PAMI(12), No. 7, July 1990, pp. 695-704.
IEEE DOI 0401
BibRef

Kenny, P., Lennig, M., Mermelstein, P.,
Speaker adaptation in a large-vocabulary Gaussian HMM recognizer,
PAMI(12), No. 9, September 1990, pp. 917-920.
IEEE DOI 0401
BibRef

Casacuberta, F.,
Some relations among stochastic finite state networks used in automatic speech recognition,
PAMI(12), No. 7, July 1990, pp. 691-695.
IEEE DOI 0401
BibRef

Yannakoudakis, E.J., Tsomokos, I., Hutton, P.J.,
n-Grams and their implication to natural language understanding,
PR(23), No. 5, 1990, pp. 509-528.
Elsevier DOI 0401
BibRef

Ney, H.[Hermann],
A comparative study of two search strategies for connected word recognition: dynamic programming and heuristic search,
PAMI(14), No. 5, May 1992, pp. 586-595.
IEEE DOI 0401
BibRef

Ney, H.[Hermann],
Stochastic Modelling: From Pattern Classification to Speech Recognition and Translation,
ICPR00(Vol III: 21-28).
IEEE DOI 0009
BibRef

Wu, J.X.[Jian-Xiong], Chan, C.[Chorkin],
Isolated word recognition by neural network models with cross-correlation coefficients for speech dynamics,
PAMI(15), No. 11, November 1993, pp. 1174-1185.
IEEE DOI 0401
BibRef

Liu, L.C.[Lih-Cherng], Chiou, D.[Denis], Wang, H.C.[Hsiao-Chuan],
A speech recognition method based on feature distributions,
PR(24), No. 8, 1991, pp. 717-722.
Elsevier DOI 0401
BibRef

Pinkowski, B.[Ben],
Multiscale fourier descriptors for classifying semivowels in spectrograms,
PR(26), No. 10, October 1993, pp. 1593-1602.
Elsevier DOI 0401
BibRef

Pinkowski, B.[Ben],
Principal Component Analysis of Speech Spectrogram Images,
PR(30), No. 5, May 1997, pp. 777-787.
Elsevier DOI 9705
BibRef

Chen, W.Y.[Wen-Yuan], Liao, Y.F.[Yuan-Fu], Chen, S.H.[Sin-Horng],
Speech recognition with hierarchical recurrent neural networks,
PR(28), No. 6, June 1995, pp. 795-805.
Elsevier DOI 0401
BibRef

Huo, Q.A.[Qi-Ang], Chan, C.[Chorkin],
Contextual vector quantization for speech recognition with discrete hidden Markov model,
PR(28), No. 4, April 1995, pp. 513-517.
Elsevier DOI 0401
BibRef

Pham, T.D.[Tuan D.], Wagner, M.[Michael],
A geostatistical model for linear prediction analysis of speech,
PR(31), No. 12, December 1998, pp. 1981-1991.
Elsevier DOI 0401
BibRef

Lee, T.[Tan], Ching, P.C., Chan, L.W.[Lai-Wan],
Isolated word recognition using modular recurrent neural networks,
PR(31), No. 6, June 1998, pp. 751-760.
Elsevier DOI 0401
BibRef

Han, J.Q.[Ji-Qing], Gao, W.[Wen],
Robust telephone speech recognition based on channel compensation,
PR(32), No. 6, June 1999, pp. 1061-1067.
Elsevier DOI 0401
BibRef

Deng, S.[Shiwen], Han, J.Q.[Ji-Qing],
Sparse Decomposition for Signal Periodic Model Over Complex Exponential Dictionary,
SPLetters(23), No. 12, December 2016, pp. 1858-1861.
IEEE DOI 1612
signal representation BibRef
And:
Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test,
ICPR10(89-92).
IEEE DOI 1008
BibRef

Lewis, M.A.[Michael A.], Ramachandran, R.P.[Ravi P.],
Cochannel speaker count labelling based on the use of cepstral and pitch prediction derived features,
PR(34), No. 2, February 2001, pp. 499-507.
Elsevier DOI 0011
BibRef

Kant, S.[Shri], Verma, N.[Neelam],
An Effective Source Recognition Algorithm: Extraction of Significant Binary Words,
PRL(21), No. 11, October 2000, pp. 981-988. 0010
BibRef

Kwong, S., He, Q.H., Man, K.F., Tang, K.S.,
A maximum model distance approach for HMM-based speech recognition,
PR(31), No. 3, March 1998, pp. 219-229.
Elsevier DOI 0401
BibRef

He, Q.H., Kwong, S., Man, K.F., Tang, K.S.,
An improved maximum model distance approach for HMM-based speech recognition systems,
PR(33), No. 10, October 2000, pp. 1749-1758.
Elsevier DOI 0006
BibRef

Wu, C.H., Chen, Y.J., Yan, G.L.,
Integration of phonetic and prosodic information for robust utterance verification,
VISP(147), No. 1, February 2000, pp. 55. 0005
BibRef

Kim, W.[Wooil], Kang, S.[Sunmee], Ko, H.S.[Han-Seok],
Spectral subtraction based on phonetic dependency and masking effects,
VISP(147), No. 5, October 2000, pp. 423-427. 0101
BibRef

Hussain, A., Campbell, D.R.,
Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise,
VISP(148), No. 2, April 2001, pp. 127-132. 0106
BibRef

Bohez, E.L.J.[Erik L.J.], Senevirathne, T.R.,
Speech recognition using fractals,
PR(34), No. 11, November 2001, pp. 2227-2243.
Elsevier DOI 0108
BibRef

Chen, S.H., Wang, J.F.,
Application of wavelet transforms for C/V segmentation on Mandarin speech signals,
VISP(148), No. 2, April 2001, pp. 133-139. 0106
BibRef

Mouria-Beji, F.[Fériel],
A hierarchical Bayesian model for continuous speech recognition,
PRL(23), No. 7, May 2002, pp. 773-781.
Elsevier DOI 0203
BibRef

Chen, F.K., Yang, J.F., Yan, Y.L.,
Candidate scheme for fast ACELP search,
VISP(149), No. 1, February 2002, pp. 10-16.
IEEE Top Reference. 0205
Algebraic code excited linear prediction. Speech coding. BibRef

Liu, J.W.[Jing-Wei], Cheng, Q.S.[Qian-Sheng], Zheng, Z.G.[Zhong-Guo], Qian, M.P.[Min-Ping],
A DTW-based probability model for speaker feature analysis and data mining,
PRL(23), No. 11, September 2002, pp. 1271-1276.
Elsevier DOI 0206
BibRef

Huang, C.S.[Chao-Shih], Wang, H.C.[Hsiao-Chuan],
Bandwidth-adjusted LPC analysis for robust speech recognition,
PRL(24), No. 9-10, June 2003, pp. 1583-1587.
Elsevier DOI 0304
BibRef

Juang, Y.T.[Yau-Tarng], Huang, K.C.[Kuo-Chang], Ding, I.J.[Ing-Jr],
Speaker adaptation based on MAP estimation using fuzzy controller,
PRL(24), No. 15, November 2003, pp. 2807-2813.
Elsevier DOI 0308
BibRef

Ding, I.J.[Ing-Jr],
Incremental MLLR speaker adaptation by fuzzy logic control,
PR(40), No. 11, November 2007, pp. 3110-3119.
Elsevier DOI 0707
Speech recognition; Speaker adaptation; Hidden Markov model; Maximum likelihood linear regression; T-S fuzzy logic controller BibRef

Li, T.F.[Tze Fen],
Speech Recognition of Mandarin Monosyllables,
PR(36), No. 11, November 2003, pp. 2713-2721.
Elsevier DOI 0309
BibRef

Farooq, O., Datta, S.,
Wavelet based robust sub-band features for phoneme recognition,
VISP(151), No. 3, June 2004, pp. 187-193.
IEEE Abstract. 0409
BibRef

Ricotti, L.P.,
Multitapering and a wavelet variant of MFCC in speech recognition,
VISP(152), No. 1, February 2005, pp. 29-35.
IEEE Abstract. 0501
BibRef

Chen, K.[Ke],
On the use of different speech representations for speaker modeling,
SMC-C(35), No. 3, August 2005, pp. 301-314.
IEEE DOI 0508
BibRef

Zhong, W., Li, S., Tai, H.M.,
Signal subspace approach for narrowband noise reduction in speech,
VISP(152), No. 6, December 2005, pp. 800-805.
DOI Link 0512
BibRef

Chen, B.[Berlin],
Exploring the use of latent topical information for statistical Chinese spoken document retrieval,
PRL(27), No. 1, 1 January 2006, pp. 9-18.
Elsevier DOI 0512
BibRef

Chen, B.[Berlin], Chen, Y.T.[Yi-Ting],
Extractive spoken document summarization for information retrieval,
PRL(29), No. 4, 1 March 2008, pp. 426-437.
Elsevier DOI 0711
Extractive summarization; Information retrieval; Topical mixture model; Spoken documents; Speech recognition BibRef

Wan, C.[Chunru], Liu, M.C.[Ming-Chun],
Content-based audio retrieval with relevance feedback,
PRL(27), No. 2, 15 January 2006, pp. 85-92.
Elsevier DOI 0512
BibRef

Radhakrishnan, R.[Regunathan], Divakaran, A.[Ajay], Xiong, Z.Y.[Zi-You], Otsuka, I.[Isao],
A Content-Adaptive Analysis and Representation Framework for Audio Event Discovery from 'Unscripted' Multimedia,
JASP(2006), 2006, pp. 1-24.
DOI Link 0603
BibRef

Chu, W.T.[Wei-Ta], Cheng, W.H.[Wen-Huang], Wu, J.L.[Ja-Ling],
Semantic Context Detection Using Audio Event Fusion,
JASP(2006), 2006, pp. 1-12.
WWW Link. 0603
BibRef

Liu, J.W.[Jing-Wei], Wang, Z.Y.[Zuo-Ying], Xiao, X.[Xi],
A hybrid SVM/DDBHMM decision fusion modeling for robust continuous digital speech recognition,
PRL(28), No. 8, 1 June 2007, pp. 912-920.
Elsevier DOI 0704
Speech recognition; Gaussian mixture model; Duration distribution based hidden Markov model (DDBHMM); Support vector machine BibRef

Leavitt, N.,
Two technologies vie for recognition in speech market,
Computer(36), No. 6, June 2003, pp. 13-16.
IEEE DOI 0306
BibRef

Paulson, L.D.,
Speech Recognition Moves from Software to Hardware,
Computer(39), No. 11, November 2006, pp. 15-18.
IEEE DOI 0611
BibRef

Stavrakoudis, D.G., Theocharis, J.B.,
Pipelined Recurrent Fuzzy Neural Networks for Nonlinear Adaptive Speech Prediction,
SMC-B(37), No. 5, October 2007, pp. 1305-1320.
IEEE DOI 0711
BibRef

Araujo, L.[Lourdes], Serrano, J.I.[J. Ignacio],
Highly accurate error-driven method for noun phrase detection,
PRL(29), No. 4, 1 March 2008, pp. 547-557.
Elsevier DOI 0711
Noun phrase detection; Evolutionary programming; Grammar induction; Information retrieval BibRef

Zhang, Y.X.[Yong-Xin], Scordilis, M.S.[Michael S.],
Effective online unsupervised adaptation of Gaussian mixture models and its application to speech classification,
PRL(29), No. 6, 15 April 2008, pp. 735-744.
Elsevier DOI 0803
Gaussian mixture model; Speech classification; Online adaptation; Unsupervised adaptation BibRef

O'Shaughnessy, D.[Douglas],
Invited paper: Automatic speech recognition: History, methods and challenges,
PR(41), No. 10, October 2008, pp. 2965-2979.
Elsevier DOI 0808
Automatic speech recognition; Hidden Markov models; Adaptation; Compensation; Pattern recognition; Spectral representation BibRef

Zeng, J.[Jia], Xie, L.[Lei], Liu, Z.Q.[Zhi-Qiang],
Type-2 fuzzy Gaussian mixture models,
PR(41), No. 12, December 2008, pp. 3636-3643.
Elsevier DOI 0810
BibRef
Earlier: A1, A3, Only:
Type-2 fuzzy hidden markov models to phoneme recognition,
ICPR04(I: 192-195).
IEEE DOI 0409
Type-2 fuzzy sets; Gaussian mixture models; Hidden Markov models BibRef

Chen, B.[Berlin], Liu, S.H.[Shih-Hung], Chu, F.H.[Fang-Hui],
Training data selection for improving discriminative training of acoustic models,
PRL(30), No. 13, 1 October 2009, pp. 1228-1235.
Elsevier DOI 0909
Continuous speech recognition; Discriminative training; Acoustic models; Data selection; Phone accuracy; Entropy BibRef

Kang, S.W.[Sang-Woo], Kim, H.[Harksoo], Seo, J.Y.[Jung-Yun],
A reliable multidomain model for speech act classification,
PRL(31), No. 1, 1 January 2010, pp. 71-74.
Elsevier DOI 1001
Speech act classification; Dialogue domain detection; Multidomain dialogue BibRef

Kang, S.W.[Sang-Woo], Seo, J.Y.[Jung-Yun],
Two-phase reanalysis model for understanding user intention,
PRL(42), No. 1, 2014, pp. 35-39.
Elsevier DOI 1404
Natural language processing BibRef

Milone, D.H.[Diego H.], di Persia, L.E.[Leandro E.], Torres, M.E.[Maria E.],
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees,
PR(43), No. 4, April 2010, pp. 1577-1589.
Elsevier DOI 1002
Sequence learning; EM algorithm; Wavelets; Speech recognition BibRef

Lu, Y.[Yong], Wu, H.Y.[Hai-Yang], Zhou, L.[Lin], Wu, Z.Y.[Zhen-Yang],
Multi-environment model adaptation based on vector Taylor series for robust speech recognition,
PR(43), No. 9, September 2010, pp. 3093-3099.
Elsevier DOI 1006
Model adaptation; Vector Taylor series; Multi-environment model; Speech recognition BibRef

Kay, S.,
A New Approach to Fourier Synthesis With Application to Neural Encoding and Speech Classification,
SPLetters(17), No. 10, October 2010, pp. 855-858.
IEEE DOI 1008
BibRef

Kay, S.,
A New Proof of the Neyman-Pearson Theorem Using the EEF and the Vindication of Sir R. Fisher,
SPLetters(19), No. 8, August 2012, pp. 451-454.
IEEE DOI 1208
BibRef

Hong, H., Zhao, Z., Wang, X., Tao, Z.,
Detection of Dynamic Structures of Speech Fundamental Frequency in Tonal Languages,
SPLetters(17), No. 10, October 2010, pp. 843-846.
IEEE DOI 1008
BibRef

Scanzio, S.[Stefano], Cumani, S.[Sandro], Gemello, R.[Roberto], Mana, F.[Franco], Laface, P.,
Parallel implementation of Artificial Neural Network training for speech recognition,
PRL(31), No. 11, 1 August 2010, pp. 1302-1309.
Elsevier DOI 1008
Artificial Neural Network; Block Back-propagation; Focused Attention Back-Propagation; GPU; CUDA; Fast Training BibRef

Heracleous, P.[Panikos], Badin, P.[Pierre], Bailly, G.[Gerard], Hagita, N.[Norihiro],
A pilot study on augmented speech communication based on Electro-Magnetic Articulography,
PRL(32), No. 8, 1 June 2011, pp. 1119-1125.
Elsevier DOI 1101
Augmented speech; Electro-Magnetic Articulography (EMA); Automatic speech recognition; Hidden Markov model (HMMs); Fusion; Noise robustness BibRef

Chen, B.[Berlin], Chen, W.H.[Wei-Hau], Lin, S.H.[Shih-Hsiang], Chu, W.Y.[Wen-Yi],
Robust speech recognition using spatial-temporal feature distribution characteristics,
PRL(32), No. 7, 1 May 2011, pp. 919-926.
Elsevier DOI 1101
Speech recognition; Noise robustness; Histogram equalization; Spatial-temporal distribution characteristics; Aurora-2 BibRef

Zamani, B.[Behzad], Akbari, A.[Ahmad], Nasersharif, B.[Babak], Jalalvand, A.[Azarakhsh],
Optimized discriminative transformations for speech features based on minimum classification error,
PRL(32), No. 7, 1 May 2011, pp. 948-955.
Elsevier DOI 1101
Minimum classification error; Principal Component Analysis; Linear Discriminant Analysis; Feature transformation; Hidden Markov Model BibRef

Lo, H.Y., Wang, J.C., Wang, H.M., Lin, S.D.,
Cost-Sensitive Multi-Label Learning for Audio Tag Annotation and Retrieval,
MultMed(13), No. 3, 2011, pp. 518-529.
IEEE DOI 1106
BibRef

Lu, L., Ghoshal, A., Renals, S.,
Regularized Subspace Gaussian Mixture Models for Speech Recognition,
SPLetters(18), No. 7, July 2011, pp. 419-422.
IEEE DOI 1101
BibRef

Lu, L., Renals, S.,
Probabilistic Linear Discriminant Analysis for Acoustic Modeling,
SPLetters(21), No. 6, June 2014, pp. 702-706.
IEEE DOI 1404
Analytical models BibRef

Remes, U., Palomaki, K.J., Raiko, T., Honkela, A., Kurimo, M.,
Missing-Feature Reconstruction With a Bounded Nonlinear State-Space Model,
SPLetters(18), No. 10, October 2011, pp. 563-566.
IEEE DOI 1109
Speech recognition. BibRef

He, Y., Han, J.,
Gaussian Specific Compensation for Channel Distortion in Speech Recognition,
SPLetters(18), No. 10, October 2011, pp. 599-602.
IEEE DOI 1109
BibRef

Roupakia, Z., Gales, M.,
Kernel Eigenvoices (Revisited) for Large-Vocabulary Speech Recognition,
SPLetters(18), No. 12, December 2011, pp. 709-712.
IEEE DOI 1112
BibRef

Kim, S.[Seonho], Yoon, J.[Juntae], Seo, J.Y.[Jung-Yun], Park, S.[Seog],
Improving Korean verb-verb morphological disambiguation using lexical knowledge from unambiguous unlabeled data and selective web counts,
PRL(33), No. 1, 1 January 2012, pp. 62-70.
Elsevier DOI 1112
POS tagging; Verb-verb morphological disambiguation; Unlabeled corpora; Automatic annotation; Web counts; Hard example-based selective sampling BibRef

Geller, T.[Tom],
Talking to Machines,
CACM(55), No. 4, April 2012, pp. 14-16.
DOI Link 1204
Voice recognition programs like Siri are now capable of understanding spoken commands, recognizing a conversation's context, and answering questions in a personable manner. BibRef

Norrenbrock, C.R., Hinterleitner, F., Heute, U., Moller, S.,
Instrumental Assessment of Prosodic Quality for Text-to-Speech Signals,
SPLetters(19), No. 5, May 2012, pp. 255-258.
IEEE DOI 1204
BibRef

Seon, C.N.[Choong-Nyoung], Kim, H.[Harksoo], Seo, J.Y.[Jung-Yun],
A statistical prediction model of speakers' intentions using multi-level features in a goal-oriented dialog system,
PRL(33), No. 10, 15 July 2012, pp. 1397-1404.
Elsevier DOI 1205
Speech act prediction; Concept sequence prediction; Multi-level feature BibRef

Kang, S.W.[Sang-Woo], Ko, Y.J.[Young-Joong], Seo, J.Y.[Jung-Yun],
Hierarchical speech-act classification for discourse analysis,
PRL(34), No. 10, 15 July 2013, pp. 1119-1124.
Elsevier DOI 1306
Natural language processing; Discourse analysis; Speech act classification; Hierarchical structure; Dialogue system BibRef

Dehzangi, O.[Omid], Ma, B.[Bin], Chng, E.S.[Eng Siong], Li, H.Z.[Hai-Zhou],
Discriminative feature extraction for speech recognition using continuous output codes,
PRL(33), No. 13, 1 October 2012, pp. 1703-1709.
Elsevier DOI 1208
BibRef
Earlier:
Fuzzy rule selection using Iterative Rule Learning for speech data classification,
ICPR08(1-4).
IEEE DOI 0812
Speech recognition; Feature transformation; Generalized discriminant analysis; Output coding BibRef

Schroder, M.[Marc], Bevacqua, E.[Elisabetta], Cowie, R.[Roddy], Eyben, F.[Florian], Gunes, H.[Hatice], Heylen, D.[Dirk], ter Maat, M.[Mark], McKeown, G.[Gary], Pammi, S.[Sathish], Pantic, M.[Maja], Pelachaud, C.[Catherine], Schuller, B.[Bjorn], de Sevin, E.[Etienne], Valstar, M.F.[Michel F.], Wollmer, M.[Martin],
Building Autonomous Sensitive Artificial Listeners,
AffCom(3), No. 2, 2012, pp. 165-183.
IEEE DOI 1208
BibRef

Furui, S., Deng, L., Gales, M., Ney, H., Tokuda, K.,
Fundamental Technologies in Modern Speech Recognition,
SPMag(29), No. 3, 2012, pp. 16-17.
IEEE DOI 1210
From the Guest Editors. Survey of speech recognition, intro to special issue BibRef

Saon, G., Chien, J.T.,
Large-Vocabulary Continuous Speech Recognition Systems: A Look at Some Recent Advances,
SPMag(29), No. 3, 2012, pp. 18-33.
IEEE DOI 1210
Survey, Speech Recognition. BibRef

Wang, H.P.[Hai-Peng], Leung, C.C.[Cheung-Chi], Lee, T.[Tan], Ma, B.[Bin], Li, H.Z.[Hai-Zhou],
Shifted-Delta MLP Features for Spoken Language Recognition,
SPLetters(20), No. 1, January 2013, pp. 15-18.
IEEE DOI 1212
BibRef

Edwards, J.,
Researchers Push Speech Recognition Toward the Mainstream,
SPMag(30), No. 1, 2012, pp. 8-11.
IEEE DOI 1212
[Special Reports] BibRef

Das, B.[Biswajit], Mandal, S.[Sandipan], Mitra, P.[Pabitra], Basu, A.[Anupam],
Aging speech recognition with speaker adaptation techniques: Study on medium vocabulary continuous Bengali speech,
PRL(34), No. 3, 1 February 2013, pp. 335-343.
Elsevier DOI 1301
Aging speech recognition; Vocal tract length normalization (VTLN); Maximum likelihood linear transform (MLLT); Maximum likelihood linear regression (MLLR); Maximum a posteriori (MAP); Maximum mutual information estimation (MMIE) BibRef

Keefer, R., Liu, Y., Bourbakis, N.,
The Development and Evaluation of an Eyes-Free Interaction Model for Mobile Reading Devices,
HMS(43), No. 1, January 2013, pp. 76-91.
IEEE DOI 1301
Voice user interface. BibRef

Siniscalchi, S.M., Yu, D.[Dong], Deng, L.[Li], Lee, C.H.[Chin-Hui],
Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model,
SPLetters(20), No. 3, March 2013, pp. 201-204.
IEEE DOI 1303
BibRef

Hutchinson, B.[Brian], Deng, L.[Li], Yu, D.[Dong],
Tensor Deep Stacking Networks,
PAMI(35), No. 8, 2013, pp. 1944-1957.
IEEE DOI 1307
Closed-form solutions; Deep learning; handwriting image classification; BibRef

O'Shaughnessy, D., Deng, L., Li, H.,
Speech Information Processing: Theory and Applications,
PIEEE(100), No. 5, May 2013, pp. 1034-1037.
IEEE DOI 1305
[Scanning the Issue], Introduction to special issue. BibRef

O'Shaughnessy, D.,
Acoustic Analysis for Automatic Speech Recognition,
PIEEE(100), No. 5, May 2013, pp. 1038-1053.
IEEE DOI 1305
BibRef

Fosler-Lussier, E., He, Y., Jyothi, P., Prabhavalkar, R.,
Conditional Random Fields in Speech, Audio, and Language Processing,
PIEEE(100), No. 5, May 2013, pp. 1054-1075.
IEEE DOI 1305
BibRef

Hermansky, H.,
Multistream Recognition of Speech: Dealing With Unknown Unknowns,
PIEEE(100), No. 5, May 2013, pp. 1076-1088.
IEEE DOI 1305
BibRef

Lee, C.H., Siniscalchi, S.M.,
An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition,
PIEEE(100), No. 5, May 2013, pp. 1089-1115.
IEEE DOI 1305
BibRef

He, X., Deng, L.,
Speech-Centric Information Processing: An Optimization-Oriented Approach,
PIEEE(100), No. 5, May 2013, pp. 1116-1135.
IEEE DOI 1305
BibRef

Young, S., Gasic, M., Thomson, B., Williams, J.D.,
POMDP-Based Statistical Spoken Dialog Systems: A Review,
PIEEE(100), No. 5, May 2013, pp. 1160-1179.
IEEE DOI 1305
Survey, Speech. BibRef

Zhou, B.,
Statistical Machine Translation for Speech: A Perspective on Structures, Learning, and Decoding,
PIEEE(100), No. 5, May 2013, pp. 1180-1202.
IEEE DOI 1305
BibRef

Li, W.F.[Wei-Feng], Zhou, Y.C.[Yi-Cong], Poh, N., Zhou, F.[Fei], Liao, Q.M.[Qing-Min],
Feature Denoising Using Joint Sparse Representation for In-Car Speech Recognition,
SPLetters(20), No. 7, 2013, pp. 681-684.
IEEE DOI cepstral analysis 1307
BibRef

Bengio, Y.[Yoshua], Courville, A.[Aaron], Vincent, P.[Pascal],
Representation Learning: A Review and New Perspectives,
PAMI(35), No. 8, 2013, pp. 1798-1828.
IEEE DOI Survey, Learning. 1307
Neural networks; Speech recognition; Boltzmann machine; Deep learning; representation learning; unsupervised learning BibRef

Hermansky, H., Cohen, J.R., Stern, R.M.,
Perceptual Properties of Current Speech Recognition Technology,
PIEEE(101), No. 9, 2013, pp. 1968-1985.
IEEE DOI 1309
Auditory system BibRef

Kolossa, D., Zeiler, S., Saeidi, R., Astudillo, R.F.[R. Fernandez],
Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty,
SPLetters(20), No. 11, 2013, pp. 1018-1021.
IEEE DOI 1310
speech recognition BibRef

Saeidi, R., Astudillo, R.F., Kolossa, D.,
Uncertain LDA: Including Observation Uncertainties in Discriminative Transforms,
PAMI(38), No. 7, July 2016, pp. 1479-1488.
IEEE DOI 1606
Estimation BibRef

Cho, J.W., Park, H.M.,
An Efficient HMM-Based Feature Enhancement Method With Filter Estimation for Reverberant Speech Recognition,
SPLetters(20), No. 12, 2013, pp. 1199-1202.
IEEE DOI 1311
Bayes methods BibRef

Lee, L.M.[Lee-Min], Jean, F.R.,
Adaptation of Hidden Markov Models for Recognizing Speech of Reduced Frame Rate,
Cyber(43), No. 6, 2013, pp. 2114-2121.
IEEE DOI 1312
hidden Markov models BibRef

Kim, K.T.[Kyung-Tae], Lin, K.H.[Kai-Hsiang], Walther, D.B.[Dirk B.], Hasegawa-Johnson, M.A.[Mark A.], Huang, T.S.[Tomas S.],
Automatic detection of auditory salience with optimized linear filters derived from human annotation,
PRL(38), No. 1, 2014, pp. 78-85.
Elsevier DOI 1402
Auditory salience BibRef

Huang, X.D.[Xue-Dong], Baker, J.[James], Reddy, R.[Raj],
A Historical Perspective of Speech Recognition,
CACM(57), No. 1, January 2014, pp. 94-103.
DOI Link 1402
Survey, Speech Recognition. What do we know now that we did not know 40 years ago? BibRef

Shi, Y.Z.[Yong-Zhe], Zhang, W.Q.[Wei-Qiang], Cai, M.[Meng], Liu, J.[Jia],
Efficient One-Pass Decoding with NNLM for Speech Recognition,
SPLetters(21), No. 4, April 2014, pp. 377-381.
IEEE DOI 1403
decoding BibRef

Zhang, W.B.[Wei-Bin], Fung, P.,
Efficient Sparse Banded Acoustic Models for Speech Recognition,
SPLetters(21), No. 3, March 2014, pp. 280-283.
IEEE DOI 1403
covariance matrices BibRef

Triefenbach, F., Demuynck, K., Martens, J.P.,
Large Vocabulary Continuous Speech Recognition With Reservoir-Based Acoustic Models,
SPLetters(21), No. 3, March 2014, pp. 311-315.
IEEE DOI 1403
error statistics BibRef

Diez, M.[Mireia], Varona, A.[Amparo], Penagarikano, M.[Mikel], Rodriguez-Fuentes, L.J.[Luis Javier], Bordel, G.[German],
On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition,
SPLetters(21), No. 6, June 2014, pp. 649-652.
IEEE DOI 1404
BibRef
Earlier: A1, A3, A2, A4, A5:
On the Use of Dot Scoring for Speaker Diarization,
IbPRIA11(612-619).
Springer DOI 1106
audio databases BibRef

Räsänen, O.[Okko], Laine, U.K.[Unto K.],
A method for noise-robust context-aware pattern discovery and recognition from categorical sequences,
PR(45), No. 1, 2012, pp. 606-616.
Elsevier DOI 1410
Speech recognition BibRef

Liu, N.H.[Ning-Han],
Effective Results Ranking for Mobile Query by Singing/Humming Using a Hybrid Recommendation Mechanism,
MultMed(16), No. 5, August 2014, pp. 1407-1420.
IEEE DOI 1410
audio signal processing BibRef

Schneiderman, R.,
Accuracy, Apps Advance Speech Recognition,
SPMag(32), No. 1, January 2015, pp. 12-125.
IEEE DOI 1502
Special Reports. Commercialization BibRef

Ban, S.M., Kim, H.S.,
Weight-Space Viterbi Decoding Based Spectral Subtraction for Reverberant Speech Recognition,
SPLetters(22), No. 9, September 2015, pp. 1424-1428.
IEEE DOI 1503
Decoding BibRef

Sakano, T.[Toshihiro], Kobayashi, Y.[Yosuke], Kondo, K.[Kazuhiro],
A Speech Intelligibility Estimation Method Using a Non-reference Feature Set,
IEICE(E98-D), No. 1, January 2015, pp. 21-28.
WWW Link. 1503
BibRef

Khaldi, K.[Kais], Boudraa, A.O.[Abdel-Ouahab], Torresani, B.[Bruno], Chonavel, T.[Thierry],
HHT-based audio coding,
SIViP(9), No. 1, January 2015, pp. 107-115.
Springer DOI 1503
BibRef

Richardson, F., Reynolds, D., Dehak, N.,
Deep Neural Network Approaches to Speaker and Language Recognition,
SPLetters(22), No. 10, October 2015, pp. 1671-1675.
IEEE DOI 1506
feature extraction BibRef

Espi, M.[Miquel], Fujimoto, M.[Masakiyo], Nakatani, T.[Tomohiro],
Acoustic Event Detection in Speech Overlapping Scenarios Based on High-Resolution Spectral Input and Deep Learning,
IEICE(E98-D), No. 10, October 2015, pp. 1799-1807.
WWW Link. 1511
BibRef

Savchenko, A.V.[Andrey V.], Savchenko, L.V.[Liudmila V.],
Towards the creation of reliable voice control system based on a fuzzy approach,
PRL(65), No. 1, 2015, pp. 145-151.
Elsevier DOI 1511
Signal processing BibRef

Trentin, E.[Edmondo],
Maximum-likelihood normalization of features increases the robustness of neural-based spoken human-computer interaction,
PRL(66), No. 1, 2015, pp. 71-80.
Elsevier DOI 1511
Feature normalization BibRef

Suh, Y.J.[Young-Joo], Kim, H.[Hoirin],
Probabilistic Class Histogram Equalization Based on Posterior Mean Estimation for Robust Speech Recognition,
SPLetters(22), No. 12, December 2015, pp. 2421-2424.
IEEE DOI 1512
maximum likelihood estimation BibRef

Sangeetha, J., Jothilakshmi, S.,
Automatic continuous speech recogniser for Dravidian languages using the auto associative neural network,
IJCVR(6), No. 1-2, 2016, pp. 113-126.
DOI Link 1601
BibRef

Wang, X.Y.[Xiao-Yun], Yamamoto, S.[Seiichi],
Speech Recognition of English by Japanese Using Lexicon Represented by Multiple Reduced Phoneme Sets,
IEICE(E98-D), No. 12, December 2015, pp. 2271-2279.
WWW Link. 1601
BibRef

Tohidypour, H.R.[Hamid Reza], Banitalebi-Dehkordi, A.[Amin],
Speech frame recognition based on less shift sensitive wavelet filter banks,
SIViP(10), No. 4, April 2016, pp. 633-637.
WWW Link. 1604
BibRef

Chung, Y.J.[Yong-Joo],
Vector Taylor series based model adaptation using noisy speech trained hidden Markov models,
PRL(75), No. 1, 2016, pp. 36-40.
Elsevier DOI 1604
Noisy speech recognition BibRef

Ansari, J.A., Sathyamurthy, A., Balasubramanyam, R.,
An Open Voice Command Interface Kit,
HMS(46), No. 3, June 2016, pp. 467-473.
IEEE DOI 1605
Hardware BibRef

Cho, B.J., Kwon, H., Cho, J.W., Kim, C., Stern, R.M., Park, H.M.,
A Subband-Based Stationary-Component Suppression Method Using Harmonics and Power Ratio for Reverberant Speech Recognition,
SPLetters(23), No. 6, June 2016, pp. 780-784.
IEEE DOI 1606
maximum likelihood estimation BibRef

Lee, H.Y., Cho, J.W., Kim, M., Park, H.M.,
DNN-Based Feature Enhancement Using DOA-Constrained ICA for Robust Speech Recognition,
SPLetters(23), No. 8, August 2016, pp. 1091-1095.
IEEE DOI 1608
direction-of-arrival estimation BibRef

Ren, H., Yan, Y.,
Structural Optimization and Online Evolutionary Learning for Spoken Dialog Management,
SPLetters(23), No. 7, July 2016, pp. 1013-1017.
IEEE DOI 1608
Monte Carlo methods BibRef

Khoubrouy, S.A., Hansen, J.H.L.,
Microphone Array Processing Strategies for Distant-Based Automatic Speech Recognition,
SPLetters(23), No. 10, October 2016, pp. 1344-1348.
IEEE DOI 1610
microphone arrays BibRef

Lamberti, F., Manuri, F., Paravati, G., Piumatti, G., Sanna, A.,
Using Semantics to Automatically Generate Speech Interfaces for Wearable Virtual and Augmented Reality Applications,
HMS(47), No. 1, February 2017, pp. 152-164.
IEEE DOI 1702
augmented reality BibRef

Fredes, J., Novoa, J., King, S., Stern, R.M., Yoma, N.B.,
Locally Normalized Filter Banks Applied to Deep Neural-Network-Based Robust Speech Recognition,
SPLetters(24), No. 4, April 2017, pp. 377-381.
IEEE DOI 1704
cepstral analysis BibRef

Shahnawazuddin, S., Sinha, R., Pradhan, G.,
Pitch-Normalized Acoustic Features for Robust Children's Speech Recognition,
SPLetters(24), No. 8, August 2017, pp. 1128-1132.
IEEE DOI 1708
feature extraction, spectral analysis, speech recognition, time-frequency analysis, SMAC features, adaptive-cepstral truncation, additive noise, low-order cepstral coefficients, normalized first central spectral moments, pitch variations, pitch-normalized acoustic feature, robust automatic speech recognition, robust children speech recognition, severe pitch mismatch ASR task, spectral moment time-frequency distribution augmented by low-order cepstral, spectral smoothening approach, Additive noise, Hidden Markov models, Mel frequency cepstral coefficient, Robustness, Speech, Automatic speech recognition (ASR), deep neural network (DNN), pitch-adaptive features, spectral smoothening, subspace, Gaussian, mixture, model, (SGMM) BibRef

Ganapathy, S.,
Multivariate Autoregressive Spectrogram Modeling for Noisy Speech Recognition,
SPLetters(24), No. 9, September 2017, pp. 1373-1377.
IEEE DOI 1708
Discrete cosine transforms, Estimation, Feature extraction, Noise measurement, Spectrogram, Speech, Speech recognition, Feature extraction, Riesz envelopes, multivariate autoregressive (MAR) models, speech, recognition BibRef


Wu, C., Ng, R.W.M., Torralba, O.S., Hain, T.,
Analysing acoustic model changes for active learning in automatic speech recognition,
WSSIP17(1-5)
IEEE DOI 1707
Acoustics, Adaptation models, Analytical models, Computational modeling, Data models, Hidden Markov models, Measurement, Active learning, confidence measures, data selection, speaker, adaptation BibRef

Kacprzak, S.,
Spoken language clustering in the i-vectors space,
WSSIP17(1-5)
IEEE DOI 1707
Clustering algorithms, Data visualization, Impurities, NIST, Speech, Training, Training data, i-vectors, language clustering, language, recognition BibRef

Pironkov, G., Dupont, S., Dutoit, T.,
Speaker-aware Multi-Task Learning for automatic speech recognition,
ICPR16(2900-2905)
IEEE DOI 1705
Acoustics, Automatic speech recognition, Feature extraction, Machine learning, Speech, Training BibRef

Zhao, Y., Zhao, R.[Rui], Wang, X.Y.[Xiao-Yang], Ji, Q.,
Multilingual articulatory features augmentation learning,
ICPR16(2895-2899)
IEEE DOI 1705
Dictionaries, Encoding, Feature extraction, Mel frequency cepstral coefficient, Semantics, Speech, Speech recognition, latent attribute learning, multilingual articulatory features, phone recognition, sparse coding, speech, attributes BibRef

Zhang, S., Liu, W.[Wen], Qin, Y.,
Wake-up-word spotting using end-to-end deep neural network system,
ICPR16(2878-2883)
IEEE DOI 1705
Computational modeling, Computer architecture, Hidden Markov models, Logic gates, Neural networks, Speech recognition, Training, CTC, LSTM, RNN, Wake-up-Word system, speech, recognition BibRef

Zhang, S.[Shilei], Qin, Y.,
Rapid feature space MLLR speaker adaptation for deep neural network acoustic modeling,
ICPR16(2889-2894)
IEEE DOI 1705
Acoustics, Adaptation models, Data models, Hidden Markov models, Standards, Training, Transforms, Deep Neural Networks, FMLLR, bilinear models, rapid, speaker, adaptation BibRef

Zheng, H.[Huadi], Cai, W., Zhou, T.[Tianyan], Zhang, S.[Shilei], Li, M.,
Text-independent voice conversion using deep neural network based phonetic level features,
ICPR16(2872-2877)
IEEE DOI 1705
Covariance matrices, Data mining, Data models, Feature extraction, Speech, Training, Training data, Gaussian mixture model, deep neural network, phoneme posterior probability, voice, conversion BibRef

Ogawa, T., Mallidi, S.H., Dupoux, E., Cohen, J., Feldman, N.H., Hermansky, H.,
A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation,
ICPR16(2222-2227)
IEEE DOI 1705
Estimation, Monitoring, Noise measurement, Reliability, Speech, Time measurement, Training BibRef

Zhang, B.[Bo], Gan, Y.[Yuqin], Song, Y.[Yan], Tang, B.[Benlai],
Application of pronunciation knowledge on phoneme recognition by LSTM neural network,
ICPR16(2906-2911)
IEEE DOI 1705
Automata, Dictionaries, Hidden Markov models, Linear programming, Neural networks, Speech, Training, connectionist temporal classification, phoneme recognition, pronunciation, knowledge BibRef

Mzah, Y., Ahfir, M., Jaidane, M.,
Late pre-dereverberation for speech intelligibility enhancement in public address systems,
ISIVC16(291-296)
IEEE DOI 1704
Position measurement BibRef

Montalvo, A.[Ana], Calvo, J.R.[José Ramón],
Discriminative Capacity and Phonetic Information of Bottleneck Features in Speech,
CIARP16(134-141).
Springer DOI 1703
BibRef

Asadullah, Shaukat, A., Ali, H., Akram, U.,
Automatic Urdu Speech Recognition using Hidden Markov Model,
ICIVC16(135-139)
IEEE DOI 1610
cepstral analysis BibRef

Ondáš, S., Juhár, J.,
Towards human-machine dialog in Slovak,
WSSIP16(1-4)
IEEE DOI 1608
hidden Markov models BibRef

Conka, D., Viszlay, P., Juhár, J.,
Fuzzy clustering in HMM-based triphone classes of 2DLDA in Slovak LVCSR,
WSSIP16(1-4)
IEEE DOI 1608
fuzzy set theory BibRef

Kacur, J., Kozicka, R., Vargic, R.,
Semi-tight covariance matrices implementation in MASPER HMM training procedure,
WSSIP16(1-4)
IEEE DOI 1608
covariance matrices BibRef

Kacur, J., Trnovsky, T., Vargic, R.,
Discriminative training of HMM using MASPER procedure,
WSSIP15(93-96)
IEEE DOI 1603
hidden Markov models BibRef

Calvo, M.[Marcos], Hurtado, L.F.[Lluís F.], García, F.[Fernando], Sanchis, E.[Emilio],
Combining Several ASR Outputs in a Graph-Based SLU System,
CIARP15(551-558).
Springer DOI 1511
speech BibRef

Rohrbach, A.[Anna], Rohrbach, M.[Marcus], Schiele, B.[Bernt],
The Long-Short Story of Movie Description,
GCPR15(209-221).
Springer DOI 1511
BibRef

Rohrbach, A.[Anna], Rohrbach, M.[Marcus], Tandon, N.[Niket], Schiele, B.[Bernt],
A dataset for Movie Description,
CVPR15(3202-3212)
IEEE DOI 1510
BibRef

Zhao, H.Q.[Han-Qing], Qin, Z.C.[Zeng-Chang], Wang, Y.[Yiyu], Wang, Y.X.[Yu-Xiao],
A Bag-of-phonemes Model for Homeplace Classification of Mandarin Speakers,
IbPRIA15(683-690).
Springer DOI 1506
BibRef

Yakubu, M.A.[M. Abukari], Maddage, N.C.[Namunu C.], Atrey, P.K.[Pradeep K.],
Audio Secret Management Scheme Using Shamir's Secret Sharing,
MMMod15(I: 396-407).
Springer DOI 1501
BibRef

Bello, C.[Claudia], Ribas, D.[Dayana], Calvo, J.R.[José R.], Ferrer, C.A.[Carlos A.],
From Speech Quality Measures to Speaker Recognition Performance,
CIARP14(199-206).
Springer DOI 1411
BibRef

Oropeza-Rodríguez, J.L.[José Luis], Suárez-Guerra, S.[Sergio], Jiménez-Hernández, M.[Mario],
The Place Theory as an Alternative Solution in Automatic Speech Recognition Tasks,
CIARP14(167-174).
Springer DOI 1411
BibRef

Diez, M., Varona, A., Penagarikano, M., Rodriguez-Fuentes, L.J., Bordel, G.,
On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition,
SPLetters(21), No. 9, September 2014, pp. 1073-1077.
IEEE DOI 1406
Decoding BibRef

Diez, M.[Mireia], Varona, A.[Amparo], Penagarikano, M.[Mike], Rodriguez-Fuentes, L.J.[Luis Javier], Bordel, G.[German],
Optimizing PLLR Features for Spoken Language Recognition,
ICPR14(779-784)
IEEE DOI 1412
Acoustics BibRef

Swietojanski, P., Ghoshal, A., Renals, S.,
Convolutional Neural Networks for Distant Speech Recognition,
SPLetters(21), No. 9, September 2014, pp. 1120-1124.
IEEE DOI 1406
Acoustics BibRef

Missaoui, I.[Ibrahim], Lachiri, Z.[Zied],
Gabor Filterbank Features for Robust Speech Recognition,
ICISP14(665-671).
Springer DOI 1406
BibRef

Carletti, V.[Vincenzo], Foggia, P.[Pasquale], Percannella, G.[Gennaro], Saggese, A.[Alessia], Strisciuglio, N.[Nicola], Vento, M.[Mario],
Audio surveillance using a bag of aural words classifier,
AVSS13(81-86)
IEEE DOI 1311
Computer architecture BibRef

Hurtado, L.F.[Lluís F.], Calvo, M.[Marcos], Gómez, J.A.[Jon Ander], García, F.[Fernando], Sanchis, E.[Emilio],
A Phonetic-Based Approach to Query-by-Example Spoken Term Detection,
CIARP13(I:504-511).
Springer DOI 1311
BibRef

Chaloupka, J.[Josef], Nouza, J.[Jan], Kucharova, M.[Michaela],
Using Various Types of Multimedia Resources to Train System for Automatic Transcription of Czech Historical Oral Archives,
MM4CH13(228-237).
Springer DOI 1309
BibRef

Nouza, J.[Jan], Cerva, P.[Petr], Silovsky, J.[Jan],
Dealing with Bilingualism in Automatic Transcription of Historical Archive of Czech Radio,
MM4CH13(238-246).
Springer DOI 1309
BibRef

Chan, K.Y.[Kit Yan], Nordholm, S.[Sven], Yiu, C.K.F.[Cedric K.F.],
Multichannel filters for speech recognition using a particle swarm optimization,
ICARCV12(937-942).
IEEE DOI 1304
BibRef

Zhao, Y.[Yue], Xu, X.N.[Xiao-Na], Yang, G.[Guosheng],
Unsupervised Tibetan speech features Learning based on Dynamic Bayesian Networks,
ICPR12(2319-2322).
WWW Link. 1302
BibRef

Nour-Eddine, L.[Lachachi], Abdelkader, A.[Adla],
Reduced Universal Background Model for Speech Recognition and Identification System,
MCPR12(303-312).
Springer DOI 1208
BibRef

Pérez Maldonado, Y.[Yara], Caballero Morales, S.O.[Santiago Omar], Cruz Ortega, R.O.[Roberto Omar],
GA Approaches to HMM Optimization for Automatic Speech Recognition,
MCPR12(313-322).
Springer DOI 1208
BibRef

Amrous, A.I.[Anissa Imen], Debyeche, M.[Mohamed],
Robust Arabic Multi-stream Speech Recognition System in Noisy Environment,
ICISP12(571-578).
Springer DOI 1208
BibRef

Touazi, A.[Azzedine], Debyeche, M.[Mohamed],
New Encoding Algorithm for Distributed Speech Recognition Based on DTFS Transform,
ICISP12(547-554).
Springer DOI 1208
BibRef

Im, J.H., Lee, S.Y.,
Unified Training of Feature Extractor and HMM Classifier for Speech Recognition,
SPLetters(19), No. 2, February 2012, pp. 111-114.
IEEE DOI 1201
BibRef

Ghigi, F.[Fabrizio], Tamarit, V.[Vicent], Martínez-Hinarejos, C.D.[Carlos D.], Benedí, J.M.[José-Miguel],
Active Learning for Dialogue Act Labelling,
IbPRIA11(652-659).
Springer DOI 1106
BibRef

Swietojanski, P.[Pawel], Wielgat, R.[Robert], Zielinski, T.[Tomasz],
Automatic Selection of Pareto-Optimal Topologies of Hidden Markov Models Using Multicriteria Evolutionary Algorithms,
EvoIASP11(224-233).
Springer DOI 1104
Applied to speech recognition. BibRef

Ravinder, K.[Kumar],
Comparison of HMM and DTW for Isolated Word Recognition System of Punjabi Language,
CIARP10(244-252).
Springer DOI 1011
BibRef

Meng, L.[Lu], Xiang, J.[Jing], Zhao, D.[Dazhe], Zhao, H.[Hong],
A New Application of MEG and DTI on Word Recognition,
ICPR10(2472-2475).
IEEE DOI 1008
BibRef

Duan, Q.S.[Quan-Sheng], Kang, S.Y.[Shi-Yin], Wu, Z.Y.[Zhi-Yong], Cai, L.H.[Lian-Hong], Shuang, Z.W.[Zhi-Wei], Qin, Y.[Yong],
Comparison of Syllable/Phone HMM Based Mandarin TTS,
ICPR10(4496-4499).
IEEE DOI 1008
BibRef

O'Gorman, L.[Lawrence],
Latency in Speech Feature Analysis for Telepresence Event Coding,
ICPR10(4464-4467).
IEEE DOI 1008
BibRef

Zhang, S.L.[Shi-Lei], Shi, Q.[Qin], Qin, Y.[Yong],
Modeling Syllable-Based Pronunciation Variation for Accented Mandarin Speech Recognition,
ICPR10(1606-1609).
IEEE DOI 1008
BibRef

Zhang, S.L.[Shi-Lei], Zhang, S.W.[Shu-Wu], Xu, B.[Bo],
A Two-level Method for Unsupervised Speaker-based Audio Segmentation,
ICPR06(IV: 298-301).
IEEE DOI 0609
BibRef

Krajewski, J.[Jarek], Batliner, A.[Anton], Kessel, S.[Silke],
Comparing Multiple Classifiers for Speech-Based Detection of Self-Confidence: A Pilot Study,
ICPR10(3716-3719).
IEEE DOI 1008
BibRef

Nolazco-Flores, J.A.[Juan A.], Aceves L., R.A.[Roberto A.], Garcia-Perera, L.P.[L. Paola],
Speech Magnitude-Spectrum Information-Entropy (MSIE) for Automatic Speech Recognition in Noisy Environments,
ICPR10(4364-4367).
IEEE DOI 1008
BibRef

Kelly, F.[Finnian], Harte, N.[Naomi],
Auditory Features Revisited for Robust Speech Recognition,
ICPR10(4456-4459).
IEEE DOI 1008
BibRef

Xie, Z.Q.[Zhao-Qiang], Miao, Z.J.[Zhen-Jiang],
Tone Recognition of Isolated Mandarin Syllables,
ICISP10(412-418).
Springer DOI 1006
BibRef

Alotaibi, Y.A.[Yousef Ajami], Alghamdi, M.[Mansour], Alotaiby, F.[Fahad],
Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus,
ICISP10(122-129).
Springer DOI 1006
BibRef

Lu, G.[Gao], Yu, H.Z.[Hong-Zhi], Li, Y.H.[Yong-Hong], Zhang, R.S.[Rui-Shan],
Study on SAMPA_ST for Lhasa Tibetan and realization of automatic labelling system,
IASP10(133-137).
IEEE DOI 1004
BibRef

Chen, X.Y.[Xiao-Ying], Jin, H.M.[Hui-Min], Yu, H.Z.[Hong-Zhi],
Acoustic research on long and short vowels in Tibetan Lhasa dialect,
IASP10(561-564).
IEEE DOI 1004
BibRef

Sahu, V.P.[Ved Prakash], Mishra, H.K.[Harendra Kumar], Sekhar, C.C.[C. Chandra],
Variational Bayes Adapted GMM Based Models for Audio Clip Classification,
PReMI09(513-518).
Springer DOI 0912
BibRef

Kacur, J., Rozinaj, G.,
Adding Voicing Features into Speech Recognition Based on HMM in Slovak,
WSSIP09(1-4).
IEEE DOI 0906
BibRef

Verteletskaya, E., Sakhnov, K., Simak, B.,
Pitch Detection Algorithms and Voiced/Unvoiced Classification for Noisy Speech,
WSSIP09(1-5).
IEEE DOI 0906
BibRef

Vlaj, D., Kos, M., Grasic, M., Kacic, Z.,
Influence of Hangover and Hangbefore Criteria on Automatic Speech Recognition,
WSSIP09(1-4).
IEEE DOI 0906
BibRef

Hanžl, V.[Václav], Pollák, P.[Petr],
Accuracy Analysis of Generalized Pronunciation Variant Selection in ASR Systems,
COST08(399-408).
Springer DOI 0810
BibRef

Camarena-Ibarrola, A.[Antonio], Chávez, E.[Edgar], Tellez, E.S.[Eric Sadit],
Robust Radio Broadcast Monitoring Using a Multi-Band Spectral Entropy Signature,
CIARP09(587-594).
Springer DOI 0911
BibRef

Mantilla-Caeiros, A.[Alfredo], Miyatake, M.N.[Mariko Nakano], Perez-Meana, H.[Hector],
Isolate Speech Recognition Based on Time-Frequency Analysis Methods,
CIARP09(297-304).
Springer DOI 0911
BibRef

Veronková, J.[Jitka], Palková, Z.[Zdena],
Perception of Czech in Noise: Stability of Vowels,
COST08(149-161).
Springer DOI 0810
BibRef

Skarnitzl, R.[Radek],
Challenges in Segmenting the Czech Lateral Liquid,
COST08(162-172).
Springer DOI 0810
BibRef

Machac, P.[Pavel],
Implications of Acoustic Variation for the Segmentation of the Czech Trill r,
COST08(173-181).
Springer DOI 0810
BibRef

Jorschick, A.B.[Annett B.],
Voicing in Labial Plosives in Czech,
COST08(182-189).
Springer DOI 0810
BibRef

Volín, J.[Jan],
Normalization of the Vocalic Space,
COST08(190-200).
Springer DOI 0810
BibRef

Rajnoha, J.[Josef], Pollák, P.[Petr],
Czech Spontaneous Speech Collection and Annotation: The Database of Technical Lectures,
COST08(377-385).
Springer DOI 0810
BibRef

Janda, J.[Jan],
Quantitative Analysis of the Relative Local Speech Rate,
COST08(368-376).
Springer DOI 0810
BibRef

Zhang, B.[Bo], Zhuang, X.[Xin], Huang, P.[Pan], Feng, C.[Chen], Zhao, J.[Jie],
Application of Uni-Directional Microphone Array for Identifying English Pronunciation Errors,
CISP09(1-5).
IEEE DOI 0910
BibRef

Kuremoto, T., Komoto, T., Kobayashi, K., Obayashi, M.,
A Voice Instruction Learning System Using PL-T-SOM,
CISP09(1-6).
IEEE DOI 0910
BibRef

Espi, M., Takeuchi, Y.,
Substitution of Vocal Folds for Voice Generation by Means of Intra-Oral Pulse Generator,
CISP09(1-5).
IEEE DOI 0910
BibRef

Orhan, Z., Gormez, Z.,
Evaluation of the Concatenative Turkish Text-to-Speech System,
CISP09(1-5).
IEEE DOI 0910
BibRef

Cai, Y.[Yu], Yuan, J.P.[Jian-Ping], Hou, C.[Chaohuan], Yang, J.[Jun], Wu, B.[Bian],
Harmonic Enhancement with Noise Reduction of Speech Signal by Comb Filtering,
CISP09(1-4).
IEEE DOI 0910
BibRef

Li, W.F.[Wei-Feng], Billard, A., Bourlard, H.,
Keyword Detection for Spontaneous Speech,
CISP09(1-5).
IEEE DOI 0910
BibRef

Zhang, X.Y.[Xin-Yi], Yao, J.X.[Jian-Xiao], He, Q.A.[Qi-Ang],
Research of STRAIGHT Spectrogram and Difference Subspace Algorithm for Speech Recognition,
CISP09(1-4).
IEEE DOI 0910
BibRef

Lu, X.[Xugang], Matsuda, S., Unoki, M., Nakamura, S.,
Temporal Modulation Normalization for Robust Speech Feature Extraction and Recognition,
CISP09(1-4).
IEEE DOI 0910
BibRef

Jun, Y.Z.[Yue Zhen], Lei, W.[Wang], Hao, W.[Wang],
A New Parameter of Speech Character Based on the Bloomfield's Model,
CISP09(1-4).
IEEE DOI 0910
BibRef

Qasemi Zadeh, B.[Behrang], Shen, J.[Jiali], O'Neill, I.[Ian], Miller, P.[Paul], Hanna, P.[Philip], Stewart, D.[Darryl], Wang, H.B.[Hong-Bin],
A Speech Based Approach to Surveillance Video Retrieval,
AVSBS09(336-339).
IEEE DOI 0909
BibRef

Cristani, M., Pesarin, A., Drioli, C., Tavano, A., Perina, A., Murino, V.,
Auditory dialog analysis and understanding by generative modelling of interactional dynamics,
CVPR4HB09(103-109).
IEEE DOI 0906
BibRef

Gosztolya, G.[Gábor], Bánhalmi, A.[András], Tóth, L.[László],
Using One-Class Classification Techniques in the Anti-phoneme Problem,
IbPRIA09(433-440).
Springer DOI 0906
BibRef

Chen, J.B.[Jin-Biao], Zhang, S.Q.[Shi-Qing],
Manifold learning-based phoneme recognition,
IASP09(308-312).
IEEE DOI 0904
BibRef

Mahdhaoui, A.[Ammar], Chetouani, M.[Mohamed], Zong, C.[Cong],
Motherese detection based on segmental and supra-segmental features,
ICPR08(1-4).
IEEE DOI 0812
parent-infant interactions. BibRef

Zeng, Z.[Zhi], Li, X.[Xin], Ma, X.H.[Xiao-Hong], Ji, Q.A.[Qi-Ang],
Adaptive context recognition based on audio signal,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Luo, L.[Li], Lu, P.F.[Peng-Fei], Wang, Z.F.[Zeng-Fu],
A real-time accompaniment system based on sung voice recognition,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Pesarin, A., Cristani, M., Murino, V., Drioli, C., Perina, A., Tavano, A.,
A statistical signature for automatic dialogue classification,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Choi, H.[Heeyoul], Gutierrez-Osuna, R.[Ricardo], Choi, S.J.[Seung-Jin], Choe, Y.[Yoonsuck],
Kernel oriented discriminant analysis for speaker-independent phoneme spaces,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Terry, L.[Louis], Katsaggelos, A.K.[Aggelos K.],
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Krajewski, J.[Jarek], Batliner, A.[Anton], Wieland, R.[Rainer],
Multiple classifier applied on predicting microsleep from speech,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Banerjee, P.[Pratyush], Garg, G.[Gaurav], Mitra, P.[Pabitra], Basu, A.[Anupam],
Application of triphone clustering in acoustic modeling for continuous speech recognition in Bengali,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Bouzid, A.[Aďcha], Ellouze, N.[Noureddine],
Voicing Detection in Noisy Speech Signal,
ICISP08(544-551).
Springer DOI 0807
BibRef

Türkmen, H.I.[H. Irem], Karsligil, M.E.[M. Elif],
Reconstruction of Dysphonic Speech by MELP,
CIARP08(767-774).
Springer DOI 0809
BibRef

Hain, T.[Thomas], Burget, L.[Lukas], Dines, J.[John], Garau, G.[Giulia], Karafiat, M.[Martin], van Leeuwen, D.[David], Lincoln, M.[Mike], Wan, V.[Vincent],
The 2007 AMI(DA) System for Meeting Transcription,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Lamel, L., Bilinski, E., Gauvain, J.L., Adda, G., Barras, C., Zhu, X.,
The LIMSI RT07 Lecture Transcription System,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Fiscus, J.G.[Jonathan G.], Ajot, J.[Jerome], Garofolo, J.S.[John S.],
The Rich Transcription 2007 Meeting Recognition Evaluation,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Stolcke, A.[Andreas], Anguera, X.[Xavier], Boakye, K.[Kofi], Çetin, Ö.[Özgür], Janin, A.[Adam], Magimai-Doss, M.[Mathew], Wooters, C.[Chuck], Zheng, J.[Jing],
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Huang, J.[Jing], Marcheret, E.[Etienne], Visweswariah, K.[Karthik], Libal, V.[Vit], Potamianos, G.[Gerasimos],
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Wölfel, M.[Matthias], Stüker, S.[Sebastian], Kraft, F.[Florian],
The ISL RT-07 Speech-to-Text System,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Schuller, B.[Björn], Wöllmer, M.[Martin], Moosmayr, T.[Tobias], Ruske, G.[Günther], Rigoll, G.[Gerhard],
Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition,
DAGM08(xx-yy).
Springer DOI 0806
BibRef

Patil, H.A.[Hemant A.], Basu, T.K.,
Cepstral Domain Teager Energy for Identifying Perceptually Similar Languages,
PReMI07(455-462).
Springer DOI 0712
BibRef

Manwani, N.[Naresh], Mitra, S.K.[Suman K.], Joshi, M.V.,
Spoken Language Identification for Indian Languages Using Split and Merge EM Algorithm,
PReMI07(463-468).
Springer DOI 0712
BibRef

Rao, K.S.[K. Sreenivasa], Laskar, R.H., Koolagudi, S.G.[Shashidhar G.],
Voice Transformation by Mapping the Features at Syllable Level,
PReMI07(479-486).
Springer DOI 0712
BibRef

García, F.[Fernando], Sanchis, E.[Emilio], Hurtado, L.F.[Lluís F.], Segarra, E.[Encarna],
Adaptive Training for Robust Spoken Language Understanding,
CIARP15(519-526).
Springer DOI 1511
BibRef

Pastor, J.[Joan], Hurtado, L.F.[Lluís F.], Segarra, E.[Encarna], Sanchis, E.[Emilio],
Language Modelization and Categorization for Voice-Activated QA,
CIARP11(475-482).
Springer DOI 1111
BibRef

García, F.[Fernando], Hurtado, L.F.[Lluís F.], Sanchis, E.[Emilio], Segarra, E.[Encarna],
An Active Learning Approach for Statistical Spoken Language Understanding,
CIARP11(565-572).
Springer DOI 1111
BibRef

Hurtado, L.F.[Lluís F.], Griol, D.[David], Sanchis, E.[Emilio], Segarra, E.[Encarna],
A Statistical User Simulation Technique for the Improvement of a Spoken Dialog System,
CIARP07(743-752).
Springer DOI 0711
BibRef
Earlier: A2, A1, A4, A3:
A Dialog Management Methodology Based on Neural Networks and Its Application to Different Domains,
CIARP08(643-650).
Springer DOI 0809
BibRef

Oropeza Rodríguez, J.L.[José Luis], Suárez Guerra, S.[Sergio], Sánchez Fernández, L.P.[Luis Pastor],
Using Adaptive Filter to Increase Automatic Speech Recognition Rate in a Digit Corpus,
CIARP07(78-87).
Springer DOI 0711
BibRef

Várallyay, G.[György],
SSM: A Novel Method to Recognize the Fundamental Frequency in Voice Signals,
CIARP07(88-95).
Springer DOI 0711
BibRef

Simőes, C.[Carla], Teixeira, C.[Carlos], Dias, M.[Miguel], Braga, D.[Daniela], Calado, A.[António],
European Portuguese Accent in Acoustic Models for Non-native English Speakers,
CIARP07(734-742).
Springer DOI 0711
BibRef

Smeaton, A.F.[Alan F.], McHugh, M.[Mike],
Towards event detection in an audio-based sensor network,
VSSN05(87-94).
WWW Link. 0511
BibRef

Esposito, A.[Anna], Stejskal, V.[Vojtech], Smékal, Z.[Zdenek], Bourbakis, N.[Nikolaos],
The Significance of Empty Speech Pauses: Cognitive and Algorithmic Issues,
BVAI07(542-554).
Springer DOI 0710
BibRef

Hernández, I.[Igmar], García, P.[Paola], Nolazco, J.[Juan], Buera, L.[Luis], Lleida, E.[Eduardo],
Robust Automatic Speech Recognition Using PD-MEEMLIN,
IbPRIA07(II: 1-8).
Springer DOI 0706
BibRef

Chung, Y.J.[Yong-Joo], Bae, K.S.[Keun-Sung],
Data-Driven Jacobian Adaptation in a Multi-model Structure for Noisy Speech Recognition,
IbPRIA07(II: 452-459).
Springer DOI 0706
BibRef

Cano, S.[Sergio], Suaste, I.[Israel], Escobedo, D.[Daniel], Reyes-García, C.A.[Carlos A.], Ekkel, T.[Taco],
A Combined Classifier of Cry Units with New Acoustic Attributes,
CIARP06(416-425).
Springer DOI 0611
BibRef

Huerta-Hernández, L.D.[Luis D.], Reyes-García, C.A.[Carlos A.],
On the Processing of Fuzzy Patterns for Text Independent Phonetic Speech Segmentation,
CIARP06(437-445).
Springer DOI 0611
BibRef

Alghassi, H., Tafazoli, S., Lawrence, P.,
The Audio Surveillance Eye,
AVSBS06(106-106).
IEEE DOI 0611
BibRef

Yuan, L.C.[Li-Chi], Chen, Z.G.[Zhi-Gang],
A Novel Statistical Model for Speech Recognition and POS Tagging,
AVSBS06(61-61).
IEEE DOI 0611
BibRef

Yin, B.[Bo], Ambikairajah, E.[Eliathamby], Chen, F.[Fang],
Combining Cepstral and Prosodic Features in Language Identification,
ICPR06(IV: 254-257).
IEEE DOI 0609
BibRef

Zouari, L.[Leila], Chollet, G.[Gerard],
Efficient Gaussian Mixture for Speech Recognition,
ICPR06(IV: 294-297).
IEEE DOI 0609
BibRef

Vinciarelli, A.[Alessandro],
Sociometry Based Multiparty Audio Recordings Summarization,
ICPR06(II: 1154-1157).
IEEE DOI 0609
BibRef

Wang, J.C.[Jia-Ching], Wang, J.F.[Jhing-Fa], Lin, C.B.[Cai-Bei], Jian, K.T.[Kun-Ting], Kuok, W.H.[Wai-He],
Content-Based Audio Classification Using Support Vector Machines and Independent Component Analysis,
ICPR06(IV: 157-160).
IEEE DOI 0609
BibRef

Huang, R.Q.[Rong-Qing], Ma, C.X.[Chang-Xue],
Toward A Speaker-Independent Real-Time Affect Detection System,
ICPR06(I: 1204-1207).
IEEE DOI 0609
BibRef

Wang, L.[Liang], Ambikairajah, E.[Eliathamby], Choi, E.H.C.[Eric H.C.],
Multi-lingual Phoneme Recognition and Language Identification Using Phonotactic Information,
ICPR06(IV: 245-248).
IEEE DOI 0609
BibRef

Kruger, S.E.[Sven E.], Schaffoner, M.[Martin], Katz, M.[Marcel], Andelic, E.[Edin], Wendemuth, A.[Andreas],
Mixture of Support Vector Machines for HMM based Speech Recognition,
ICPR06(IV: 326-329).
IEEE DOI 0609
BibRef

Andelic, E.[Edin], Schaffoner, M.[Martin], Katz, M.[Marcel], Kruger, S.E.[Sven E.],
A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants as Acoustic Models,
ICPR06(II: 1158-1161).
IEEE DOI 0609
BibRef

Halavati, R.[Ramin], Shouraki, S.B.[Saeed Bagheri], Tajik, H.[Hossein], Cholakian, A.[Arpineh], Razaghpour, M.[Mina],
A Novel Approach to Very Fast and Noise Robust, Isolated Word Speech Recognition,
ICPR06(III: 190-193).
IEEE DOI 0609
BibRef

Lin, H.[Hui], Ou, Z.J.[Zhi-Jian],
Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks,
ICPR06(IV: 258-261).
IEEE DOI 0609
BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Noth, E.[Elmar], Nkenke, E.[Emeka], Haderlein, T.[Tino], Rosanowski, F.[Frank], Schuster, M.[Maria],
Intelligibility of Children with Cleft Lip and Palate: Evaluation by Speech Recognition Techniques,
ICPR06(IV: 274-277).
IEEE DOI 0609
BibRef

Zioko, B.[Bartosz], Manandhar, S.[Suresh], Wilson, R.C.[Richard C.],
Phoneme segmentation of speech,
ICPR06(IV: 282-285).
IEEE DOI 0609
BibRef

Choi, E.H.C.[Eric H. C.],
A Noise Robust Front-end for Speech Recognition Using Hough Transform and Cumulative Distribution Mapping,
ICPR06(IV: 286-289).
IEEE DOI 0609
BibRef

Liu, M.[Ming], Huang, T.S.[Thomas S.],
A Bayesian Predictive Method for Automatic Speech Segmentation,
ICPR06(IV: 290-293).
IEEE DOI 0609
BibRef

Haas, J.[Jürgen], Gallwitz, F.[Florian], Horndasch, A.[Axel], Huber, R.[Richard], Warnke, V.[Volker],
Telephone-Based Speech Dialog Systems,
DAGM05(125).
Springer DOI 0509
BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Robust Parallel Speech Recognition in Multiple Energy Bands,
DAGM05(133).
Springer DOI 0509
BibRef

Hacker, C.[Christian], Cincarek, T.[Tobias], Gruhn, R.[Rainer], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Pronunciation Feature Extraction,
DAGM05(141).
Springer DOI 0509
BibRef

Ivanecky, J.[Jozef], Fischer, J.[Julia], Mast, M.[Marion], Kunzmann, S.[Siegfried], Ross, T.[Thomas], Fischer, V.[Volker],
Multi-lingual and Multi-modal Speech Processing and Applications,
DAGM05(149).
Springer DOI 0509
BibRef

Dai, H.S.[Hai-Sheng], Zhu, X.Y.[Xiao-Yan], Luo, Y.P.[Yu-Pin], Yang, S.[Shiyuan],
An Utterance Verification Algorithm in Keyword Spotting System,
IbPRIA05(II:555).
Springer DOI 0509
BibRef

Rodríguez, L.J.[Luis Javier], Torres, M.I.[M. Inés],
A Clustering Algorithm for the Fast Match of Acoustic Conditions in Continuous Speech Recognition,
IbPRIA05(II:562).
Springer DOI 0509
BibRef

Sánchez, J.A.[Joan Andreu], Benedí, J.M.[José Miguel], Linares, D.[Diego],
Performance of a SCFG-Based Language Model with Training Data Sets of Increasing Size,
IbPRIA05(II:586).
Springer DOI 0509
BibRef

Nolazco-Flores, J.A.[Juan A.], Salgado-Garza, L.R.[Luis R.], Peńa-Díaz, M.[Marco],
Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages,
IbPRIA05(II:595).
Springer DOI 0509
BibRef

Ortiz, D.[Daniel], Varea, I.G.[Ismael García], Casacuberta, F.[Francisco],
A General Framework to Deal with the Scaling Problem in Phrase-Based Statistical Machine Translation,
IbPRIA07(II: 314-322).
Springer DOI 0706
BibRef

Tomás, J.[Jesús], Lloret, J.[Jaime], Casacuberta, F.[Francisco],
Phrase-Based Statistical Machine Translation Using Approximate Matching,
IbPRIA07(I: 475-482).
Springer DOI 0706
BibRef
Earlier:
Phrase-Based Alignment Models for Statistical Machine Translation,
IbPRIA05(II:605).
Springer DOI 0509
BibRef

García-Varea, I.[Ismael], Ortiz, D.[Daniel], Nevado, F.[Francisco], Gómez, P.A.[Pedro A.], Casacuberta, F.[Francisco],
Automatic Segmentation of Bilingual Corpora: A Comparison of Different Techniques,
IbPRIA05(II:614).
Springer DOI 0509
BibRef

Andrés, J.[Jesús], Navarro, J.R.[José R.], Juan, A.[Alfons], Casacuberta, F.[Francisco],
Word Translation Disambiguation Using Multinomial Classifiers,
IbPRIA05(II:622).
Springer DOI 0509
BibRef

Ribadas, F.J.[Francisco Jose], Vilares, M.[Manuel], Vilares, J.[Jesus],
Semantic Similarity Between Sentences Through Approximate Tree Matching,
IbPRIA05(II:638).
Springer DOI 0509
BibRef

Chen, K.[Ke],
Speaker Modeling with Various Speech Representations,
ICBA04(592-599).
Springer DOI 0505
BibRef

Sit, C.H.[Chin-Hung], Mak, M.W.[Man-Wai], Kung, S.Y.[Sun-Yuan],
Maximum Likelihood and Maximum a Posteriori Adaptation for Distributed Speaker Recognition Systems,
ICBA04(640-647).
Springer DOI 0505
BibRef

Gutkin, A., King, S.,
Structural representation of speech for phonetic classification,
ICPR04(III: 438-441).
IEEE DOI 0409
BibRef

Demirekler, M., Karahan, F., Ciloglu, T.,
Fusing length and voicing information, and HMM decision using a Bayesian causal tree against insufficient training data,
ICPR00(Vol III: 102-105).
IEEE DOI 0403
BibRef

Kashino, K., Kurozumi, T., Murase, H.,
Feature fluctuation absorption for a quick audio retrieval from long recordings,
ICPR00(Vol III: 98-101).
IEEE DOI 0403
BibRef

Garcia-Varea, I., Sanchis, A., Casacuberta, F.,
A new approach to speech-input statistical translation,
ICPR00(Vol III: 90-93).
IEEE DOI 0403
BibRef

Gravier, G., Sigelle, M., Chollet, G.,
A Markov random field model for automatic speech recognition,
ICPR00(Vol III: 254-257).
IEEE DOI 0403
BibRef

Ruiz, N., Rosa, M., Lopez, F., Martinez, D., Mata, R.,
New algorithm for searching minimum bit rate wavelet representations with application to multiresolution-based perceptual audio coding,
ICPR00(Vol III: 286-289).
IEEE DOI 0403
BibRef

Steidl, S.[Stefan], Stemmer, G.[Georg], Hacker, C.[Christian], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Improving Children's Speech Recognition by HMM Interpolation with an Adults' Speech Recognizer,
DAGM03(600-607).
Springer DOI 0310
BibRef

Stephenson, T.A., Magimai-Doss, M., Bourlard, H.,
Mixed bayesian networks with auxiliary variables for automatic speech recognition,
ICPR02(IV: 293-296).
IEEE DOI 0211
BibRef

Bourlard, H.,
Some recent advances in speech recognition with potential applications in other statistical pattern recognition areas,
ICPR02(III: 727-727).
IEEE DOI 0211
BibRef

Tanaka, K., Kojima, H., Fujimura, N., Itoh, Y.,
Constructing speech processing systems on universal phonetic codes accompanied with reference acoustic models,
ICPR02(III: 728-731).
IEEE DOI 0211
BibRef

Katz, M., Meier, H.G., Dolfing, H., Klakow, D.,
Robustness of linear discriminant analysis in automatic speech recognition,
ICPR02(III: 371-374).
IEEE DOI 0211
BibRef

Lefevre, S., Maillard, B., Vincent, N.,
A two level classifier process for audio segmentation,
ICPR02(III: 891-894).
IEEE DOI 0211
BibRef

de Stefano, C., Della Cioppa, A., Marcelli, A.,
An investigation on MPEG audio segmentation by evolutionary algorithms,
ICDAR01(952-956).
IEEE DOI 0109
BibRef

Nouza, J.,
Feature selection methods for hidden Markov model-based speech recognition,
ICPR96(II: 186-190).
IEEE DOI 0509
BibRef

Vande Wouwer, G., Scheunders, P., van Dyck, D.,
Wavelet-FILVQ classifier for speech analysis,
ICPR96(IV: 214-218).
IEEE DOI 0509
BibRef

Uma, S., Sridhar, V., Krishna, G.,
Time-normalization techniques for speaker-independent isolated word recognition,
ICPR92(III:537-540).
IEEE DOI 9208
BibRef

Rieck, S., Schukat-Talamazzini, E.G., Niemann, H.,
Speaker adaptation using semi-continuous hidden Markov models,
ICPR92(III:541-544).
IEEE DOI 9208
BibRef

He, H.Y.[Hai-Yan], Wen, C.Y.[Cheng-Yi],
ART2-based multiple MLPs neural network for speaker-independent recognition of isolated words,
ICPR92(II:590-593).
IEEE DOI 9208
BibRef

Edmonds, E.A., Pan, L.Y., O'Brien, S.M.,
Automatic feature extraction from spectrograms for acoustic-phonetic analysis,
ICPR92(II:701-704).
IEEE DOI 9208
BibRef

Ishikawa, Y., Nakajima, K.,
A real time connected word recognition system,
ICPR90(II: 215-217).
IEEE DOI 9008
BibRef

Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speech Analysis, other than Recognition .


Last update:Sep 18, 2017 at 11:34:11