24.1.4 Signal Processing, Speech Papers

Chapter Contents (Back)
These are mostly included since they are in the full ToC for journals that are taken completely.

Hanson, A.R., Riseman, E.M., Fisher, E.,
Context in word recognition,
PR(8), No. 1, January 1976, pp. 35-45.
WWW Version. 0309 BibRef

de Mori, R., Laface, P., Makhonine, V.A., Mezzalama, M.,
A syntactic procedure for the recognition of glottal pulses in continuous speech,
PR(9), No. 4, 1977, pp. 181-189.
WWW Version. 0309 BibRef

Maroy, J.P., Berthod, M.,
Natural language understanding by a robot: A pattern recognition problem,
PR(10), No. 2, 1978, pp. 63-71.
WWW Version. 0309 BibRef

Pal, S.K., Datta, A.K., Majumder, D.D.[D. Dutta],
A self-supervised vowel recognition system,
PR(12), No. 1, 1980, pp. 27-34.
WWW Version. 0309 BibRef

Pathak, A.[Amita], Pal, S.K.[Sankar K.],
On the convergence of 'A self-supervised vowel recognition system',
PR(20), No. 2, 1987, pp. 237-244.
WWW Version. 0309 BibRef

de Mori, R.[Renato], Giordano, G.[Giovanna],
Algorithms for syllabic hypothesization in continuous speech,
PR(14), No. 1-6, 1981, pp. 245-260.
WWW Version. 0309 BibRef

Howard, Jr., J.H.[James H.],
Feature selection in human auditory perception,
PR(15), No. 5, 1982, pp. 397-403.
WWW Version. 0309 BibRef

Thomason, M.G., Granum, E., Blake, R.E.,
Experiments in dynamic programming inference of Markov networks with strings representing speech data,
PR(19), No. 5, 1986, pp. 343-352.
WWW Version. 0309 BibRef

Tanaka, E.[Eiichi], Toyama, T.[Takanori], Kawai, S.[Sachiko],
High speed error correction of phoneme sequences,
PR(19), No. 5, 1986, pp. 407-412.
WWW Version. 0309 BibRef

Lee, L.S., Tseng, C.Y., Chen, K.J., Huang, J., Hwang, C.H., Ting, P.Y., Lin, L.J., Chen, C.C.,
A Mandarin dictation machine based upon a hierarchical recognition approach and Chinese natural language analysis,
PAMI(12), No. 7, July 1990, pp. 695-704.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401 BibRef

Kenny, P., Lennig, M., Mermelstein, P.,
Speaker adaptation in a large-vocabulary Gaussian HMM recognizer,
PAMI(12), No. 9, September 1990, pp. 917-920.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401 BibRef

Casacuberta, F.,
Some relations among stochastic finite state networks used in automatic speech recognition,
PAMI(12), No. 7, July 1990, pp. 691-695.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401 BibRef

Yannakoudakis, E.J., Tsomokos, I., Hutton, P.J.,
n-Grams and their implication to natural language understanding,
PR(23), No. 5, 1990, pp. 509-528.
WWW Version. 0401 BibRef

Hochberg, J., Mniszewski, S.M., Calleja, T., Papcun, G.J.,
A default hierarchy for pronouncing English,
PAMI(13), No. 9, September 1991, pp. 957-964.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401 BibRef

Carlson, B.A., Clements, M.A.,
A computationally compact divergence measure for speech processing,
PAMI(13), No. 12, December 1991, pp. 1255-1260.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401 BibRef

Ney, H.[Hermann],
A comparative study of two search strategies for connected word recognition: dynamic programming and heuristic search,
PAMI(14), No. 5, May 1992, pp. 586-595.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401 BibRef

Ney, H.[Hermann],
Stochastic Modelling: From Pattern Classification to Speech Recognition and Translation,
ICPR00(Vol III: 21-28).
IEEE DOI Reference
HTML Version. 0009 BibRef

Wu, J.X.[Jian-Xiong], Chan, C.[Chorkin],
Isolated word recognition by neural network models with cross-correlation coefficients for speech dynamics,
PAMI(15), No. 11, November 1993, pp. 1174-1185.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401 BibRef

Liu, L.C.[Lih-Cherng], Chiou, D.[Denis], Wang, H.C.[Hsiao-Chuan],
A speech recognition method based on feature distributions,
PR(24), No. 8, 1991, pp. 717-722.
WWW Version. 0401 BibRef

Pinkowski, B.[Ben],
Multiscale fourier descriptors for classifying semivowels in spectrograms,
PR(26), No. 10, October 1993, pp. 1593-1602.
WWW Version. 0401 BibRef

Pinkowski, B.,
Principal Component Analysis of Speech Spectrogram Images,
PR(30), No. 5, May 1997, pp. 777-787.
WWW Version. 9705 BibRef

Chen, W.Y.[Wen-Yuan], Liao, Y.F.[Yuan-Fu], Chen, S.H.[Sin-Horng],
Speech recognition with hierarchical recurrent neural networks,
PR(28), No. 6, June 1995, pp. 795-805.
WWW Version. 0401 BibRef

Huo, Q.A.[Qi-Ang], Chan, C.[Chorkin],
Contextual vector quantization for speech recognition with discrete hidden Markov model,
PR(28), No. 4, April 1995, pp. 513-517.
WWW Version. 0401 BibRef

Pham, T.D.[Tuan D.], Wagner, M.[Michael],
A geostatistical model for linear prediction analysis of speech,
PR(31), No. 12, December 1998, pp. 1981-1991.
WWW Version. 0401 BibRef

Lee, T.[Tan], Ching, P.C., Chan, L.W.[Lai-Wan],
Isolated word recognition using modular recurrent neural networks,
PR(31), No. 6, June 1998, pp. 751-760.
WWW Version. 0401 BibRef

Tacer, B.[Berkant], Loughlin, P.J.[Patrick J.],
Non-stationary signal classification using the joint moments of time-frequency distributions,
PR(31), No. 11, November 1998, pp. 1635-1641.
WWW Version. 0401 BibRef

Han, J.[Jiqing], Gao, W.[Wen],
Robust telephone speech recognition based on channel compensation,
PR(32), No. 6, June 1999, pp. 1061-1067.
WWW Version. 0401 BibRef

Lewis, M.A.[Michael A.], Ramachandran, R.P.[Ravi P.],
Cochannel speaker count labelling based on the use of cepstral and pitch prediction derived features,
PR(34), No. 2, February 2001, pp. 499-507.
WWW Version. 0011 BibRef

Kant, S.[Shri], Verma, N.[Neelam],
An Effective Source Recognition Algorithm: Extraction of Significant Binary Words,
PRL(21), No. 11, October 2000, pp. 981-988. 0010 BibRef

Kwong, S., He, Q.H., Man, K.F., Tang, K.S.,
A maximum model distance approach for HMM-based speech recognition,
PR(31), No. 3, March 1998, pp. 219-229.
WWW Version. 0401 BibRef

He, Q.H., Kwong, S., Man, K.F., Tang, K.S.,
An improved maximum model distance approach for HMM-based speech recognition systems,
PR(33), No. 10, October 2000, pp. 1749-1758.
WWW Version. 0006 BibRef

Li, M., McAllister, H.G., Black, N.D., de Perez, T.A.,
Wavelet-based nonlinear AGC method for hearing aid loudness compensation,
VISP(147), No. 6, December 2000, pp. 502-507. 0101 BibRef

Gray, P., Hollier, M.P., Massara, R.E.,
Non-intrusive speech-quality assessment using vocal-tract models,
VISP(147), No. 6, December 2000, pp. 493-501. 0101 BibRef

Wu, C.H., Chen, Y.J., Yan, G.L.,
Integration of phonetic and prosodic information for robust utterance verification,
VISP(147), No. 1, February 2000, pp. 55. 0005 BibRef

Kim, W.[Wooil], Kang, S.[Sunmee], Ko, H.S.[Han-Seok],
Spectral subtraction based on phonetic dependency and masking effects,
VISP(147), No. 5, October 2000, pp. 423-427. 0101 BibRef

Hussain, A., Campbell, D.R.,
Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise,
VISP(148), No. 2, April 2001, pp. 127-132. 0106 BibRef

Bohez, E.L.J.[Erik L.J.], Senevirathne, T.R.,
Speech recognition using fractals,
PR(34), No. 11, November 2001, pp. 2227-2243.
WWW Version. 0108 BibRef

Sarkar, S., Poor, H.V.,
Multirate signal processing on finite fields,
VISP(148), No. 4, August 2001, pp. 254-262. 0201 BibRef

Chen, S.H., Wang, J.F.,
Application of wavelet transforms for C/V segmentation on Mandarin speech signals,
VISP(148), No. 2, April 2001, pp. 133-139. 0106 BibRef

Mouria-Beji, F.[Fériel],
A hierarchical Bayesian model for continuous speech recognition,
PRL(23), No. 7, May 2002, pp. 773-781.
HTML Version. 0203 BibRef

Chen, F.K., Yang, J.F., Yan, Y.L.,
Candidate scheme for fast ACELP search,
VISP(149), No. 1, February 2002, pp. 10-16.
IEEE Top Reference. 0205Algebraic code excited linear prediction. Speech coding. BibRef

Mumolo, E.[Enzo],
Spectral domain texture analysis for speech enhancement,
PR(35), No. 10, October 2002, pp. 2181-2191.
WWW Version. 0206 BibRef

Liu, J.W.[Jing-Wei], Cheng, Q.S.[Qian-Sheng], Zheng, Z.G.[Zhong-Guo], Qian, M.[Minping],
A DTW-based probability model for speaker feature analysis and data mining,
PRL(23), No. 11, September 2002, pp. 1271-1276.
HTML Version. 0206 BibRef

Ding, Z.O., McLoughlin, I.V., Tan, E.C.,
Extension of proposal of standards for intelligibility tests of Chinese speech: CDRT-tone,
VISP(150), No. 1, February 2003, pp. 1-5.
IEEE Top Reference. 0304 BibRef

Huang, C.S.[Chao-Shih], Wang, H.C.[Hsiao-Chuan],
Bandwidth-adjusted LPC analysis for robust speech recognition,
PRL(24), No. 9-10, June 2003, pp. 1583-1587.
WWW Version. 0304 BibRef

Juang, Y.T.[Yau-Tarng], Huang, K.C.[Kuo-Chang], Ding, I.J.[Ing-Jr],
Speaker adaptation based on MAP estimation using fuzzy controller,
PRL(24), No. 15, November 2003, pp. 2807-2813.
WWW Version. 0308 BibRef

Ding, I.J.[Ing-Jr],
Incremental MLLR speaker adaptation by fuzzy logic control,
PR(40), No. 11, November 2007, pp. 3110-3119.
WWW Version. 0707Speech recognition; Speaker adaptation; Hidden Markov model; Maximum likelihood linear regression; T-S fuzzy logic controller BibRef

Li, T.F.[Tze Fen],
Speech Recognition of Mandarin Monosyllables,
PR(36), No. 11, November 2003, pp. 2713-2721.
WWW Version. 0309 BibRef

Farooq, O., Datta, S.,
Wavelet based robust sub-band features for phoneme recognition,
VISP(151), No. 3, June 2004, pp. 187-193.
IEEE Abstract. IEEE Top Reference. 0409 BibRef

de Lamare, R.C., Alcaim, A.,
Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec,
VISP(152), No. 1, February 2005, pp. 74-86.
IEEE Abstract. IEEE Top Reference. 0501 BibRef

Ricotti, L.P.,
Multitapering and a wavelet variant of MFCC in speech recognition,
VISP(152), No. 1, February 2005, pp. 29-35.
IEEE Abstract. IEEE Top Reference. 0501 BibRef

Chen, K.[Ke],
On the use of different speech representations for speaker modeling,
SMC-C(35), No. 3, August 2005, pp. 301-314.
IEEE DOI Reference 0508 BibRef

Vera-Candeas, P., Ruiz-Reyes, N., Rosa-Zurera, M., Lopez-Ferreras, F., Curpian-Alonso, J.,
New matching pursuit based sinusoidal modelling method for audio coding,
VISP(151), No. 1, February 2004, pp. 21-28.
IEEE Abstract. IEEE Top Reference. 0403 BibRef

Vera-Candeas, P.[Pedro], Ruiz-Reyes, N.[Nicolás], Rosa-Zurera, M.[Manuel], Cuevas-Martinez, J.C.[Juan C.], López-Ferreras, F.[Francisco],
Adaptive Signal Models for Wide-Band Speech and Audio Compression,
IbPRIA05(II:571).
Springer DOI Reference 0509 BibRef

Zhong, W., Li, S., Tai, H.M.,
Signal subspace approach for narrowband noise reduction in speech,
VISP(152), No. 6, December 2005, pp. 800-805.
WWW Version. 0512 BibRef

Chen, B.[Berlin],
Exploring the use of latent topical information for statistical Chinese spoken document retrieval,
PRL(27), No. 1, 1 January 2006, pp. 9-18.
WWW Version. 0512 BibRef

Chen, B.[Berlin], Chen, Y.T.[Yi-Ting],
Extractive spoken document summarization for information retrieval,
PRL(29), No. 4, 1 March 2008, pp. 426-437.
WWW Version. 0711Extractive summarization; Information retrieval; Topical mixture model; Spoken documents; Speech recognition BibRef

Wan, C.[Chunru], Liu, M.C.[Ming-Chun],
Content-based audio retrieval with relevance feedback,
PRL(27), No. 2, 15 January 2006, pp. 85-92.
WWW Version. 0512 BibRef

Li, C., Li, S., Zhang, D., Chen, G.,
Cryptanalysis of a data security protection scheme for VoIP,
VISP(153), No. 1, February 2006, pp. 1-10.
WWW Version. 0602 BibRef

Radhakrishnan, R.[Regunathan], Divakaran, A.[Ajay], Xiong, Z.[Ziyou], Otsuka, I.[Isao],
A Content-Adaptive Analysis and Representation Framework for Audio Event Discovery from 'Unscripted' Multimedia,
JASP(2006), 2006, pp. 1-24.
WWW Version. 0603 BibRef

Chu, W.T.[Wei-Ta], Cheng, W.H.[Wen-Huang], Wu, J.L.[Ja-Ling],
Semantic Context Detection Using Audio Event Fusion,
JASP(2006), 2006, pp. 1-12.
WWW Version. 0603 BibRef

Sandler, M., Black, D.,
Scalable audio coding for compression and loss resilient streaming,
VISP(153), No. 3, June 2006, pp. 331-339.
WWW Version. 0608 BibRef

Chang, J.H.[Joon-Hyuk], Gazor, S.[Saeed], Kim, N.S.[Nam Soo], Mitra, S.K.[Sanjit K.],
Multiple statistical models for soft decision in noisy speech enhancement,
PR(40), No. 3, March 2007, pp. 1123-1134.
WWW Version. 0611Speech enhancement; DCT; Multiple statistical model; Gaussian; Laplacian; Gamma; GOF; PSFM; SAP; PESQ BibRef

Liu, J.W.[Jing-Wei], Wang, Z.Y.[Zuo-Ying], Xiao, X.[Xi],
A hybrid SVM/DDBHMM decision fusion modeling for robust continuous digital speech recognition,
PRL(28), No. 8, 1 June 2007, pp. 912-920.
WWW Version. 0704Speech recognition; Gaussian mixture model; Duration distribution based hidden Markov model (DDBHMM); Support vector machine BibRef

Guido, R.C.[Rodrigo Capobianco], Pereira, J.C.[Jose Carlos], Slaets, J.F.W.[Jan Frans Willem],
Introduction to the Special Issue: Advances on pattern recognition for speech and audio processing,
PRL(28), No. 11, 1 August 2007, pp. 1283-1284.
WWW Version. 0706 BibRef

Leavitt, N.,
Two technologies vie for recognition in speech market,
Computer(36), No. 6, June 2003, pp. 13-16.
IEEE DOI Reference 0306 BibRef

Paulson, L.D.,
Speech Recognition Moves from Software to Hardware,
Computer(39), No. 11, November 2006, pp. 15-18.
IEEE DOI Reference 0611 BibRef

Stavrakoudis, D.G., Theocharis, J.B.,
Pipelined Recurrent Fuzzy Neural Networks for Nonlinear Adaptive Speech Prediction,
SMC-B(37), No. 5, October 2007, pp. 1305-1320.
IEEE DOI Reference 0711 BibRef

Frankel, J.[Joe], King, S.[Simon],
Factoring Gaussian precision matrices for linear dynamic models,
PRL(28), No. 16, December 2007, pp. 2264-2272.
WWW Version. 0711Linear dynamic model; Error distribution; Precision matrix Speech. BibRef

Chouireb, F.[Fatima], Guerti, M.[Mhania],
Towards a high quality Arabic speech synthesis system based on neural networks and residual excited vocal tract model,
SIViP(2), No. 1, January 2008, pp. 73-87.
Springer DOI Reference 0712 BibRef

Araujo, L.[Lourdes], Serrano, J.I.[J. Ignacio],
Highly accurate error-driven method for noun phrase detection,
PRL(29), No. 4, 1 March 2008, pp. 547-557.
WWW Version. 0711Noun phrase detection; Evolutionary programming; Grammar induction; Information retrieval BibRef

Zhang, Y.X.[Yong-Xin], Scordilis, M.S.[Michael S.],
Effective online unsupervised adaptation of Gaussian mixture models and its application to speech classification,
PRL(29), No. 6, 15 April 2008, pp. 735-744.
WWW Version. 0803Gaussian mixture model; Speech classification; Online adaptation; Unsupervised adaptation BibRef

Baluja, S.[Shumeet], Covell, M.[Michele],
Waveprint: Efficient wavelet-based audio fingerprinting,
PR(41), No. 11, November 2008, pp. 3467-3480.
WWW Version. 0808Audio retrieval; Applications; Image/video retrieval; Pattern analysis BibRef

O'Shaughnessy, D.[Douglas],
Invited paper: Automatic speech recognition: History, methods and challenges,
PR(41), No. 10, October 2008, pp. 2965-2979.
WWW Version. 0808Automatic speech recognition; Hidden Markov models; Adaptation; Compensation; Pattern recognition; Spectral representation BibRef

Zeng, J.[Jia], Xie, L.[Lei], Liu, Z.Q.A.[Zhi-Qi-Ang],
Type-2 fuzzy Gaussian mixture models,
PR(41), No. 12, December 2008, pp. 3636-3643.
WWW Version. 0810 BibRef
Earlier: A1, A3, Only:
Type-2 fuzzy hidden markov models to phoneme recognition,
ICPR04(I: 192-195).
IEEE DOI Reference 0409Type-2 fuzzy sets; Gaussian mixture models; Hidden Markov models BibRef


Bouzid, A.[Aďcha], Ellouze, N.[Noureddine],
Voicing Detection in Noisy Speech Signal,
ICISP08(544-551).
Springer DOI Reference 0807 BibRef

Kukharchik, P., Kheidorov, I., Bovbel, E., Ladeev, D.,
Speech Signal Processing Based on Wavelets and SVM for Vocal Tract Pathology Detection,
ICISP08(192-199).
Springer DOI Reference 0807 BibRef

Türkmen, H.I.[H. Irem], Karsligil, M.E.[M. Elif],
Reconstruction of Dysphonic Speech by MELP,
CIARP08(767-774).
Springer DOI Reference 0809 BibRef

Hain, T.[Thomas], Burget, L.[Lukas], Dines, J.[John], Garau, G.[Giulia], Karafiat, M.[Martin], van Leeuwen, D.[David], Lincoln, M.[Mike], Wan, V.[Vincent],
The 2007 AMI(DA) System for Meeting Transcription,
MTPH07(xx-yy).
Springer DOI Reference 0705 BibRef

Lamel, L., Bilinski, E., Gauvain, J.L., Adda, G., Barras, C., Zhu, X.,
The LIMSI RT07 Lecture Transcription System,
MTPH07(xx-yy).
Springer DOI Reference 0705 BibRef

Fiscus, J.G.[Jonathan G.], Ajot, J.[Jerome], Garofolo, J.S.[John S.],
The Rich Transcription 2007 Meeting Recognition Evaluation,
MTPH07(xx-yy).
Springer DOI Reference 0705 BibRef

Stolcke, A.[Andreas], Anguera, X.[Xavier], Boakye, K.[Kofi], Çetin, Ö.[Özgür], Janin, A.[Adam], Magimai-Doss, M.[Mathew], Wooters, C.[Chuck], Zheng, J.[Jing],
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System,
MTPH07(xx-yy).
Springer DOI Reference 0705 BibRef

Huang, J.[Jing], Marcheret, E.[Etienne], Visweswariah, K.[Karthik], Libal, V.[Vit], Potamianos, G.[Gerasimos],
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings,
MTPH07(xx-yy).
Springer DOI Reference 0705 BibRef

Wölfel, M.[Matthias], Stüker, S.[Sebastian], Kraft, F.[Florian],
The ISL RT-07 Speech-to-Text System,
MTPH07(xx-yy).
Springer DOI Reference 0705 BibRef

Schuller, B.[Björn], Wöllmer, M.[Martin], Moosmayr, T.[Tobias], Ruske, G.[Günther], Rigoll, G.[Gerhard],
Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition,
DAGM08(xx-yy).
Springer DOI Reference 0806 BibRef

Patil, H.A.[Hemant A.], Basu, T.K.,
Cepstral Domain Teager Energy for Identifying Perceptually Similar Languages,
PReMI07(455-462).
Springer DOI Reference 0712 BibRef

Manwani, N.[Naresh], Mitra, S.K.[Suman K.], Joshi, M.V.,
Spoken Language Identification for Indian Languages Using Split and Merge EM Algorithm,
PReMI07(463-468).
Springer DOI Reference 0712 BibRef

Rao, K.S.[K. Sreenivasa], Laskar, R.H., Koolagudi, S.G.[Shashidhar G.],
Voice Transformation by Mapping the Features at Syllable Level,
PReMI07(479-486).
Springer DOI Reference 0712 BibRef

Nagesha, Kumar, G.H.[G. Hemantha],
Signal Resampling Technique Combining Level Crossing and Auditory Features,
PReMI07(447-454).
Springer DOI Reference 0712 BibRef

Hurtado, L.F.[Lluís F.], Griol, D.[David], Sanchis, E.[Emilio], Segarra, E.[Encarna],
A Statistical User Simulation Technique for the Improvement of a Spoken Dialog System,
CIARP07(743-752).
Springer DOI Reference 0711 BibRef

Oropeza Rodríguez, J.L.[José Luis], Suárez Guerra, S.[Sergio], Sánchez Fernández, L.P.[Luis Pastor],
Using Adaptive Filter to Increase Automatic Speech Recognition Rate in a Digit Corpus,
CIARP07(78-87).
Springer DOI Reference 0711 BibRef

Várallyay, G.[György],
SSM: A Novel Method to Recognize the Fundamental Frequency in Voice Signals,
CIARP07(88-95).
Springer DOI Reference 0711 BibRef

Ohara, M.[Masatoshi], Utsumi, A.[Akira], Yamazoe, H.[Hirotake], Abe, S.[Shinji], Katayama, N.[Noriaki],
Attention Monitoring for Music Contents Based on Analysis of Signal-Behavior Structures,
ACCV07(I: 292-302).
Springer DOI Reference 0711 BibRef

Simőes, C.[Carla], Teixeira, C.[Carlos], Dias, M.[Miguel], Braga, D.[Daniela], Calado, A.[António],
European Portuguese Accent in Acoustic Models for Non-native English Speakers,
CIARP07(734-742).
Springer DOI Reference 0711 BibRef

Smeaton, A.F.[Alan F.], McHugh, M.[Mike],
Towards event detection in an audio-based sensor network,
VSSN05(87-94).
WWW Version. 0511 BibRef

Esposito, A.[Anna], Stejskal, V.[Vojtech], Smékal, Z.[Zdenek], Bourbakis, N.[Nikolaos],
The Significance of Empty Speech Pauses: Cognitive and Algorithmic Issues,
BVAI07(542-554).
Springer DOI Reference 0710 BibRef

Hernández, I.[Igmar], García, P.[Paola], Nolazco, J.[Juan], Buera, L.[Luis], Lleida, E.[Eduardo],
Robust Automatic Speech Recognition Using PD-MEEMLIN,
IbPRIA07(II: 1-8).
Springer DOI Reference 0706 BibRef

Chung, Y.J.[Yong-Joo], Bae, K.S.[Keun-Sung],
Data-Driven Jacobian Adaptation in a Multi-model Structure for Noisy Speech Recognition,
IbPRIA07(II: 452-459).
Springer DOI Reference 0706 BibRef

Expósito, J.E.M.[J. E. Muńoz], Reyes, N.R.[N. Ruiz], Galán, S.G.[S. Garcia], Candeas, P.V.[P. Vera],
Speech/Music Classification Based on Distributed Evolutionary Fuzzy Logic for Intelligent Audio Coding,
IbPRIA07(II: 556-563).
Springer DOI Reference 0706 BibRef

Ferrer, C.A.[Carlos A.], González, E.[Eduardo], Hernández-Díaz, M.E.[María E.],
Evaluation of Time and Frequency Domain-Based Methods for the Estimation of Harmonics-to-Noise-Ratios in Voice Signals,
CIARP06(406-415).
Springer DOI Reference 0611 BibRef

Cano, S.[Sergio], Suaste, I.[Israel], Escobedo, D.[Daniel], Reyes-García, C.A.[Carlos A.], Ekkel, T.[Taco],
A Combined Classifier of Cry Units with New Acoustic Attributes,
CIARP06(416-425).
Springer DOI Reference 0611 BibRef

Huerta-Hernández, L.D.[Luis D.], Reyes-García, C.A.[Carlos A.],
On the Processing of Fuzzy Patterns for Text Independent Phonetic Speech Segmentation,
CIARP06(437-445).
Springer DOI Reference 0611 BibRef

Alghassi, H., Tafazoli, S., Lawrence, P.,
The Audio Surveillance Eye,
AVSBS06(106-106).
IEEE DOI Reference 0611 BibRef

Yuan, L.[Lichi], Chen, Z.G.[Zhi-Gang],
A Novel Statistical Model for Speech Recognition and POS Tagging,
AVSBS06(61-61).
IEEE DOI Reference 0611 BibRef

Yin, B.[Bo], Ambikairajah, E.[Eliathamby], Chen, F.[Fang],
Combining Cepstral and Prosodic Features in Language Identification,
ICPR06(IV: 254-257).
WWW Version. 0609 BibRef

Leila, Chollet, G.[Gerard],
Efficient Gaussian Mixture for Speech Recognition,
ICPR06(IV: 294-297).
WWW Version. 0609 BibRef

Vinciarelli, A.[Alessandro],
Sociometry Based Multiparty Audio Recordings Summarization,
ICPR06(II: 1154-1157).
WWW Version. 0609 BibRef

Wang, J.C.[Jia-Ching], Wang, J.F.[Jhing-Fa], Lin, C.B.[Cai-Bei], Jian, K.T.[Kun-Ting], Kuok, W.H.[Wai-He],
Content-Based Audio Classification Using Support Vector Machines and Independent Component Analysis,
ICPR06(IV: 157-160).
WWW Version. 0609 BibRef

Huang, R.Q.[Rong-Qing], Ma, C.X.[Chang-Xue],
Toward A Speaker-Independent Real-Time Affect Detection System,
ICPR06(I: 1204-1207).
WWW Version. 0609 BibRef

Wang, L.[Liang], Ambikairajah, E.[Eliathamby], Choi, E.H.C.[Eric H.C.],
Multi-lingual Phoneme Recognition and Language Identification Using Phonotactic Information,
ICPR06(IV: 245-248).
WWW Version. 0609 BibRef

Kruger, S.E.[Sven E.], Schaffoner, M.[Martin], Katz, M.[Marcel], Andelic, E.[Edin], Wendemuth, A.[Andreas],
Mixture of Support Vector Machines for HMM based Speech Recognition,
ICPR06(IV: 326-329).
WWW Version. 0609 BibRef

Zhang, S.L.[Shi-Lei], Zhang, S.W.[Shu-Wu], Xu, B.[Bo],
A Two-level Method for Unsupervised Speaker-based Audio Segmentation,
ICPR06(IV: 298-301).
WWW Version. 0609 BibRef

Pao, T.L.[Tsang-Long], Chen, Y.T.[Yu-Te], Yeh, J.H.[Jun-Heng], Li, P.J.[Pei-Jia],
Mandarin Emotional Speech Recognition Based on SVM and NN,
ICPR06(I: 1096-1100).
WWW Version. 0609 BibRef

Andelic, E.[Edin], Schaffoner, M.[Martin], Katz, M.[Marcel], Kruger, S.E.[Sven E.],
A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants as Acoustic Models,
ICPR06(II: 1158-1161).
WWW Version. 0609 BibRef

You, M.[Mingyu], Chen, C.[Chun], Bu, J.J.[Jia-Jun], Liu, J.[Jia], Tao, J.H.[Jian-Hua],
Emotional Speech Analysis on Nonlinear Manifold,
ICPR06(III: 91-94).
WWW Version. 0609 BibRef

Halavati, R.[Ramin], Shouraki, S.B.[Saeed Bagheri], Tajik, H.[Hossein], Cholakian, A.[Arpineh], Razaghpour, M.[Mina],
A Novel Approach to Very Fast and Noise Robust, Isolated Word Speech Recognition,
ICPR06(III: 190-193).
WWW Version. 0609 BibRef

Lin, H.[Hui], Ou, Z.J.[Zhi-Jian],
Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks,
ICPR06(IV: 258-261).
WWW Version. 0609 BibRef

Li, W.H.[Wei-Hong], Liu, M.[Ming], Zhu, Z.G.[Zhi-Gang], Huang, T.S.[Thomas S.],
LDV Remote Voice Acquisition and Enhancement,
ICPR06(IV: 262-265).
WWW Version. 0609 BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Noth, E.[Elmar], Nkenke, E.[Emeka], Haderlein, T.[Tino], Rosanowski, F.[Frank], Schuster, M.[Maria],
Intelligibility of Children with Cleft Lip and Palate: Evaluation by Speech Recognition Techniques,
ICPR06(IV: 274-277).
WWW Version. 0609 BibRef

Zioko, B.[Bartosz], Manandhar, S.[Suresh], Wilson, R.C.[Richard C.],
Phoneme segmentation of speech,
ICPR06(IV: 282-285).
WWW Version. 0609 BibRef

Choi, E.H.C.[Eric H. C.],
A Noise Robust Front-end for Speech Recognition Using Hough Transform and Cumulative Distribution Mapping,
ICPR06(IV: 286-289).
WWW Version. 0609 BibRef

Liu, M.[Ming], Huang, T.S.[Thomas S.],
A Bayesian Predictive Method for Automatic Speech Segmentation,
ICPR06(IV: 290-293).
WWW Version. 0609 BibRef

Xue, W.[Wei], Du, S.[Sidan], Fang, C.Z.[Cheng-Zhi], Ye, Y.[Yingxian],
Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm,
CVHCI06(78-88).
Springer DOI Reference 0605 BibRef

Haas, J.[Jürgen], Gallwitz, F.[Florian], Horndasch, A.[Axel], Huber, R.[Richard], Warnke, V.[Volker],
Telephone-Based Speech Dialog Systems,
DAGM05(125).
Springer DOI Reference 0509 BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Robust Parallel Speech Recognition in Multiple Energy Bands,
DAGM05(133).
Springer DOI Reference 0509 BibRef

Hacker, C.[Christian], Cincarek, T.[Tobias], Gruhn, R.[Rainer], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Pronunciation Feature Extraction,
DAGM05(141).
Springer DOI Reference 0509 BibRef

Ivanecky, J.[Jozef], Fischer, J.[Julia], Mast, M.[Marion], Kunzmann, S.[Siegfried], Ross, T.[Thomas], Fischer, V.[Volker],
Multi-lingual and Multi-modal Speech Processing and Applications,
DAGM05(149).
Springer DOI Reference 0509 BibRef

Dai, H.S.[Hai-Sheng], Zhu, X.Y.[Xiao-Yan], Luo, Y.P.[Yu-Pin], Yang, S.[Shiyuan],
An Utterance Verification Algorithm in Keyword Spotting System,
IbPRIA05(II:555).
Springer DOI Reference 0509 BibRef

Rodríguez, L.J.[Luis Javier], Torres, M.I.[M. Inés],
A Clustering Algorithm for the Fast Match of Acoustic Conditions in Continuous Speech Recognition,
IbPRIA05(II:562).
Springer DOI Reference 0509 BibRef

García-Perera, L.P.[L. Paola], Nolazco-Flores, J.A.[Juan A.], Mex-Perera, C.[Carlos],
Cryptographic-Speech-Key Generation Architecture Improvements,
IbPRIA05(II:579).
Springer DOI Reference 0509 BibRef

Sánchez, J.A.[Joan Andreu], Benedí, J.M.[José Miguel], Linares, D.[Diego],
Performance of a SCFG-Based Language Model with Training Data Sets of Increasing Size,
IbPRIA05(II:586).
Springer DOI Reference 0509 BibRef

Nolazco-Flores, J.A.[Juan A.], Salgado-Garza, L.R.[Luis R.], Peńa-Díaz, M.[Marco],
Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages,
IbPRIA05(II:595).
Springer DOI Reference 0509 BibRef

Ortiz, D.[Daniel], Varea, I.G.[Ismael García], Casacuberta, F.[Francisco],
A General Framework to Deal with the Scaling Problem in Phrase-Based Statistical Machine Translation,
IbPRIA07(II: 314-322).
Springer DOI Reference 0706 BibRef

Tomás, J.[Jesús], Lloret, J.[Jaime], Casacuberta, F.[Francisco],
Phrase-Based Statistical Machine Translation Using Approximate Matching,
IbPRIA07(I: 475-482).
Springer DOI Reference 0706 BibRef
Earlier:
Phrase-Based Alignment Models for Statistical Machine Translation,
IbPRIA05(II:605).
Springer DOI Reference 0509 BibRef

García-Varea, I.[Ismael], Ortiz, D.[Daniel], Nevado, F.[Francisco], Gómez, P.A.[Pedro A.], Casacuberta, F.[Francisco],
Automatic Segmentation of Bilingual Corpora: A Comparison of Different Techniques,
IbPRIA05(II:614).
Springer DOI Reference 0509 BibRef

Andrés, J.[Jesús], Navarro, J.R.[José R.], Juan, A.[Alfons], Casacuberta, F.[Francisco],
Word Translation Disambiguation Using Multinomial Classifiers,
IbPRIA05(II:622).
Springer DOI Reference 0509 BibRef

Ribadas, F.J.[Francisco Jose], Vilares, M.[Manuel], Vilares, J.[Jesus],
Semantic Similarity Between Sentences Through Approximate Tree Matching,
IbPRIA05(II:638).
Springer DOI Reference 0509 BibRef

Welk, M.[Martin], Bergmeister, A.[Achim], Weickert, J.[Joachim],
Denoising of Audio Data by Nonlinear Diffusion,
ScaleSpace05(598-609).
WWW Version. 0505 BibRef

Chen, K.[Ke],
Speaker Modeling with Various Speech Representations,
ICBA04(592-599).
WWW Version. 0505 BibRef

Sit, C.H.[Chin-Hung], Mak, M.W.[Man-Wai], Kung, S.Y.[Sun-Yuan],
Maximum Likelihood and Maximum a Posteriori Adaptation for Distributed Speaker Recognition Systems,
ICBA04(640-647).
WWW Version. 0505 BibRef

Gutkin, A., King, S.,
Structural representation of speech for phonetic classification,
ICPR04(III: 438-441).
IEEE DOI Reference 0409 BibRef

Cristani, M., Bicego, M., Murino, V.,
On-line adaptive background modelling for audio surveillance,
ICPR04(II: 399-402).
IEEE DOI Reference 0409 BibRef

Demirekler, M., Karahan, F., Ciloglu, T.,
Fusing length and voicing information, and HMM decision using a Bayesian causal tree against insufficient training data,
ICPR00(Vol III: 102-105).
IEEE DOI Reference 0403 BibRef

Kashino, K., Kurozumi, T., Murase, H.,
Feature fluctuation absorption for a quick audio retrieval from long recordings,
ICPR00(Vol III: 98-101).
IEEE DOI Reference 0403 BibRef

Garcia-Varea, I., Sanchis, A., Casacuberta, F.,
A new approach to speech-input statistical translation,
ICPR00(Vol III: 90-93).
IEEE DOI Reference 0403 BibRef

Gravier, G., Sigelle, M., Chollet, G.,
A Markov random field model for automatic speech recognition,
ICPR00(Vol III: 254-257).
IEEE DOI Reference 0403 BibRef

Ruiz, N., Rosa, M., Lopez, F., Martinez, D., Mata, R.,
New algorithm for searching minimum bit rate wavelet representations with application to multiresolution-based perceptual audio coding,
ICPR00(Vol III: 286-289).
IEEE DOI Reference 0403 BibRef

Steidl, S.[Stefan], Stemmer, G.[Georg], Hacker, C.[Christian], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Improving Children's Speech Recognition by HMM Interpolation with an Adults' Speech Recognizer,
DAGM03(600-607).
HTML Version. 0310 BibRef

Stephenson, T.A., Magimai-Doss, M., Bourlard, H.,
Mixed bayesian networks with auxiliary variables for automatic speech recognition,
ICPR02(IV: 293-296).
IEEE DOI Reference 0211 BibRef

Bourlard, H.,
Some recent advances in speech recognition with potential applications in other statistical pattern recognition areas,
ICPR02(III: 727-727).
IEEE DOI Reference 0211 BibRef

Tanaka, K., Kojima, H., Fujimura, N., Itoh, Y.,
Constructing speech processing systems on universal phonetic codes accompanied with reference acoustic models,
ICPR02(III: 728-731).
IEEE DOI Reference 0211 BibRef

Katz, M., Meier, H.G., Dolfing, H., Klakow, D.,
Robustness of linear discriminant analysis in automatic speech recognition,
ICPR02(III: 371-374).
IEEE DOI Reference 0211 BibRef

Lefevre, S., Maillard, B., Vincent, N.,
A two level classifier process for audio segmentation,
ICPR02(III: 891-894).
IEEE DOI Reference 0211 BibRef

de Stefano, C., Della Cioppa, A., Marcelli, A.,
An investigation on MPEG audio segmentation by evolutionary algorithms,
ICDAR01(952-956).
IEEE DOI Reference 0109 BibRef

Nouza, J.,
Feature selection methods for hidden Markov model-based speech recognition,
ICPR96(II: 186-190).
IEEE DOI Reference 0509 BibRef

VandeWouwer, G., Scheunders, P., VanDyck, D.,
Wavelet-FILVQ classifier for speech analysis,
ICPR96(IV: 214-218).
IEEE DOI Reference 0509 BibRef

Uma, S., Sridhar, V., Krishna, G.,
Time-normalization techniques for speaker-independent isolated word recognition,
ICPR92(III:537-540).
IEEE DOI Reference 9208 BibRef

Rieck, S., Schukat-Talamazzini, E.G., Niemann, H.,
Speaker adaptation using semi-continuous hidden Markov models,
ICPR92(III:541-544).
IEEE DOI Reference 9208 BibRef

He, H.Y.[Hai-Yan], Wen, C.Y.[Cheng-Yi],
ART2-based multiple MLPs neural network for speaker-independent recognition of isolated words,
ICPR92(II:590-593).
IEEE DOI Reference 9208 BibRef

Edmonds, E.A., Pan, L.Y., O'Brien, S.M.,
Automatic feature extraction from spectrograms for acoustic-phonetic analysis,
ICPR92(II:701-704).
IEEE DOI Reference 9208 BibRef

Ishikawa, Y., Nakajima, K.,
A real time connected word recognition system,
ICPR90(II: 215-217).
IEEE DOI Reference 9008 BibRef

Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speaker Verification, Speaker Identification .


Last update:Jan 1, 2009 at 17:09:16