24.1.15.1 Speech Analysis, other than Recognition

Chapter Contents (Back)
Speech. Not so much what is said, but other analysis

Howard, Jr., J.H.[James H.],
Feature selection in human auditory perception,
PR(15), No. 5, 1982, pp. 397-403.
Elsevier DOI 0309
BibRef

Thomason, M.G., Granum, E., Blake, R.E.,
Experiments in dynamic programming inference of Markov networks with strings representing speech data,
PR(19), No. 5, 1986, pp. 343-352.
Elsevier DOI 0309
BibRef

Hochberg, J., Mniszewski, S.M., Calleja, T., Papcun, G.J.,
A default hierarchy for pronouncing English,
PAMI(13), No. 9, September 1991, pp. 957-964.
IEEE DOI 0401
BibRef

Carlson, B.A., Clements, M.A.,
A computationally compact divergence measure for speech processing,
PAMI(13), No. 12, December 1991, pp. 1255-1260.
IEEE DOI 0401
BibRef

Tacer, B.[Berkant], Loughlin, P.J.[Patrick J.],
Non-stationary signal classification using the joint moments of time-frequency distributions,
PR(31), No. 11, November 1998, pp. 1635-1641.
Elsevier DOI 0401
BibRef

Li, M., McAllister, H.G., Black, N.D., de Perez, T.A.,
Wavelet-based nonlinear AGC method for hearing aid loudness compensation,
VISP(147), No. 6, December 2000, pp. 502-507. 0101
BibRef

Gray, P., Hollier, M.P., Massara, R.E.,
Non-intrusive speech-quality assessment using vocal-tract models,
VISP(147), No. 6, December 2000, pp. 493-501. 0101
BibRef

Sarkar, S., Poor, H.V.,
Multirate signal processing on finite fields,
VISP(148), No. 4, August 2001, pp. 254-262. 0201
BibRef

Mumolo, E.[Enzo],
Spectral domain texture analysis for speech enhancement,
PR(35), No. 10, October 2002, pp. 2181-2191.
Elsevier DOI 0206
BibRef

Ding, Z.O., McLoughlin, I.V., Tan, E.C.,
Extension of proposal of standards for intelligibility tests of Chinese speech: CDRT-tone,
VISP(150), No. 1, February 2003, pp. 1-5.
IEEE Top Reference. 0304
BibRef

de Lamare, R.C., Alcaim, A.,
Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec,
VISP(152), No. 1, February 2005, pp. 74-86.
IEEE Abstract. 0501
BibRef

Vera-Candeas, P., Ruiz-Reyes, N., Rosa-Zurera, M., Lopez-Ferreras, F., Curpian-Alonso, J.,
New matching pursuit based sinusoidal modelling method for audio coding,
VISP(151), No. 1, February 2004, pp. 21-28.
IEEE Abstract. 0403
BibRef

Vera-Candeas, P.[Pedro], Ruiz-Reyes, N.[Nicolás], Rosa-Zurera, M.[Manuel], Cuevas-Martinez, J.C.[Juan C.], López-Ferreras, F.[Francisco],
Adaptive Signal Models for Wide-Band Speech and Audio Compression,
IbPRIA05(II:571).
Springer DOI 0509
BibRef

Li, C., Li, S., Zhang, D., Chen, G.,
Cryptanalysis of a data securityp protection scheme for VoIP,
VISP(153), No. 1, February 2006, pp. 1-10.
DOI Link 0602
BibRef

Sandler, M., Black, D.,
Scalable audio coding for compression and loss resilient streaming,
VISP(153), No. 3, June 2006, pp. 331-339.
DOI Link 0608
BibRef

Guido, R.C.[Rodrigo Capobianco], Pereira, J.C.[Jose Carlos], Slaets, J.F.W.[Jan Frans Willem],
Introduction to the Special Issue: Advances on pattern recognition for speech and audio processing,
PRL(28), No. 11, 1 August 2007, pp. 1283-1284.
Elsevier DOI 0706
BibRef

Chang, J.H.[Joon-Hyuk], Gazor, S.[Saeed], Kim, N.S.[Nam Soo], Mitra, S.K.[Sanjit K.],
Multiple statistical models for soft decision in noisy speech enhancement,
PR(40), No. 3, March 2007, pp. 1123-1134.
Elsevier DOI 0611
Speech enhancement; DCT; Multiple statistical model; Gaussian; Laplacian; Gamma; GOF; PSFM; SAP; PESQ BibRef

Frankel, J.[Joe], King, S.[Simon],
Factoring Gaussian precision matrices for linear dynamic models,
PRL(28), No. 16, December 2007, pp. 2264-2272.
Elsevier DOI 0711
Linear dynamic model; Error distribution; Precision matrix Speech. BibRef

Arias-Londono, J.D.[Julian D.], Godino-Llorente, J.I.[Juan I.], Saenz-Lechon, N.[Nicolas], Osma-Ruiz, V.[Victor], Castellanos-Dominguez, C.G.[Cesar German],
An improved method for voice pathology detection by means of a HMM-based feature space transformation,
PR(43), No. 9, September 2010, pp. 3100-3112.
Elsevier DOI 1006
Pathological voice; Hidden Markov models; Minimum classification error; Dynamic feature space transformation BibRef

Mahdi, A.E.[Abdulhussain E.], Picovici, D.[Dorel],
New single-ended objective measure for non-intrusive speech quality evaluation,
SIViP(4), No. 1, March 2010, pp. xx-yy.
Springer DOI 1003
BibRef

Shafiee, S.[Soheil], Almasganj, F.[Farshad], Vazirnezhad, B.[Bahram], Jafari, A.[Ayyoob],
A two-stage speech activity detection system considering fractal aspects of prosody,
PRL(31), No. 9, 1 July 2010, pp. 936-948.
Elsevier DOI 1004
Speech activity detection; Prosody; Fractal dimension BibRef

Yoon, J.Y.[Jae-Yul], Park, H.[Hochong],
Improving the Speech Quality of VoIP by Packet Prioritization,
SPLetters(18), No. 12, December 2011, pp. 725-728.
IEEE DOI 1112
BibRef

Dennis, J., Tran, H.D., Li, H.,
Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions,
SPLetters(18), No. 2, February 2011, pp. 130-133.
IEEE DOI 1101
BibRef

Liang, Y.[Yuan], Liu, X.L.[Xiang-Long], Lou, Y.H.[Yi-Hua], Shan, B.S.[Bao-Song],
An improved noise-robust voice activity detector based on hidden semi-Markov models,
PRL(32), No. 7, 1 May 2011, pp. 1044-1053.
Elsevier DOI 1101
Voice activity detection; State duration; Observation distribution; Hidden semi-Markov model; Likelihood ratio test; Forward variable BibRef

Liu, X.L.[Xiang-Long], Liang, Y.[Yuan], Lou, Y.H.[Yi-Hua], Li, H.[He], Shan, B.S.[Bao-Song],
Noise-Robust Voice Activity Detector Based on Hidden Semi-Markov Models,
ICPR10(81-84).
IEEE DOI 1008
BibRef

Mohanty, M.N.[Mihir Narayan], Jena, B.[Bhagyalaxmi],
Analysis of stressed human speech,
IJCVR(2), No. 2, 2011, pp. 180-187.
DOI Link 1109
BibRef

Lopez-Moreno, I., Ramos, D., Gonzalez-Dominguez, J., Gonzalez-Rodriguez, J.,
Von Mises-Fisher Models in the Total Variability Subspace for Language Recognition,
SPLetters(18), No. 12, December 2011, pp. 705-708.
IEEE DOI 1112
BibRef

Jelassi, S.[Sofiene], Rubino, G.[Gerardo],
A study of artificial speech quality assessors of VoIP calls subject to limited bursty packet losses,
JIVP(2011), No. 1 2011, pp. xx-yy.
DOI Link 1203
BibRef

Ben Aicha, A.[Anis], Ben Jebara, S.[Sofia],
Reduction of musical residual noise using perceptual tools with classic speech denoising techniques,
SIViP(6), No. 1, March 2012, pp. 85-97.
WWW Link. 1203
BibRef

Pulakka, H., Laaksonen, L., Myllyla, V., Yrttiaho, Y., Alku, P.,
Conversational Evaluation of Speech Bandwidth Extension Using a Mobile Handset,
SPLetters(19), No. 4, April 2012, pp. 203-206.
IEEE DOI 1203
BibRef

Liang, S.[Shan], Liu, W.J.[Wen-Ju], Jiang, W.[Wei],
Integrating Binary Mask Estimation With MRF Priors of Cochleagram for Speech Separation,
SPLetters(19), No. 10, October 2012, pp. 627-630.
IEEE DOI 1209
BibRef

Esch, T., Rungeler, M., Heese, F., Vary, P.,
Estimation of Rapidly Time-Varying Harmonic Noise for Speech Enhancement,
SPLetters(19), No. 10, October 2012, pp. 659-662.
IEEE DOI 1209
BibRef

Safavi, S., Hanani, A., Russell, M., Jancovic, P., Carey, M.J.,
Contrasting the Effects of Different Frequency Bands on Speaker and Accent Identification,
SPLetters(19), No. 12, December 2012, pp. 829-832.
IEEE DOI 1212
BibRef

Safavi, S., Khan, U.A.,
Revisiting Finite-Time Distributed Algorithms via Successive Nulling of Eigenvalues,
SPLetters(22), No. 1, January 2015, pp. 54-57.
IEEE DOI 1410
directed graphs BibRef

Wu, Z.Z.[Zhi-Zheng], Kinnunen, T., Chng, E.S.[Eng Siong], Li, H.Z.[Hai-Zhou],
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion,
SPLetters(19), No. 12, December 2012, pp. 914-917.
IEEE DOI 1212
BibRef

Valero, X., Alias, F.,
Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification,
MultMed(14), No. 6, 2012, pp. 1684-1689.
IEEE DOI 1212
BibRef

Weninger, F.[Felix], Krajewski, J.[Jarek], Batliner, A.[Anton], Schuller, B.[Björn],
The Voice of Leadership: Models and Performances of Automatic Analysis in Online Speeches,
AffCom(3), No. 4 2012, pp. 496-508.
IEEE DOI 1302
BibRef

Gerkmann, T., Krawczyk, M.,
MMSE-Optimal Spectral Amplitude Estimation Given the STFT-Phase,
SPLetters(20), No. 2, February 2013, pp. 129-132.
IEEE DOI 1302
BibRef

Kim, H.G.[Han-Gyu], Jang, G.J.[Gil-Jin], Park, J.S.[Jeong-Sik], Kim, J.H.[Ji-Hwan], Oh, Y.H.[Yung-Hwan],
Particle filtering based pitch sequence correction for monaural speech segregation,
IJIST(23), No. 1, March 2013, pp. 64-70.
DOI Link 1303
BibRef

Dessein, A., Cont, A.,
An Information-Geometric Approach to Real-Time Audio Segmentation,
SPLetters(20), No. 4, April 2013, pp. 331-334.
IEEE DOI 1303
BibRef

Drugman, T.,
Residual Excitation Skewness for Automatic Speech Polarity Detection,
SPLetters(20), No. 4, April 2013, pp. 387-390.
IEEE DOI 1303
BibRef

Yadav, J., Rao, K.S.,
Detection of Vowel Offset Point From Speech Signal,
SPLetters(20), No. 4, April 2013, pp. 299-302.
IEEE DOI 1303
BibRef

Mohammadiha, N., Martin, R., Leijon, A.,
Spectral Domain Speech Enhancement Using HMM State-Dependent Super-Gaussian Priors,
SPLetters(20), No. 3, March 2013, pp. 253-256.
IEEE DOI 1303
BibRef

Taal, C.H., Jensen, J., Leijon, A.,
On Optimal Linear Filtering of Speech for Near-End Listening Enhancement,
SPLetters(20), No. 3, March 2013, pp. 225-228.
IEEE DOI 1303
BibRef

Teng, P., Jia, Y.,
Voice Activity Detection Via Noise Reducing Using Non-Negative Sparse Coding,
SPLetters(20), No. 5, May 2013, pp. 475-478.
IEEE DOI 1304
BibRef

Romoli, L., Cecchi, S., Piazza, F.,
A Combined Approach for Channel Decorrelation in Stereo Acoustic Echo Cancellation Exploiting Time-Varying Frequency Shifting,
SPLetters(20), No. 7, 2013, pp. 717-720.
IEEE DOI 1307
BibRef

Szurley, J., Bertrand, A., Moonen, M.,
On the Use of Time-Domain Widely Linear Filtering for Binaural Speech Enhancement,
SPLetters(20), No. 7, 2013, pp. 649-652.
IEEE DOI 1307
speech enhancement BibRef

Sarria-Paja, M., Falk, T.H.,
Whispered Speech Detection in Noise Using Auditory-Inspired Modulation Spectrum Features,
SPLetters(20), No. 8, 2013, pp. 783-786.
IEEE DOI 1307
Gaussian processes BibRef

Ramirez, M.A.,
Intra-Predictive Switched Split Vector Quantization of Speech Spectra,
SPLetters(20), No. 8, 2013, pp. 791-794.
IEEE DOI 1307
Gaussian processes BibRef

Ying, D., Yan, Y.,
Robust and Fast Localization of Single Speech Source Using a Planar Array,
SPLetters(20), No. 9, 2013, pp. 909-912.
IEEE DOI 1308
Concave cost function BibRef

Moller, S., Heusdens, R.,
Objective Estimation of Speech Quality for Communication Systems,
PIEEE(101), No. 9, 2013, pp. 1955-1967.
IEEE DOI 1309
Prediction models BibRef

Mowlaee, P., Saeidi, R.,
Iterative Closed-Loop Phase-Aware Single-Channel Speech Enhancement,
SPLetters(20), No. 12, 2013, pp. 1235-1239.
IEEE DOI 1311
Delays BibRef

Kulmer, J., Mowlaee, P.,
Phase Estimation in Single Channel Speech Enhancement Using Phase Decomposition,
SPLetters(22), No. 5, May 2015, pp. 598-602.
IEEE DOI 1411
Harmonic analysis BibRef

Ganapathy, S., Pelecanos, J.,
Enhancing Frequency Shifted Speech Signals in Single Side-Band Communication,
SPLetters(20), No. 12, 2013, pp. 1231-1234.
IEEE DOI 1311
radio receivers BibRef

Traa, J., Smaragdis, P.,
A Wrapped Kalman Filter for Azimuthal Speaker Tracking,
SPLetters(20), No. 12, 2013, pp. 1257-1260.
IEEE DOI 1311
Approximation methods BibRef

Hu, P.F.[Peng-Fei], Liu, W.[Wenju], Jiang, W.[Wei], Yang, Z.[Zhanlei],
Latent topic model for audio retrieval,
PR(47), No. 3, 2014, pp. 1138-1143.
Elsevier DOI 1312
Topic model BibRef

Drugman, T.,
Maximum Phase Modeling for Sparse Linear Prediction of Speech,
SPLetters(21), No. 2, February 2014, pp. 185-189.
IEEE DOI 1402
filtering theory BibRef

Xu, Y.[Yong], Du, J.[Jun], Dai, L.R.[Li-Rong], Lee, C.H.[Chin-Hui],
An Experimental Study on Speech Enhancement Based on Deep Neural Networks,
SPLetters(21), No. 1, January 2014, pp. 65-68.
IEEE DOI 1402
BibRef

Jin, Y.G.[Yu Gwang], Shin, J.W.[Jong Won], Kim, N.S.[Nam Soo],
Spectro-Temporal Filtering for Multichannel Speech Enhancement in Short-Time Fourier Transform Domain,
SPLetters(21), No. 3, March 2014, pp. 352-355.
IEEE DOI 1403
Fourier transforms BibRef

Kwon, K.[Kisoo], Shin, J.W.[Jong Won], Kim, N.S.[Nam Soo],
NMF-Based Speech Enhancement Using Bases Update,
SPLetters(22), No. 4, April 2015, pp. 450-454.
IEEE DOI 1411
matrix decomposition BibRef

Arsikere, H., Lulich, S.M., Alwan, A.,
Estimating Speaker Height and Subglottal Resonances Using MFCCs and GMMs,
SPLetters(21), No. 2, February 2014, pp. 159-162.
IEEE DOI 1402
Gaussian processes BibRef

He, L., Zhang, J., Liu, Q., Yin, H., Lech, M.,
Automatic Evaluation of Hypernasality and Consonant Misarticulation in Cleft Palate Speech,
SPLetters(21), No. 10, October 2014, pp. 1298-1301.
IEEE DOI 1407
Accuracy BibRef

Nathwani, K., Pandit, P., Hegde, R.M.,
Group Delay Based Methods for Speaker Segregation and its Application in Multimedia Information Retrieval,
MultMed(15), No. 6, 2013, pp. 1326-1339.
IEEE DOI 1309
Correlation BibRef

Xie, D.[Danhui], Zhang, W.B.[Wei-Bin],
Estimating Speech Spectral Amplitude Based on the Nakagami Approximation,
SPLetters(21), No. 11, November 2014, pp. 1375-1379.
IEEE DOI 1408
Gaussian distribution BibRef

Drugman, T., Stylianou, Y.,
Fast Inter-Harmonic Reconstruction for Spectral Envelope Estimation in High-Pitched Voices,
SPLetters(21), No. 11, November 2014, pp. 1418-1422.
IEEE DOI 1408
harmonic analysis BibRef

Drugman, T., Stylianou, Y., Kida, Y., Akamine, M.,
Voice Activity Detection: Merging Source and Filter-based Information,
SPLetters(23), No. 2, February 2016, pp. 252-256.
IEEE DOI 1602
filtering theory BibRef

Zheng, C.S.[Cheng-Shi], Peng, R.H.[Ren-Hua], Li, J.[Jian], Li, X.D.[Xiao-Dong],
A Constrained MMSE LP Residual Estimator for Speech Dereverberation in Noisy Environments,
SPLetters(21), No. 12, December 2014, pp. 1462-1466.
IEEE DOI 1410
least mean squares methods BibRef

Sarma, B.D., Prasanna, S.R.M.,
Analysis of Vocal Tract Constrictions using Zero Frequency Filtering,
SPLetters(21), No. 12, December 2014, pp. 1481-1485.
IEEE DOI 1410
filtering theory BibRef

Kim, M., Smaragdis, P.,
Mixtures of Local Dictionaries for Unsupervised Speech Enhancement,
SPLetters(22), No. 3, March 2015, pp. 293-297.
IEEE DOI 1410
Dictionaries BibRef

Kleijn, W.B., Hendriks, R.C.,
A Simple Model of Speech Communication and its Application to Intelligibility Enhancement,
SPLetters(22), No. 3, March 2015, pp. 303-307.
IEEE DOI 1410
Auditory system BibRef

Ko, Y.J.[Young-Joong],
New feature weighting approaches for speech-act classification,
PRL(51), No. 1, 2015, pp. 107-111.
Elsevier DOI 1412
Natural language processing BibRef

Degottex, G.,
A Time Regularization Technique for Discrete Spectral Envelopes Through Frequency Derivative,
SPLetters(22), No. 7, July 2015, pp. 978-982.
IEEE DOI 1412
Cepstral analysis BibRef

Mysore, G.J.,
Can we Automatically Transform Speech Recorded on Common Consumer Devices in Real-World Environments into Professional Production Quality Speech?: A Dataset, Insights, and Challenges,
SPLetters(22), No. 8, August 2015, pp. 1006-1010.
IEEE DOI 1502
audio recording BibRef

Nordholm, S., Kellermann, W., Doclo, S., Valimaki, V., Makino, S., Hershey, J.,
Signal Processing Techniques for Assisted Listening,
SPMag(32), No. 2, March 2015, pp. 16-17.
IEEE DOI 1503
From the Guest Editors. Acoustic signal processing BibRef

Doclo, S., Kellermann, W., Makino, S., Nordholm, S.E.,
Multichannel Signal Enhancement Algorithms for Assisted Listening Devices: Exploiting spatial diversity using multiple microphones,
SPMag(32), No. 2, March 2015, pp. 18-30.
IEEE DOI 1503
audio signal processing BibRef

Kowalczyk, K., Thiergart, O., Taseska, M., Del Galdo, G., Pulkki, V., Habets, E.A.P.,
Parametric Spatial Sound Processing: A flexible and efficient solution to sound scene acquisition, modification, and reproduction,
SPMag(32), No. 2, March 2015, pp. 31-42.
IEEE DOI 1503
audio signal processing BibRef

Kleijn, W.B., Crespo, J.B., Hendriks, R.C., Petkov, P., Sauert, B., Vary, P.,
Optimizing Speech Intelligibility in a Noisy Environment: A unified view,
SPMag(32), No. 2, March 2015, pp. 43-54.
IEEE DOI 1503
speech enhancement BibRef

Gerkmann, T., Krawczyk-Becker, M., Le Roux, J.,
Phase Processing for Single-Channel Speech Enhancement: History and recent advances,
SPMag(32), No. 2, March 2015, pp. 55-66.
IEEE DOI 1503
array signal processing BibRef

Wouters, J., McDermott, H.J., Francart, T.,
Sound Coding in Cochlear Implants: From electric pulses to hearing,
SPMag(32), No. 2, March 2015, pp. 67-80.
IEEE DOI 1503
acoustic signal processing BibRef

Betlehem, T., Zhang, W.[Wen], Poletti, M.A., Abhayapala, T.D.,
Personal Sound Zones: Delivering interface-free audio to multiple listeners,
SPMag(32), No. 2, March 2015, pp. 81-91.
IEEE DOI 1503
audio signal processing BibRef

Valimaki, V., Franck, A., Ramo, J., Gamper, H., Savioja, L.,
Assisted Listening Using a Headset: Enhancing audio perception in real, augmented, and virtual environments,
SPMag(32), No. 2, March 2015, pp. 92-99.
IEEE DOI 1503
audio signal processing BibRef

Sunder, K., He, J.J.[Jian-Jun], Tan, E.L.[Ee Leng], Gan, W.S.[Woon-Seng],
Natural Sound Rendering for Headphones: Integration of signal processing techniques,
SPMag(32), No. 2, March 2015, pp. 100-113.
IEEE DOI 1503
audio signal processing BibRef

Falk, T.H., Parsa, V., Santos, J.F., Arehart, K., Hazrati, O., Huber, R., Kates, J.M., Scollie, S.,
Objective Quality and Intelligibility Prediction for Users of Assistive Listening Devices: Advantages and limitations of existing tools,
SPMag(32), No. 2, March 2015, pp. 114-124.
IEEE DOI 1503
hearing aids BibRef

Saeedi, J.[Jamal], Ahadi, S.M.[Seyed Mohammad], Faez, K.[Karim],
Robust voice activity detection directed by noise classification,
SIViP(9), No. 3, March 2015, pp. 561-572.
WWW Link. 1503
BibRef

Ozawa, K.[Kenji], Tsukahara, S.[Shota], Kinoshita, Y.[Yuichiro], Morise, M.[Masanori],
Instantaneous Evaluation of the Sense of Presence in Audio-Visual Content,
IEICE(E98-D), No. 1, January 2015, pp. 49-57.
WWW Link. 1503
BibRef

Ozawa, K.[Kenji], Tsukahara, S.[Shota], Kinoshita, Y.[Yuichiro], Morise, M.[Masanori],
Development of an Estimation Model for Instantaneous Presence in Audio-Visual Content,
IEICE(E99-D), No. 1, January 2016, pp. 120-127.
WWW Link. 1601
BibRef

Yao, X., Jitsuhiro, T., Miyajima, C., Kitaoka, N., Takeda, K.,
Modeling of Physical Characteristics of Speech under Stress,
SPLetters(22), No. 10, October 2015, pp. 1801-1805.
IEEE DOI 1506
Atmospheric modeling BibRef

Adiga, N., Prasanna, S.R.M.,
Detection of Glottal Activity Using Different Attributes of Source Information,
SPLetters(22), No. 11, November 2015, pp. 2107-2111.
IEEE DOI 1509
feature extraction BibRef

Tong, R.J.[Ren-Jie], Bao, G.Z.[Guang-Zhao], Ye, Z.F.[Zhong-Fu],
A Higher Order Subspace Algorithm for Multichannel Speech Enhancement,
SPLetters(22), No. 11, November 2015, pp. 2004-2008.
IEEE DOI 1509
AWGN BibRef

Tong, R.J.[Ren-Jie], Ye, Z.F.[Zhong-Fu],
Supplementations to the Higher Order Subspace Algorithm for Suppression of Spatially Colored Noise,
SPLetters(24), No. 5, May 2017, pp. 668-672.
IEEE DOI 1704
Colored noise BibRef

Meenakshi, G.N., Ghosh, P.K.,
Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal,
SPLetters(22), No. 11, November 2015, pp. 1859-1863.
IEEE DOI 1509
signal detection BibRef

Hsu, C.C.[Chung-Chien], Cheong, K.M.[Kah-Meng], Chi, T.S.[Tai-Shih], Tsao, Y.[Yu],
Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation,
IEICE(E98-D), No. 10, October 2015, pp. 1808-1817.
WWW Link. 1511
BibRef

Tavares, R., Coelho, R.,
Speech Enhancement with Nonstationary Acoustic Noise Detection in Time Domain,
SPLetters(23), No. 1, January 2016, pp. 6-10.
IEEE DOI 1601
speech enhancement BibRef

Lachachi, N.E.[Nour-Eddine], Adla, A.[Abdelkader],
Two approaches-based L2-SVMs reduced to MEB problems for dialect identification,
IJCVR(6), No. 1-2, 2016, pp. 1-18.
DOI Link 1601
BibRef

Gholami-Boroujeny, S.[Shiva], Fallatah, A.[Anwar], Heffernan, B.P.[Brian P.], Dajani, H.R.[Hilmi R.],
Neural network-based adaptive noise cancellation for enhancement of speech auditory brainstem responses,
SIViP(10), No. 1, February 2016, pp. 389-395.
Springer DOI 1601
BibRef

Luo, Y.[You], Bao, G.Z.[Guang-Zhao], Xu, Y.F.[Yang-Fei], Ye, Z.F.[Zhong-Fu],
Supervised Monaural Speech Enhancement Using Complementary Joint Sparse Representations,
SPLetters(23), No. 2, February 2016, pp. 237-241.
IEEE DOI 1602
BibRef

Braun, S., Habets, E.A.P.,
Online Dereverberation for Dynamic Scenarios Using a Kalman Filter With an Autoregressive Model,
SPLetters(23), No. 12, December 2016, pp. 1741-1745.
IEEE DOI 1612
Fourier transforms BibRef

Chakrabarty, S., Habets, E.A.P.,
On the Numerical Instability of an LCMV Beamformer for a Uniform Linear Array,
SPLetters(23), No. 2, February 2016, pp. 272-276.
IEEE DOI 1602
Fourier transforms BibRef

Cherkassky, D., Gannot, S.,
New Insights into the Kalman Filter Beamformer: Applications to Speech and Robustness,
SPLetters(23), No. 3, March 2016, pp. 376-380.
IEEE DOI 1603
Kalman filters BibRef

Chung, H., Plourde, E., Champagne, B.,
Discriminative Training of NMF Model Based on Class Probabilities for Speech Enhancement,
SPLetters(23), No. 4, April 2016, pp. 502-506.
IEEE DOI 1604
Convergence BibRef

Helmrich, C.R., Edler, B.,
Audio Coding Using Overlap and Kernel Adaptation,
SPLetters(23), No. 5, May 2016, pp. 590-594.
IEEE DOI 1604
audio coding BibRef

Eyben, F., Scherer, K.R., Schuller, B.W., Sundberg, J., André, E., Busso, C., Devillers, L.Y., Epps, J., Laukka, P., Narayanan, S.S., Truong, K.P.,
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing,
AffCom(7), No. 2, April 2016, pp. 190-202.
IEEE DOI 1606
Frequency measurement BibRef

Wang, J., Shang, Y., Jiang, S., Gowda, D., Lv, K.,
Whispered Speech Detection Using Fusion of Group-Delay-Based Subband Modulation Spectrum and Correntropy Features,
SPLetters(23), No. 8, August 2016, pp. 1042-1046.
IEEE DOI 1608
entropy BibRef

Wang, S.S., Chern, A., Tsao, Y., Hung, J.W., Lu, X., Lai, Y.H., Su, B.,
Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization,
SPLetters(23), No. 8, August 2016, pp. 1101-1105.
IEEE DOI 1608
Fourier transforms BibRef

López-Oller, D., Gomez, A.M., Pérez-Córdoba, J.L., Sánchez, V.,
An Error Mitigation Technique for Erasure Channels Based on a Wavelet Representation of the Speech Excitation Signal,
MultMed(18), No. 7, July 2016, pp. 1245-1256.
IEEE DOI 1608
Haar transforms BibRef

Strasser, F., Puder, H.,
Correlation Detection for Adaptive Feedback Cancellation in Hearing Aids,
SPLetters(23), No. 7, July 2016, pp. 979-983.
IEEE DOI 1608
Acoustics BibRef

Park, J., Jin, Y.G., Hwang, S., Shin, J.W.,
Dual Microphone Voice Activity Detection Exploiting Interchannel Time and Level Differences,
SPLetters(23), No. 10, October 2016, pp. 1335-1339.
IEEE DOI 1610
acoustic signal detection BibRef

Petkov, P.N., Stylianou, Y.,
Adaptive Gain Control for Enhanced Speech Intelligibility Under Reverberation,
SPLetters(23), No. 10, October 2016, pp. 1434-1438.
IEEE DOI 1610
adaptive control BibRef

Kobayashi, K.[Kazuhiro], Toda, T.[Tomoki], Nakano, T.[Tomoyasu], Goto, M.[Masataka], Nakamura, S.[Satoshi],
Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion,
IEICE(E99-D), No. 11, November 2016, pp. 2767-2777.
WWW Link. 1611
BibRef

Wang, Y., Zhao, S., Li, J., Kuang, J.,
Speech Bandwidth Extension Using Recurrent Temporal Restricted Boltzmann Machines,
SPLetters(23), No. 12, December 2016, pp. 1877-1881.
IEEE DOI 1612
Boltzmann machines BibRef

Prathosh, A.P., P, S., Ramakrishnan, A.G., Kumar Ghosh, P.,
Cumulative Impulse Strength for Epoch Extraction,
SPLetters(23), No. 4, April 2016, pp. 424-428.
IEEE DOI 1604
speech processing BibRef

Vignolo, L.D.[Leandro D.], Prasanna, S.R.M.[S.R. Mahadeva], Dandapat, S.[Samarendra], Rufiner, H.L.[H. Leonardo], Milone, D.H.[Diego H.],
Feature optimisation for stress recognition in speech,
PRL(84), No. 1, 2016, pp. 1-7.
Elsevier DOI 1612
Evolutionary algorithms BibRef

Sun, P., Qin, J.,
Low-Rank and Sparsity Analysis Applied to Speech Enhancement Via Online Estimated Dictionary,
SPLetters(23), No. 12, December 2016, pp. 1862-1866.
IEEE DOI 1612
expectation-maximisation algorithm BibRef

Jukic, A., van Waterschoot, T., Doclo, S.,
Adaptive Speech Dereverberation Using Constrained Sparse Multichannel Linear Prediction,
SPLetters(24), No. 1, January 2017, pp. 101-105.
IEEE DOI 1702
minimisation BibRef

Jiao, Y., Berisha, V., Liss, J., Hsu, S.C., Levy, E., McAuliffe, M.,
Articulation Entropy: An Unsupervised Measure of Articulatory Precision,
SPLetters(24), No. 4, April 2017, pp. 485-489.
IEEE DOI 1704
Acoustic measurements BibRef

Airaksinen, M., Bollepalli, B., Pohjalainen, J., Alku, P.,
Glottal Vocoding With Frequency-Warped Time-Weighted Linear Prediction,
SPLetters(24), No. 4, April 2017, pp. 446-450.
IEEE DOI 1704
speech coding BibRef

Chetupalli, S.R., Sreenivas, T.V.,
Joint Bayesian Estimation of Time-Varying LP Parameters and Excitation for Speech,
SPLetters(24), No. 4, April 2017, pp. 357-361.
IEEE DOI 1704
Gaussian processes BibRef

Chollet, M., Scherer, S.,
Assessing Public Speaking Ability from Thin Slices of Behavior,
FG17(310-316)
IEEE DOI 1707
Feature extraction, Interviews, Public speaking, Speech, Training, Videos, Visualization BibRef

de-la-Calle-Silos, F., Stern, R.M.,
Synchrony-Based Feature Extraction for Robust Automatic Speech Recognition,
SPLetters(24), No. 8, August 2017, pp. 1158-1162.
IEEE DOI 1708
feature extraction, speech recognition, auditory-nerve activity, auditory-nerve firings, automatic speech recognition system robustness enhancement, feature extraction schemes, generalized synchrony detector, multiple standard speech databases, noise removal, noise suppression, putative synchrony, robust automatic speech recognition, synchrony-based feature extraction, temporal pattern model application, temporal patterns, Databases, Feature extraction, Frequency synchronization, Mel frequency cepstral coefficient, Robustness, Speech, Speech recognition, Auditory modeling, auditory synchrony, feature extraction, physiological modeling, robust, speech, recognition BibRef

Zhang, Q., Chen, Z., Yin, F.,
Speaker Tracking Based on Distributed Particle Filter in Distributed Microphone Networks,
SMCS(47), No. 9, September 2017, pp. 2433-2443.
IEEE DOI 1708
Bayes methods, Cybernetics, Estimation, Kalman filters, Microphones, Particle filters, Reverberation, Average consensus filter, distributed microphone networks, distributed particle filter (DPF), multiple-hypothesis model, speaker tracking. BibRef

Ávila, F.R., Tcheou, M.P., Biscainho, L.W.P.,
Audio Soft Declipping Based on Constrained Weighted Least Squares,
SPLetters(24), No. 9, September 2017, pp. 1348-1352.
IEEE DOI 1708
Cost function, Discrete cosine transforms, Frequency-domain analysis, Nonlinear distortion, Predistortion, Speech, Audio declipping, nonlinear signal processing, sparsity, weighted least squares (WLS) BibRef

Huang, Z.[Zhen], Siniscalchi, S.M.[Sabato Marco], Lee, C.H.[Chin-Hui],
Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation,
PRL(98), No. 1, 2017, pp. 1-7.
Elsevier DOI 1710
System, combination BibRef

Reddy, C.K.A.[C. Karadagur Ananda], Shankar, N., Bhat, G.S.[G. Shreedhar], Charan, R., Panahi, I.,
An Individualized Super-Gaussian Single Microphone Speech Enhancement for Hearing Aid Users With Smartphone as an Assistive Device,
SPLetters(24), No. 11, November 2017, pp. 1601-1605.
IEEE DOI 1710
hearing aids, maximum likelihood estimation, signal denoising, BibRef


Zhang, L., Chen, J.[Jiaxu], Luo, Y.[You], Fu, J.F.[Jia-Fei], Ye, Z.F.[Zhong-Fu],
Supervised single-channel speech dereverberation and denoising using a two-stage processing,
ICIVC17(818-822)
IEEE DOI 1708
Adaptive filters, Noise measurement, Speech, non-negative matrix factorization, room impulse response, speech dereverberation and denoising, two-stage, processing BibRef

Bedoui, A., Ben Jebara, S.,
On the use of opening phase slopes of the glottal signal to characterize unilateral vocal folds paralysis,
ISIVC16(41-46)
IEEE DOI 1704
Estimation BibRef

Ben Ali, F., Djaziri-Larbi, S.,
A very low bit rate codec for wide band speech based on a long-term perceptual harmonic plus noise model,
ISIVC16(71-76)
IEEE DOI 1704
Bit rate BibRef

Ferreira, A.,
Implantation of voicing on whispered speech using frequency-domain parametric modelling of source and filter information,
ISIVC16(159-166)
IEEE DOI 1704
Estimation BibRef

Pozzebon, A.[Alessandro], Biliotti, F.[Francesca], Calamai, S.[Silvia],
Places Speaking with Their Own Voices. A Case Study from the Gra.fo Archives,
EuroMed16(II: 232-239).
Springer DOI 1611
BibRef

Vlaj, D., Kos, M., Kacic, Z.,
Quick and efficient definition of hangbefore and hangover criteria for voice activity detection,
WSSIP16(1-4)
IEEE DOI 1608
speech processing BibRef

Ballesteros L, D.M.[Dora M.], Renza, D.[Diego], Camacho, S.[Steven],
High Scrambling Degree in Audio Through Imitation of an Unintelligible Signal,
MCPR16(251-259).
Springer DOI 1608
BibRef

Onchis, D.M.[Darian M.], Real, P.[Pedro],
On Homotopy Continuation for Speech Restoration,
CTIC16(152-156).
Springer DOI 1608
BibRef

Dubey, M.L., Shultz, P.F., Kenyon, G.T.,
Learning phase-rich features from streaming auditory images,
Southwest16(73-76)
IEEE DOI 1605
Convolution BibRef

Nagy, G.[George], Nagy, N.[Naomi],
Tongue in Cheek,
CIAP15(I:332-342).
Springer DOI 1511
For phonetics, linguistics. BibRef

Montalvo, A.[Ana], Costa, Y.M.G.[Yandre M. G.], Calvo, J.R.[José Ramón],
Language Identification Using Spectrogram Texture,
CIARP15(543-550).
Springer DOI 1511
BibRef

Aizezi, Y.[Yasen], Jamal, A.[Anwar], Mamat, D.[Dilxat], Abdurexit, R.[Ruxianguli], Ubul, K.[Kurban],
Analytical Method and Research of Uyghur Language Chunks Based on Digital Forensics,
ISCA15(258-266).
Springer DOI 1511
BibRef

Hammami, N., Bedda, M., Farah, N., Mansouri, S.,
R-Letter disorder diagnosis (R-LDD): Arabic speech database development for automatic diagnosis of childhood speech disorders (Case study),
ISCV15(1-7)
IEEE DOI 1506
acoustic signal processing BibRef

Nakajima, J.[Jiro], Kimura, A.[Akisato], Sugimoto, A.[Akihiro], Kashino, K.[Kunio],
Visual Attention Driven by Auditory Cues,
MMMod15(II: 74-86).
Springer DOI 1501
BibRef

Ishikura, K.[Kazumasa], Uemura, A.[Aiko], Katto, J.[Jiro],
Live Version Identification with Audio Scene Detection,
MMMod15(I: 408-417).
Springer DOI 1501
BibRef

Xie, S.B.[Song-Bo], Yang, Y.H.[Yu-Hong], Hu, R.M.[Rui-Min], Wang, Y.[Yanye], Yu, H.J.[Hong-Jiang], Dong, S.L.[Shao-Long], Gao, L.[Li], Yang, C.[Cheng],
Signal-Aware Parametric Quality Model for Audio and Speech over IP Networks,
MMMod15(I: 487-497).
Springer DOI 1501
BibRef

Xue, L.[Like], Su, F.[Feng],
Auditory Scene Classification with Deep Belief Network,
MMMod15(I: 348-359).
Springer DOI 1501
BibRef

Tu, M.[Ming], Xie, X.[Xiang], Na, X.Y.[Xing-Yu],
Computational Auditory Scene Analysis Based Voice Activity Detection,
ICPR14(797-802)
IEEE DOI 1412
Feature extraction BibRef

Lu, T.[Tong], Weng, Y.B.[Yang-Bing], Wang, G.Y.[Gong-You],
Audiotory Movie Summarization by Detecting Scene Changes and Sound Events,
ICPR14(756-760)
IEEE DOI 1412
Awards activities BibRef

Nguyen-Son, H.Q.[Hoang-Quoc], Hoang, A.T.[Anh-Tu], Tran, M.T.[Minh-Triet], Yoshiura, H.[Hiroshi], Sonehara, N.[Noboru], Echizen, I.[Isao],
Anonymizing Temporal Phrases in Natural Language Text to be Posted on Social Networking Services,
IWDW13(437-451).
Springer DOI 1407
BibRef

Maka, T.[Tomasz], Dziurzanski, P.[Piotr],
Feature contours fusion for determining segment boundaries in audio data,
WSSIP14(111-114) 1406
Educational institutions BibRef

Souza, D.[Danilo], Saturnino, L.[Levi], Maciel, A.M.A.[Alexandre M.A.],
A portability evaluation of Brazilian Portuguese voices produced with MARY TTS,
WSSIP14(95-98) 1406
BibRef

Frid, A.[Alex], Lavner, Y.[Yizhar],
Spectral and textural features for automatic classification of fricatives using SVM,
WSSIP14(99-102) 1406
Auditory system BibRef

Savchenko, A.V.[Andrey V.],
Semi-automated Speaker Adaptation: How to Control the Quality of Adaptation?,
ICISP14(638-646).
Springer DOI 1406
BibRef

Merazka, F.[Fatiha],
Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G.722.2 Codec,
ICISP14(658-664).
Springer DOI 1406
BibRef

Souli, S.[Sameh], Lachiri, Z.[Zied], Kuznietsov, A.[Alexander],
Using Three Reassigned Spectrogram Patches and Log-Gabor Filter for Audio Surveillance Application,
CIARP13(I:527-534).
Springer DOI 1311
BibRef

Joseph, S.M.[Shijo M.], Babu, A.P.[Anto P.],
Continuous speech coding using coiflets wavelet,
ICSIPR13(253-257).
IEEE DOI 1304
BibRef

Nivedita, D.[Deshpande], Kavita, T.[Thakur], Zadgaonkar, A.S.,
First degree heart block determination from speech analysis,
ICSIPR13(103-106).
IEEE DOI 1304
BibRef

Sadjadi, S.O., Hansen, J.H.L.,
Unsupervised Speech Activity Detection Using Voicing Measures and Perceptual Spectral Flux,
SPLetters(20), No. 3, March 2013, pp. 197-200.
IEEE DOI 1303
BibRef

Zhang, L.[Long], Li, H.F.[Hai-Feng], Ma, L.[Lin],
An adaptive unsupervised clustering of pronunciation errors for automatic pronunciation error detection,
ICPR12(1521-1525).
WWW Link. 1302
BibRef

Rosales-Pérez, A.[Alejandro], Reyes-García, C.A.[Carlos A.], Gonzalez, J.A.[Jesus A.], Arch-Tirado, E.[Emilio],
Infant Cry Classification Using Genetic Selection of a Fuzzy Model,
CIARP12(212-219).
Springer DOI 1209
BibRef

González, D.C.[Diana Cristina], Ling, L.L.[Lee Luan], oViolaro, F.[Fábio],
Analysis of the Multifractal Nature of Speech Signals,
CIARP12(740-748).
Springer DOI 1209
BibRef

Tanveer, S.[Saad], Muhammad, A.[Aslam], Martinez-Enriquez, A.M., Escalada-Imaz, G.,
Phonetic Unification of Multiple Accents for Spanish and Arabic Languages,
MCPR12(323-333).
Springer DOI 1208
BibRef

Falek, L.[Leila], Teffahi, H.[Hocine], Djeradi, A.[Amar],
Methodology for Acoustic Characterization of a Labial Constraint in Speech Production,
ICISP12(131-141).
Springer DOI 1208
BibRef

Krum, D.M.[David M.], Suma, E.A.[Evan A.], Bolas, M.[Mark],
Spatial misregistration of virtual human audio: Implications of the precedence effect,
3DUI12(147-148).
IEEE DOI 1204
BibRef

Yang, Y.J.[Ying-Jie], Zhang, H.H.[Huan-Huan], Guo, X.[Xiue],
A pitch tracking method mixing ACF and AMDF algorithms based on correlations,
IASP11(553-556).
IEEE DOI 1112
autocorrelation functions; average magnitude difference functions. Speech BibRef

Guo, S.[Shuni], Gao, L.[Lu], Yu, H.Z.[Hong-Zhi],
Research on Lhasa Tibetan prosodic model of journalese based on respiratory signal,
IASP11(26-30).
IEEE DOI 1112
BibRef

Resmi, K., Kumar, S.[Satish], Sardana, H.K., Chhabra, R.[Radhika],
Graphical Speech Training system for hearing impaired,
ICIIP11(1-6).
IEEE DOI 1112
BibRef

Gómez, J.A.[Jon Ander], Calvo, M.[Marcos],
Improvements on Automatic Speech Segmentation at the Phonetic Level,
CIARP11(557-564).
Springer DOI 1111
BibRef

Le, P.N.[Phu Ngoc], Epps, J.[Julien], Choi, E.H.C.[Eric H.C.], Ambikairajah, E.[Eliathamby],
A Study of Voice Source and Vocal Tract Filter Based Features in Cognitive Load Classification,
ICPR10(4516-4519).
IEEE DOI 1008
BibRef

Stark, M.[Michael], Wohlmayr, M.[Michael], Pernkopf, F.[Franz],
Single Channel Speech Separation Using Source-Filter Representation,
ICPR10(826-829).
IEEE DOI 1008
BibRef

Stadelmann, T.[Thilo], Wang, Y.H.[Ying-Hui], Smith, M.[Matthew], Ewerth, R.[Ralph], Freisleben, B.[Bernd],
Rethinking Algorithm Design and Development in Speech Processing,
ICPR10(4476-4479).
IEEE DOI 1008
BibRef

Gonzalez-Caravaca, G.[Guillermo], Toledano, D.T.[Doroteo Torre], Puertas, M.[Maria],
Phone-Conditioned Suboptimal Wiener Filtering,
ICPR10(4480-4483).
IEEE DOI 1008
BibRef

Sepehr, H.[Hamid], Nooralahiyan, A.Y.[Amir Y.], Brennan, P.V.[Paul V.],
Improving Performance of a Noise Reduction Algorithm by Switching the Analysis Filter Bank,
ICISP10(262-271).
Springer DOI 1006
for speech BibRef

Kos, M., Grasic, M., Vlaj, D., Kacic, Z.,
On-Line Speech/Music Segmentation for Broadcast News Domain,
WSSIP09(1-4).
IEEE DOI 0906
BibRef

Grasic, M., Kos, M., Vlaj, D., Kacic, Z.,
The Influence of Speech/Non-Speech Segmentation on On-Line and Off-Line Speaker Segmentation Accuracy,
WSSIP09(1-4).
IEEE DOI 0906
BibRef

Zuta, V.[Vivien],
Voice Pleasantness of Female Voices and the Assessment of Physical Characteristics,
COST08(116-125).
Springer DOI 0810
BibRef

Pignotti, A.[Alessio], Marcozzi, D.[Daniele], Cifani, S.[Simone], Squartini, S.[Stefano], Piazza, F.[Francesco],
A Blind Source Separation Based Approach for Speech Enhancement in Noisy and Reverberant Environment,
COST08(356-367).
Springer DOI 0810
BibRef

Stadelmann, T., Heinzl, S., Unterberger, M., Freisleben, B.,
WebVoice: A Toolkit for Perceptual Insights into Speech Processing,
CISP09(1-5).
IEEE DOI 0910
BibRef

Tang, Y.B.[Yi-Bin], Huang, R.[Rong], Wu, Z.Y.[Zhen-Yang],
A 2.4kbps Multiband Characteristic Waveform Interpolation Speech Coding Algorithm,
CISP09(1-4).
IEEE DOI 0910
BibRef

Zou, X.[Xia], Zhang, X.W.[Xiong-Wei],
A 450bps Speech Coding Algorithm Based on Multi-Mode Matrix Quantization,
CISP09(1-3).
IEEE DOI 0910
BibRef

Kuhnapfel, T.[Thorsten], Tan, T.[Tele], Venkatesh, S.[Svertha], Igel, B.[Burkhard],
Distributed Audio Network for Speech Enhancement in Challenging Noise Backgrounds,
AVSBS09(308-313).
IEEE DOI 0909
BibRef

Kuhnapfel, T.[Thorsten], Tan, T.[Tele], Venkatesh, S.[Svetha], Nordholm, S.E.[Sven Erik], Igel, B.[Burkhard],
Adaptive speech enhancement with varying noise backgrounds,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Li, X.K.[Xiao-Kun], Deng, Y.[Yunbin],
Combining speech energy and edge information for fast and efficient voice activity detection in noisy environments,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Kukharchik, P., Kheidorov, I., Bovbel, E., Ladeev, D.,
Speech Signal Processing Based on Wavelets and SVM for Vocal Tract Pathology Detection,
ICISP08(192-199).
Springer DOI 0807
BibRef

Nagesha, Kumar, G.H.[G. Hemantha],
Signal Resampling Technique Combining Level Crossing and Auditory Features,
PReMI07(447-454).
Springer DOI 0712
BibRef

Ferrer, C.A.[Carlos A.], González, E.[Eduardo], Hernández-Díaz, M.E.[María E.],
Evaluation of Time and Frequency Domain-Based Methods for the Estimation of Harmonics-to-Noise-Ratios in Voice Signals,
CIARP06(406-415).
Springer DOI 0611
BibRef

Li, W.H.[Wei-Hong], Liu, M.[Ming], Zhu, Z.G.[Zhi-Gang], Huang, T.S.[Thomas S.],
LDV Remote Voice Acquisition and Enhancement,
ICPR06(IV: 262-265).
IEEE DOI 0609
BibRef

Xue, W.[Wei], Du, S.[Sidan], Fang, C.Z.[Cheng-Zhi], Ye, Y.X.[Ying-Xian],
Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm,
CVHCI06(78-88).
Springer DOI 0605
BibRef

García-Perera, L.P.[L. Paola], Nolazco-Flores, J.A.[Juan A.], Mex-Perera, C.[Carlos],
Cryptographic-Speech-Key Generation Architecture Improvements,
IbPRIA05(II:579).
Springer DOI 0509
BibRef

Welk, M.[Martin], Bergmeister, A.[Achim], Weickert, J.[Joachim],
Denoising of Audio Data by Nonlinear Diffusion,
ScaleSpace05(598-609).
Springer DOI 0505
BibRef

Cristani, M., Bicego, M., Murino, V.,
On-line adaptive background modelling for audio surveillance,
ICPR04(II: 399-402).
IEEE DOI 0409
BibRef

Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speech Synthesis, Synthetic Speech .


Last update:Nov 11, 2017 at 13:31:57