24.1.15.3 Speech Synthesis, Synthetic Speech

Chapter Contents (Back)
Speech. Speech Synthesis. Synthesis, Speech.

Yeh, C.Y., Hwang, S.H.,
Efficient text analyser with prosody generator-driven approach for Mandarin text-to-speech,
VISP(152), No. 6, December 2005, pp. 793-799.
DOI Link 0512
BibRef

Chouireb, F.[Fatima], Guerti, M.[Mhania],
Towards a high quality Arabic speech synthesis system based on neural networks and residual excited vocal tract model,
SIViP(2), No. 1, January 2008, pp. 73-87.
Springer DOI 0712
BibRef

Elfitri, I., Gunel, B., Kondoz, A.M.,
Multichannel Audio Coding Based on Analysis by Synthesis,
PIEEE(99), No. 4, April 2011, pp. 657-670.
IEEE DOI 1103
Part of 3-D display series. BibRef

Jung, C.S.[Chi-Sang], Joo, Y.S.[Young-Sun], Kang, H.G.[Hong-Goo],
Waveform Interpolation-Based Speech Analysis/Synthesis for HMM-Based TTS Systems,
SPLetters(19), No. 12, December 2012, pp. 809-812.
IEEE DOI 1212
BibRef

Carmona, J.L., Barker, J., Gomez, A.M., Ma, N.[Ning],
Speech Spectral Envelope Enhancement by HMM-Based Analysis/Resynthesis,
SPLetters(20), No. 6, 2013, pp. 563-566.
IEEE DOI speech enhancement 1307
BibRef

Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J., Oura, K.,
Speech Synthesis Based on Hidden Markov Models,
PIEEE(100), No. 5, May 2013, pp. 1234-1252.
IEEE DOI 1305
BibRef

Ling, Z., Kang, S., Zen, H., Senior, A., Schuster, M., Qian, X., Meng, H., Deng, L.,
Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends,
SPMag(32), No. 3, May 2015, pp. 35-52.
IEEE DOI 1504
Acoustic signal detection BibRef

Bordel, G., Penagarikano, M., Rodriguez-Fuentes, L.J., Alvarez, A., Varona, A.,
Probabilistic Kernels for Improved Text-to-Speech Alignment in Long Audio Tracks,
SPLetters(23), No. 1, January 2016, pp. 126-129.
IEEE DOI 1601
Acoustics BibRef

Ninh, D.K.[Duy Khanh], Yamashita, Y.[Yoichi],
F0 Parameterization of Glottalized Tones in HMM-Based Speech Synthesis for Hanoi Vietnamese,
IEICE(E98-D), No. 12, December 2015, pp. 2280-2289.
WWW Link. 1601
BibRef

Erro, D.,
Two-Band Radial Postfiltering in Cepstral Domain with Application to Speech Synthesis,
SPLetters(23), No. 2, February 2016, pp. 202-206.
IEEE DOI 1602
filtering theory BibRef

Hu, Y.J., Ling, Z.H.,
DBN-based Spectral Feature Representation for Statistical Parametric Speech Synthesis,
SPLetters(23), No. 3, March 2016, pp. 321-325.
IEEE DOI 1603
belief networks BibRef

Tsiaras, V., Maia, R., Diakoloukas, V., Stylianou, Y., Digalakis, V.,
Global Variance in Speech Synthesis With Linear Dynamical Models,
SPLetters(23), No. 8, August 2016, pp. 1057-1061.
IEEE DOI 1608
speech synthesis BibRef

Wang, F.Z.[Fang-Zhou], Nagano, H.[Hidehisa], Kashino, K.[Kunio], Igarashi, T.[Takeo],
Visualizing Video Sounds With Sound Word Animation to Enrich User Experience,
MultMed(19), No. 2, February 2017, pp. 418-429.
IEEE DOI 1702
BibRef

Sharma, B., Prasanna, S.R.M.,
Enhancement of Spectral Tilt in Synthesized Speech,
SPLetters(24), No. 4, April 2017, pp. 382-386.
IEEE DOI 1704
speech enhancement BibRef

Singh, R.[Rita], Jiménez, A.[Abelino], Řland, A.[Anders],
Voice disguise by mimicry: deriving statistical articulometric evidence to evaluate claimed impersonation,
IET-Bio(6), No. 4, July 2017, pp. 282-289.
DOI Link 1707
BibRef

Lee, K.S.,
Restricted Boltzmann Machine-Based Voice Conversion for Nonparallel Corpus,
SPLetters(24), No. 8, August 2017, pp. 1103-1107.
IEEE DOI 1708
Boltzmann machines, probability, speaker recognition, OGI VOICES corpus, conversion function, linear transformation, parallel training corpus. BibRef

Reddy, M.K., Rao, K.S.,
Robust Pitch Extraction Method for the HMM-Based Speech Synthesis System,
SPLetters(24), No. 8, August 2017, pp. 1133-1137.
IEEE DOI 1708
feature extraction, hidden Markov models, speech synthesis, wavelet transforms, CMU Arctic and Keele databases, HMM-based speech synthesis system, continuous wavelet transform coefficients, hidden Markov model-based HTS, pitch estimation, pitch tracking, robust pitch extraction method, speech representation, BibRef

Liu, Z.C., Ling, Z.H., Dai, L.R.,
Statistical Parametric Speech Synthesis Using Generalized Distillation Framework,
SPLetters(25), No. 5, May 2018, pp. 695-699.
IEEE DOI 1805
Fourier transforms, acoustic signal processing, learning (artificial intelligence), recurrent neural nets, speech synthesis BibRef

Drugman, T., Huybrechts, G., Klimkov, V., Moinet, A.,
Traditional Machine Learning for Pitch Detection,
SPLetters(25), No. 11, November 2018, pp. 1745-1749.
IEEE DOI 1811
acoustic signal processing, estimation theory, feature extraction, learning (artificial intelligence), speech synthesis BibRef

Arik, S.Ö., Jun, H., Diamos, G.,
Fast Spectrogram Inversion Using Multi-Head Convolutional Neural Networks,
SPLetters(26), No. 1, January 2019, pp. 94-98.
IEEE DOI 1901
audio signal processing, feedforward neural nets, interpolation, iterative methods, learning (artificial intelligence), speech synthesis BibRef

Masuyama, Y., Yatabe, K., Oikawa, Y.,
Griffin-Lim Like Phase Recovery via Alternating Direction Method of Multipliers,
SPLetters(26), No. 1, January 2019, pp. 184-188.
IEEE DOI 1901
acoustic signal processing, iterative methods, optimisation, subjective test, objective measure, ADMM, signal recovery, STFT-based speech synthesis BibRef

Kwon, O., Jang, I., Ahn, C., Kang, H.,
An Effective Style Token Weight Control Technique for End-to-End Emotional Speech Synthesis,
SPLetters(26), No. 9, September 2019, pp. 1383-1387.
IEEE DOI 1909
Speech synthesis, Spectrogram, Training, Decoding, Aerospace electronics, Acoustics, Vocoders, emotion weight values BibRef


Wong, A., Xu, A., Dudek, G.,
Investigating Trust Factors in Human-Robot Shared Control: Implicit Gender Bias Around Robot Voice,
CRV19(195-200)
IEEE DOI 1908
Robots, Measurement, Task analysis, Drones, Graphical user interfaces, Uncertainty, Psychology, trust, gender bias BibRef

Yang, M., Zhang, D., Tao, J.,
Reducing Tongue Shape Dimensionality from Hundreds of Available Resources Using Autoencoder,
ICPR18(2875-2880)
IEEE DOI 1812
Tongue, Shape, Strain, Dimensionality reduction, Training, Image reconstruction, Noise reduction, vocal tract, tongue shape, neural network BibRef

Xiao, L., Wang, Z.,
Dense Convolutional Recurrent Neural Network for Generalized Speech Animation,
ICPR18(633-638)
IEEE DOI 1812
Feature extraction, Animation, Acoustics, Decoding, Visualization, Logic gates, Hidden Markov models BibRef

Shah, N.J.[Nirmesh J.], Patil, H.A.[Hemant A.],
Analysis of Features and Metrics for Alignment in Text-Dependent Voice Conversion,
PReMI17(299-307).
Springer DOI 1711
BibRef

Rybárová, R., Drozd, I., Rozinaj, G.,
GUI for interactive speech synthesis,
WSSIP16(1-4)
IEEE DOI 1608
XML BibRef

Coto-Jiménez, M.[Marvin], Goddard-Close, J.[John],
LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic Voices,
MCPR16(280-289).
Springer DOI 1608
BibRef

Vasek, M., Rozinaj, G., Rybárová, R.,
Letter-To-Sound conversion for speech synthesizer,
WSSIP16(1-4)
IEEE DOI 1608
speech processing BibRef

Rybarová, R., del Corral, G., Rozinaj, G.,
Diphone spanish text-to-speech synthesizer,
WSSIP15(121-124)
IEEE DOI 1603
natural language processing BibRef

Verma, R., Sarkar, P., Rao, K.S.,
Conversion of neutral speech to storytelling style speech,
ICAPR15(1-6)
IEEE DOI 1511
natural language processing BibRef

Wang, Y.[Yang], Tao, J.H.[Jian-Hua], Yang, M.H.[Ming-Hao], Li, Y.[Ya],
Extended Decision Tree with or Relationship for HMM-Based Speech Synthesis,
ACPR13(225-229)
IEEE DOI 1408
decision trees BibRef

Gao, L.[Lu], Yu, H.Z.[Hong-Zhi], Zhang, J.H.[Jins-Huang], Fang, H.P.[Hua-Ping],
Research on HMM_based speech synthesis for Lhasa dialect,
IASP11(429-433).
IEEE DOI 1112
BibRef

Chakraborty, R.[Rupayan], Garain, U.[Utpal],
Role of Synthetically Generated Samples on Speech Recognition in a Resource-Scarce Language,
ICPR10(1618-1621).
IEEE DOI 1008
BibRef

Rao, K.S.[K. Sreenivasa], Maity, S.[Sudhamay], Taru, A.[Amol], Koolagudi, S.G.[Shashidhar G.],
Unit Selection Using Linguistic, Prosodic and Spectral Distance for Developing Text-to-Speech System in Hindi,
PReMI09(531-536).
Springer DOI 0912
BibRef

Bahrampour, A.[Anvar], Barkhoda, W.[Wafa], Azami, B.Z.[Bahram Zahir],
Implementation of Three Text to Speech Systems for Kurdish Language,
CIARP09(321-328).
Springer DOI 0911
BibRef

Shirbahadurkar, S.D., Bormane, D.S.,
Marathi Language Speech Synthesizer Using Concatenative Synthesis Strategy (Spoken in Maharashtra, India),
ICMV09(181-185).
IEEE DOI 0912
BibRef

Tucková, J.[Jana], Holub, J.[Jan], Dubeda, T.[Tomáš],
Technical and Phonetic Aspects of Speech Quality Assessment: The Case of Prosody Synthesis,
COST08(126-132).
Springer DOI 0810
BibRef

Bauer, D.[Dominik], Kannampuzha, J.[Jim], Kröger, B.J.[Bernd J.],
Articulatory Speech Re-synthesis: Profiting from Natural Acoustic Speech Data,
COST08(344-355).
Springer DOI 0810
BibRef

Gu, H.Y.[Hung-Yan], Cai, C.L.[Chen-Lin], Cai, S.F.[Song-Fong],
An HNM-Based Speaker-Nonspecific Timbre Transformation Scheme for Speech Synthesis,
CISP09(1-5).
IEEE DOI 0910
BibRef

Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speaker Verification, Speaker Identification .


Last update:Nov 7, 2019 at 15:08:56