-
1
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " in Proc. of Interspeech, 1999, pp. 2347-2350.
-
(1999)
Proc. of Interspeech
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
2
-
-
67651002140
-
Statistical parametric speech synthesis
-
Heiga Zen, Keiichi Tokuda, and Alan W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
-
3
-
-
84876687945
-
Speech synthesis based on hidden markov models
-
May
-
Keiichi Tokuda, Yoshihiko Nankaku, Tomoki Toda, Heiga Zen, Junichi Yamagishi, and Keiichiro Oura, "Speech synthesis based on hidden markov models, " Proceedings of the IEEE, vol. 101, no. 5, pp. 1234-1252, May 2013.
-
(2013)
Proceedings of the IEEE
, vol.101
, Issue.5
, pp. 1234-1252
-
-
Tokuda, K.1
Nankaku, Y.2
Toda, T.3
Zen, H.4
Yamagishi, J.5
Oura, K.6
-
4
-
-
84890490547
-
Statistical parametric speech synthesis using deep neural networks
-
May
-
Heiga Zen, Andrew Senior, and Mike Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. of ICASSP, May 2013, pp. 7962-7966.
-
(2013)
Proc. of ICASSP
, pp. 7962-7966
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
5
-
-
85032750981
-
Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends
-
May
-
Zhen-Hua Ling, Shi-Yin Kang, Heiga Zen, Andrew Senior, Mike Schuster, Xiao-Jun Qian, Helen Meng, and Li Deng, "Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends, " Signal Processing Magazine, IEEE, vol. 32, no. 3, pp. 35-52, May 2015.
-
(2015)
Signal Processing Magazine, IEEE
, vol.32
, Issue.3
, pp. 35-52
-
-
Ling, Z.1
Kang, S.2
Zen, H.3
Senior, A.4
Schuster, M.5
Qian, X.6
Meng, H.7
Deng, L.8
-
6
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
Hideki Kawahara, Ikuyo Masuda-Katsuse, and Alain De Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, Issue.3
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigne, A.3
-
7
-
-
84874199000
-
Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system straight
-
Hideki Kawahara, Jo Estill, and Osamu Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system straight, " in MAVEBA, 2001.
-
(2001)
MAVEBA
-
-
Kawahara, H.1
Estill, J.2
Fujimura, O.3
-
8
-
-
0032638660
-
On phase perception in speech
-
Mar
-
Harald Pobloth and W. Bastiaan Kleijn, "On phase perception in speech, " in Proc. of ICASSP, Mar 1999, vol. 1, pp. 29-32 vol. 1.
-
(1999)
Proc. of ICASSP
, vol.1
, pp. 29-32
-
-
Pobloth, H.1
Bastiaan Kleijn, W.2
-
9
-
-
84959096758
-
Phase perception of the glottal excitation of vocoded speech
-
Dresden, September
-
Tuomo Raitio, Lauri Juvela, Antti Suni, Martti Vainio, and Paavo Alku, "Phase perception of the glottal excitation of vocoded speech, " in Proc. of Interspeech, Dresden, September 2015, pp. 254-258.
-
(2015)
Proc. of Interspeech
, pp. 254-258
-
-
Raitio, T.1
Juvela, L.2
Suni, A.3
Vainio, M.4
Alku, P.5
-
10
-
-
84867209230
-
HMM-based Finnish text-to-speech system utilizing glottal inverse filtering
-
Brisbane, Australia, September
-
Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, and Paavo Alku, "HMM-based Finnish text-to-speech system utilizing glottal inverse filtering, " in Proc. of Interspeech, Brisbane, Australia, September 2008, pp. 1881-1884.
-
(2008)
Proc. of Interspeech
, pp. 1881-1884
-
-
Raitio, T.1
Suni, A.2
Pulakka, H.3
Vainio, M.4
Alku, P.5
-
11
-
-
77957744515
-
HMMbased speech synthesis utilizing glottal inverse filtering
-
January
-
Tuomo Raitio, Antti Suni, Junichi Yamagishi, Hannu Pulakka, Jani Nurminen, Martti Vainio, and Paavo Alku, "HMMbased speech synthesis utilizing glottal inverse filtering, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 1, pp. 153-165, January 2011.
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.19
, Issue.1
, pp. 153-165
-
-
Raitio, T.1
Suni, A.2
Yamagishi, J.3
Pulakka, H.4
Nurminen, J.5
Vainio, M.6
Alku, P.7
-
12
-
-
84865755765
-
The GlottHMM speech synthesis entry for Blizzard Challenge 2010
-
Kyoto, Japan, September
-
Antti Suni, Tuomo Raitio, Martti Vainio, and Paavo Alku, "The GlottHMM speech synthesis entry for Blizzard Challenge 2010, " in Blizzard Challenge 2010 Workshop, Kyoto, Japan, September 2010.
-
(2010)
Blizzard Challenge 2010 Workshop
-
-
Suni, A.1
Raitio, T.2
Vainio, M.3
Alku, P.4
-
13
-
-
84911869827
-
Voice source modelling using deep neural networks for statistical parametric speech synthesis
-
Lisbon, Portugal, September
-
Tuomo Raitio, Heng Lu, John Kane, Antti Suni, Martti Vainio, Simon King, and Paavo Alku, "Voice source modelling using deep neural networks for statistical parametric speech synthesis, " in 22nd European Signal Processing Conference (EUSIPCO), Lisbon, Portugal, September 2014.
-
(2014)
22nd European Signal Processing Conference (EUSIPCO)
-
-
Raitio, T.1
Lu, H.2
Kane, J.3
Suni, A.4
Vainio, M.5
King, S.6
Alku, P.7
-
14
-
-
84910068090
-
Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort
-
Singapore, September
-
Tuomo Raitio, Antti Suni, Lauri Juvela, Martti Vainio, and Paavo Alku, "Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort, " in Proc. of Interspeech, Singapore, September 2014, pp. 1969-1973.
-
(2014)
Proc. of Interspeech
, pp. 1969-1973
-
-
Raitio, T.1
Suni, A.2
Juvela, L.3
Vainio, M.4
Alku, P.5
-
15
-
-
84890547237
-
Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise
-
March
-
Tuomo Raitio, Antti Suni, Martti Vainio, and Paavo Alku, "Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise, " Computer Speech & Language, vol. 28, no. 2, pp. 648-664, March 2014.
-
(2014)
Computer Speech & Language
, vol.28
, Issue.2
, pp. 648-664
-
-
Raitio, T.1
Suni, A.2
Vainio, M.3
Alku, P.4
-
16
-
-
84942607168
-
A deep generative architecture for postfiltering in statistical parametric speech synthesis
-
Nov
-
Ling-Hui Chen, Tuomo Raitio, Cassia Valentini-Botinhao, Zhen-Hua Ling, and Junichi Yamagishi, "A deep generative architecture for postfiltering in statistical parametric speech synthesis, " Audio, Speech, and Language Processing, IEEE/ACM Transactions on, vol. 23, no. 11, pp. 2003-2014, Nov 2015.
-
(2015)
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
, vol.23
, Issue.11
, pp. 2003-2014
-
-
Chen, L.1
Raitio, T.2
Valentini-Botinhao, C.3
Ling, Z.4
Yamagishi, J.5
-
17
-
-
84898074254
-
Quasi closed phase glottal inverse filtering analysis with weighted linear prediction
-
March
-
Manu Airaksinen, Tuomo Raitio, Brad Story, and Paavo Alku, "Quasi closed phase glottal inverse filtering analysis with weighted linear prediction, " Audio, Speech, and Language Processing, IEEE/ACM Transactions on, vol. 22, no. 3, pp. 596-607, March 2014.
-
(2014)
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
, vol.22
, Issue.3
, pp. 596-607
-
-
Airaksinen, M.1
Raitio, T.2
Story, B.3
Alku, P.4
-
18
-
-
0026881384
-
Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
-
Eurospeech '91
-
Paavo Alku, "Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering, " Speech Communication, vol. 11, no. 2-3, pp. 109-118, 1992, Eurospeech '91.
-
(1992)
Speech Communication
, vol.11
, Issue.2-3
, pp. 109-118
-
-
Alku, P.1
-
19
-
-
84890448428
-
The GlottHMM entry for Blizzard Challenge 2011: Utilizing source unit selection in HMM-based speech synthesis for improved excitation generation
-
Turin, Italy, September
-
Antti Suni, Tuomo Raitio, Martti Vainio, and Paavo Alku, "The GlottHMM entry for Blizzard Challenge 2011: Utilizing source unit selection in HMM-based speech synthesis for improved excitation generation, " in Blizzard Challenge 2011 Workshop, Turin, Italy, September 2011.
-
(2011)
Blizzard Challenge 2011 Workshop
-
-
Suni, A.1
Raitio, T.2
Vainio, M.3
Alku, P.4
-
20
-
-
84856294347
-
Glottal inverse filtering analysis of human voice production - A review of estimation and parameterization methods of the glottal excitation and their applications. (invited article)
-
Paavo Alku, "Glottal inverse filtering analysis of human voice production-a review of estimation and parameterization methods of the glottal excitation and their applications. (invited article), " Sadhana-Academy Proceedings in Engineering Sciences, vol. 36, no. 5, pp. 623-650, 2011.
-
(2011)
Sadhana-Academy Proceedings in Engineering Sciences
, vol.36
, Issue.5
, pp. 623-650
-
-
Alku, P.1
-
21
-
-
0016495091
-
Linear prediction: A tutorial review
-
Apr
-
John Makhoul, "Linear prediction: A tutorial review, " Proceedings of the IEEE, vol. 63, no. 4, pp. 561-580, Apr 1975.
-
(1975)
Proceedings of the IEEE
, vol.63
, Issue.4
, pp. 561-580
-
-
Makhoul, J.1
-
22
-
-
84882383984
-
Formant frequency estimation of high-pitched vowels using weighted linear predictiona)
-
Paavo Alku, Jouni Pohjalainen, Martti Vainio, Anne-Maria Laukkanen, and Brad Story, "Formant frequency estimation of high-pitched vowels using weighted linear predictiona), " The Journal of the Acoustical Society of America, vol. 134, no. 2, 2013.
-
(2013)
The Journal of the Acoustical Society of America
, vol.134
, Issue.2
-
-
Alku, P.1
Pohjalainen, J.2
Vainio, M.3
Laukkanen, A.4
Story, B.5
-
23
-
-
70450198169
-
Glottal closure and opening instant detection from speech signals
-
Thomas Drugman and Thierry Dutoit, "Glottal closure and opening instant detection from speech signals., " in Proc. of Interspeech, 2009, pp. 2891-2894.
-
(2009)
Proc. of Interspeech
, pp. 2891-2894
-
-
Drugman, T.1
Dutoit, T.2
-
24
-
-
84863419425
-
Detection of glottal closure instants from speech signals: A quantitative review
-
March
-
Thomas Drugman, Mark Thomas, Jon Gudnason, Patrick Naylor, and Thierry Dutoit, "Detection of glottal closure instants from speech signals: A quantitative review, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 994-1006, March 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.3
, pp. 994-1006
-
-
Drugman, T.1
Thomas, M.2
Gudnason, J.3
Naylor, P.4
Dutoit, T.5
-
26
-
-
84857819132
-
Theano: A CPU and GPU math expression compiler
-
Oral Presentation, June
-
James Bergstra, Olivier Breuleux, Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, Guillaume Desjardins, Joseph Turian, David Warde-Farley, and Yoshua Bengio, "Theano: a CPU and GPU math expression compiler, " in Proc. of the Python for Scientific Computing Conference (SciPy), June 2010, Oral Presentation.
-
(2010)
Proc. of the Python for Scientific Computing Conference (SciPy)
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
Warde-Farley, D.8
Bengio, Y.9
-
27
-
-
84897544737
-
Theano: New features and speed improvements
-
Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, James Bergstra, Ian J. Goodfellow, Arnaud Bergeron, Nicolas Bouchard, and Yoshua Bengio, "Theano: new features and speed improvements, " Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012.
-
(2012)
Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop
-
-
Bastien, F.1
Lamblin, P.2
Pascanu, R.3
Bergstra, J.4
Goodfellow, I.J.5
Bergeron, A.6
Bouchard, N.7
Bengio, Y.8
-
28
-
-
85133720638
-
The HMM-based speech synthesis system version 2. 0
-
Bonn, Germany, August
-
Heiga Zen, Takashi Nose, Junichi Yamagishi, Shinji Sako, Takashi Masuko, Alan W. Black, and Keiichi Tokuda, "The HMM-based speech synthesis system version 2. 0, " in Proc. of ISCA SSW6, Bonn, Germany, August 2007, pp. 294-299.
-
(2007)
Proc. of ISCA SSW6
, pp. 294-299
-
-
Zen, H.1
Nose, T.2
Yamagishi, J.3
Sako, S.4
Masuko, T.5
Black, A.W.6
Tokuda, K.7
-
29
-
-
84973276298
-
Methods for subjective determination of transmission quality
-
800 ITU-T SG12 Geneva, Switzerland, Aug.
-
"Methods for Subjective Determination of Transmission Quality, " Recommendation P. 800, ITU-T SG12, Geneva, Switzerland, Aug. 1996.
-
(1996)
Recommendation
-
-
|