SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2010, Pages 4614-4617

An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech

(3) Vishnubhotla, Srikanth a Fernandez, Raul b Ramabhadran, Bhuvana b

a UNIVERSITY OF MARYLAND (United States)

b IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

Autoencoders; Excitation modeling; Hidden Markov models; Neural networks; Speech synthesis

Indexed keywords

DIGITAL STORAGE; HIDDEN MARKOV MODELS; LEARNING SYSTEMS; NEURAL NETWORKS;

AUTO ENCODERS; DATA DRIVEN; EXCITATION MODELING; HIDDEN-MARKOV MODELS; HMM-BASED; HMM-TTS; LOW DIMENSIONALITY; NETWORK-BASED; NEURAL-NETWORKS; TEXT TO SPEECH;

SPEECH SYNTHESIS;

EID: 78049412607 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2010.5495546 Document Type: Conference Paper

Times cited : (16)

References (10)

1
- 0011510419
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Eurospeech, 1999, pp. 1223-1226.
- (1999) Eurospeech , pp. 1223-1226
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

2
- 84856248349
- A trainable excitation model for HMM-based speech synthesis
- R. Maia, T. Toda, H. Zen, Y. Nankaku, and K. Tokuda, "A trainable excitation model for HMM-based speech synthesis," in Interspeech, 2007, pp. 1909-1912.
- (2007) Interspeech , pp. 1909-1912
- Maia, R.¹ Toda, T.² Zen, H.³ Nankaku, Y.⁴ Tokuda, K.⁵

3
- 84883320815
- Fundamentals and recent advances in HMM-based speech synthesis
- K. Tokuda and H. Zen, "Fundamentals and recent advances in HMM-based speech synthesis," in Interspeech, 2009.
- (2009) Interspeech
- Tokuda, K.¹ Zen, H.²

4
- 51449085178
- On the state definition for a trainable excitation model in HMM-based speech synthesis
- R. Maia, T. Toda, K. Tokuda, S. Sakai, and S. Nakamura, "On the state definition for a trainable excitation model in HMM-based speech synthesis," in ICASSP, 2008, pp. 3965-3968.
- (2008) ICASSP , pp. 3965-3968
- Maia, R.¹ Toda, T.² Tokuda, K.³ Sakai, S.⁴ Nakamura, S.⁵

5
- 82155160991
- Towards an improved modeling of the glottal source in statistical parametric speech synthesis
- J. P. Cabral, S. Renals, K. Richmond, and J. Yamagishi, "Towards an improved modeling of the glottal source in statistical parametric speech synthesis," in Proc. Sixth ISCA Workshop on Speech Synth., 2007, pp. 131-136.
- (2007) Proc. Sixth ISCA Workshop on Speech Synth. , pp. 131-136
- Cabral, J.P.¹ Renals, S.² Richmond, K.³ Yamagishi, J.⁴

6
- 67650793794
- Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis
- T. Drugman, A. Moinet, T. Dutoit, and G. Wilfart, "Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis," in ICASSP, 2009, pp. 3793-3796.
- (2009) ICASSP , pp. 3793-3796
- Drugman, T.¹ Moinet, A.² Dutoit, T.³ Wilfart, G.⁴

7
- 85009096905
- An automatic pitch marking method using wavelet transform
- M. Sakamoto and T. Saito, "An automatic pitch marking method using wavelet transform," in ICSLP, 2000, vol. 3, pp. 650-653.
- (2000) ICSLP , vol.3 , pp. 650-653
- Sakamoto, M.¹ Saito, T.²

8
- 33746600649
- Reducing the dimensionality of data with neural networks
- July
- G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, July 2006.
- (2006) Science , vol.313 , Issue.5786 , pp. 504-507
- Hinton, G.E.¹ Salakhutdinov, R.R.²

9
- 84862612564
- On contrastive divergence learning
- M. Á. Carreira-Perpiñán and G.E. Hinton, "On contrastive divergence learning," in Artificial Intelligence and Statistics, 2005, pp. 33-41.
- (2005) Artificial Intelligence and Statistics , pp. 33-41
- Carreira-Perpiñán, M.Á.¹ Hinton, G.E.²

10
- 84878381939
- Version 3.2
- "Speech Signal Processing Toolkit (SPTK)," Version 3.2, http://sp-tk.sourceforge.net/.
- Speech Signal Processing Toolkit (SPTK)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.