SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2009, Pages 4025-4028

Trajectory training considering global variance for HMM-based speech synthesis

(2) Toda, Tomoki a Young, Steve b

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

b UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Global varian; Hidden Markov models; Speech synthesis; Training criterion; Trajectory likelihood

Indexed keywords

CLOSED FORM SOLUTIONS; GLOBAL VARIAN; HMM-BASED SPEECH SYNTHESIS; NATURAL SPEECH; NOVEL METHODS; OPTIMIZATION CRITERIA; PARAMETER OPTIMIZATION; STATISTICAL MODELING; SYNTHESIS OPTIMIZATION; SYNTHETIC SPEECH; TRAINING CRITERION; TRAINING METHODS; UNIFIED FRAMEWORK;

ACOUSTICS; COMPUTATIONAL GRAMMARS; SIGNAL PROCESSING; SPEECH SYNTHESIS; STRUCTURAL OPTIMIZATION; TRAJECTORIES;

HIDDEN MARKOV MODELS;

EID: 67650826181 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2009.4960511 Document Type: Conference Paper

Times cited : (18)

References (12)

1
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- Budapest, Hungary, Sep
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. Proc. EUROSPEECH, pp. 2347-2350, Budapest, Hungary, Sep. 1999.
- (1999) Proc. EUROSPEECH , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

2
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- Istanbul, Turkey, June
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura. Speech parameter generation algorithms for HMM-based speech synthesis. Proc. ICASSP, pp. 1315-1318, Istanbul, Turkey, June 2000.
- (2000) Proc. ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

3
- 33749573927
- Reformulating the HMM as a trajetory model by imposing explicit relationships between static and dynamic feature vector sequences
- H. Zen, K. Tokuda, and T. Kitamura. Reformulating the HMM as a trajetory model by imposing explicit relationships between static and dynamic feature vector sequences. Computer Speech and Language, Vol. 21, pp. 153-173, 2007.
- (2007) Computer Speech and Language , vol.21 , pp. 153-173
- Zen, H.¹ Tokuda, K.² Kitamura, T.³

4
- 33846429403
- Minimum generation error training for HMM-based speech synthesis
- Toulouse, France, May
- Y.-J. Wu and R.H. Wang. Minimum generation error training for HMM-based speech synthesis. Proc. ICASSP, pp. 89-92, Toulouse, France, May 2006.
- (2006) Proc. ICASSP , pp. 89-92
- Wu, Y.-J.¹ Wang, R.H.²

5
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- May
- T. Toda and K. Tokuda. A speech parameter generation algorithm considering global variance for HMM-based speech synthesis. IEICE Transactions, Vol. E90-D, No. 5, pp. 816-824, May 2007.
- (2007) IEICE Transactions , vol.E90-D , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

6
- 51449106803
- Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
- Las Vegas, USA, Mar
- Y.-J.Wu, H. Zen, Y. Nankaku and K. Tokuda. Minimum generation error criterion considering global/local variance for HMM-based speech synthesis. Proc. ICASSP, pp. 4621-4624, Las Vegas, USA, Mar. 2008.
- (2008) Proc. ICASSP , pp. 4621-4624
- Wu, Y.J.¹ Zen, H.² Nankaku, Y.³ Tokuda, K.⁴

7
- 70349220142
- Master Thesis in Japanese, Nagoya Institute of Technology
- K. Nakamura. Model training considering global variance for HMMbased speech synthesis. Master Thesis (in Japanese), Nagoya Institute of Technology, 2007.
- (2007) Model training considering global variance for HMMbased speech synthesis
- Nakamura, K.¹

8
- 0036522887
- Multi-space probability distribution HMM
- K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi. Multi-space probability distribution HMM. IEICE Trans. Inf. and Syst., Vol. E85-D, No. 3, pp. 455-464, 2002.
- (2002) IEICE Trans. Inf. and Syst , vol.E85-D , Issue.3 , pp. 455-464
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

9
- 85131821539
- Mel-generalized cepstral analysis - a unified approach to speech spectral estimation
- Yokohama, Japan, Sep
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai. Mel-generalized cepstral analysis - a unified approach to speech spectral estimation. Proc. ICSLP, pp. 1043-1045, Yokohama, Japan, Sep. 1994.
- (1994) Proc. ICSLP , pp. 1043-1045
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

10
- 33646773080
- CMU ARCTIC databases for speech synthesis
- Technical Report, CMU-LTI-03-177, Language Technologies Institute, Carnegie Mellon University
- J. Kominek and A. W. Black. CMU ARCTIC databases for speech synthesis. Technical Report, CMU-LTI-03-177, Language Technologies Institute, Carnegie Mellon University, 2003.
- (2003)
- Kominek, J.¹ Black, A.W.²

11
- 70349214044
- http://www.speech.cs.cmu.edu/flite/index.html

12
- 0032673049
- 0 extraction: Possible role of a repetitive structure in sounds
- 0 extraction: possible role of a repetitive structure in sounds. Speech Communication, Vol. 27, No. 3-4, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigńe, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.