SCOPUS 정보 검색 플랫폼

7th ISCA Workshop on Speech Synthesis, SSW 2010

Volumn , Issue , 2010, Pages 334-339

Comparison of Formant Enhancement Methods for HMM-Based Speech Synthesis

(5) Raitio, Tuomo a Suni, Antti b Pulakka, Hannu a Vainio, Martti b Alku, Paavo a

a AALTO UNIVERSITY (Finland)

b UNIVERSITY OF HELSINKI (Finland)

Author keywords

formant enhancement; hidden Markov model; over smoothing; speech synthesis

Indexed keywords

SPEECH SYNTHESIS;

ALL-POLE MODELING; FORMANT ENHANCEMENT; HIDDEN MARKOV MODEL-BASED SPEECH SYNTHESIS; HIDDEN-MARKOV MODELS; MODEL TRAINING; OVER-SMOOTHING; PERFORMANCE; SPECTRAL ENVELOPES; SPECTRAL MODELING; SPEECH SOUNDS;

HIDDEN MARKOV MODELS;

EID: 84865718521 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (21)

References (26)

1
- 85031628788
- An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features
- Tokuda, K., Masuko, T., Yamada, T., Kobayashi, T. and Imai, S., “An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features”, in Proc. Eurospeech, 1:757-760, 1995.
- (1995) Proc. Eurospeech , vol.1 , pp. 757-760
- Tokuda, K.¹ Masuko, T.² Yamada, T.³ Kobayashi, T.⁴ Imai, S.⁵

2
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- Sep
- Yoshimura, T., Tokuda, K., Masuko, T., Kobayashi, T. and Kitamura, T., “Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis”, in Proc. Eurospeech, 2374-2350, Sep. 1999.
- (1999) Proc. Eurospeech , pp. 2374-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

3
- 84966348891
- An HMM-based speech synthesis system applied to English
- Sep
- Tokuda, K., Zen, H. and Black, A. W., “An HMM-based speech synthesis system applied to English”, in Proc. 2002 IEEE Workshop on Speech Synthesis, 227-230, Sep. 2002.
- (2002) Proc. 2002 IEEE Workshop on Speech Synthesis , pp. 227-230
- Tokuda, K.¹ Zen, H.² Black, A. W.³

4
- 67651002140
- Statistical parametric speech synthesis
- Zen, H., Tokuda, K. and Black, A. W., “Statistical parametric speech synthesis”, Speech Commun., 51(11):1039-1064, 2009.
- (2009) Speech Commun , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A. W.³

5
- 0016495091
- Linear prediction: A tutorial review
- Apr
- Makhoul, J., “Linear prediction: A tutorial review”, in Proc. of the IEEE, 63(4):561-580, Apr. 1975.
- (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

6
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- Fukada, T., Tokuda, K., Kobayashi, T., Imai, S., “An adaptive algorithm for mel-cepstral analysis of speech”, in Proc. ICASSP, 137-140, 1992.
- (1992) Proc. ICASSP , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

7
- 85009231267
- Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features
- Sep
- Tokuda, K., Zen, H. and Kitamura, T., “Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features”, In Proc. Eurospeech, 865-868. Sep. 2003.
- (2003) Proc. Eurospeech , pp. 865-868
- Tokuda, K.¹ Zen, H.² Kitamura, T.³

8
- 33846429403
- Minimum generation error training for HMM-based speech synthesis
- Wu, Y.-J. and Wang, R.-H., “Minimum generation error training for HMM-based speech synthesis”, in Proc. ICASSP, 89-92, 2006.
- (2006) Proc. ICASSP , pp. 89-92
- Wu, Y.-J.¹ Wang, R.-H.²

9
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- May
- Toda, T. and Tokuda, K., “A speech parameter generation algorithm considering global variance for HMM-based speech synthesis”, IEICE Trans. Inf. & Syst., E90-D(5):816-824, May 2007.
- (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

10
- 34547553049
- A study on conditional parameter generation from HMM based on maximum likelihood criterion
- [in Japanese]
- Masuko, T., Tokuda, K. and Kobayashi, T., “A study on conditional parameter generation from HMM based on maximum likelihood criterion”, in Proc. Autumn Meeting of ASJ, 209-210, 2003. [in Japanese]
- (2003) Proc. Autumn Meeting of ASJ , pp. 209-210
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³

11
- 67650851754
- USTC system for Blizzard Challenge 2006 an improved HMM-based speech synthesis method
- Ling, Z.-H., Wu Y.-J., Wang Y.-P., Qin, L. and Wang, R.-H., “USTC system for Blizzard Challenge 2006 an improved HMM-based speech synthesis method”, Blizzard Challenge Workshop, 2006.
- (2006) Blizzard Challenge Workshop
- Ling, Z.-H.¹ Wu, Y.-J.² Wang, Y.-P.³ Qin, L.⁴ Wang, R.-H.⁵

12
- 0029219433
- Adaptive postfiltering for quality enhancement of coded speech
- Jan
- Chen, J.-H. and Gersho, A., “Adaptive postfiltering for quality enhancement of coded speech”, IEEE Trans. on Speech and Audio Processing, 3(1):59-71, Jan. 1995.
- (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , Issue.1 , pp. 59-71
- Chen, J.-H.¹ Gersho, A.²

13
- 85133720638
- The HMM-based speech synthesis system (HTS) version 2.0
- Aug
- Zen, H., Nose, T., Yamagishi, J., Sako, S., Masuko, T., Black, A. W. and Tokuda, K., “The HMM-based speech synthesis system (HTS) version 2.0”, in Sixth ISCA Workshop on Speech Synthesis, 294-299, Aug. 2007.
- (2007) Sixth ISCA Workshop on Speech Synthesis , pp. 294-299
- Zen, H.¹ Nose, T.² Yamagishi, J.³ Sako, S.⁴ Masuko, T.⁵ Black, A. W.⁶ Tokuda, K.⁷

14
- 79952258981
- Apr. Online
- HTS, “HMM-based speech synthesis system”, Apr. 2009. Online: http://hts.sp.nitech.ac.jp
- (2009) HMM-based speech synthesis system

15
- 34547505386
- Ph.D Thesis, University of Science and Technology of China, [in Chinese]
- Wu, Y.-J., “Research on HMM-based Speech Synthesis”, Ph.D Thesis, University of Science and Technology of China, 2006. [in Chinese]
- (2006) Research on HMM-based Speech Synthesis
- Wu, Y.-J.¹

16
- 67650797364
- Postfiltering for HMM-based speech synthesis using mel-LSPs
- [in Japanese]
- Oura, K., Zen, H., Nankaku, Y., Lee, A. and Tokuda, K., “Postfiltering for HMM-based speech synthesis using mel-LSPs”, Proc. Autumn Meeting of ASJ, pp. 367-368, 2007. [in Japanese]
- (2007) Proc. Autumn Meeting of ASJ , pp. 367-368
- Oura, K.¹ Zen, H.² Nankaku, Y.³ Lee, A.⁴ Tokuda, K.⁵

17
- 0002557614
- Line spectrum pair (LSP) and speech data compression
- Soong, F. K. and Juang, B.-H., “Line spectrum pair (LSP) and speech data compression”, Proc. ICASSP, 9:37-40, 1984.
- (1984) Proc. ICASSP , vol.9 , pp. 37-40
- Soong, F. K.¹ Juang, B.-H.²

18
- 0003757962
- 2nd ed., Springer-Verlag
- Flanagan, J. L., “Speech Analysis, Synthesis and Perception”, 2nd ed., Springer-Verlag, 1972.
- (1972) Speech Analysis, Synthesis and Perception
- Flanagan, J. L.¹

19
- 84867209230
- HMM-based Finnish text-to-speech system utilizing glottal inverse filtering
- Raitio, T., Suni, A., Pulakka, H., Vainio, M. and Alku, P., “HMM-based Finnish text-to-speech system utilizing glottal inverse filtering”, Proc. Interspeech, 2008.
- (2008) Proc. Interspeech
- Raitio, T.¹ Suni, A.² Pulakka, H.³ Vainio, M.⁴ Alku, P.⁵

20
- 85133445050
- HMM-based speech synthesis utilizing glottal inverse filtering
- (in press)
- Raitio, T., Suni, A., Yamagishi, J., Pulakka, H., Nurminen, J., Vainio, M. and Alku, P., “HMM-based speech synthesis utilizing glottal inverse filtering”, IEEE Trans. Audio, Speech, and Language Processing, (in press)
- IEEE Trans. Audio, Speech, and Language Processing
- Raitio, T.¹ Suni, A.² Yamagishi, J.³ Pulakka, H.⁴ Nurminen, J.⁵ Vainio, M.⁶ Alku, P.⁷

21
- 0032875050
- A method for generating natural-sounding speech stimuli for cognitive brain research
- Alku, P., Tiitinen, H. and Näätänen, R., “A method for generating natural-sounding speech stimuli for cognitive brain research”, Clinical Neurophysiology, 110:1329-1333, 1999.
- (1999) Clinical Neurophysiology , vol.110 , pp. 1329-1333
- Alku, P.¹ Tiitinen, H.² Näätänen, R.³

22
- 0026881384
- Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
- Jun
- Alku, P., “Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering”, Speech Commun. 11(2-3):109-118, Jun. 1992.
- (1992) Speech Commun , vol.11 , Issue.2-3 , pp. 109-118
- Alku, P.¹

23
- 0030101058
- A revision of Zwicker's loudness model
- Moore, B. C. J. and Glasberg, B. R., “A revision of Zwicker's loudness model”, ACTA Acustica, 82:335-345, 1996.
- (1996) ACTA Acustica , vol.82 , pp. 335-345
- Moore, B. C. J.¹ Glasberg, B. R.²

24
- 34547533934
- Hidden semi-Markov model based speech synthesis
- Oct
- Zen, H., Tokuda, K., Masuko, T., Kobayashi, T. and Kitamura, T., “Hidden semi-Markov model based speech synthesis”, Proc. Interspeech, 2:1397-1400, Oct. 2004.
- (2004) Proc. Interspeech , vol.2 , pp. 1397-1400
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

25
- 53049107776
- Accent and prominence in Finnish speech synthesis
- Oct
- Vainio, M., Suni, A. and Sirjola, P., “Accent and prominence in Finnish speech synthesis”, Proc. of the 10th International Conference on Speech and Computer, 309-312, Oct. 2005.
- (2005) Proc. of the 10th International Conference on Speech and Computer , pp. 309-312
- Vainio, M.¹ Suni, A.² Sirjola, P.³

26
- 0003450846
- Recommendation ITU-T P.800 International Telecommunication Union, Aug
- Recommendation ITU-T P.800 “Methods for subjective determination of transmission quality”, International Telecommunication Union, Aug. 1996.
- (1996) Methods for subjective determination of transmission quality

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.