SCOPUS 정보 검색 플랫폼

6th ISCA Workshop on Speech Synthesis, SSW 2007

Volumn , Issue , 2007, Pages 131-136

An Excitation Model for HMM-Based Speech Synthesis Based on Residual Modeling

(5) Maia, Ranniery a,b Toda, Tomoki a,c Zen, Heiga d Nankaku, Yoshihiko d Tokuda, Keiichi a,d

a NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (Japan)

b ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

c NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

d NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

EXCITED STATES; SPEECH CODING; SPEECH SYNTHESIS;

ANALYSIS BY SYNTHESIS; EXCITATION MODELING; HMM-BASED; HMM-BASED SPEECH SYNTHESIS; PULSE TRAIN; RESIDUAL MODEL; SPEECH SYNTHESIZER; STATE-DEPENDENT; WAVEFORM GENERATION; WHITE NOISE SEQUENCE;

WHITE NOISE;

EID: 78649297510 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (69)

References (19)

1
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis,” in Proc. of EUROSPEECH, 1999.
- (1999) Proc. of EUROSPEECH
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

2
- 4544291748
- Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis
- J. Yamagishi, M. Tachibana, T. Masuko, and T. Kobayashi, “Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis,” in Proc. of ICASSP, 2004.
- (2004) Proc. of ICASSP
- Yamagishi, J.¹ Tachibana, M.² Masuko, T.³ Kobayashi, T.⁴

3
- 84966348891
- An HMM-based speech synthesis system applied to English
- K. Tokuda, H. Zen, and A. W. Black, “An HMM-based speech synthesis system applied to English,” in Proc. of IEEE Workshop in Speech Synthesis, 2002.
- (2002) Proc. of IEEE Workshop in Speech Synthesis
- Tokuda, K.¹ Zen, H.² Black, A. W.³

4
- 85009097254
- Mixed-excitation for HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Mixed-excitation for HMM-based speech synthesis,” in Proc. of EUROSPEECH, 2001.
- (2001) Proc. of EUROSPEECH
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

5
- 85133716335
- A 2.4 kbits/s MELP candidate for the U.S. Fdereal Standard
- A. McCree, K. Truong, E. George, T. Barnwell, and V. Viswanathan, “A 2.4 kbits/s MELP candidate for the U.S. Fdereal Standard,” in Proc. of ICASSP, 2006.
- (2006) Proc. of ICASSP
- McCree, A.¹ Truong, K.² George, E.³ Barnwell, T.⁴ Viswanathan, V.⁵

6
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds
- Apr
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, “Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds,” Speech Communication, vol. 27, Apr. 1999.
- (1999) Speech Communication , vol.27
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigné, A.³

7
- 33846405723
- Details of the Nitech HMM-based speech synthesis for Blizzard Challenge 2005
- H. Zen, T. Toda, M. Nakamura, and K. Tokuda, “Details of the Nitech HMM-based speech synthesis for Blizzard Challenge 2005,” IEICE Trans. on Inf. and Systems, vol. E90-D, no. 1, 2007.
- (2007) IEICE Trans. on Inf. and Systems , vol.E90-D , Issue.1
- Zen, H.¹ Toda, T.² Nakamura, M.³ Tokuda, K.⁴

8
- 33846406459
- Two-band excitation for HMM-based speech synthesis
- S. J. Kim and M. Hahn, “Two-band excitation for HMM-based speech synthesis,” IEICE Trans. Inf. & Syst., vol. E90-D, 2007.
- (2007) IEICE Trans. Inf. & Syst , vol.E90-D
- Kim, S. J.¹ Hahn, M.²

9
- 34547542349
- Improving the Arabic HMM based speech synthesis quality
- O. Abdel-Hamid, S. Abdou, and M. Rashwan, “Improving the Arabic HMM based speech synthesis quality,” in Proc. of ICSLP, 2006.
- (2006) Proc. of ICSLP
- Abdel-Hamid, O.¹ Abdou, S.² Rashwan, M.³

10
- 0348152138
- Wiley-Interscience
- W. Chu, Speech Coding Algorithms. Wiley-Interscience, 2003.
- (2003) Speech Coding Algorithms
- Chu, W.¹

11
- 85089837272
- Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)
- M. Akamine and T. Kagoshima, “Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS),” in Proc. ICSLP, 1998.
- (1998) Proc. ICSLP
- Akamine, M.¹ Kagoshima, T.²

12
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- Dec
- E. Moulines and F. Charpentier, “Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones,” Speech Communication, vol. 9, Dec. 1990.
- (1990) Speech Communication , vol.9
- Moulines, E.¹ Charpentier, F.²

13
- 85133663448
- Mixed excitation for HMM-based speech synthesis based on state-dependent filtering
- R. Maia, T. Toda, H. Zen, Y. Nankaku, and K. Tokuda, “Mixed excitation for HMM-based speech synthesis based on state-dependent filtering,” in Proc. of Spring Meeting of the Acoust. Society of Japan, 2007.
- (2007) Proc. of Spring Meeting of the Acoust. Society of Japan
- Maia, R.¹ Toda, T.² Zen, H.³ Nankaku, Y.⁴ Tokuda, K.⁵

14
- 0003597650
- Kluwer Academics
- L. B. Jackson, Digital filters and signal processing. Kluwer Academics, 1996.
- (1996) Digital filters and signal processing
- Jackson, L. B.¹

15
- 0003874959
- Springer-Verlag
- J. D. Markel and A. H. Gray, Jr., Linear prediction of speech. Springer-Verlag, 1986.
- (1986) Linear prediction of speech
- Markel, J. D.¹ Gray, A. H.²

16
- 84908144695
- The use of Fast Fourier Transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms
- June
- P. Welch, “The use of Fast Fourier Transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms,” IEEE Trans. Audio and Electroacoustics, vol. 15, June 1967.
- (1967) IEEE Trans. Audio and Electroacoustics , vol.15
- Welch, P.¹

17
- 85133653256
- http://festvox.org/cmu arctic.

18
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, “An adaptive algorithm for mel-cepstral analysis of speech,” in Proc. of ICASSP, 1992.
- (1992) Proc. of ICASSP
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

19
- 33947629275
- Residual conversion versus prediction on voice morphing systems
- H. Duxans and A. Bonafonte, “Residual conversion versus prediction on voice morphing systems,” in Proc. of ICASSP, 2006.
- (2006) Proc. of ICASSP
- Duxans, H.¹ Bonafonte, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.