SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2010, Pages 4610-4613

Simple methods for improving speaker-similarity of HMM-based speech synthesis

(2) Yamagishi, Junichi a King, Simon a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

HMM; HTS; Speech synthesis; TTS

Indexed keywords

SIGNAL PROCESSING; SPEECH RECOGNITION;

'CURRENT; FREQUENCY WARPING; HMM; HMM-BASED SPEECH SYNTHESIS; HTS; LOGARITHMIC SCALING; SAMPLING RATES; SIMPLE METHOD; TTS; WAVEFORMS;

SPEECH SYNTHESIS;

EID: 78049403515 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2010.5495562 Document Type: Conference Paper

Times cited : (14)

References (18)

1
- 67651002140
- Statistical parametric speech synthesis
- Nov.
- H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis," Speech Communication, vol. 51, no. 11, pp. 1039-1064, Nov. 2009.
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

2
- 67650790758
- The Blizzard Challenge 2008
- V. Karaiskos, S. King, R. A. J. Clark, and C. Mayo, "The Blizzard Challenge 2008," in Proc. Blizzard Challenge Workshop, Brisbane, Australia, Sep. 2008.
- Proc. Blizzard Challenge Workshop, Brisbane, Australia, Sep. 2008
- Karaiskos, V.¹ King, S.² Clark, R.A.J.³ Mayo, C.⁴

3
- 67650819492
- The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge
- Sep.
- J. Yamagishi, H. Zen, Y.-J. Wu, T. Toda, and K. Tokuda, "The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge," in Proc. Blizzard Challenge 2008, Brisbane, Australia, Sep. 2008.
- (2008) Proc. Blizzard Challenge 2008, Brisbane, Australia
- Yamagishi, J.¹ Zen, H.² Wu, Y.-J.³ Toda, T.⁴ Tokuda, K.⁵

4
- 0002648826
- A model of loudness summation
- E. Zwicker and B. Scharf, "A model of loudness summation," Psych. Rev., vol. 72, pp. 2-26, 1965.
- (1965) Psych. Rev. , vol.72 , pp. 2-26
- Zwicker, E.¹ Scharf, B.²

5
- 78049361102
- Incorporation of mixed excitation model and postfilter into HMM-based text-to-speech synthesis
- Aug. in Japanese
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Incorporation of mixed excitation model and postfilter into HMM-based text-to-speech synthesis," IEICE Trans., vol. J87-D-II, no. 8, pp. 1565-1571, Aug. 2004, (in Japanese).
- (2004) IEICE Trans. , vol.J87-D-II , Issue.8 , pp. 1565-1571
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

6
- 85011187169
- Analysis of voice fundamental frequency contours for declarative sentences of Japanese
- Oct.
- H. Fujisaki and K. Hirose, "Analysis of voice fundamental frequency contours for declarative sentences of Japanese," J. Acoust. Soc. Japan (E), vol. 5, no. 4, pp. 233-242, Oct. 2000.
- (2000) J. Acoust. Soc. Japan (E) , vol.5 , Issue.4 , pp. 233-242
- Fujisaki, H.¹ Hirose, K.²

7
- 70450161300
- Thousands of voices for HMM-based speech synthesis
- Sep.
- J. Yamagishi et al., "Thousands of voices for HMM-based speech synthesis," in Proc. Interspeech 2009, Brighton, U.K., Sep. 2009, pp. 420-423.
- (2009) Proc. Interspeech 2009, Brighton, U.K. , pp. 420-423
- Yamagishi, J.¹

8
- 0000133998
- An analysis of transformations
- G. E. P. Box and D. R. Cox, "An analysis of transformations," Journal of the Royal Statistical Society. Series B (Methodological), vol. 26, no. 2, pp. 211-252, 1964.
- (1964) Journal of the Royal Statistical Society. Series B (Methodological) , vol.26 , Issue.2 , pp. 211-252
- Box, G.E.P.¹ Cox, D.R.²

9
- 0001310760
- Spectral estimation of speech based on mel-cepstral representation
- Aug. in Japanese
- K. Tokuda, T. Kobayashi, T. Fukada, H. Saito, and S. Imai, "Spectral estimation of speech based on mel-cepstral representation," IEICE Trans. Fundamentals, vol. J74-A, no. 8, pp. 1240-1248, Aug. 1991, in Japanese.
- (1991) IEICE Trans. Fundamentals , vol.J74-A , Issue.8 , pp. 1240-1248
- Tokuda, K.¹ Kobayashi, T.² Fukada, T.³ Saito, H.⁴ Imai, S.⁵

10
- 78049412356
- Recursive calculation of mel-cepstrum from LP coefficients
- Apr.
- K. Tokuda, T. Kobayashi, and S. Imai, "Recursive calculation of mel-cepstrum from LP coefficients," in Technical Report of Nagoya Institute of Technology, Apr. 1994.
- (1994) Technical Report of Nagoya Institute of Technology
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

11
- 0016938506
- Auditory filter shapes derived with noise stimuli
- Mar.
- R. Patterson, "Auditory filter shapes derived with noise stimuli," Journal of the Acoustical Society of America, vol. 76, pp. 640-654, Mar. 1982.
- (1982) Journal of the Acoustical Society of America , vol.76 , pp. 640-654
- Patterson, R.¹

12
- 0001481529
- Bark and ERB bilinear transforms
- Jul.
- J. O. Smith III and J. S. Abel, "Bark and ERB bilinear transforms," IEEE Trans. on Speech Audio Process., vol. 7, no. 6, pp. 697-708, Jul. 1999.
- (1999) IEEE Trans. on Speech Audio Process. , vol.7 , Issue.6 , pp. 697-708
- Smith III, J.O.¹ Abel, J.S.²

13
- 84898967346
- Gaussianization
- Nov.
- S. S. Chen and R. A. Gopinath, "Gaussianization," in NIPS 2000, Nov. 2000, pp. 423-429.
- (2000) NIPS 2000 , pp. 423-429
- Chen, S.S.¹ Gopinath, R.A.²

14
- 85030493378
- Synthesis of regional English using a keyword lexicon
- Sep.
- S. Fitt and S. Isard, "Synthesis of regional English using a keyword lexicon," in Proc. Eurospeech 1999, vol. 2, Sep. 1999, pp. 823-826.
- (1999) Proc. Eurospeech 1999 , vol.2 , pp. 823-826
- Fitt, S.¹ Isard, S.²

15
- 34047123652
- Multisyn: Open-domain unit selection for the Festival speech synthesis system
- R. A. J. Clark, K. Richmond, and S. King, "Multisyn: Open-domain unit selection for the Festival speech synthesis system," Speech Communication, vol. 49, no. 4, pp. 317-330, 2007.
- (2007) Speech Communication , vol.49 , Issue.4 , pp. 317-330
- Clark, R.A.J.¹ Richmond, K.² King, S.³

16
- 33846405723
- Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
- Jan.
- H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.1 , pp. 325-333
- Zen, H.¹ Toda, T.² Nakamura, M.³ Tokuda, K.⁴

17
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigné, A.³

18
- 33846429403
- Minimum generation error training for HMM-based speech synthesis
- May
- Y. Wu and R.-H. Wang, "Minimum generation error training for HMM-based speech synthesis," in Proc. ICASSP 2006, May 2006, pp. 89-92.
- (2006) Proc. ICASSP 2006 , pp. 89-92
- Wu, Y.¹ Wang, R.-H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.