SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2010, Pages 4606-4609

Improved modeling for F0 generation and V/U decision in HMM-based TTS

(6) Zhang, Qingqing a,b Soong, Frank a Qian, Yao a Yan, Zhijie a Pan, Jielin b Yan, Yonghong b

a MICROSOFT RESEARCH ASIA (China)

b INSTITUTE OF ACOUSTICS (China)

Author keywords

F0 generation; HMM based TTS; V U decision model; Voicing strength

Indexed keywords

EXTRACTION; PROBABILITY DENSITY FUNCTION; SIGNAL PROCESSING;

DECISION MODELING; F0 GENERATION; F0 MODELING; HMM-BASED; HMM-BASED TTS; NEW APPROACHES; SYNTHESIZED SPEECH; TRAINING DATA; V/U DECISION MODEL; VOICING STRENGTH;

ERRORS;

EID: 78049409326 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2010.5495561 Document Type: Conference Paper

Times cited : (25)

References (15)

1
- 0033708106
- Speech Parameter Generation Algorithms for HMM-based Speech Synthesis
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech Parameter Generation Algorithms for HMM-based Speech Synthesis", Proc. of ICASSP, 2000.
- Proc. of ICASSP, 2000
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

2
- 0032673049
- Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. M. Katsuse, and A. D. Cheveigne, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based f0 extraction: possible role of a repetitive structure in sounds", Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Katsuse, I.M.² Cheveigne, A.D.³

3
- 84867223798
- Robustness of HMM-based Speech Synthesis
- J. Yamagishi, Z. Ling, and S. King, "Robustness of HMM-based Speech Synthesis", Proc. of InterSpeech, 2008.
- Proc. of InterSpeech, 2008
- Yamagishi, J.¹ Ling, Z.² King, S.³

4
- 11144317887
- Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency
- Dec.
- D. Arifianto, T. Tanaka, T. Masuko, and T. Kobayashi, "Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency", IEICE Trans. Inf. & Syst., vol. E87-D, no. 12, pp. 2812-2820, Dec. 2004.
- (2004) IEICE Trans. Inf. & Syst. , vol.E87-D , Issue.12 , pp. 2812-2820
- Arifianto, D.¹ Tanaka, T.² Masuko, T.³ Kobayashi, T.⁴

5
- 84928118106
- Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity
- H. Kawahara, H. Katayose, A. Cheveigńe, and R. Patterson, "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity", Proc .of EuroSpeech, 1999.
- Proc.of EuroSpeech, 1999
- Kawahara, H.¹ Katayose, H.² Cheveigńe, A.³ Patterson, R.⁴

6
- 0001455934
- A robust algorithm for pitch tracking (RAPT)
- W. Kleijn and K. Paliwal, Eds. Elsevier
- D. Talkin, "A robust algorithm for pitch tracking (RAPT)", in Speech Coding and Synthesis,W. Kleijn and K. Paliwal, Eds. Elsevier, 1995, pp. 495-518.
- (1995) Speech Coding and Synthesis , pp. 495-518
- Talkin, D.¹

7
- 33749573927
- Reformulating the HMM as a Trajectory Model by Imposing Explicit Relationships between static and Dynamic Feature Vector Sequences
- H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a Trajectory Model by Imposing Explicit Relationships between static and Dynamic Feature Vector Sequences," Computer Speech & Language, vol. 21, no. 1, pp. 153-173, 2007.
- (2007) Computer Speech & Language , vol.21 , Issue.1 , pp. 153-173
- Zen, H.¹ Tokuda, K.² Kitamura, T.³

8
- 34547517493
- Full HMM Training for Minimizing Generation Error in Synthesis
- Y. Wu, R. Wang, and F. Soong, "Full HMM Training for Minimizing Generation Error in Synthesis," in Proc. ICASSP,2007.
- Proc. ICASSP,2007
- Wu, Y.¹ Wang, R.² Soong, F.³

9
- 67650823157
- Probablistic modelling of f0 in unvoiced regions in hmm based speech synthesis
- K. Yu, T. Toda, M. Gasic, S. Keizer, F. Mairesse, B. Thomson and S. Young, "Probablistic modelling of f0 in unvoiced regions in hmm based speech synthesis," ICASSP 2009, Taipei, Taiwan, April 19-24.
- ICASSP 2009, Taipei, Taiwan, April 19-24
- Yu, K.¹ Toda, T.² Gasic, M.³ Keizer, S.⁴ Mairesse, F.⁵ Thomson, B.⁶ Young, S.⁷

10
- 0037567970
- Pitch pattern generation using multi-space probability distribution hmm
- T. Masuko, K. Tokuda, N. Miyazaki, and T. Kobayashi, "Pitch pattern generation using multi-space probability distribution hmm," IEICE Trans., vol. J83-D-II, no. 7, pp. 1600-1609, 2000
- (2000) IEICE Trans. , vol.J83-D-II , Issue.7 , pp. 1600-1609
- Masuko, T.¹ Tokuda, K.² Miyazaki, N.³ Kobayashi, T.⁴

11
- 70450169782
- A Minimum V/U Error Approach to F0 Genearation in HMM-based TTS
- Y. Qian, F. k. Soong, M. Wang, Z. Wu, "A Minimum V/U Error Approach to F0 Genearation in HMM-based TTS", In Proc. InterSpeech 2009.
- Proc. InterSpeech 2009
- Qian, Y.¹ Soong, F.K.² Wang, M.³ Wu, Z.⁴

12
- 78049378328
- Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis
- Sept.
- S. Kang, Z. Shuang, Q. Duan, Y. Qin, L. Cai, "Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis", in Proc. of Interspeech2009, Brighton, UK, Sept. 2009.
- (2009) Proc. of Interspeech2009, Brighton, UK
- Kang, S.¹ Shuang, Z.² Duan, Q.³ Qin, Y.⁴ Cai, L.⁵

13
- 84905283451
- New methods in continuous Mandarin speech recognition
- C.J. Chen, R.A. Gopinath, M.D. Monkowski,M.A. Picheny, K. Shen, "New methods in continuous Mandarin speech recognition", In Proc. of Eurospeech 1997, pp. 1543-1546.
- Proc. of Eurospeech 1997 , pp. 1543-1546
- Chen, C.J.¹ Gopinath, R.A.² Monkowski, M.D.³ Picheny, M.A.⁴ Shen, K.⁵

14
- 0033906251
- MDL-based Context-Dependent Sub-word Modeling for Speech Recognition
- K. Shinoda, and T. Watanable, "MDL-based Context-Dependent Sub-word Modeling for Speech Recognition", J. Acoust. Soc. Jpn(E), vol.21, no.2, pp.79-86, 2000.
- (2000) J. Acoust. Soc. Jpn(E) , vol.21 , Issue.2 , pp. 79-86
- Shinoda, K.¹ Watanable, T.²

15
- 79952258981
- "HMM-based Speech Synthesis System (HTS)," http://hts.sp. nitech.ac.jp. 2009
- HMM-based Speech Synthesis System (HTS)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.