SCOPUS 정보 검색 플랫폼

4th ITRW on Speech Synthesis, SSW 2001

Volumn , Issue , 2001, Pages

A Concatenative Mandarin TTS System without Prosody Model and Prosody Modification

Author keywords

[No Author keywords available]

Indexed keywords

FINAL DECISION; MULTI-TIER; NON-UNIFORM; PROSODY MODELING; PROSODY PREDICTIONS; SELECTION SCHEME; SPEECH CORPORA; SYNTHESISED; TTS SYSTEMS; UNIT SELECTION;

EID: 85039405250 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (14)

References (16)

1
- 0025543906
- Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones
- Moulines, E. and Charpentier, F., “Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones”, Speech Communication 9 (1990), 453-467.
- (1990) Speech Communication , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

2
- 0141588508
- The Klattalk text-to-speech conversion system
- Klatt, D.H., “The Klattalk text-to-speech conversion system”, Proc. ICASSP'82, 1982, 1589-1592.
- (1982) Proc. ICASSP'82 , pp. 1589-1592
- Klatt, D.H.¹

3
- 0022896756
- Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and TV announcers
- Fujisaki, H., Hirose, K., Takahashi, N. and Morikawa, H., “Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and TV announcers”, Proc. ICASSP'86, 1986, 2039-2042.
- (1986) Proc. ICASSP'86 , pp. 2039-2042
- Fujisaki, H.¹ Hirose, K.² Takahashi, N.³ Morikawa, H.⁴

4
- 0028405296
- Assignment of segmental duration in text-to-speech synthesis
- Van Santen, J., “Assignment of segmental duration in text-to-speech synthesis”, Computer Speech and Language, Vol.8, 1994, 95-129.
- (1994) Computer Speech and Language , vol.8 , pp. 95-129
- Van Santen, J.¹

5
- 21844469662
- A study of pitch pattern generation using HMM-based statistical information
- Fukada, T., Komori, Y. Aso,T. and Ohora, Y., “A study of pitch pattern generation using HMM-based statistical information”, Proc. ICSLP'94, 1994, 723-726.
- (1994) Proc. ICSLP'94 , pp. 723-726
- Fukada, T.¹ Komori, Y.² Aso, T.³ Ohora, Y.⁴

6
- 0032665603
- A dynamical system model for generating fundamental frequency for speech synthesis
- Ross, K.N. and Ostendorf, M., “A dynamical system model for generating fundamental frequency for speech synthesis”, IEEE transactions on speech and audio processing, Vol.7, No. 3, 1999, 295-309.
- (1999) IEEE transactions on speech and audio processing , vol.7 , Issue.3 , pp. 295-309
- Ross, K.N.¹ Ostendorf, M.²

7
- 85009107944
- Using Bayesian belief networks for model duration in text-to-speech systems
- Goubanova, O. and Taylor, P., “Using Bayesian belief networks for model duration in text-to-speech systems”, Proc. ICSLP'2000, 2000.
- (2000) Proc. ICSLP'2000
- Goubanova, O.¹ Taylor, P.²

8
- 0035121063
- Statistical prosodic modeling: from corpus design to parameter estimation
- Bellegarda, J.R., Silverman, K.E.A., Lenzo, K. and Anderson, V., “Statistical prosodic modeling: from corpus design to parameter estimation”, IEEE transactions on speech and audio processing, Vol.9. No.1, 2001, 52-66.
- (2001) IEEE transactions on speech and audio processing , vol.9 , Issue.1 , pp. 52-66
- Bellegarda, J.R.¹ Silverman, K.E.A.² Lenzo, K.³ Anderson, V.⁴

9
- 0032073761
- An RNN-based prosodic information synthesizer for Mandarin text-to-speech
- Chen, S., Hwang, S. and Wang, Y., “An RNN-based prosodic information synthesizer for Mandarin text-to-speech”, IEEE transactions on speech and audio processing, Vol.6, No.3, 1998, 226-239.
- (1998) IEEE transactions on speech and audio processing , vol.6 , Issue.3 , pp. 226-239
- Chen, S.¹ Hwang, S.² Wang, Y.³

10
- 17344374779
- Tree-based unit selection for English speech synthesis
- Wang, W. J., Campbell, W. N., Iwahashi, N. and Sagisaka, Y., “Tree-based unit selection for English speech synthesis”, ICASSP'93, vol.2, 191-194.
- ICASSP'93 , vol.2 , pp. 191-194
- Wang, W. J.¹ Campbell, W. N.² Iwahashi, N.³ Sagisaka, Y.⁴

11
- 0031642265
- Automatic generation of synthesis units for trainable text-to-speech systems
- Hon, H., Acero, A., Huang, S., Liu, J. and Plumpe, M., “Automatic generation of synthesis units for trainable text-to-speech systems”, ICASSP'98, vol.1, 293-296.
- ICASSP'98 , vol.1 , pp. 293-296
- Hon, H.¹ Acero, A.² Huang, S.³ Liu, J.⁴ Plumpe, M.⁵

12
- 0001208125
- Optimizing selection of units from speech database for concatenative synthesis
- Black, A. and Campbell, N., “Optimizing selection of units from speech database for concatenative synthesis”, ICASSP'96, 373-376, 1996.
- (1996) ICASSP'96 , pp. 373-376
- Black, A.¹ Campbell, N.²

13
- 0003840408
- Research on perception of juncture between syllables in Chinese
- Chu, M., Tang, D., Si, H., Tian, X. and Lu, S., “Research on perception of juncture between syllables in Chinese”, Chinese Journal of Acoustics, Vol.17, No.2, 143-152.
- Chinese Journal of Acoustics , vol.17 , Issue.2 , pp. 143-152
- Chu, M.¹ Tang, D.² Si, H.³ Tian, X.⁴ Lu, S.⁵

14
- 0034855169
- Segmenting unrestricted Chinese text into prosodic words instead of lexical words
- Qian, Y., Chu, M., Peng, H., “Segmenting unrestricted Chinese text into prosodic words instead of lexical words”, Proc. ICASSP2001, 2001.
- (2001) Proc. ICASSP2001
- Qian, Y.¹ Chu, M.² Peng, H.³

15
- 84871624073
- editor, Kluwer Academic Publishers
- Sproat, R., editor, Multilingual text-to-speech synthesis: the Bell labs approach, Kluwer Academic Publishers, 1998, 17-20.
- (1998) Multilingual text-to-speech synthesis: the Bell labs approach , pp. 17-20
- Sproat, R.¹

16
- 0004056285
- chapter 4
- Huang, X.D., Acero, A., Hon, H. and Meredith, S., Spoken Language Processing (draft), chapter 4.
- Spoken Language Processing (draft)
- Huang, X.D.¹ Acero, A.² Hon, H.³ Meredith, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.