메뉴 건너뛰기




Volumn , Issue , 2001, Pages

A Concatenative Mandarin TTS System without Prosody Model and Prosody Modification

Author keywords

[No Author keywords available]

Indexed keywords

FINAL DECISION; MULTI-TIER; NON-UNIFORM; PROSODY MODELING; PROSODY PREDICTIONS; SELECTION SCHEME; SPEECH CORPORA; SYNTHESISED; TTS SYSTEMS; UNIT SELECTION;

EID: 85039405250     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (14)

References (16)
  • 1
    • 0025543906 scopus 로고
    • Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones
    • Moulines, E. and Charpentier, F., “Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones”, Speech Communication 9 (1990), 453-467.
    • (1990) Speech Communication , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 2
    • 0141588508 scopus 로고
    • The Klattalk text-to-speech conversion system
    • Klatt, D.H., “The Klattalk text-to-speech conversion system”, Proc. ICASSP'82, 1982, 1589-1592.
    • (1982) Proc. ICASSP'82 , pp. 1589-1592
    • Klatt, D.H.1
  • 3
    • 0022896756 scopus 로고
    • Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and TV announcers
    • Fujisaki, H., Hirose, K., Takahashi, N. and Morikawa, H., “Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and TV announcers”, Proc. ICASSP'86, 1986, 2039-2042.
    • (1986) Proc. ICASSP'86 , pp. 2039-2042
    • Fujisaki, H.1    Hirose, K.2    Takahashi, N.3    Morikawa, H.4
  • 4
    • 0028405296 scopus 로고
    • Assignment of segmental duration in text-to-speech synthesis
    • Van Santen, J., “Assignment of segmental duration in text-to-speech synthesis”, Computer Speech and Language, Vol.8, 1994, 95-129.
    • (1994) Computer Speech and Language , vol.8 , pp. 95-129
    • Van Santen, J.1
  • 5
    • 21844469662 scopus 로고
    • A study of pitch pattern generation using HMM-based statistical information
    • Fukada, T., Komori, Y. Aso,T. and Ohora, Y., “A study of pitch pattern generation using HMM-based statistical information”, Proc. ICSLP'94, 1994, 723-726.
    • (1994) Proc. ICSLP'94 , pp. 723-726
    • Fukada, T.1    Komori, Y.2    Aso, T.3    Ohora, Y.4
  • 6
    • 0032665603 scopus 로고    scopus 로고
    • A dynamical system model for generating fundamental frequency for speech synthesis
    • Ross, K.N. and Ostendorf, M., “A dynamical system model for generating fundamental frequency for speech synthesis”, IEEE transactions on speech and audio processing, Vol.7, No. 3, 1999, 295-309.
    • (1999) IEEE transactions on speech and audio processing , vol.7 , Issue.3 , pp. 295-309
    • Ross, K.N.1    Ostendorf, M.2
  • 7
    • 85009107944 scopus 로고    scopus 로고
    • Using Bayesian belief networks for model duration in text-to-speech systems
    • Goubanova, O. and Taylor, P., “Using Bayesian belief networks for model duration in text-to-speech systems”, Proc. ICSLP'2000, 2000.
    • (2000) Proc. ICSLP'2000
    • Goubanova, O.1    Taylor, P.2
  • 9
    • 0032073761 scopus 로고    scopus 로고
    • An RNN-based prosodic information synthesizer for Mandarin text-to-speech
    • Chen, S., Hwang, S. and Wang, Y., “An RNN-based prosodic information synthesizer for Mandarin text-to-speech”, IEEE transactions on speech and audio processing, Vol.6, No.3, 1998, 226-239.
    • (1998) IEEE transactions on speech and audio processing , vol.6 , Issue.3 , pp. 226-239
    • Chen, S.1    Hwang, S.2    Wang, Y.3
  • 10
    • 17344374779 scopus 로고    scopus 로고
    • Tree-based unit selection for English speech synthesis
    • Wang, W. J., Campbell, W. N., Iwahashi, N. and Sagisaka, Y., “Tree-based unit selection for English speech synthesis”, ICASSP'93, vol.2, 191-194.
    • ICASSP'93 , vol.2 , pp. 191-194
    • Wang, W. J.1    Campbell, W. N.2    Iwahashi, N.3    Sagisaka, Y.4
  • 11
    • 0031642265 scopus 로고    scopus 로고
    • Automatic generation of synthesis units for trainable text-to-speech systems
    • Hon, H., Acero, A., Huang, S., Liu, J. and Plumpe, M., “Automatic generation of synthesis units for trainable text-to-speech systems”, ICASSP'98, vol.1, 293-296.
    • ICASSP'98 , vol.1 , pp. 293-296
    • Hon, H.1    Acero, A.2    Huang, S.3    Liu, J.4    Plumpe, M.5
  • 12
    • 0001208125 scopus 로고    scopus 로고
    • Optimizing selection of units from speech database for concatenative synthesis
    • Black, A. and Campbell, N., “Optimizing selection of units from speech database for concatenative synthesis”, ICASSP'96, 373-376, 1996.
    • (1996) ICASSP'96 , pp. 373-376
    • Black, A.1    Campbell, N.2
  • 13
    • 0003840408 scopus 로고    scopus 로고
    • Research on perception of juncture between syllables in Chinese
    • Chu, M., Tang, D., Si, H., Tian, X. and Lu, S., “Research on perception of juncture between syllables in Chinese”, Chinese Journal of Acoustics, Vol.17, No.2, 143-152.
    • Chinese Journal of Acoustics , vol.17 , Issue.2 , pp. 143-152
    • Chu, M.1    Tang, D.2    Si, H.3    Tian, X.4    Lu, S.5
  • 14
    • 0034855169 scopus 로고    scopus 로고
    • Segmenting unrestricted Chinese text into prosodic words instead of lexical words
    • Qian, Y., Chu, M., Peng, H., “Segmenting unrestricted Chinese text into prosodic words instead of lexical words”, Proc. ICASSP2001, 2001.
    • (2001) Proc. ICASSP2001
    • Qian, Y.1    Chu, M.2    Peng, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.