메뉴 건너뛰기




Volumn 14, Issue 4, 2006, Pages 1099-1108

The IBM expressive text-to-speech synthesis system for american english

Author keywords

Corpus driven text to speech (TTS); Expressive speech synthesis; Prosodic phonology; Text to speech (TTS); Tones and break indices (ToBI)

Indexed keywords

CORPUS DRIVEN TEXT TO SPEECH (TTS); EXPRESSIVE SPEECH SYNTHESIS; PROSODIC PHONOLOGY; TEXT TO SPEECH (TTS); TONES AND BREAK INDICES (TOBI);

EID: 34047275265     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.876123     Document Type: Article
Times cited : (110)

References (28)
  • 1
    • 0012925721 scopus 로고    scopus 로고
    • Perceptual bandwidth
    • Mar
    • B. Reeves and C. Nass, "Perceptual bandwidth," Commun. ACM, vol. 43, no. 3, pp. 65-70, Mar. 2000.
    • (2000) Commun. ACM , vol.43 , Issue.3 , pp. 65-70
    • Reeves, B.1    Nass, C.2
  • 2
    • 34047248476 scopus 로고    scopus 로고
    • Multi-layered extensions to the speech synthesis markup language for describing expressiveness
    • Geneva, Switzerland
    • E. Eide, R. Bakis, W. Hamza, and J. Pitrelli, "Multi-layered extensions to the speech synthesis markup language for describing expressiveness," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 1645-1648.
    • (2003) Proc. Eurospeech , pp. 1645-1648
    • Eide, E.1    Bakis, R.2    Hamza, W.3    Pitrelli, J.4
  • 5
    • 84985926077 scopus 로고    scopus 로고
    • Segment Sektion in the L&H realspeak laboratory TTS system
    • Beijing, China
    • G. Coorman, J. Fackrell, P. Rutten, and B. Van Coile, "Segment Sektion in the L&H realspeak laboratory TTS system," in Proc. ICSLP, Beijing, China, 2000, pp. 395-398.
    • (2000) Proc. ICSLP , pp. 395-398
    • Coorman, G.1    Fackrell, J.2    Rutten, P.3    Van Coile, B.4
  • 6
    • 0004131347 scopus 로고    scopus 로고
    • Trainable speech synthesis,
    • Ph.D. dissertation, Cambridge Univ. Eng. Dept, Cambridge, U.K
    • R. E. Donovan, "Trainable speech synthesis," Ph.D. dissertation, Cambridge Univ. Eng. Dept., Cambridge, U.K., 1996.
    • (1996)
    • Donovan, R.E.1
  • 8
    • 0039885315 scopus 로고
    • Context dependent vector quantization for continuous speech recognition
    • Minneapolis, MN
    • L. R. Bahl, P. V. deSouza, P. S. Gopalakrishnan, and M. A. Picheny, "Context dependent vector quantization for continuous speech recognition," in Proc. ICASSP, Minneapolis, MN, 1993, pp. 632-635.
    • (1993) Proc. ICASSP , pp. 632-635
    • Bahl, L.R.1    deSouza, P.V.2    Gopalakrishnan, P.S.3    Picheny, M.A.4
  • 9
    • 85135181226 scopus 로고
    • Improvements in an HMM-based speech synthesiser
    • R. Donovan and P. Woodland, "Improvements in an HMM-based speech synthesiser," in Proc. Eurospeech, 1995, pp. 573-576.
    • (1995) Proc. Eurospeech , pp. 573-576
    • Donovan, R.1    Woodland, P.2
  • 10
    • 85133526552 scopus 로고    scopus 로고
    • Automatically Clustering Similar Units for Unit Selection in Speech Synthesis
    • A. W. Black and P. Taylor, "Automatically Clustering Similar Units for Unit Selection in Speech Synthesis," in Proc. Eurospeech, 1997, pp. 601-604.
    • (1997) Proc. Eurospeech , pp. 601-604
    • Black, A.W.1    Taylor, P.2
  • 11
    • 80051612889 scopus 로고    scopus 로고
    • A new distance measure for costing spectral discontinuities in concatenate speech synthesisers
    • Perthshire, U.K
    • R. E. Donovan, "A new distance measure for costing spectral discontinuities in concatenate speech synthesisers," in Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis, Perthshire, U.K., 2001, pp. 59-62.
    • (2001) Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis , pp. 59-62
    • Donovan, R.E.1
  • 12
    • 34047249700 scopus 로고    scopus 로고
    • Intrinsic phone durations are speaker-specific
    • Denver, CO
    • H. R. Pfitzinger, "Intrinsic phone durations are speaker-specific," in Proc. ICSLP, vol. 2, Denver, CO, 2002, pp. 1113-1116.
    • (2002) Proc. ICSLP , vol.2 , pp. 1113-1116
    • Pfitzinger, H.R.1
  • 13
    • 33745197584 scopus 로고    scopus 로고
    • Reconciling pronunciation differences between the front-end and back-end in the IBM speech synthesis system
    • Jeju, South Korea, Oct
    • W. Hamza, R. Bakis, and E. Eide, "Reconciling pronunciation differences between the front-end and back-end in the IBM speech synthesis system," in Proc. ICSLP, Jeju, South Korea, Oct. 2004, pp. 2561-2564.
    • (2004) Proc. ICSLP , pp. 2561-2564
    • Hamza, W.1    Bakis, R.2    Eide, E.3
  • 14
    • 85009274956 scopus 로고    scopus 로고
    • Data-driven segment preselection in the IBM trainable speech synthesis system
    • Denver, CO
    • W. Hamza and R. Donovan, "Data-driven segment preselection in the IBM trainable speech synthesis system," in Proc. ICSLP, Denver, CO, 2002, pp. 2609-1612.
    • (2002) Proc. ICSLP , pp. 2609-1612
    • Hamza, W.1    Donovan, R.2
  • 15
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • Atlanta, GA
    • A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP, Atlanta, GA, 1996, pp. 373-376.
    • (1996) Proc. ICASSP , pp. 373-376
    • Hunt, A.1    Black, A.2
  • 16
    • 34047274359 scopus 로고    scopus 로고
    • Re-defining intonation from selected units for nonuniform units based speech synthesis
    • Leuven, Belgium
    • B. Bozkurt, T. Dutoit, and V. Pagel, "Re-defining intonation from selected units for nonuniform units based speech synthesis," in Proc. SPS-IEEE Benelux Signal Process. Symp., Leuven, Belgium, 2002, pp. 141-144.
    • (2002) Proc. SPS-IEEE Benelux Signal Process. Symp , pp. 141-144
    • Bozkurt, B.1    Dutoit, T.2    Pagel, V.3
  • 18
    • 85009247888 scopus 로고    scopus 로고
    • Expressive speech synthesis : Using a concatenative synthesizer
    • Denver, CO
    • M. Bulut, S. Narayanan, and A. Syrdal, "Expressive speech synthesis : using a concatenative synthesizer," in Proc. ICSLP, Denver, CO, 2002, pp. 1265-1268.
    • (2002) Proc. ICSLP , pp. 1265-1268
    • Bulut, M.1    Narayanan, S.2    Syrdal, A.3
  • 19
    • 84966356293 scopus 로고    scopus 로고
    • Preservation, identification, and use of emotion in a text-to-speech system
    • Santa Monica, CA, Sep
    • E. Eide, "Preservation, identification, and use of emotion in a text-to-speech system," in Proc. IEEE Workshop on Speech Synthesis, Santa Monica, CA, Sep. 2002.
    • (2002) Proc. IEEE Workshop on Speech Synthesis
    • Eide, E.1
  • 21
    • 0030355540 scopus 로고    scopus 로고
    • 0 contours from ToBI labels using linear regression
    • Philadelphia, PA, pp
    • 0 contours from ToBI labels using linear regression," in Proc. ICSLP, Philadelphia, PA, pp. 1385-1388.
    • Proc. ICSLP , pp. 1385-1388
    • Black, A.W.1    Hunt, A.J.2
  • 22
    • 85030872484 scopus 로고
    • Evaluation of prosodic transcription labeling reliability in the ToBI framework
    • Yokohama, Japan, Sep
    • J. F. Pitrelli, M. E. Beckman, and J. Hirschberg, "Evaluation of prosodic transcription labeling reliability in the ToBI framework," in Proc. ICSLP, vol. I, Yokohama, Japan, Sep. 1994, pp. 123-126.
    • (1994) Proc. ICSLP , vol.1 , pp. 123-126
    • Pitrelli, J.F.1    Beckman, M.E.2    Hirschberg, J.3
  • 23
    • 85009080611 scopus 로고    scopus 로고
    • Inter-transcriber reliability of ToBI prosodie labeling
    • Beijing, China
    • A. K. Syrdal and J. McGory, "Inter-transcriber reliability of ToBI prosodie labeling," in Proc. ICSLP, Beijing, China., 2000, pp. 235-238.
    • (2000) Proc. ICSLP , pp. 235-238
    • Syrdal, A.K.1    McGory, J.2
  • 25
    • 0343353984 scopus 로고    scopus 로고
    • Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events
    • A. Conkie, G. Riccardi, and R. C. Rose, "Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events," in Proc. Eurospeech, 1999, pp. 523-526.
    • (1999) Proc. Eurospeech , pp. 523-526
    • Conkie, A.1    Riccardi, G.2    Rose, R.C.3
  • 26
    • 21844471192 scopus 로고    scopus 로고
    • ToBI or Not ToBI
    • Aix-en-Provence, France, pp
    • C. W. Wightman, "ToBI or Not ToBI," in Proc. Speech Prosody, Aix-en-Provence, France, pp. 25-29.
    • Proc. Speech Prosody , pp. 25-29
    • Wightman, C.W.1
  • 27
    • 0035156005 scopus 로고    scopus 로고
    • Automatic ToBI prediction and alignment to speed manual labeling of prosody
    • A. K. Syrdal, J. Hirschberg, J. McGory, and M. Beckman, "Automatic ToBI prediction and alignment to speed manual labeling of prosody," Speech Commun., vol. 33, no. 1-2, pp. 135-151, 2001.
    • (2001) Speech Commun , vol.33 , Issue.1-2 , pp. 135-151
    • Syrdal, A.K.1    Hirschberg, J.2    McGory, J.3    Beckman, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.