메뉴 건너뛰기




Volumn 5237 LNCS, Issue , 2008, Pages 125-136

Multimodal unit selection for 2D audiovisual text-to-speech synthesis

Author keywords

[No Author keywords available]

Indexed keywords

AUDIOVISUAL; INTERACTIVE COMPUTER SYSTEMS; MACHINE LEARNING; SPEECH SYNTHESIS;

EID: 57949116211     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-85853-9_12     Document Type: Conference Paper
Times cited : (10)

References (20)
  • 4
    • 84872004031 scopus 로고    scopus 로고
    • Sample-Based Synthesis of Photo-Realistic Talking Heads
    • Cosatto, E., Graf, H.P.: Sample-Based Synthesis of Photo-Realistic Talking Heads. Computer Animation, 103-110 (1998)
    • (1998) Computer Animation , vol.103-110
    • Cosatto, E.1    Graf, H.P.2
  • 5
    • 0034271782 scopus 로고    scopus 로고
    • Photo-realistic talking-heads from image samples
    • Cosatto, E., Graf, H.P.: Photo-realistic talking-heads from image samples. IEEE Transactions on multimedia 2, 152-163 (2000)
    • (2000) IEEE Transactions on multimedia , vol.2 , pp. 152-163
    • Cosatto, E.1    Graf, H.P.2
  • 7
    • 57949097441 scopus 로고    scopus 로고
    • Visual Speech Synthesis by Morphing Visemes (MikeTalk). MIT AI Lab
    • Ezzat, T., Poggio, T.: Visual Speech Synthesis by Morphing Visemes (MikeTalk). MIT AI Lab, A.I Memo 1658 (1999)
    • (1999) A.I Memo , vol.1658
    • Ezzat, T.1    Poggio, T.2
  • 8
    • 0036989560 scopus 로고    scopus 로고
    • Association for Computing Machinery's Special Interest Group on Graphics and Interactive Techniques
    • Ezzat, T., Geiger, G., Poggio, T.: Trainable videorealistic speech animation. Association for Computing Machinery's Special Interest Group on Graphics and Interactive Techniques 21, 388-398 (2002)
    • (2002) Trainable videorealistic speech animation , vol.21 , pp. 388-398
    • Ezzat, T.1    Geiger, G.2    Poggio, T.3
  • 10
    • 57949092690 scopus 로고    scopus 로고
    • Text-to-Audio Visual Speech Synthesizer
    • Goyal, U.K., Kapoor, A., Kalra, P.: Text-to-Audio Visual Speech Synthesizer. Virtual Worlds, 256-269 (2000)
    • (2000) Virtual Worlds , pp. 256-269
    • Goyal, U.K.1    Kapoor, A.2    Kalra, P.3
  • 11
    • 85133343575 scopus 로고    scopus 로고
    • Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information
    • Grant, K.W., Greenberg, S.: Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information. In: Workshop on Audio-Visual Speech Processing, pp. 132-137 (2001)
    • (2001) Workshop on Audio-Visual Speech Processing , pp. 132-137
    • Grant, K.W.1    Greenberg, S.2
  • 14
    • 84929634396 scopus 로고    scopus 로고
    • Unit Selection Synthesis Using Long NonUniform Units and Phoneme Identity Matching
    • Latacz, L., Kong, Y., Verhelst, W.: Unit Selection Synthesis Using Long NonUniform Units and Phoneme Identity Matching. In: 6th ISCA Workshop on Speech Synthesis, pp. 270-275 (2007)
    • (2007) 6th ISCA Workshop on Speech Synthesis , pp. 270-275
    • Latacz, L.1    Kong, Y.2    Verhelst, W.3
  • 16
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk, H., MacDonald, J.: Hearing lips and seeing voices. Nature 264, 746-748 (1976)
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 17
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9, 453-467 (1990)
    • (1990) Speech Communication , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 18
    • 0033336969 scopus 로고    scopus 로고
    • Users Evaluation: Synthetic talking faces for interactive services
    • Pandzic, I., Ostermann, J., Milien, D.: Users Evaluation: Synthetic talking faces for interactive services. The Visual Computer 15, 2330-2340 (1999)
    • (1999) The Visual Computer , vol.15 , pp. 2330-2340
    • Pandzic, I.1    Ostermann, J.2    Milien, D.3
  • 19
    • 10444256499 scopus 로고    scopus 로고
    • Near-videorealistic synthetic talking faces: Implementation and evaluation
    • Theobald, B.J., Bangham, J.A., Matthews, I.A., Cawley, G.C.: Near-videorealistic synthetic talking faces: implementation and evaluation. Speech Communication 44, 127-140 (2004)
    • (2004) Speech Communication , vol.44 , pp. 127-140
    • Theobald, B.J.1    Bangham, J.A.2    Matthews, I.A.3    Cawley, G.C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.