SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5237 LNCS, Issue , 2008, Pages 125-136

Multimodal unit selection for 2D audiovisual text-to-speech synthesis

(4) Mattheyses, Wesley a Latacz, Lukas a Verhelst, Werner a Sahli, Hichem a

a VRIJE UNIVERSITEIT BRUSSEL (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

AUDIOVISUAL; INTERACTIVE COMPUTER SYSTEMS; MACHINE LEARNING; SPEECH SYNTHESIS;

AUDIO-VISUAL CORRELATIONS; AUDIO-VISUAL DATABASE; AUDIO-VISUAL SPEECH; MULTIMODAL OUTPUT; PHOTO-REALISTIC; SYNTHESIS TECHNIQUES; SYNTHETIC SPEECH; TEXT-TO-SPEECH SYSTEM;

SPEECH COMMUNICATION;

EID: 57949116211 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-85853-9_12 Document Type: Conference Paper

Times cited : (10)

References (20)

1
- 0142216141
- Audiovisual speech synthesis
- Bailly, G., Brar, M., Elisei, F., Odisio, M.: Audiovisual speech synthesis. International Journal of Speech Technology 6, 331-346 (2003)
- (2003) International Journal of Speech Technology , vol.6 , pp. 331-346
- Bailly, G.¹ Brar, M.² Elisei, F.³ Odisio, M.⁴

2
- 0030366485
- An Investigation into the Generation of Mouth Shapes for a Talking Head
- Breen. A.P., Bowers. E., Welsh, W.: An Investigation into the Generation of Mouth Shapes for a Talking Head. In: International Conference on Spoken Language Processing, vol. 4, pp. 2159-2162 (1996)
- (1996) International Conference on Spoken Language Processing , vol.4 , pp. 2159-2162
- Breen, A.P.¹ Bowers, E.² Welsh, W.³

3
- 0030677313
- Video Rewrite: Driving Visual Speech with Audio
- Bregler, C., Covell, M., Slaney, M.: Video Rewrite: Driving Visual Speech with Audio. In: Association for Computing Machinery's Special Interest Group on Graphics and Interactive Techniques, pp. 353-360 (1997)
- (1997) Association for Computing Machinery's Special Interest Group on Graphics and Interactive Techniques , pp. 353-360
- Bregler, C.¹ Covell, M.² Slaney, M.³

4
- 84872004031
- Sample-Based Synthesis of Photo-Realistic Talking Heads
- Cosatto, E., Graf, H.P.: Sample-Based Synthesis of Photo-Realistic Talking Heads. Computer Animation, 103-110 (1998)
- (1998) Computer Animation , vol.103-110
- Cosatto, E.¹ Graf, H.P.²

5
- 0034271782
- Photo-realistic talking-heads from image samples
- Cosatto, E., Graf, H.P.: Photo-realistic talking-heads from image samples. IEEE Transactions on multimedia 2, 152-163 (2000)
- (2000) IEEE Transactions on multimedia , vol.2 , pp. 152-163
- Cosatto, E.¹ Graf, H.P.²

6
- 0034517331
- Audio-Visual Unit Selection for the Synthesis of Photo-Realistic Talking-Heads
- Cosatto, E., Potamianos, G., Graf, H.P.: Audio-Visual Unit Selection for the Synthesis of Photo-Realistic Talking-Heads. International Conference on Multimedia, and Expo, pp. 619-622 (2000)
- (2000) International Conference on Multimedia, and Expo , pp. 619-622
- Cosatto, E.¹ Potamianos, G.² Graf, H.P.³

7
- 57949097441
- Visual Speech Synthesis by Morphing Visemes (MikeTalk). MIT AI Lab
- Ezzat, T., Poggio, T.: Visual Speech Synthesis by Morphing Visemes (MikeTalk). MIT AI Lab, A.I Memo 1658 (1999)
- (1999) A.I Memo , vol.1658
- Ezzat, T.¹ Poggio, T.²

8
- 0036989560
- Association for Computing Machinery's Special Interest Group on Graphics and Interactive Techniques
- Ezzat, T., Geiger, G., Poggio, T.: Trainable videorealistic speech animation. Association for Computing Machinery's Special Interest Group on Graphics and Interactive Techniques 21, 388-398 (2002)
- (2002) Trainable videorealistic speech animation , vol.21 , pp. 388-398
- Ezzat, T.¹ Geiger, G.² Poggio, T.³

9
- 79951787154
- Joint Audio-Visual Units Selection - The Javus Speech Synthesizer
- Fagel, S.: Joint Audio-Visual Units Selection - The Javus Speech Synthesizer. In: International Conference on Speech and Computer (2006)
- (2006) International Conference on Speech and Computer
- Fagel, S.¹

10
- 57949092690
- Text-to-Audio Visual Speech Synthesizer
- Goyal, U.K., Kapoor, A., Kalra, P.: Text-to-Audio Visual Speech Synthesizer. Virtual Worlds, 256-269 (2000)
- (2000) Virtual Worlds , pp. 256-269
- Goyal, U.K.¹ Kapoor, A.² Kalra, P.³

11
- 85133343575
- Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information
- Grant, K.W., Greenberg, S.: Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information. In: Workshop on Audio-Visual Speech Processing, pp. 132-137 (2001)
- (2001) Workshop on Audio-Visual Speech Processing , pp. 132-137
- Grant, K.W.¹ Greenberg, S.²

12
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database
- Hunt, A., Black, A.: Unit selection in a concatenative speech synthesis system using a large speech database. In: International Conference on Acoustics, Speech and Signal Processing, pp. 373-376 (1996)
- (1996) International Conference on Acoustics, Speech and Signal Processing , pp. 373-376
- Hunt, A.¹ Black, A.²

13
- 67650591682
- NeXTeNS: A New Open Source Text-to-speech System for Dutch
- Kerkhoff, J., Marsi, E.: NeXTeNS: a New Open Source Text-to-speech System for Dutch. In: 13th meeting of Computational Linguistics in the Netherlands (2002)
- (2002) 13th meeting of Computational Linguistics in the Netherlands
- Kerkhoff, J.¹ Marsi, E.²

14
- 84929634396
- Unit Selection Synthesis Using Long NonUniform Units and Phoneme Identity Matching
- Latacz, L., Kong, Y., Verhelst, W.: Unit Selection Synthesis Using Long NonUniform Units and Phoneme Identity Matching. In: 6th ISCA Workshop on Speech Synthesis, pp. 270-275 (2007)
- (2007) 6th ISCA Workshop on Speech Synthesis , pp. 270-275
- Latacz, L.¹ Kong, Y.² Verhelst, W.³

15
- 57949109158
- Flemish Voice for the Nextens Text-To-Speech System
- Mattheyses, W., Latacz, L., Kong, Y.O., Verhelst, W.: Flemish Voice for the Nextens Text-To-Speech System. In: Fifth Slovenian and First International Language Technologies Conference (2006)
- (2006) Fifth Slovenian and First International Language Technologies Conference
- Mattheyses, W.¹ Latacz, L.² Kong, Y.O.³ Verhelst, W.⁴

16
- 0017199877
- Hearing lips and seeing voices
- McGurk, H., MacDonald, J.: Hearing lips and seeing voices. Nature 264, 746-748 (1976)
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ MacDonald, J.²

17
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9, 453-467 (1990)
- (1990) Speech Communication , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

18
- 0033336969
- Users Evaluation: Synthetic talking faces for interactive services
- Pandzic, I., Ostermann, J., Milien, D.: Users Evaluation: Synthetic talking faces for interactive services. The Visual Computer 15, 2330-2340 (1999)
- (1999) The Visual Computer , vol.15 , pp. 2330-2340
- Pandzic, I.¹ Ostermann, J.² Milien, D.³

19
- 10444256499
- Near-videorealistic synthetic talking faces: Implementation and evaluation
- Theobald, B.J., Bangham, J.A., Matthews, I.A., Cawley, G.C.: Near-videorealistic synthetic talking faces: implementation and evaluation. Speech Communication 44, 127-140 (2004)
- (2004) Speech Communication , vol.44 , pp. 127-140
- Theobald, B.J.¹ Bangham, J.A.² Matthews, I.A.³ Cawley, G.C.⁴

20
- 0003361474
- Digital image warping
- Los Alamitos
- Wolberg, G.: Digital image warping. IEEE Computer Society Press, Los Alamitos (1990)
- (1990) IEEE Computer Society Press
- Wolberg, G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.