SCOPUS 정보 검색 플랫폼

Computer Graphics Forum

Volumn 34, Issue 2, 2015, Pages 193-204

VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track

(7) Garrido, P a Valgaerts, L a Sarmadi, H a Steiner, I b Varanasi, K c Perez P c Theobalt, C a

a MAX PLANCK INSTITUTE FOR INFORMATICS (Germany)

b SAARLAND UNIVERSITY (Germany)

c Technicolor (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; TRANSLATION (LANGUAGES);

3D SHAPE MODEL; AUDIO ANALYSIS; COMPLEX PROCESSES; HIGH QUALITY; RETRIEVAL METHODS; TV PRODUCTION; VISUAL DISCOMFORT; VISUAL QUALITIES;

VISUAL LANGUAGES;

EID: 84932116100 PISSN: 01677055 EISSN: 14678659 Source Type: Journal
DOI: 10.1111/cgf.12552 Document Type: Conference Paper

Times cited : (160)

References (50)

1
- 84887344163
- Expressive visual text-to-speech using active appearance models
- 3, 10
- Anderson R., Stenger B., Wan V., Cipolla R.,: Expressive visual text-to-speech using active appearance models. In Proc. CVPR (2013), pp. 3382-3389. 3, 10
- (2013) Proc. CVPR , pp. 3382-3389
- Anderson, R.¹ Stenger, B.² Wan, V.³ Cipolla, R.⁴

2
- 0141615907
- Reanimating faces in images and video
- 3
- Blanz V., Basso C., Vetter T., Poggio T.,: Reanimating faces in images and video. CGF (Proc. EUROGRAPHICS) 22, 3 (2003), 641-650. 3
- (2003) CGF (Proc. EUROGRAPHICS) , vol.22 , Issue.3 , pp. 641-650
- Blanz, V.¹ Basso, C.² Vetter, T.³ Poggio, T.⁴

3
- 0030677313
- Video Rewrite: Driving visual speech with audio
- 3, 10, 11
- Bregler C., Covell M., Slaney M.,: Video Rewrite: Driving visual speech with audio. In ACM TOG (Proc. SIGGRAPH) (1997), pp. 353-360. 3, 10, 11
- (1997) ACM TOG (Proc. SIGGRAPH) , pp. 353-360
- Bregler, C.¹ Covell, M.² Slaney, M.³

4
- 84888031698
- Hyper-heuristics: A survey of the state of the art
- 6
- Burke E.K., Gendreau M., Hyde M., Kendall G., Ochoa G., Ozcan E., QU R.,: Hyper-heuristics: A survey of the state of the art. Journal of the Operational Research Society 64, 12 (2013), 1695-1724. 6
- (2013) Journal of the Operational Research Society , vol.64 , Issue.12 , pp. 1695-1724
- Burke, E.K.¹ Gendreau, M.² Hyde, M.³ Kendall, G.⁴ Ochoa, G.⁵ Ozcan, E.⁶ Qu, R.⁷

5
- 80051892421
- High-quality passive facial performance capture using anchor frames
- 3
- Beeler T., Hahn F., Bradley D., Bickel B., Beardsley P., Gotsman C., Sumner R.W., Gross M.,: High-quality passive facial performance capture using anchor frames. In ACM TOG (Proc. SIGGRAPH) (2011), vol. 30, pp. 75:1-75:10. 3
- (2011) ACM TOG (Proc. SIGGRAPH) , vol.30 , pp. 751-7510
- Beeler, T.¹ Hahn, F.² Bradley, D.³ Bickel, B.⁴ Beardsley, P.⁵ Gotsman, C.⁶ Sumner, R.W.⁷ Gross, M.⁸

6
- 78149247678
- Kluwer, 6
- Burke E., Hyde M., Kendall G., Ochoa G., OZCAN E., WOODWARD J.R.,: A Classification of Hyper-heuristic Approaches. Kluwer, 2010. 6
- (2010) A Classification of Hyper-heuristic Approaches
- Burke, E.¹ Hyde, M.² Kendall, G.³ Ochoa, G.⁴ Ozcan, E.⁵ Woodward, J.R.⁶

7
- 84937437186
- Voice puppetry
- 2, 11
- Brand M.,: Voice puppetry. In ACM TOG (Proc. SIGGRAPH) (1999), pp. 21-28. 2, 11
- (1999) ACM TOG (Proc. SIGGRAPH) , pp. 21-28
- Brand, M.¹

8
- 4444257069
- Praat, a system for doing phonetics by computer
- 7
- Boersma P., Weenink D.,: Praat, a system for doing phonetics by computer. Glot International 5, 9/10 (2001), 341-345. 7
- (2001) Glot International , vol.5 , Issue.910 , pp. 341-345
- Boersma, P.¹ Weenink, D.²

9
- 84880833516
- Online modeling for realtime facial animation
- 3
- Bouaziz S., Wang Y., Pauly M.,: Online modeling for realtime facial animation. In ACM TOG (Proc. SIGGRAPH) (2013), vol. 32, pp. 40:1-40:10. 3
- (2013) ACM TOG (Proc. SIGGRAPH) , vol.32 , pp. 401-4010
- Bouaziz, S.¹ Wang, Y.² Pauly, M.³

10
- 35349009809
- Transferable video-realistic speech animation
- 3
- Chang Y., Ezzat T.,: Transferable video-realistic speech animation. In Proc. SCA (2005), pp. 29-31. 3
- (2005) Proc. SCA , pp. 29-31
- Chang, Y.¹ Ezzat, T.²

11
- 85008058913
- Real-time speech motion synthesis from recorded motions
- 3
- Cao Y., Faloutsos P., Kohler E., Pighin F.,: Real-time speech motion synthesis from recorded motions. In Proc. SCA (2004), pp. 347-355. 3
- (2004) Proc. SCA , pp. 347-355
- Cao, Y.¹ Faloutsos, P.² Kohler, E.³ Pighin, F.⁴

12
- 84905743974
- Displaced dynamic expression regression for real-time facial tracking and animation
- 3
- Cao C., Hou Q., Zhou K.,: Displaced dynamic expression regression for real-time facial tracking and animation. In ACM TOG (Proc. SIGGRAPH) (2014), vol. 33, p. 43. 3
- (2014) ACM TOG (Proc. SIGGRAPH) , vol.33 , pp. 43
- Cao, C.¹ Hou, Q.² Zhou, K.³

13
- 84981334718
- EFASE: Expressive facial animation synthesis and editing with phoneme-isomap controls
- 3
- Deng Z., NEumann U.,: eFASE: expressive facial animation synthesis and editing with phoneme-isomap controls. In Proc. SCA (2006), pp. 251-260. 3
- (2006) Proc. SCA , pp. 251-260
- Deng, Z.¹ Neumann, U.²

14
- 82455171679
- Video face replacement
- 1130:10. 3, 10
- Dale K., Sunkavalli K., Johnson M.K., Vlasic D., Matusik W., Pfister H.,: Video face replacement. In ACM TOG (Proc. SIGGRAPH Asia) (2011), vol. 30, pp. 130:1130:10. 3, 10t
- (2011) ACM TOG (Proc. SIGGRAPH Asia) , vol.30 , pp. 130
- Dale, K.¹ Sunkavalli, K.² Johnson, M.K.³ Vlasic, D.⁴ Matusik, W.⁵ Pfister, H.⁶

15
- 0036989560
- Trainable video-realistic speech animation
- 3
- Ezzat T., Geiger G., Poggio T.,: Trainable video-realistic speech animation. In ACM TOG (Proc. SIGGRAPH) (2002), pp. 388-398. 3
- (2002) ACM TOG (Proc. SIGGRAPH) , pp. 388-398
- Ezzat, T.¹ Geiger, G.² Poggio, T.³

16
- 84863934238
- A flexible and adaptive hyper-heuristic approach for (dynamic) capacitated vehicle routing problems
- 6
- Garrido P., Castro C.,: A flexible and adaptive hyper-heuristic approach for (dynamic) capacitated vehicle routing problems. Fundamenta Informaticae 119, 1 (2012), 29-60. 6
- (2012) Fundamenta Informaticae , vol.119 , Issue.1 , pp. 29-60
- Garrido, P.¹ Castro, C.²

17
- 84911366471
- Automatic face reenactment
- 3, 10
- Garrido P., Valgaerts L., Rehmsen O., Thormaehlen T., Perez P., Theobalt C.,: Automatic face reenactment. In Proc. CVPR (2014). 3, 10
- (2014) Proc. CVPR
- Garrido, P.¹ Valgaerts, L.² Rehmsen, O.³ Thormaehlen, T.⁴ Perez, P.⁵ Theobalt, C.⁶

18
- 84887853831
- Reconstructing detailed dynamic face geometry from monocular video
- 3, 4, 5, 7
- Garrido P., Valgaerts L., WU C., Theobalt C.,: Reconstructing detailed dynamic face geometry from monocular video. In ACM TOG (Proc. SIGGRAPH Asia) (2013), vol. 32, pp. 158:1-158:10. 3, 4, 5, 7
- (2013) ACM TOG (Proc. SIGGRAPH Asia) , vol.32 , pp. 1581-15810
- Garrido, P.¹ Valgaerts, L.² Wu, C.³ Theobalt, C.⁴

19
- 84965489665
- Speak my language: Current attitudes to television subtitling and dubbing
- 2
- Kilborn R.,: Speak my language: Current attitudes to television subtitling and dubbing. Media Culture Society 15, 4 (1993), 641-660. 2
- (1993) Media Culture Society , vol.15 , Issue.4 , pp. 641-660
- Kilborn, R.¹

20
- 84898663109
- Data-driven speech animation synthesis focusing on realistic inside of the mouth
- 3
- Kawai M., Iwao T., Mima D., Maejima A., Morishima S.,: Data-driven speech animation synthesis focusing on realistic inside of the mouth. Journal of Information Processing 22, 2 (2014), 401-409. 3
- (2014) Journal of Information Processing , vol.22 , Issue.2 , pp. 401-409
- Kawai, M.¹ Iwao, T.² Mima, D.³ Maejima, A.⁴ Morishima, S.⁵

21
- 0141504400
- Visyl-lable based speech animation
- 3
- Kshirsagar S., Magnenat-Thalmann N.,: Visyl-lable based speech animation. In CGF (Proc. EUROGRAPHICS) (2003), vol. 22, pp. 632-640. 3
- (2003) CGF (Proc. EUROGRAPHICS) , vol.22 , pp. 632-640
- Kshirsagar, S.¹ Magnenat-Thalmann, N.²

22
- 78149315731
- Being John Malkovich
- 3
- Kemelmacher-Shlizerman I., Sankar A., Shechtman E., Seitz S.M.,: Being John Malkovich. In Proc. ECCV (2010), pp. 341-353. 3
- (2010) Proc. ECCV , pp. 341-353
- Kemelmacher-Shlizerman, I.¹ Sankar, A.² Shechtman, E.³ Seitz, S.M.⁴

23
- 84971340084
- Practice and theory of blendshape facial models
- 7
- Lewis J.P., Anjyo K., Rhee T., Zhang M., Pighin F., Deng Z.,: Practice and theory of blendshape facial models. In EUROGRAPHICS STAR report (2014), pp. 199-218. 7
- (2014) EUROGRAPHICS STAR report , pp. 199-218
- Lewis, J.P.¹ Anjyo, K.² Rhee, T.³ Zhang, M.⁴ Pighin, F.⁵ Deng, Z.⁶

24
- 85095220056
- Real-time analysis-synthesis and intelligibility of talking faces
- 2
- LE Goff B., Guiard-Marigny T., Cohen M., Benoit C.,: Real-time analysis-synthesis and intelligibility of talking faces. In ESCA/IEEE Workshop on Speech Synthesis (1994), pp. 53-56. 2
- (1994) ESCA/IEEE Workshop on Speech Synthesis , pp. 53-56
- Le Goff, B.¹ Guiard-Marigny, T.² Cohen, M.³ Benoit, C.⁴

25
- 84925972209
- Visual vowel and diphthong perception across speakers
- 2
- Lesner S.A., Kricos P.B.,: Visual vowel and diphthong perception across speakers. Journal of the Academy of Rehabilitative Audiology 14 (1981), 252-258. 2
- (1981) Journal of the Academy of Rehabilitative Audiology , vol.14 , pp. 252-258
- Lesner, S.A.¹ Kricos, P.B.²

26
- 80155129710
- Realistic facial expression synthesis for an image-based talking head
- 3, 10
- Liu K., Ostermann J.,: Realistic facial expression synthesis for an image-based talking head. In Proc. ICME (July 2011), pp. 1-6. 3, 10
- (2011) Proc. ICME (July) , pp. 1-6
- Liu, K.¹ Ostermann, J.²

27
- 77749295421
- Real-time prosody-driven synthesis of body language
- 2
- Levine S., Theobalt C., Koltun V.,: Real-time prosody-driven synthesis of body language. In ACM TOG (Proc. SIGGRAPH Asia) (2009), vol. 28, pp. 172:1-172:10. 2
- (2009) ACM TOG (Proc. SIGGRAPH Asia) , vol.28 , pp. 1721-17210
- Levine, S.¹ Theobalt, C.² Koltun, V.³

28
- 84866661849
- A data-driven approach for facial expression synthesis in video
- 3
- Li K., XU F., Wang J., Dai Q., Liu Y.,: A data-driven approach for facial expression synthesis in video. In Proc. CVPR (2012), pp. 57-64. 3
- (2012) Proc. CVPR , pp. 57-64
- Li, K.¹ Xu, F.² Wang, J.³ Dai, Q.⁴ Liu, Y.⁵

29
- 31344439475
- Accurate visible speech synthesis based on concatenating variable length motion capture data
- 3
- MA J., Cole R.A., Pellom B.L., Ward W., Wise B.,: Accurate visible speech synthesis based on concatenating variable length motion capture data. IEEE TVCG 12, 2 (2006), 266-276. 3
- (2006) IEEE TVCG , vol.12 , Issue.2 , pp. 266-276
- Ma, J.¹ Cole, R.A.² Pellom, B.L.³ Ward, W.⁴ Wise, B.⁵

30
- 0017199877
- Hearing lips and seeing voices
- 2
- MCGurk H., MacDonald J.,: Hearing lips and seeing voices. Nature 264, 5588 (1976), 746-748. 2
- (1976) Nature , vol.264 , Issue.5588 , pp. 746-748
- Mcgurk, H.¹ Macdonald, J.²

31
- 0035151733
- Expression cloning
- 3
- Noh J., Neumann U.,: Expression cloning. In ACM TOG (Proc. SIGGRAPH) (2001), pp. 277-288. 3
- (2001) ACM TOG (Proc. SIGGRAPH) , pp. 277-288
- Noh, J.¹ Neumann, U.²

32
- 0021864128
- Visemes observed by the hearing-impaired and normal-hearing adult viewers
- 2
- Owens E., Blazek B.,: Visemes observed by the hearing-impaired and normal-hearing adult viewers. Journal of Speech, Language, and Hearing Research 28 (1986), 381-393. 2
- (1986) Journal of Speech, Language, and Hearing Research , vol.28 , pp. 381-393
- Owens, E.¹ Blazek, B.²

33
- 0033285078
- Resynthesizing facial animation through 3D model-based tracking
- 3
- Pighin F., Szeliski R., Salesin D.,: Resynthesizing facial animation through 3D model-based tracking. In Proc. CVPR (1999), pp. 143-150. 3
- (1999) Proc. CVPR , pp. 143-150
- Pighin, F.¹ Szeliski, R.² Salesin, D.³

34
- 2642557514
- FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
- 2
- Slaney M., Covell M.,: FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks. In Proc. NIPS (2000), pp. 814-820. 2
- (2000) Proc. NIPS , pp. 814-820
- Slaney, M.¹ Covell, M.²

35
- 56749093125
- A generic framework for efficient 2-D and 3-D facial expression analogy
- 3
- Song M., Dong Z., Theobalt C., Wang H., Liu Z., Seidel H.P.,: A generic framework for efficient 2-D and 3-D facial expression analogy. IEEE Trans. Multimedia 9, 7 (2007), 1384-1395. 3
- (2007) IEEE Trans. Multimedia , vol.9 , Issue.7 , pp. 1384-1395
- Song, M.¹ Dong, Z.² Theobalt, C.³ Wang, H.⁴ Liu, Z.⁵ Seidel, H.P.⁶

36
- 79851513710
- Deformable model fitting by regularized landmark mean-shift
- 3
- Saragih J.M., Lucey S., Cohn J.F.,: Deformable model fitting by regularized landmark mean-shift. IJCV 91, 2 (2011), 200-215. 3
- (2011) IJCV , vol.91 , Issue.2 , pp. 200-215
- Saragih, J.M.¹ Lucey, S.² Cohn, J.F.³

37
- 85121160602
- Spacetime expression cloning for blendshapes
- 11
- Seol Y., Lewis J.P., Seo J., Choi B., Anjyo K., Noh J.,: Spacetime expression cloning for blendshapes. ACM TOG 31, 2 (2012), 14. 11
- (2012) ACM TOG , vol.31 , Issue.2 , pp. 14
- Seol, Y.¹ Lewis, J.P.² Seo, J.³ Choi, B.⁴ Anjyo, K.⁵ Noh, J.⁶

38
- 0001048664
- Visual contribution to speech intelligibility in noise
- 2
- Sumby W., Pollack I.,: Visual contribution to speech intelligibility in noise. Journal of the Acoustical Society of America 26, 2 (1954), 212-215. 2
- (1954) Journal of the Acoustical Society of America , vol.26 , Issue.2 , pp. 212-215
- Sumby, W.¹ Pollack, I.²

39
- 84981340969
- Simulating speech with a physics-based facial muscle model
- 3
- Sifakis E., Selle A., Robinson-Mosher A.L., Fedkiw R.,: Simulating speech with a physics-based facial muscle model. In Proc. SCA (2006), pp. 261-270. 3
- (2006) Proc. SCA , pp. 261-270
- Sifakis, E.¹ Selle, A.² Robinson-Mosher, A.L.³ Fedkiw, R.⁴

40
- 0034449452
- Video textures
- 3
- Schödl A., Szeliski R., Salesin D., Essa I.A.,: Video textures. In ACM TOG (Proc. SIGGRAPH) (2000), pp. 489-498. 3
- (2000) ACM TOG (Proc. SIGGRAPH) , pp. 489-498
- Schödl, A.¹ Szeliski, R.² Salesin, D.³ Essa, I.A.⁴

41
- 0027128576
- Lipreading and audio-visual speech perception
- 2
- Summerfield Q.,: Lipreading and audio-visual speech perception. Philosophical Transactions of the Royal Society Series B: Biological Sciences 335, 1273 (1992), 71-78. 2
- (1992) Philosophical Transactions of the Royal Society Series B: Biological Sciences , vol.335 , Issue.1273 , pp. 71-78
- Summerfield, Q.¹

42
- 57649221616
- Real-time expression cloning using appearance models
- 3
- Theobald B.-J., Matthews I.A., Cohn J.F., Boker S.M.,: Real-time expression cloning using appearance models. In Proc. ICMI (2007), pp. 134-139. 3
- (2007) Proc. ICMI , pp. 134-139
- Theobald, B.-J.¹ Matthews, I.A.² Cohn, J.F.³ Boker, S.M.⁴

43
- 84988955843
- Dynamic units of visual speech
- 2, 3
- Taylor S.L., Mahler M., Theobald B.-J., Matthews I.,: Dynamic units of visual speech. In Proc. SCA (2012), pp. 275-284. 2, 3
- (2012) Proc. SCA , pp. 275-284
- Taylor, S.L.¹ Mahler, M.² Theobald, B.-J.³ Matthews, I.⁴

44
- 33646016842
- Face transfer with multilinear models
- 3
- Vlasic D., Brand M., Pfister H., Popovic J.,: Face transfer with multilinear models. In ACM TOG (Proc. SIGGRAPH) (2005), vol. 24, pp. 426-433. 3
- (2005) ACM TOG (Proc. SIGGRAPH) , vol.24 , pp. 426-433
- Vlasic, D.¹ Brand, M.² Pfister, H.³ Popovic, J.⁴

45
- 80051884182
- Realtime performance-based facial animation
- 3
- Weise T., BOUAZIZ S., Li H., Pauly M.,: Realtime performance-based facial animation. In ACM TOG (Proc. SIGGRAPH) (2011), vol. 30, pp. 77:1-77:10. 3
- (2011) ACM TOG (Proc. SIGGRAPH) , vol.30 , pp. 771-7710
- Weise, T.¹ Bouaziz, S.² Li, H.³ Pauly, M.⁴

46
- 0025474465
- Performance driven facial animation
- 3
- Williams L.,: Performance driven facial animation. In ACM TOG (Proc. SIGGRAPH) (1990), vol. 24, pp. 235-242. 3
- (1990) ACM TOG (Proc. SIGGRAPH) , vol.24 , pp. 235-242
- Williams, L.¹

47
- 84944130807
- Vision based control of 3D facial animation
- 3
- Xiang-Chai J., Xiao J., Hodgins J.,: Vision based control of 3D facial animation. In Proc. SCA (2003), pp. 193-206. 3
- (2003) Proc. SCA , pp. 193-206
- Xiang-Chai, J.¹ Xiao, J.² Hodgins, J.³

48
- 80051867136
- Video-based characters: Creating new human performances from a multiview video database
- 3
- XU F., Liu Y., Stoll C., Tompkin J., Bharaj G., Dai Q., Seidel H.-P., Kautz J., Theobalt C.,: Video-based characters: Creating new human performances from a multiview video database. In ACM TOG (Proc. SIGGRAPH) (2011), vol. 30, pp. 32:1-32:10. 3
- (2011) ACM TOG (Proc. SIGGRAPH) , vol.30 , pp. 321-3210
- Xu, F.¹ Liu, Y.² Stoll, C.³ Tompkin, J.⁴ Bharaj, G.⁵ Dai, Q.⁶ Seidel, H.-P.⁷ Kautz, J.⁸ Theobalt, C.⁹

49
- 0003822743
- Cambridge University Engineering Department, 7
- Young S., Evermann G., Gales M., Hain T., Kershaw D., Liu X.A., Moore G., Odell J., Ollason D., Povey D., Valtchev V., Woodland P.,: The HTK Book. Cambridge University Engineering Department, 2006. 7
- (2006) The HTK Book
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.A.⁶ Moore, G.⁷ Odell, J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.¹²

50
- 0032178592
- Quantitative association of vocal-tract and facial behavior
- 2, 3
- Yehia H., Rubin P., Vatikiotis-Bateson E.,: Quantitative association of vocal-tract and facial behavior. Speech Communication 26, 1-2 (1998), 23-43. 2, 3
- (1998) Speech Communication , vol.26 , Issue.12 , pp. 23-43
- Yehia, H.¹ Rubin, P.² Vatikiotis-Bateson, E.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.