메뉴 건너뛰기




Volumn 34, Issue 2, 2015, Pages 193-204

VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; TRANSLATION (LANGUAGES);

EID: 84932116100     PISSN: 01677055     EISSN: 14678659     Source Type: Journal    
DOI: 10.1111/cgf.12552     Document Type: Conference Paper
Times cited : (160)

References (50)
  • 1
    • 84887344163 scopus 로고    scopus 로고
    • Expressive visual text-to-speech using active appearance models
    • 3, 10
    • Anderson R., Stenger B., Wan V., Cipolla R.,: Expressive visual text-to-speech using active appearance models. In Proc. CVPR (2013), pp. 3382-3389. 3, 10
    • (2013) Proc. CVPR , pp. 3382-3389
    • Anderson, R.1    Stenger, B.2    Wan, V.3    Cipolla, R.4
  • 3
    • 0030677313 scopus 로고    scopus 로고
    • Video Rewrite: Driving visual speech with audio
    • 3, 10, 11
    • Bregler C., Covell M., Slaney M.,: Video Rewrite: Driving visual speech with audio. In ACM TOG (Proc. SIGGRAPH) (1997), pp. 353-360. 3, 10, 11
    • (1997) ACM TOG (Proc. SIGGRAPH) , pp. 353-360
    • Bregler, C.1    Covell, M.2    Slaney, M.3
  • 8
    • 4444257069 scopus 로고    scopus 로고
    • Praat, a system for doing phonetics by computer
    • 7
    • Boersma P., Weenink D.,: Praat, a system for doing phonetics by computer. Glot International 5, 9/10 (2001), 341-345. 7
    • (2001) Glot International , vol.5 , Issue.910 , pp. 341-345
    • Boersma, P.1    Weenink, D.2
  • 9
    • 84880833516 scopus 로고    scopus 로고
    • Online modeling for realtime facial animation
    • 3
    • Bouaziz S., Wang Y., Pauly M.,: Online modeling for realtime facial animation. In ACM TOG (Proc. SIGGRAPH) (2013), vol. 32, pp. 40:1-40:10. 3
    • (2013) ACM TOG (Proc. SIGGRAPH) , vol.32 , pp. 401-4010
    • Bouaziz, S.1    Wang, Y.2    Pauly, M.3
  • 10
    • 35349009809 scopus 로고    scopus 로고
    • Transferable video-realistic speech animation
    • 3
    • Chang Y., Ezzat T.,: Transferable video-realistic speech animation. In Proc. SCA (2005), pp. 29-31. 3
    • (2005) Proc. SCA , pp. 29-31
    • Chang, Y.1    Ezzat, T.2
  • 11
    • 85008058913 scopus 로고    scopus 로고
    • Real-time speech motion synthesis from recorded motions
    • 3
    • Cao Y., Faloutsos P., Kohler E., Pighin F.,: Real-time speech motion synthesis from recorded motions. In Proc. SCA (2004), pp. 347-355. 3
    • (2004) Proc. SCA , pp. 347-355
    • Cao, Y.1    Faloutsos, P.2    Kohler, E.3    Pighin, F.4
  • 12
    • 84905743974 scopus 로고    scopus 로고
    • Displaced dynamic expression regression for real-time facial tracking and animation
    • 3
    • Cao C., Hou Q., Zhou K.,: Displaced dynamic expression regression for real-time facial tracking and animation. In ACM TOG (Proc. SIGGRAPH) (2014), vol. 33, p. 43. 3
    • (2014) ACM TOG (Proc. SIGGRAPH) , vol.33 , pp. 43
    • Cao, C.1    Hou, Q.2    Zhou, K.3
  • 13
    • 84981334718 scopus 로고    scopus 로고
    • EFASE: Expressive facial animation synthesis and editing with phoneme-isomap controls
    • 3
    • Deng Z., NEumann U.,: eFASE: expressive facial animation synthesis and editing with phoneme-isomap controls. In Proc. SCA (2006), pp. 251-260. 3
    • (2006) Proc. SCA , pp. 251-260
    • Deng, Z.1    Neumann, U.2
  • 16
    • 84863934238 scopus 로고    scopus 로고
    • A flexible and adaptive hyper-heuristic approach for (dynamic) capacitated vehicle routing problems
    • 6
    • Garrido P., Castro C.,: A flexible and adaptive hyper-heuristic approach for (dynamic) capacitated vehicle routing problems. Fundamenta Informaticae 119, 1 (2012), 29-60. 6
    • (2012) Fundamenta Informaticae , vol.119 , Issue.1 , pp. 29-60
    • Garrido, P.1    Castro, C.2
  • 18
    • 84887853831 scopus 로고    scopus 로고
    • Reconstructing detailed dynamic face geometry from monocular video
    • 3, 4, 5, 7
    • Garrido P., Valgaerts L., WU C., Theobalt C.,: Reconstructing detailed dynamic face geometry from monocular video. In ACM TOG (Proc. SIGGRAPH Asia) (2013), vol. 32, pp. 158:1-158:10. 3, 4, 5, 7
    • (2013) ACM TOG (Proc. SIGGRAPH Asia) , vol.32 , pp. 1581-15810
    • Garrido, P.1    Valgaerts, L.2    Wu, C.3    Theobalt, C.4
  • 19
    • 84965489665 scopus 로고
    • Speak my language: Current attitudes to television subtitling and dubbing
    • 2
    • Kilborn R.,: Speak my language: Current attitudes to television subtitling and dubbing. Media Culture Society 15, 4 (1993), 641-660. 2
    • (1993) Media Culture Society , vol.15 , Issue.4 , pp. 641-660
    • Kilborn, R.1
  • 20
    • 84898663109 scopus 로고    scopus 로고
    • Data-driven speech animation synthesis focusing on realistic inside of the mouth
    • 3
    • Kawai M., Iwao T., Mima D., Maejima A., Morishima S.,: Data-driven speech animation synthesis focusing on realistic inside of the mouth. Journal of Information Processing 22, 2 (2014), 401-409. 3
    • (2014) Journal of Information Processing , vol.22 , Issue.2 , pp. 401-409
    • Kawai, M.1    Iwao, T.2    Mima, D.3    Maejima, A.4    Morishima, S.5
  • 26
    • 80155129710 scopus 로고    scopus 로고
    • Realistic facial expression synthesis for an image-based talking head
    • 3, 10
    • Liu K., Ostermann J.,: Realistic facial expression synthesis for an image-based talking head. In Proc. ICME (July 2011), pp. 1-6. 3, 10
    • (2011) Proc. ICME (July) , pp. 1-6
    • Liu, K.1    Ostermann, J.2
  • 27
    • 77749295421 scopus 로고    scopus 로고
    • Real-time prosody-driven synthesis of body language
    • 2
    • Levine S., Theobalt C., Koltun V.,: Real-time prosody-driven synthesis of body language. In ACM TOG (Proc. SIGGRAPH Asia) (2009), vol. 28, pp. 172:1-172:10. 2
    • (2009) ACM TOG (Proc. SIGGRAPH Asia) , vol.28 , pp. 1721-17210
    • Levine, S.1    Theobalt, C.2    Koltun, V.3
  • 28
    • 84866661849 scopus 로고    scopus 로고
    • A data-driven approach for facial expression synthesis in video
    • 3
    • Li K., XU F., Wang J., Dai Q., Liu Y.,: A data-driven approach for facial expression synthesis in video. In Proc. CVPR (2012), pp. 57-64. 3
    • (2012) Proc. CVPR , pp. 57-64
    • Li, K.1    Xu, F.2    Wang, J.3    Dai, Q.4    Liu, Y.5
  • 29
    • 31344439475 scopus 로고    scopus 로고
    • Accurate visible speech synthesis based on concatenating variable length motion capture data
    • 3
    • MA J., Cole R.A., Pellom B.L., Ward W., Wise B.,: Accurate visible speech synthesis based on concatenating variable length motion capture data. IEEE TVCG 12, 2 (2006), 266-276. 3
    • (2006) IEEE TVCG , vol.12 , Issue.2 , pp. 266-276
    • Ma, J.1    Cole, R.A.2    Pellom, B.L.3    Ward, W.4    Wise, B.5
  • 30
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • 2
    • MCGurk H., MacDonald J.,: Hearing lips and seeing voices. Nature 264, 5588 (1976), 746-748. 2
    • (1976) Nature , vol.264 , Issue.5588 , pp. 746-748
    • Mcgurk, H.1    Macdonald, J.2
  • 32
    • 0021864128 scopus 로고
    • Visemes observed by the hearing-impaired and normal-hearing adult viewers
    • 2
    • Owens E., Blazek B.,: Visemes observed by the hearing-impaired and normal-hearing adult viewers. Journal of Speech, Language, and Hearing Research 28 (1986), 381-393. 2
    • (1986) Journal of Speech, Language, and Hearing Research , vol.28 , pp. 381-393
    • Owens, E.1    Blazek, B.2
  • 33
    • 0033285078 scopus 로고    scopus 로고
    • Resynthesizing facial animation through 3D model-based tracking
    • 3
    • Pighin F., Szeliski R., Salesin D.,: Resynthesizing facial animation through 3D model-based tracking. In Proc. CVPR (1999), pp. 143-150. 3
    • (1999) Proc. CVPR , pp. 143-150
    • Pighin, F.1    Szeliski, R.2    Salesin, D.3
  • 34
    • 2642557514 scopus 로고    scopus 로고
    • FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
    • 2
    • Slaney M., Covell M.,: FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks. In Proc. NIPS (2000), pp. 814-820. 2
    • (2000) Proc. NIPS , pp. 814-820
    • Slaney, M.1    Covell, M.2
  • 35
    • 56749093125 scopus 로고    scopus 로고
    • A generic framework for efficient 2-D and 3-D facial expression analogy
    • 3
    • Song M., Dong Z., Theobalt C., Wang H., Liu Z., Seidel H.P.,: A generic framework for efficient 2-D and 3-D facial expression analogy. IEEE Trans. Multimedia 9, 7 (2007), 1384-1395. 3
    • (2007) IEEE Trans. Multimedia , vol.9 , Issue.7 , pp. 1384-1395
    • Song, M.1    Dong, Z.2    Theobalt, C.3    Wang, H.4    Liu, Z.5    Seidel, H.P.6
  • 36
    • 79851513710 scopus 로고    scopus 로고
    • Deformable model fitting by regularized landmark mean-shift
    • 3
    • Saragih J.M., Lucey S., Cohn J.F.,: Deformable model fitting by regularized landmark mean-shift. IJCV 91, 2 (2011), 200-215. 3
    • (2011) IJCV , vol.91 , Issue.2 , pp. 200-215
    • Saragih, J.M.1    Lucey, S.2    Cohn, J.F.3
  • 37
    • 85121160602 scopus 로고    scopus 로고
    • Spacetime expression cloning for blendshapes
    • 11
    • Seol Y., Lewis J.P., Seo J., Choi B., Anjyo K., Noh J.,: Spacetime expression cloning for blendshapes. ACM TOG 31, 2 (2012), 14. 11
    • (2012) ACM TOG , vol.31 , Issue.2 , pp. 14
    • Seol, Y.1    Lewis, J.P.2    Seo, J.3    Choi, B.4    Anjyo, K.5    Noh, J.6
  • 38
  • 39
    • 84981340969 scopus 로고    scopus 로고
    • Simulating speech with a physics-based facial muscle model
    • 3
    • Sifakis E., Selle A., Robinson-Mosher A.L., Fedkiw R.,: Simulating speech with a physics-based facial muscle model. In Proc. SCA (2006), pp. 261-270. 3
    • (2006) Proc. SCA , pp. 261-270
    • Sifakis, E.1    Selle, A.2    Robinson-Mosher, A.L.3    Fedkiw, R.4
  • 45
    • 80051884182 scopus 로고    scopus 로고
    • Realtime performance-based facial animation
    • 3
    • Weise T., BOUAZIZ S., Li H., Pauly M.,: Realtime performance-based facial animation. In ACM TOG (Proc. SIGGRAPH) (2011), vol. 30, pp. 77:1-77:10. 3
    • (2011) ACM TOG (Proc. SIGGRAPH) , vol.30 , pp. 771-7710
    • Weise, T.1    Bouaziz, S.2    Li, H.3    Pauly, M.4
  • 46
    • 0025474465 scopus 로고
    • Performance driven facial animation
    • 3
    • Williams L.,: Performance driven facial animation. In ACM TOG (Proc. SIGGRAPH) (1990), vol. 24, pp. 235-242. 3
    • (1990) ACM TOG (Proc. SIGGRAPH) , vol.24 , pp. 235-242
    • Williams, L.1
  • 47
    • 84944130807 scopus 로고    scopus 로고
    • Vision based control of 3D facial animation
    • 3
    • Xiang-Chai J., Xiao J., Hodgins J.,: Vision based control of 3D facial animation. In Proc. SCA (2003), pp. 193-206. 3
    • (2003) Proc. SCA , pp. 193-206
    • Xiang-Chai, J.1    Xiao, J.2    Hodgins, J.3
  • 50
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal-tract and facial behavior
    • 2, 3
    • Yehia H., Rubin P., Vatikiotis-Bateson E.,: Quantitative association of vocal-tract and facial behavior. Speech Communication 26, 1-2 (1998), 23-43. 2, 3
    • (1998) Speech Communication , vol.26 , Issue.12 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.