-
1
-
-
84971577321
-
-
others arXiv preprint arXiv: 1603.04467 (2016)
-
Martin Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, and others. 2016. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv: 1603.04467 (2016).
-
(2016)
Tensorflow: Large-scale Machine Learning on Heterogeneous Distributed Systems
-
-
Abadi, M.1
Agarwal, A.2
Barham, P.3
Brevdo, E.4
Chen, Z.5
Citro, C.6
Corrado, G.S.7
Davis, A.8
Dean, J.9
Devin, M.10
-
4
-
-
85030783704
-
-
others (2012)
-
Fabrice Bellard, M Niedermayer, and others. 2012. FFmpeg. Availabel from: http://ffm.peg.org (2012).
-
(2012)
-
-
Bellard, F.1
Niedermayer, M.2
-
5
-
-
84872221378
-
Tools for placing cuts and transitions in interview video
-
(2012)
-
Floraine Berthouzoz, Wilmot Li, and Maneesh Agrawala. 2012. Tools for placing cuts and transitions in interview video. ACM Trans. Graph. 31, 4 (2012), 67-1.
-
(2012)
ACM Trans. Graph.
, vol.31
, Issue.4
, pp. 61-67
-
-
Berthouzoz, F.1
Li, W.2
Agrawala, M.3
-
9
-
-
79551559765
-
A multiresolution spline with application to image mosaics
-
(1983)
-
Peter J Burt and Edward H Adelson. 1983. A multiresolution spline with application to image mosaics. ACM Transactions on Graphics (TOG) 2, 4 (1983), 217-236.
-
(1983)
ACM Transactions on Graphics (TOG)
, vol.2
, Issue.4
, pp. 217-236
-
-
Burt, P.J.1
Adelson, E.H.2
-
10
-
-
84980047577
-
Real-time facial animation with image-based dynamic avatars
-
(2016)
-
Chen Cao, Hongzhi Wu, Yanlin Weng, Tianjia Shao, and Kun Zhou. 2016. Real-time facial animation with image-based dynamic avatars. ACM Transactions on Graphics (TOG) 35, 4 (2016), 126.
-
(2016)
ACM Transactions on Graphics (TOG)
, vol.35
, Issue.4
, pp. 126
-
-
Cao, C.1
Wu, H.2
Weng, Y.3
Shao, T.4
Zhou, K.5
-
11
-
-
33645777234
-
Expressive speech-driven facial animation
-
(2005)
-
Yong Cao, Wen C Tien, Petros Faloutsos, and Frédéric Pighin. 2005. Expressive speech-driven facial animation. ACM Transactions on Graphics (TOG) 24, 4 (2005), 1283-1302.
-
(2005)
ACM Transactions on Graphics (TOG)
, vol.24
, Issue.4
, pp. 1283-1302
-
-
Cao, Y.1
Tien, W.C.2
Faloutsos, P.3
Pighin, F.4
-
12
-
-
0035363218
-
Active appearance models
-
others (2001)
-
Timothy F Cootes, Gareth J Edwards, Christopher J Taylor, and others. 2001. Active appearance models. IEEE Transactions on pattern analysis and machine intelligence 23, 6(2001), 681-685.
-
(2001)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.23
, Issue.6
, pp. 681-685
-
-
Cootes, T.F.1
Edwards, G.J.2
Taylor, C.J.3
-
13
-
-
82455171679
-
Video face replacement
-
(2011)
-
Kevin Dale, Kalyan Sunkavalli, Micah K Johnson, Daniel Vlasic, Wojciech Matusik, and Hanspeter Pfister. 2011. Video face replacement. ACM Transactions on Graphics (TOG) 30, 6 (2011), 130.
-
(2011)
ACM Transactions on Graphics (TOG)
, vol.30
, Issue.6
, pp. 130
-
-
Dale, K.1
Sunkavalli, K.2
Johnson, M.K.3
Vlasic, D.4
Matusik, W.5
Pfister, H.6
-
15
-
-
84946029513
-
Photo-real talking head with deep bidirectional LSTM
-
IEEE
-
Bo Fan, Lijuan Wang, Frank K Soong, and Lei Xie. 2015a. Photo-real talking head with deep bidirectional LSTM. In 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4884-4888.
-
(2015)
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 4884-4888
-
-
Fan, B.1
Wang, L.2
Soong, F.K.3
Xie, L.4
-
16
-
-
84994256183
-
A deep bidirectional LSTM approach for video-realistic talking head
-
(2015)
-
Bo Fan, Lei Xie, Shan Yang, Lijuan Wang, and Frank K Soong. 2015b. A deep bidirectional LSTM approach for video-realistic talking head. Multimedia Tools and Applications (2015), 1-23.
-
(2015)
Multimedia Tools and Applications
, pp. 1-23
-
-
Fan, B.1
Xie, L.2
Yang, S.3
Wang, L.4
Soong, F.K.5
-
17
-
-
16244385915
-
Audio/visual mapping with cross-modal hidden Markov models
-
(2005)
-
Shengli Fu, Ricardo Gutierrez-Osuna, Anna Esposito, Praveen K Kakumanu, and Oscar N Garcia. 2005. Audio/visual mapping with cross-modal hidden Markov models. IEEE Transactions on Multimedia 7, 2 (2005), 243-252.
-
(2005)
IEEE Transactions on Multimedia
, vol.7
, Issue.2
, pp. 243-252
-
-
Fu, S.1
Gutierrez-Osuna, R.2
Esposito, A.3
Kakumanu, P.K.4
Garcia, O.N.5
-
19
-
-
84911366471
-
Automatic face reenactment
-
Pablo Garrido, Levi Valgaerts, Ole Rehmsen, Thorsten Thormahlen, Patrick Perez, and Christian Theobalt. 2014. Automatic face reenactment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4217-4224.
-
(2014)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
, pp. 4217-4224
-
-
Garrido, P.1
Valgaerts, L.2
Rehmsen, O.3
Thormahlen, T.4
Perez, P.5
Theobalt, C.6
-
20
-
-
84932116100
-
Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio track
-
Wiley Online Library
-
Pablo Garrido, Levi Valgaerts, Hamid Sarmadi, Ingmar Steiner, Kiran Varanasi, Patrick Perez, and Christian Theobalt. 2015. Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio track. In Computer Graphics Forum, Vol. 34. Wiley Online Library, 193-204.
-
(2015)
Computer Graphics Forum
, vol.34
, pp. 193-204
-
-
Garrido, P.1
Valgaerts, L.2
Sarmadi, H.3
Steiner, I.4
Varanasi, K.5
Perez, P.6
Theobalt, C.7
-
23
-
-
27744588611
-
Framewise phoneme classification with bidirectional LSTM and other neural network architectures
-
(2005)
-
Alex Graves and Jürgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks 18, 5 (2005), 602-610.
-
(2005)
Neural Networks
, vol.18
, Issue.5
, pp. 602-610
-
-
Graves, A.1
Schmidhuber, J.2
-
24
-
-
0031573117
-
Long short-term memory
-
(1997)
-
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735-1780.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
25
-
-
84898663109
-
Data-driven speech animation synthesis focusing on realistic inside of the mouth
-
(2014)
-
Masahide Kawai, Tomoyori Iwao, Daisuke Mima, Akinobu Maejima, and Shigeo Morishima. 2014. Data-driven speech animation synthesis focusing on realistic inside of the mouth. Journal of information processing 22, 2 (2014), 401-409.
-
(2014)
Journal of Information Processing
, vol.22
, Issue.2
, pp. 401-409
-
-
Kawai, M.1
Iwao, T.2
Mima, D.3
Maejima, A.4
Morishima, S.5
-
28
-
-
70349425850
-
Dlib-ml: A machine learning toolkit
-
(2009)
-
Davis E. King. 2009. Dlib-ml: A Machine Learning Toolkit. Journal of Machine Learning Research 10 (2009), 1755-1758.
-
(2009)
Journal of Machine Learning Research
, vol.10
, pp. 1755-1758
-
-
King, D.E.1
-
30
-
-
84866661849
-
A data-driven approach for facial expression synthesis in video
-
IEEE
-
Kai Li, Feng Xu, Jue Wang, Qionghai Dai, and Yebin Liu. 2012. A data-driven approach for facial expression synthesis in video. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 57-64.
-
(2012)
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference On
, pp. 57-64
-
-
Li, K.1
Xu, F.2
Wang, J.3
Dai, Q.4
Liu, Y.5
-
32
-
-
51949118316
-
Human-assisted motion annotation
-
CVPR 2008. IEEE Conference on. IEEE
-
Ce Liu, William T Freeman, Edward H Adelson, and Yair Weiss. 2008. Human-assisted motion annotation. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 1-8.
-
(2008)
Computer Vision and Pattern Recognition, 2008
, pp. 1-8
-
-
Liu, C.1
Freeman, W.T.2
Adelson, E.H.3
Weiss, Y.4
-
33
-
-
84879068811
-
Comprehensive many-to-many phoneme-to-viseme mapping and its application for concatenative visual speech synthesis
-
(2013)
-
Wesley Mattheyses, Lukas Latacz, and Werner Verhelst. 2013. Comprehensive many-to-many phoneme-to-viseme mapping and its application for concatenative visual speech synthesis. Speech Communication 55, 7 (2013), 857-876.
-
(2013)
Speech Communication
, vol.55
, Issue.7
, pp. 857-876
-
-
Mattheyses, W.1
Latacz, L.2
Verhelst, W.3
-
34
-
-
84912553696
-
Audiovisual speech synthesis: An overview of the state-of-the-art
-
(2015)
-
Wesley Mattheyses and Werner Verhelst. 2015. Audiovisual speech synthesis: An overview of the state-of-the-art. Speech Communication 66 (2015), 182-217.
-
(2015)
Speech Communication
, vol.66
, pp. 182-217
-
-
Mattheyses, W.1
Verhelst, W.2
-
35
-
-
33749242231
-
Hybrid images
-
(July 2006)
-
Aude Oliva, Antonio Torralba, and Philippe G. Schyns. 2006. Hybrid Images. ACM Trans. Graph. 25, 3 (July 2006), 527-532. DOI: https://doi.org/10.1145/1141911.1141919
-
(2006)
ACM Trans. Graph.
, vol.25
, Issue.3
, pp. 527-532
-
-
Oliva, A.1
Torralba, A.2
Schyns, P.G.3
-
36
-
-
85030788856
-
-
(2016)
-
Wener Robitza. 2016. ffmpeg-normalize. https://github.com/slhck/ffmpeg-normalize. (2016).
-
(2016)
Ffmpeg-Normalize
-
-
Robitza, W.1
-
38
-
-
0010069372
-
HMM-based text-to-audio-visual speech synthesis
-
Shinji Sako, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura. 2000. HMM-based text-to-audio-visual speech synthesis. In INTERSPEECH. 25-28.
-
(2000)
INTERSPEECH
, pp. 25-28
-
-
Sako, S.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
45
-
-
24644514008
-
An image inpainting technique based on the fast marching method
-
(2004)
-
Alexandru Telea. 2004. An image inpainting technique based on the fast marching method. Journal of graphics tools 9, 1 (2004), 23-34.
-
(2004)
Journal of Graphics Tools
, vol.9
, Issue.1
, pp. 23-34
-
-
Telea, A.1
-
46
-
-
84995921764
-
Real-time expression transfer for facial reenactment
-
(2015)
-
Justus Thies, Michael Zollhöfer, Matthias Nießner, Levi Valgaerts, Marc Stamminger, and Christian Theobalt. 2015. Real-time expression transfer for facial reenactment. ACM Transactions on Graphics (TOG) 34, 6 (2015), 183.
-
(2015)
ACM Transactions on Graphics (TOG)
, vol.34
, Issue.6
, pp. 183
-
-
Thies, J.1
Zollhöfer, M.2
Nießner, M.3
Valgaerts, L.4
Stamminger, M.5
Theobalt, C.6
-
47
-
-
84986308411
-
Face2face: Real-time face capture and reenactment of rgb videos
-
IEEE (2016)
-
Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt, and Matthias Nießner. 2016. Face2face: Real-time face capture and reenactment of rgb videos. Proc. Computer Vision and Pattern Recognition (CVPR), IEEE 1 (2016).
-
(2016)
Proc. Computer Vision and Pattern Recognition (CVPR)
, vol.1
-
-
Thies, J.1
Zollhöfer, M.2
Stamminger, M.3
Theobalt, C.4
Nießner, M.5
-
48
-
-
85011070895
-
-
arXiv preprint arXiv:1609.03499 (2016)
-
Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, and Koray Kavukcuoglu. 2016. Wavenet: A generative model for raw audio. arXiv preprint arXiv:1609.03499 (2016).
-
(2016)
Wavenet: A Generative Model for Raw Audio
-
-
Van Den Oord, A.1
Dieleman, S.2
Zen, H.3
Simonyan, K.4
Vinyals, O.5
Graves, A.6
Kalchbrenner, N.7
Senior, A.8
Kavukcuoglu, K.9
-
49
-
-
33646016842
-
Face transfer with multilinear models
-
ACM
-
Daniel Vlasic, Matthew Brand, Hanspeter Pfister, and JovanPopovic. 2005. Face transfer with multilinear models. In ACM Transactions on Graphics (TOG), Vol. 24. ACM, 426-433.
-
(2005)
ACM Transactions on Graphics (TOG)
, vol.24
, pp. 426-433
-
-
Vlasic, D.1
Brand, M.2
Pfister, H.3
Popovic, J.4
-
51
-
-
79959854294
-
Synthesizing photo-real talking head via trajectory-guided sample selection
-
Lijuan Wang, Xiaojun Qian, Wei Han, and Frank K Soong. 2010. Synthesizing photo-real talking head via trajectory-guided sample selection. In INTERSPEECH, Vol. 10. 446-449.
-
(2010)
INTERSPEECH
, vol.10
, pp. 446-449
-
-
Wang, L.1
Qian, X.2
Han, W.3
Soong, F.K.4
-
52
-
-
34147186624
-
A coupled HMM approach to video-realistic speech animation
-
(2007)
-
Lei Xie and Zhi-Qiang Liu. 2007a. A coupled HMM approach to video-realistic speech animation. Pattern Recognition 40, 8 (2007), 2325-2340.
-
(2007)
Pattern Recognition
, vol.40
, Issue.8
, pp. 2325-2340
-
-
Xie, L.1
Liu, Z.-Q.2
-
53
-
-
33947583073
-
Realistic mouth-synching for speech-driven talking face using articulatory modelling
-
(2007)
-
Lei Xie and Zhi-Qiang Liu. 2007b. Realistic mouth-synching for speech-driven talking face using articulatory modelling. IEEE Transactions on Multimedia 9, 3 (2007), 500-510.
-
(2007)
IEEE Transactions on Multimedia
, vol.9
, Issue.3
, pp. 500-510
-
-
Xie, L.1
Liu, Z.-Q.2
-
56
-
-
84906253471
-
A new language independent, photo-realistic talking head driven by voice only
-
Xinjian Zhang, Lijuan Wang, Gang Li, Frank Seide, and Frank K Soong. 2013. A new language independent, photo-realistic talking head driven by voice only. In INTERSPEECH. 2743-2747.
-
(2013)
INTERSPEECH
, pp. 2743-2747
-
-
Zhang, X.1
Wang, L.2
Li, G.3
Seide, F.4
Soong, F.K.5
|