SCOPUS 정보 검색 플랫폼

NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference

Volumn , Issue , 2015, Pages 1494-1504

Translating videos to natural language using deep recurrent neural networks

(6) Venugopalan, Subhashini a Xu, Huijuan b Donahue, Jeff c Rohrbach, Marcus c Mooney, Raymond a Saenko, Kate b

a UNIVERSITY OF TEXAS AT AUSTIN (United States)

b UNIVERSITY OF MASSACHUSETTS LOWELL (United States)

c INTERNATIONAL COMPUTER SCIENCE INSTITUTE (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; RECURRENT NEURAL NETWORKS; TRANSLATION (LANGUAGES);

HUMAN EVALUATION; LANGUAGE GENERATION; LARGE VOCABULARY; NATURAL LANGUAGES; PREDICTION ACCURACY; STATIC IMAGES; SYMBOL GROUNDING PROBLEM; VIDEO DATASETS;

DEEP NEURAL NETWORKS;

EID: 84959876769 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.3115/v1/n15-1173 Document Type: Conference Paper

Times cited : (526)

References (48)

1
- 80052886947
- Generating image descriptions using dependency relational patterns
- Ahmet Aker and Robert Gaizauskas. 2010. Generating image descriptions using dependency relational patterns. In Association for Computational Linguistics (ACL).
- (2010) Association for Computational Linguistics (ACL)
- Aker, A.¹ Gaizauskas, R.²

2
- 77951155435
- Video2text: Learning to annotate video content
- H. Aradhye, G. Toderici, and J. Yagnik. 2009. Video2text: Learning to annotate video content. In IEEE International Conference on Data Mining Workshops (ICDMW).
- (2009) IEEE International Conference on Data Mining Workshops (ICDMW)
- Aradhye, H.¹ Toderici, G.² Yagnik, J.³

3
- 85116156579
- METEOR: An automatic metric for MT evaluation with improved correlation with human judgments
- Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization.
- (2005) Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation And/or Summarization
- Banerjee, S.¹ Lavie, A.²

4
- 84885996388
- Video in sentences out
- Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, Jinlian Wei, Yifan Yin, and Zhiqi Zhang. 2012. Video in sentences out. In Association for Uncertainty in Artificial Intelligence (UAI).
- (2012) Association for Uncertainty in Artificial Intelligence (UAI)
- Barbu, A.¹ Bridge, A.² Burchill, Z.³ Coroian, D.⁴ Dickinson, S.⁵ Fidler, S.⁶ Michaux, A.⁷ Mussman, S.⁸ Narayanaswamy, S.⁹ Salvi, D.¹⁰ Schmidt, L.¹¹ Shangguan, J.¹² Mark Siskind, J.¹³ Waggoner, J.¹⁴ Wang, S.¹⁵ Wei, J.¹⁶ Yin, Y.¹⁷ Zhang, Z.¹⁸

5
- 84859089502
- Collecting highly parallel data for paraphrase evaluation
- David L. Chen and William B. Dolan. 2011. Collecting highly parallel data for paraphrase evaluation. In Association for Computational Linguistics (ACL).
- (2011) Association for Computational Linguistics (ACL)
- Chen, D.L.¹ Dolan, W.B.²

6
- 84943799837
- arXiv preprint arXiv:1409.1259
- Kyunghyun Cho, Bart van Merriënboer, Dzmitry Bahdanau, and Yoshua Bengio. 2014. On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259.
- (2014) On the Properties of Neural Machine Translation: Encoder-decoder Approaches
- Cho, K.¹ Van Merriënboer, B.² Bahdanau, D.³ Bengio, Y.⁴

7
- 84874280480
- Translating related words to videos and back through latent topics
- P. Das, R. K. Srihari, and J. J. Corso. 2013a. Translating related words to videos and back through latent topics. In Proceedings of Sixth ACM International Conference on Web Search and Data Mining (WSDM).
- (2013) Proceedings of Sixth ACM International Conference on Web Search and Data Mining (WSDM)
- Das, P.¹ Srihari, R.K.² Corso, J.J.³

8
- 84887345951
- A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching
- P. Das, C. Xu, R. F. Doell, and J. J. Corso. 2013b. A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching. In Conference on Computer Vision and Pattern Recognition (CVPR).
- (2013) Conference on Computer Vision and Pattern Recognition (CVPR)
- Das, P.¹ Xu, C.² Doell, R.F.³ Corso, J.J.⁴

9
- 84864139941
- Beyond audio and video retrieval: Towards multimedia summarization
- ACM
- D. Ding, F. Metze, S. Rawat, P.F. Schulam, S. Burger, E. Younessian, L. Bao, M.G. Christel, and A. Hauptmann. 2012. Beyond audio and video retrieval: towards multimedia summarization. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (ICMR). ACM.
- (2012) Proceedings of the 2nd ACM International Conference on Multimedia Retrieval (ICMR)
- Ding, D.¹ Metze, F.² Rawat, S.³ Schulam, P.F.⁴ Burger, S.⁵ Younessian, E.⁶ Bao, L.⁷ Christel, M.G.⁸ Hauptmann, A.⁹

10
- 84904482223
- arXiv preprint arXiv:1310.1531
- Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2013. Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531.
- (2013) Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition
- Donahue, J.¹ Jia, Y.² Vinyals, O.³ Hoffman, J.⁴ Zhang, N.⁵ Tzeng, E.⁶ Darrell, T.⁷

11
- 84946802546
- Long-term recurrent convolutional networks for visual recognition and description
- abs/1411.4389
- Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell. 2014. Long-term recurrent convolutional networks for visual recognition and description. CoRR, abs/1411.4389.
- (2014) CoRR
- Donahue, J.¹ Anne Hendricks, L.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

12
- 84906928552
- Comparing automatic evaluation measures for image description
- Desmond Elliott and Frank Keller. 2014. Comparing automatic evaluation measures for image description. In Association for Computational Linguistics (ACL).
- (2014) Association for Computational Linguistics (ACL)
- Elliott, D.¹ Keller, F.²

13
- 84946802531
- From captions to visual concepts and back
- abs/1411.4952
- Hao Fang, Saurabh Gupta, Forrest N. Iandola, Rupesh Srivastava, Li Deng, Piotr Dollár, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, C. Lawrence Zitnick, and Geoffrey Zweig. 2014. From captions to visual concepts and back. CoRR, abs/1411.4952.
- (2014) CoRR
- Fang, H.¹ Gupta, S.² Iandola, F.N.³ Srivastava, R.⁴ Deng, L.⁵ Dollár, P.⁶ Gao, J.⁷ He, X.⁸ Mitchell, M.⁹ Platt, J.C.¹⁰ Zitnick, C.L.¹¹ Zweig, G.¹²

14
- 80052017343
- Every picture tells a story: Generating sentences from images
- A. Farhadi, M. Hejrati, M. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. 2010. Every picture tells a story: Generating sentences from images. European Conference on Computer Vision (ECCV).
- (2010) European Conference on Computer Vision (ECCV)
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

15
- 84919832465
- Towards end-to-end speech recognition with recurrent neural networks
- Alex Graves and Navdeep Jaitly. 2014. Towards end-to-end speech recognition with recurrent neural networks. In Proceedings of the 31st International Conference on Machine Learning (ICML-14).
- (2014) Proceedings of the 31st International Conference on Machine Learning (ICML-14)
- Graves, A.¹ Jaitly, N.²

16
- 84898773262
- Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
- December
- Sergio Guadarrama, Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell, and Kate Saenko. 2013. Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In IEEE International Conference on Computer Vision (ICCV), December.
- (2013) IEEE International Conference on Computer Vision (ICCV)
- Guadarrama, S.¹ Krishnamoorthy, N.² Malkarnenkar, G.³ Venugopalan, S.⁴ Mooney, R.⁵ Darrell, T.⁶ Saenko, K.⁷

17
- 0031573117
- Long short-term memory
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, 9(8).
- (1997) Neural Computation , vol.9 , Issue.8
- Hochreiter, S.¹ Schmidhuber, J.²

18
- 0041914606
- Sepp Hochreiter, Yoshua Bengio, Paolo Frasconi, and Jürgen Schmidhuber. 2001. Gradient flow in recurrent nets: the difficulty of learning long-term dependencies.
- (2001) Gradient Flow in Recurrent Nets: The Difficulty of Learning Long-term Dependencies
- Hochreiter, S.¹ Bengio, Y.² Frasconi, P.³ Schmidhuber, J.⁴

19
- 84906494296
- From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
- Peter Young Alice Lai Micah Hodosh and Julia Hockenmaier. 2014. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics (TACL).
- (2014) Transactions of the Association for Computational Linguistics (TACL)
- Hodosh, P.Y.A.L.M.¹ Hockenmaier, J.²

20
- 84880804795
- A multi-modal clustering method for web videos
- Springer
- Haiqi Huang, Yueming Lu, Fangwei Zhang, and Songlin Sun. 2013. A multi-modal clustering method for web videos. In Trustworthy Computing and Services. Springer.
- (2013) Trustworthy Computing and Services
- Huang, H.¹ Lu, Y.² Zhang, F.³ Sun, S.⁴

21
- 84913555165
- arXiv preprint arXiv:1408.5093
- Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093.
- (2014) Caffe: Convolutional Architecture for Fast Feature Embedding
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

22
- 84937843643
- Deep fragment embeddings for bidirectional image sentence mapping
- Andrej Karpathy, Armand Joulin, and Li Fei-Fei. 2014. Deep fragment embeddings for bidirectional image sentence mapping. Advances in Neural Information Processing Systems (NIPS).
- (2014) Advances in Neural Information Processing Systems (NIPS)
- Karpathy, A.¹ Joulin, A.² Fei-Fei, L.³

23
- 84898785322
- Describing video contents in natural language
- Muhammad Usman Ghani Khan and Yoshihiko Gotoh. 2012. Describing video contents in natural language. Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data.
- (2012) Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
- Khan, M.U.G.¹ Gotoh, Y.²

24
- 84944113729
- arXiv preprint arXiv:1411.2539
- Ryan Kiros, Ruslan Salakhuditnov, and Richard. S Zemel. 2014. Unifying visual-semantic embeddings with multimodal neural language models. arXiv preprint arXiv:1411.2539.
- (2014) Unifying Visual-semantic Embeddings with Multimodal Neural Language Models
- Kiros, R.¹ Salakhuditnov, R.² Zemel, R.S.³

25
- 49449108990
- Cambridge University Press
- Philipp Koehn. 2010. Statistical Machine Translation. Cambridge University Press.
- (2010) Statistical Machine Translation
- Koehn, P.¹

26
- 0036843382
- Natural language description of human activities from video images based on concept hierarchy of actions
- A. Kojima, T. Tamura, and K. Fukunaga. 2002. Natural language description of human activities from video images based on concept hierarchy of actions. International Journal of Computer Vision (IJCV), 50(2).
- (2002) International Journal of Computer Vision (IJCV) , vol.50 , Issue.2
- Kojima, A.¹ Tamura, T.² Fukunaga, K.³

27
- 84893398951
- Generating natural-language video descriptions using text-mined knowledge
- Niveda Krishnamoorthy, Girish Malkarnenkar, Raymond J. Mooney, Kate Saenko, and Sergio Guadarrama. 2013. Generating natural-language video descriptions using text-mined knowledge. In AAAI Conference on Artificial Intelligence (AAAI).
- (2013) AAAI Conference on Artificial Intelligence (AAAI)
- Krishnamoorthy, N.¹ Malkarnenkar, G.² Mooney, R.J.³ Saenko, K.⁴ Guadarrama, S.⁵

28
- 84876231242
- ImageNet classification with deep convolutional neural networks
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (NIPS).
- (2012) Advances in Neural Information Processing Systems (NIPS)
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

29
- 80052901011
- Baby talk: Understanding and generating simple image descriptions
- Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C Berg, and Tamara L Berg. 2011. Baby talk: Understanding and generating simple image descriptions. In Conference on Computer Vision and Pattern Recognition (CVPR).
- (2011) Conference on Computer Vision and Pattern Recognition (CVPR)
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.C.⁶ Berg, T.L.⁷

30
- 84934873221
- Treetalk: Composition and compression of trees for image descriptions
- Polina Kuznetsova, Vicente Ordonez, Tamara L Berg, UNC Chapel Hill, and Yejin Choi. 2014. Treetalk: Composition and compression of trees for image descriptions. Transactions of the Association for Computational Linguistics, 2(10).
- (2014) Transactions of the Association for Computational Linguistics , vol.2 , Issue.10
- Kuznetsova, P.¹ Ordonez, V.² Berg, T.L.³ Choi, Y.⁴

31
- 51849094354
- Save: A framework for semantic annotation of visual events
- M.W. Lee, A. Hakeem, N. Haering, and S.C. Zhu. 2008. Save: A framework for semantic annotation of visual events. In Conference on Computer Vision and Pattern Recognition (CVPR).
- (2008) Conference on Computer Vision and Pattern Recognition (CVPR)
- Lee, M.W.¹ Hakeem, A.² Haering, N.³ Zhu, S.C.⁴

32
- 84906505935
- arXiv preprint arXiv:1405.0312
- Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. arXiv preprint arXiv:1405.0312.
- (2014) Microsoft COCO: Common Objects in Context
- Lin, T.-Y.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollár, P.⁷ Zitnick, C.L.⁸

33
- 84951072975
- arXiv preprint arXiv:1410.1090
- Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, and Alan L Yuille. 2014. Explain images with multimodal recurrent neural networks. arXiv preprint arXiv:1410.1090.
- (2014) Explain Images with Multimodal Recurrent Neural Networks
- Mao, J.¹ Xu, W.² Yang, Y.³ Wang, J.⁴ Yuille, A.L.⁵

34
- 84959182849
- Improving video activity recognition using object recognition and text mining
- Tanvi S. Motwani and Raymond J. Mooney. 2012. Improving video activity recognition using object recognition and text mining. In Proceedings of the 20th European Conference on Artificial Intelligence (ECAI).
- (2012) Proceedings of the 20th European Conference on Artificial Intelligence (ECAI)
- Motwani, T.S.¹ Mooney, R.J.²

35
- 84905274625
- TRECVID 2012 - An overview of the goals, tasks, data, evaluation mechanisms and metrics
- Paul Over, George Awad, Martial Michel, Jonathan Fiscus, Greg Sanders, B Shaw, Alan F. Smeaton, and Georges Quéenot. 2012. TRECVID 2012 - an overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2012.
- (2012) Proceedings of TRECVID 2012
- Over, P.¹ Awad, G.² Michel, M.³ Fiscus, J.⁴ Sanders, G.⁵ Shaw, B.⁶ Smeaton, A.F.⁷ Quéenot, G.⁸

36
- 85133336275
- BLEU: A method for automatic evaluation of machine translation
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Association for Computational Linguistics (ACL).
- (2002) Association for Computational Linguistics (ACL)
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

37
- 84898775239
- Translating video content to natural language descriptions
- Marcus Rohrbach, Wei Qiu, Ivan Titov, Stefan Thater, Manfred Pinkal, and Bernt Schiele. 2013. Translating video content to natural language descriptions. In IEEE International Conference on Computer Vision (ICCV).
- (2013) IEEE International Conference on Computer Vision (ICCV)
- Rohrbach, M.¹ Qiu, W.² Titov, I.³ Thater, S.⁴ Pinkal, M.⁵ Schiele, B.⁶

38
- 84960170289
- Coherent multi-sentence video description with variable level of detail
- September
- Anna Rohrbach, Marcus Rohrbach, Wei Qiu, Annemarie Friedrich, Manfred Pinkal, and Bernt Schiele. 2014. Coherent multi-sentence video description with variable level of detail. In German Conference on Pattern Recognition (GCPR), September.
- (2014) German Conference on Pattern Recognition (GCPR)
- Rohrbach, A.¹ Rohrbach, M.² Qiu, W.³ Friedrich, A.⁴ Pinkal, M.⁵ Schiele, B.⁶

39
- 84909978410
- Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2014. ImageNet Large Scale Visual Recognition Challenge.
- (2014) ImageNet Large Scale Visual Recognition Challenge
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.¹⁰ Berg, A.C.¹¹ Fei-Fei, L.¹²

40
- 84928547704
- Sequence to sequence learning with neural networks
- Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems (NIPS).
- (2014) Advances in Neural Information Processing Systems (NIPS)
- Sutskever, I.¹ Vinyals, O.² Le, Q.V.³

41
- 84959932469
- Integrating language and vision to generate natural language descriptions of videos in the wild
- August
- J. Thomason, S. Venugopalan, S. Guadarrama, K. Saenko, and R.J. Mooney. 2014. Integrating language and vision to generate natural language descriptions of videos in the wild. In Proceedings of the 25th International Conference on Computational Linguistics (COLING), August.
- (2014) Proceedings of the 25th International Conference on Computational Linguistics (COLING)
- Thomason, J.¹ Venugopalan, S.² Guadarrama, S.³ Saenko, K.⁴ Mooney, R.J.⁵

42
- 84951910303
- Show and tell: A neural image caption generator
- abs/1411.4555
- Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2014. Show and tell: A neural image caption generator. CoRR, abs/1411.4555.
- (2014) CoRR
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

43
- 77954177620
- Multimodal fusion for video search reranking
- Shikui Wei, Yao Zhao, Zhenfeng Zhu, and Nan Liu. 2010. Multimodal fusion for video search reranking. IEEE Transactions on Knowledge and Data Engineering, 22(8).
- (2010) IEEE Transactions on Knowledge and Data Engineering , vol.22 , Issue.8
- Wei, S.¹ Zhao, Y.² Zhu, Z.³ Liu, N.⁴

44
- 84940762015
- Jointly modeling deep video and compositional text to bridge vision and language in a unified framework
- R. Xu, C. Xiong,W. Chen, and J. J. Corso. 2015. Jointly modeling deep video and compositional text to bridge vision and language in a unified framework. In AAAI Conference on Artificial Intelligence (AAAI).
- (2015) AAAI Conference on Artificial Intelligence (AAAI)
- Xu, R.¹ Xiongw. Chen, C.² Corso, J.J.³

45
- 77954862144
- I2t: Image parsing to text description
- B.Z. Yao, X. Yang, L. Lin, M.W. Lee, and S.C. Zhu. 2010. I2t: Image parsing to text description. Proceedings of the IEEE, 98(8).
- (2010) Proceedings of the IEEE , vol.98 , Issue.8
- Yao, B.Z.¹ Yang, X.² Lin, L.³ Lee, M.W.⁴ Zhu, S.C.⁵

46
- 84897743886
- Grounded language learning from videos described with sentences
- Haonan Yu and Jeffrey Mark Siskind. 2013. Grounded language learning from videos described with sentences. In Association for Computational Linguistics (ACL).
- (2013) Association for Computational Linguistics (ACL)
- Yu, H.¹ Mark Siskind, J.²

47
- 84958234084
- arXiv preprint arXiv:1410.4615
- Wojciech Zaremba and Ilya Sutskever. 2014. Learning to execute. arXiv preprint arXiv:1410.4615.
- (2014) Learning to Execute
- Zaremba, W.¹ Sutskever, I.²

48
- 84921476116
- Visualizing and understanding convolutional networks
- Springer
- Matthew D Zeiler and Rob Fergus. 2014. Visualizing and understanding convolutional networks. In European Conference on Computer Vision (ECCV). Springer.
- (2014) European Conference on Computer Vision (ECCV)
- Zeiler, M.D.¹ Fergus, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.