SCOPUS 정보 검색 플랫폼

4th International Conference on Learning Representations, ICLR 2016 - Conference Track Proceedings

Volumn , Issue , 2016, Pages

Delving deeper into convolutional networks for learning video representations

(4) Ballas, Nicolas a Yao, Li a Pal, Chris b Courville, Aaron a

a UNIVERSITÉ DE MONTRÉAL (Canada)

b ÉCOLE POLYTECHNIQUE DE MONTRÉAL (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

IMAGE RESOLUTION; LARGE DATASET; RECURRENT NEURAL NETWORKS; VIDEO RECORDING;

CONVOLUTIONAL NETWORKS; HIGH DIMENSIONALITY; HUMAN-ACTION RECOGNITION; RECURRENT NETWORKS; SPATIAL RESOLUTION; SPATIO TEMPORAL FEATURES; VIDEO REPRESENTATIONS; VISUAL REPRESENTATIONS;

CONVOLUTION;

EID: 85083954507 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (329)

References (40)

1
- 84922389693
- arXiv preprint
- Bahdanau, D., Cho, K., and Bengio, Y. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473, 2014.
- (2014) Neural Machine Translation by Jointly Learning to Align and Translate
- Bahdanau, D.¹ Cho, K.² Bengio, Y.³

2
- 84897544737
- Theano: New features and speed improvements
- Bastien, Frédéric, Lamblin, Pascal, Pascanu, Razvan, Bergstra, James, Goodfellow, Ian J., Bergeron, Arnaud, Bouchard, Nicolas, and Bengio, Yoshua. Theano: new features and speed improvements. Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012.
- (2012) Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop
- Bastien, F.¹ Lamblin, P.² Pascanu, R.³ Bergstra, J.⁴ Goodfellow, I.J.⁵ Bergeron, A.⁶ Bouchard, N.⁷ Bengio, Y.⁸

3
- 0028392483
- Learning long-term dependencies with gradient descent is difficult
- Bengio, Y., Simard, P., and Frasconi, P. Learning long-term dependencies with gradient descent is difficult. Neural Networks, IEEE Transactions on, 1994.
- (1994) Neural Networks, IEEE Transactions on
- Bengio, Y.¹ Simard, P.² Frasconi, P.³

4
- 84857855190
- Random search for hyper-parameter optimization
- Bergstra, James and Bengio, Yoshua. Random search for hyper-parameter optimization. JMLR, 2012.
- (2012) JMLR
- Bergstra, J.¹ Bengio, Y.²

5
- 84857819132
- Theano: A CPU and GPU math expression compiler
- Bergstra, James, Breuleux, Olivier, Bastien, Frédéric, Lamblin, Pascal, Pascanu, Razvan, Des-jardins, Guillaume, Turian, Joseph, Warde-Farley, David, and Bengio, Yoshua. Theano: a CPU and GPU math expression compiler. In Proceedings of the Python for Scientific Computing Conference (SciPy), 2010.
- (2010) Proceedings of the Python for Scientific Computing Conference (SciPy)
- Bergstra, J.¹ Breuleux, O.² Bastien, F.³ Lamblin, P.⁴ Pascanu, R.⁵ Desjardins, G.⁶ Turian, J.⁷ Warde-Farley, D.⁸ Bengio, Y.⁹

6
- 79551562584
- Large displacement optical flow: Descriptor matching in variational motion estimation. Pattern analysis and machine Intelligence
- Brox, T. and Malik, J. Large displacement optical flow: descriptor matching in variational motion estimation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2011.
- (2011) IEEE Transactions on
- Brox, T.¹ Malik, J.²

7
- 84859089502
- Collecting highly parallel data for paraphrase evaluation
- Association for Computational Linguistics
- Chen, David L and Dolan, William B. Collecting highly parallel data for paraphrase evaluation. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pp. 190–200. Association for Computational Linguistics, 2011.
- (2011) Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies , vol.1 , pp. 190-200
- Chen, D.L.¹ Dolan, W.B.²

8
- 84952349295
- Chen, Xinlei, Fang, Hao, Lin, Tsung-Yi, Vedantam, Ramakrishna, Gupta, Saurabh, Dollar, Piotr, and Zitnick, C Lawrence. Microsoft coco captions: Data collection and evaluation server. arXiv 1504.00325, 2015.
- (2015) Microsoft Coco Captions: Data Collection and Evaluation Server
- Chen, X.¹ Fang, H.² Lin, T.-Y.³ Vedantam, R.⁴ Gupta, S.⁵ Dollar, P.⁶ Zitnick, C.L.⁷

9
- 84961291190
- arXiv preprint
- Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., and Bengio, Y. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
- (2014) Learning Phrase Representations Using Rnn Encoder-Decoder for Statistical Machine Translation
- Cho, K.¹ Van Merriënboer, B.² Gulcehre, C.³ Bahdanau, D.⁴ Bougares, F.⁵ Schwenk, H.⁶ Bengio, Y.⁷

10
- 84939821078
- arXiv preprint
- Chung, Junyoung, Gulcehre, Caglar, Cho, KyungHyun, and Bengio, Yoshua. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
- (2014) Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
- Chung, J.¹ Gulcehre, C.² Cho, K.³ Bengio, Y.⁴

11
- 84926007060
- Meteor universal: Language specific translation evaluation for any target language
- Denkowski, Michael and Lavie, Alon. Meteor universal: Language specific translation evaluation for any target language. In EACL Workshop, 2014.
- (2014) EACL Workshop
- Denkowski, M.¹ Lavie, A.²

12
- 84944046597
- arXiv preprint
- Donahue, J., Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., and Darrell, T. Long-term recurrent convolutional networks for visual recognition and description. arXiv preprint arXiv:1411.4389, 2014.
- (2014) Long-Term Recurrent Convolutional Networks for Visual Recognition and Description
- Donahue, J.¹ Hendricks, L.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

13
- 84919832465
- Towards end-to-end speech recognition with recurrent neural networks
- Graves, A. and Jaitly, N. Towards end-to-end speech recognition with recurrent neural networks. In ICML, 2014.
- (2014) ICML
- Graves, A.¹ Jaitly, N.²

14
- 0031573117
- Long short-term memory
- Hochreiter, Sepp and Schmidhuber, Jürgen. Long short-term memory. Neural computation, 1997.
- (1997) Neural Computation
- Hochreiter, S.¹ Schmidhuber, J.²

15
- 84905052261
- Thumos challenge: Action recognition with a large number of classes
- Jiang, YG, Liu, J, Roshan Zamir, A, Toderici, G, Laptev, I, Shah, M, and Sukthankar, R. Thumos challenge: Action recognition with a large number of classes. Technical Report, 2014.
- (2014) Technical Report
- Jiang, Y.G.¹ Liu, J.² Roshan Zamir, A.³ Toderici, G.⁴ Laptev, I.⁵ Shah, M.⁶ Sukthankar, R.⁷

16
- 84911364368
- Large-scale video classification with convolutional neural networks
- Karpathy, Andrej, Toderici, George, Shetty, Sachin, Leung, Tommy, Sukthankar, Rahul, and Fei-Fei, Li. Large-scale video classification with convolutional neural networks. In CVPR. IEEE, 2014.
- (2014) CVPR
- Karpathy, A.¹ Toderici, G.² Shetty, S.³ Leung, T.⁴ Sukthankar, R.⁵ Fei-Fei, L.⁶

17
- 84941620184
- arXiv preprint
- Kingma, Diederik and Ba, Jimmy. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- (2014) Adam: A Method for Stochastic Optimization
- Kingma, D.¹ Ba, J.²

18
- 84969541681
- arXiv preprint
- Lan, Zhenzhong, Lin, Ming, Li, Xuanchong, Hauptmann, Alexander G, and Raj, Bhiksha. Beyond gaussian pyramid: Multi-skip feature stacking for action recognition. arXiv preprint arXiv:1411.6660, 2014.
- (2014) Beyond Gaussian Pyramid: Multi-Skip Feature Stacking for Action Recognition
- Lan, Z.¹ Lin, M.² Li, X.³ Hauptmann, A.G.⁴ Raj, B.⁵

19
- 0032203257
- Gradient-based learning applied to document recognition
- LeCun, Y., Bottou, L., Bengio, Y., and Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998.
- (1998) Proceedings of the IEEE
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

20
- 84971668900
- arXiv preprint
- Ng, Joe Yue-Hei, Hausknecht, Matthew, Vijayanarasimhan, Sudheendra, Vinyals, Oriol, Monga, Rajat, and Toderici, George. Beyond short snippets: Deep networks for video classification. arXiv preprint arXiv:1503.08909, 2015.
- (2015) Beyond Short Snippets: Deep Networks for Video Classification
- Ng, J.Y.-H.¹ Hausknecht, M.² Vijayanarasimhan, S.³ Vinyals, O.⁴ Monga, R.⁵ Toderici, G.⁶

21
- 84990842753
- arXiv preprint
- Pan, Pingbo, Xu, Zhongwen, Yang, Yi, Wu, Fei, and Zhuang, Yueting. Hierarchical recurrent neural encoder for video representation with application to captioning. arXiv preprint arXiv:1511.03476, 2015.
- (2015) Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
- Pan, P.¹ Xu, Z.² Yang, Y.³ Wu, F.⁴ Zhuang, Y.⁵

22
- 85133336275
- BLEU: A method for automatic evaluation of machine translation
- Papineni, Kishore, Roukos, Salim, Ward, Todd, and Zhu, Wei-Jing. Bleu: a method for automatic evaluation of machine translation. In ACL, 2002.
- (2002) ACL
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

23
- 84866718894
- Action bank: A high-level representation of activity in video
- Sadanand, S. and Corso, J. Action bank: A high-level representation of activity in video. In CVPR. IEEE, 2012.
- (2012) CVPR
- Sadanand, S.¹ Corso, J.²

24
- 84978958015
- arXiv preprint
- Shi, Xingjian, Chen, Zhourong, Wang, Hao, Yeung, Dit-Yan, Wong, Wai-Kin, and Woo, Wang-chun. Convolutional lstm network: A machine learning approach for precipitation nowcasting. arXiv preprint arXiv:1506.04214, 2015.
- (2015) Convolutional Lstm Network: A Machine Learning Approach for Precipitation Nowcasting
- Shi, X.¹ Chen, Z.² Wang, H.³ Yeung, D.-Y.⁴ Wong, W.-K.⁵ Woo, W.-C.⁶

25
- 84937862424
- Two-stream convolutional networks for action recognition in videos
- Simonyan, Karen and Zisserman, Andrew. Two-stream convolutional networks for action recognition in videos. In Advances in Neural Information Processing Systems, pp. 568–576, 2014a.
- (2014) Advances in Neural Information Processing Systems , pp. 568-576
- Simonyan, K.¹ Zisserman, A.²

26
- 84925410541
- arXiv preprint
- Simonyan, Karen and Zisserman, Andrew. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014b.
- (2014) Very Deep Convolutional Networks for Large-Scale Image Recognition
- Simonyan, K.¹ Zisserman, A.²

27
- 84884955228
- arXiv preprint
- Soomro, Khurram, Zamir, Amir Roshan, and Shah, Mubarak. Ucf101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402, 2012.
- (2012) Ucf101: A Dataset of 101 Human Actions Classes from Videos in the Wild
- Soomro, K.¹ Zamir, A.R.² Shah, M.³

28
- 84969544782
- Unsupervised learning of video representations using lstms
- Srivastava, N., Mansimov, E., and Salakhutdinov, R. Unsupervised learning of video representations using lstms. In ICML, 2015.
- (2015) ICML
- Srivastava, N.¹ Mansimov, E.² Salakhutdinov, R.³

29
- 84964983441
- arXiv preprint
- Szegedy, Christian, Liu, Wei, Jia, Yangqing, Sermanet, Pierre, Reed, Scott, Anguelov, Dragomir, Erhan, Dumitru, Vanhoucke, Vincent, and Rabinovich, Andrew. Going deeper with convolutions. arXiv preprint arXiv:1409.4842, 2014.
- (2014) Going Deeper with Convolutions
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶ Erhan, D.⁷ Vanhoucke, V.⁸ Rabinovich, A.⁹

30
- 84959932469
- Integrating language and vision to generate natural language descriptions of videos in the wild
- Thomason, Jesse, Venugopalan, Subhashini, Guadarrama, Sergio, Saenko, Kate, and Mooney, Raymond. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, 2014.
- (2014) COLING
- Thomason, J.¹ Venugopalan, S.² Guadarrama, S.³ Saenko, K.⁴ Mooney, R.⁵

31
- 84969504307
- arXiv preprint
- Tran, D., Bourdev, L., Fergus, R., Torresani, L., and Paluri, M. C3d: generic features for video analysis. arXiv preprint arXiv:1412.0767, 2014.
- (2014) C3d: Generic Features for Video Analysis
- Tran, D.¹ Bourdev, L.² Fergus, R.³ Torresani, L.⁴ Paluri, M.⁵

32
- 84959197551
- Vedantam, Ramakrishna, Zitnick, C Lawrence, and Parikh, Devi. CIDEr: Consensus-based image description evaluation. arXiv:1411.5726, 2014.
- (2014) CIDEr: Consensus-Based Image Description Evaluation
- Vedantam, R.¹ Zitnick, C.L.² Parikh, D.³

33
- 84959876769
- Translating videos to natural language using deep recurrent neural networks
- Venugopalan, Subhashini, Xu, Huijuan, Donahue, Jeff, Rohrbach, Marcus, Mooney, Raymond, and Saenko, Kate. Translating videos to natural language using deep recurrent neural networks. NAACL, 2015.
- (2015) NAACL
- Venugopalan, S.¹ Xu, H.² Donahue, J.³ Rohrbach, M.⁴ Mooney, R.⁵ Saenko, K.⁶

34
- 80052877143
- Action recognition by dense trajectories
- Wang, H., Kläser, A., Schmid, C., and Liu, C. Action recognition by dense trajectories. In CVPR. IEEE, 2011.
- (2011) CVPR
- Wang, H.¹ Kläser, A.² Schmid, C.³ Liu, C.⁴

35
- 84950265630
- arXiv preprint
- Wang, Limin, Qiao, Yu, and Tang, Xiaoou. Action recognition with trajectory-pooled deep-convolutional descriptors. arXiv preprint arXiv:1505.04868, 2015a.
- (2015) Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors
- Wang, L.¹ Qiao, Y.² Tang, X.³

36
- 84955300999
- arXiv preprint
- Wang, Limin, Xiong, Yuanjun, Wang, Zhe, and Qiao, Yu. Towards good practices for very deep two-stream convnets. arXiv preprint arXiv:1507.02159, 2015b.
- (2015) Towards Good Practices for Very Deep Two-Stream Convnets
- Wang, L.¹ Xiong, Y.² Wang, Z.³ Qiao, Y.⁴

37
- 85030455323
- arXiv preprint
- Yao, Li, Ballas, Nicolas, Cho, Kyunghyun, Smith, John R., and Bengio, Yoshua. Trainable performance upper bounds for image and video captioning. arXiv preprint arXiv:1511.0459, 2015a.
- (2015) Trainable Performance Upper Bounds for Image and Video Captioning
- Yao, L.¹ Ballas, N.² Cho, K.³ Smith, J.R.⁴ Bengio, Y.⁵

38
- 84973884896
- Describing videos by exploiting temporal structure
- Yao, Li, Torabi, Atousa, Cho, Kyunghyun, Ballas, Nicolas, Pal, Christopher, Larochelle, Hugo, and Courville, Aaron. Describing videos by exploiting temporal structure. In Computer Vision (ICCV), 2015 IEEE International Conference on. IEEE, 2015b.
- (2015) Computer Vision (ICCV), 2015 IEEE International Conference on
- Yao, L.¹ Torabi, A.² Cho, K.³ Ballas, N.⁴ Pal, C.⁵ Larochelle, H.⁶ Courville, A.⁷

39
- 84990820289
- Yu, Haonan, Wang, Jiang, Huang, Zhiheng, Yang, Yi, and Xu, Wei. Video paragraph captioning using hierarchical recurrent neural networks. arXiv 1510.07712, 2015.
- (2015) Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
- Yu, H.¹ Wang, J.² Huang, Z.³ Yang, Y.⁴ Xu, W.⁵

40
- 84969736572
- Technical report
- Zeiler, Matthew D. ADADELTA: an adaptive learning rate method. Technical report, 2012.
- (2012) ADADELTA: An Adaptive Learning Rate Method
- Zeiler, M.D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.