-
2
-
-
84897544737
-
Theano: New features and speed improvements
-
Bastien, Frédéric, Lamblin, Pascal, Pascanu, Razvan, Bergstra, James, Goodfellow, Ian J., Bergeron, Arnaud, Bouchard, Nicolas, and Bengio, Yoshua. Theano: new features and speed improvements. Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012.
-
(2012)
Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop
-
-
Bastien, F.1
Lamblin, P.2
Pascanu, R.3
Bergstra, J.4
Goodfellow, I.J.5
Bergeron, A.6
Bouchard, N.7
Bengio, Y.8
-
3
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
Bengio, Y., Simard, P., and Frasconi, P. Learning long-term dependencies with gradient descent is difficult. Neural Networks, IEEE Transactions on, 1994.
-
(1994)
Neural Networks, IEEE Transactions on
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
4
-
-
84857855190
-
Random search for hyper-parameter optimization
-
Bergstra, James and Bengio, Yoshua. Random search for hyper-parameter optimization. JMLR, 2012.
-
(2012)
JMLR
-
-
Bergstra, J.1
Bengio, Y.2
-
5
-
-
84857819132
-
Theano: A CPU and GPU math expression compiler
-
Bergstra, James, Breuleux, Olivier, Bastien, Frédéric, Lamblin, Pascal, Pascanu, Razvan, Des-jardins, Guillaume, Turian, Joseph, Warde-Farley, David, and Bengio, Yoshua. Theano: a CPU and GPU math expression compiler. In Proceedings of the Python for Scientific Computing Conference (SciPy), 2010.
-
(2010)
Proceedings of the Python for Scientific Computing Conference (SciPy)
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
Warde-Farley, D.8
Bengio, Y.9
-
6
-
-
79551562584
-
Large displacement optical flow: Descriptor matching in variational motion estimation. Pattern analysis and machine Intelligence
-
Brox, T. and Malik, J. Large displacement optical flow: descriptor matching in variational motion estimation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2011.
-
(2011)
IEEE Transactions on
-
-
Brox, T.1
Malik, J.2
-
8
-
-
84952349295
-
-
Chen, Xinlei, Fang, Hao, Lin, Tsung-Yi, Vedantam, Ramakrishna, Gupta, Saurabh, Dollar, Piotr, and Zitnick, C Lawrence. Microsoft coco captions: Data collection and evaluation server. arXiv 1504.00325, 2015.
-
(2015)
Microsoft Coco Captions: Data Collection and Evaluation Server
-
-
Chen, X.1
Fang, H.2
Lin, T.-Y.3
Vedantam, R.4
Gupta, S.5
Dollar, P.6
Zitnick, C.L.7
-
9
-
-
84961291190
-
-
arXiv preprint
-
Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., and Bengio, Y. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
-
(2014)
Learning Phrase Representations Using Rnn Encoder-Decoder for Statistical Machine Translation
-
-
Cho, K.1
Van Merriënboer, B.2
Gulcehre, C.3
Bahdanau, D.4
Bougares, F.5
Schwenk, H.6
Bengio, Y.7
-
10
-
-
84939821078
-
-
arXiv preprint
-
Chung, Junyoung, Gulcehre, Caglar, Cho, KyungHyun, and Bengio, Yoshua. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
-
(2014)
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
-
-
Chung, J.1
Gulcehre, C.2
Cho, K.3
Bengio, Y.4
-
11
-
-
84926007060
-
Meteor universal: Language specific translation evaluation for any target language
-
Denkowski, Michael and Lavie, Alon. Meteor universal: Language specific translation evaluation for any target language. In EACL Workshop, 2014.
-
(2014)
EACL Workshop
-
-
Denkowski, M.1
Lavie, A.2
-
12
-
-
84944046597
-
-
arXiv preprint
-
Donahue, J., Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., and Darrell, T. Long-term recurrent convolutional networks for visual recognition and description. arXiv preprint arXiv:1411.4389, 2014.
-
(2014)
Long-Term Recurrent Convolutional Networks for Visual Recognition and Description
-
-
Donahue, J.1
Hendricks, L.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
13
-
-
84919832465
-
Towards end-to-end speech recognition with recurrent neural networks
-
Graves, A. and Jaitly, N. Towards end-to-end speech recognition with recurrent neural networks. In ICML, 2014.
-
(2014)
ICML
-
-
Graves, A.1
Jaitly, N.2
-
15
-
-
84905052261
-
Thumos challenge: Action recognition with a large number of classes
-
Jiang, YG, Liu, J, Roshan Zamir, A, Toderici, G, Laptev, I, Shah, M, and Sukthankar, R. Thumos challenge: Action recognition with a large number of classes. Technical Report, 2014.
-
(2014)
Technical Report
-
-
Jiang, Y.G.1
Liu, J.2
Roshan Zamir, A.3
Toderici, G.4
Laptev, I.5
Shah, M.6
Sukthankar, R.7
-
16
-
-
84911364368
-
Large-scale video classification with convolutional neural networks
-
Karpathy, Andrej, Toderici, George, Shetty, Sachin, Leung, Tommy, Sukthankar, Rahul, and Fei-Fei, Li. Large-scale video classification with convolutional neural networks. In CVPR. IEEE, 2014.
-
(2014)
CVPR
-
-
Karpathy, A.1
Toderici, G.2
Shetty, S.3
Leung, T.4
Sukthankar, R.5
Fei-Fei, L.6
-
18
-
-
84969541681
-
-
arXiv preprint
-
Lan, Zhenzhong, Lin, Ming, Li, Xuanchong, Hauptmann, Alexander G, and Raj, Bhiksha. Beyond gaussian pyramid: Multi-skip feature stacking for action recognition. arXiv preprint arXiv:1411.6660, 2014.
-
(2014)
Beyond Gaussian Pyramid: Multi-Skip Feature Stacking for Action Recognition
-
-
Lan, Z.1
Lin, M.2
Li, X.3
Hauptmann, A.G.4
Raj, B.5
-
19
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
LeCun, Y., Bottou, L., Bengio, Y., and Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998.
-
(1998)
Proceedings of the IEEE
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
20
-
-
84971668900
-
-
arXiv preprint
-
Ng, Joe Yue-Hei, Hausknecht, Matthew, Vijayanarasimhan, Sudheendra, Vinyals, Oriol, Monga, Rajat, and Toderici, George. Beyond short snippets: Deep networks for video classification. arXiv preprint arXiv:1503.08909, 2015.
-
(2015)
Beyond Short Snippets: Deep Networks for Video Classification
-
-
Ng, J.Y.-H.1
Hausknecht, M.2
Vijayanarasimhan, S.3
Vinyals, O.4
Monga, R.5
Toderici, G.6
-
21
-
-
84990842753
-
-
arXiv preprint
-
Pan, Pingbo, Xu, Zhongwen, Yang, Yi, Wu, Fei, and Zhuang, Yueting. Hierarchical recurrent neural encoder for video representation with application to captioning. arXiv preprint arXiv:1511.03476, 2015.
-
(2015)
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
-
-
Pan, P.1
Xu, Z.2
Yang, Y.3
Wu, F.4
Zhuang, Y.5
-
22
-
-
85133336275
-
BLEU: A method for automatic evaluation of machine translation
-
Papineni, Kishore, Roukos, Salim, Ward, Todd, and Zhu, Wei-Jing. Bleu: a method for automatic evaluation of machine translation. In ACL, 2002.
-
(2002)
ACL
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
23
-
-
84866718894
-
Action bank: A high-level representation of activity in video
-
Sadanand, S. and Corso, J. Action bank: A high-level representation of activity in video. In CVPR. IEEE, 2012.
-
(2012)
CVPR
-
-
Sadanand, S.1
Corso, J.2
-
24
-
-
84978958015
-
-
arXiv preprint
-
Shi, Xingjian, Chen, Zhourong, Wang, Hao, Yeung, Dit-Yan, Wong, Wai-Kin, and Woo, Wang-chun. Convolutional lstm network: A machine learning approach for precipitation nowcasting. arXiv preprint arXiv:1506.04214, 2015.
-
(2015)
Convolutional Lstm Network: A Machine Learning Approach for Precipitation Nowcasting
-
-
Shi, X.1
Chen, Z.2
Wang, H.3
Yeung, D.-Y.4
Wong, W.-K.5
Woo, W.-C.6
-
28
-
-
84969544782
-
Unsupervised learning of video representations using lstms
-
Srivastava, N., Mansimov, E., and Salakhutdinov, R. Unsupervised learning of video representations using lstms. In ICML, 2015.
-
(2015)
ICML
-
-
Srivastava, N.1
Mansimov, E.2
Salakhutdinov, R.3
-
29
-
-
84964983441
-
-
arXiv preprint
-
Szegedy, Christian, Liu, Wei, Jia, Yangqing, Sermanet, Pierre, Reed, Scott, Anguelov, Dragomir, Erhan, Dumitru, Vanhoucke, Vincent, and Rabinovich, Andrew. Going deeper with convolutions. arXiv preprint arXiv:1409.4842, 2014.
-
(2014)
Going Deeper with Convolutions
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.7
Vanhoucke, V.8
Rabinovich, A.9
-
30
-
-
84959932469
-
Integrating language and vision to generate natural language descriptions of videos in the wild
-
Thomason, Jesse, Venugopalan, Subhashini, Guadarrama, Sergio, Saenko, Kate, and Mooney, Raymond. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, 2014.
-
(2014)
COLING
-
-
Thomason, J.1
Venugopalan, S.2
Guadarrama, S.3
Saenko, K.4
Mooney, R.5
-
31
-
-
84969504307
-
-
arXiv preprint
-
Tran, D., Bourdev, L., Fergus, R., Torresani, L., and Paluri, M. C3d: generic features for video analysis. arXiv preprint arXiv:1412.0767, 2014.
-
(2014)
C3d: Generic Features for Video Analysis
-
-
Tran, D.1
Bourdev, L.2
Fergus, R.3
Torresani, L.4
Paluri, M.5
-
33
-
-
84959876769
-
Translating videos to natural language using deep recurrent neural networks
-
Venugopalan, Subhashini, Xu, Huijuan, Donahue, Jeff, Rohrbach, Marcus, Mooney, Raymond, and Saenko, Kate. Translating videos to natural language using deep recurrent neural networks. NAACL, 2015.
-
(2015)
NAACL
-
-
Venugopalan, S.1
Xu, H.2
Donahue, J.3
Rohrbach, M.4
Mooney, R.5
Saenko, K.6
-
34
-
-
80052877143
-
Action recognition by dense trajectories
-
Wang, H., Kläser, A., Schmid, C., and Liu, C. Action recognition by dense trajectories. In CVPR. IEEE, 2011.
-
(2011)
CVPR
-
-
Wang, H.1
Kläser, A.2
Schmid, C.3
Liu, C.4
-
36
-
-
84955300999
-
-
arXiv preprint
-
Wang, Limin, Xiong, Yuanjun, Wang, Zhe, and Qiao, Yu. Towards good practices for very deep two-stream convnets. arXiv preprint arXiv:1507.02159, 2015b.
-
(2015)
Towards Good Practices for Very Deep Two-Stream Convnets
-
-
Wang, L.1
Xiong, Y.2
Wang, Z.3
Qiao, Y.4
-
37
-
-
85030455323
-
-
arXiv preprint
-
Yao, Li, Ballas, Nicolas, Cho, Kyunghyun, Smith, John R., and Bengio, Yoshua. Trainable performance upper bounds for image and video captioning. arXiv preprint arXiv:1511.0459, 2015a.
-
(2015)
Trainable Performance Upper Bounds for Image and Video Captioning
-
-
Yao, L.1
Ballas, N.2
Cho, K.3
Smith, J.R.4
Bengio, Y.5
-
38
-
-
84973884896
-
Describing videos by exploiting temporal structure
-
Yao, Li, Torabi, Atousa, Cho, Kyunghyun, Ballas, Nicolas, Pal, Christopher, Larochelle, Hugo, and Courville, Aaron. Describing videos by exploiting temporal structure. In Computer Vision (ICCV), 2015 IEEE International Conference on. IEEE, 2015b.
-
(2015)
Computer Vision (ICCV), 2015 IEEE International Conference on
-
-
Yao, L.1
Torabi, A.2
Cho, K.3
Ballas, N.4
Pal, C.5
Larochelle, H.6
Courville, A.7
-
39
-
-
84990820289
-
-
Yu, Haonan, Wang, Jiang, Huang, Zhiheng, Yang, Yi, and Xu, Wei. Video paragraph captioning using hierarchical recurrent neural networks. arXiv 1510.07712, 2015.
-
(2015)
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
-
-
Yu, H.1
Wang, J.2
Huang, Z.3
Yang, Y.4
Xu, W.5
|