-
1
-
-
0142166851
-
A neural probabilistic language model
-
Mar
-
Y. Bengio, R. Ducharme, P. Vincent, and C. Janvin. A neural probabilistic language model. J. Mach. Learn. Res., 3:1137-1155, Mar. 2003
-
(2003)
J. Mach. Learn. Res
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Janvin, C.4
-
5
-
-
84944046597
-
-
arXiv preprint arXiv:1411. 4389v2
-
J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. arXiv preprint arXiv:1411. 4389v2, 2014
-
(2014)
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
6
-
-
33645837850
-
On the multivariate laplace distribution
-
T. Eltoft, T. Kim, and T.-W. Lee. On the multivariate laplace distribution. Signal Processing Letters, IEEE, 13(5):300-303, 2006
-
(2006)
Signal Processing Letters, IEEE
, vol.13
, Issue.5
, pp. 300-303
-
-
Eltoft, T.1
Kim, T.2
Lee, T.-W.3
-
7
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
Springer
-
A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. In Computer Vision-ECCV 2010, pages 15-29. Springer, 2010
-
(2010)
Computer Vision-ECCV 2010
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
9
-
-
84890466217
-
Improving neural networks by preventing co-adaptation of feature detectors
-
abs/1207. 0580
-
G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. CoRR, abs/1207. 0580, 2012
-
(2012)
CoRR
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
10
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. J. Artif. Intell. Res. (JAIR), 47:853-899, 2013
-
(2013)
J. Artif. Intell. Res. (JAIR)
, vol.47
, pp. 853-899
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
12
-
-
0000107975
-
Relations between two sets of variates
-
H. Hotelling. Relations between two sets of variates. Biometrika, pages 321-377, 1936
-
(1936)
Biometrika
, pp. 321-377
-
-
Hotelling, H.1
-
14
-
-
84942676733
-
Deep visual-semantic alignments for generating image descriptions
-
Computer Science Department, Stanford University
-
A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. Technical report, Computer Science Department, Stanford University, 2014
-
(2014)
Technical Report
-
-
Karpathy, A.1
Fei-Fei, L.2
-
18
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
IEEE
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 1601-1608. IEEE, 2011
-
(2011)
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
, pp. 1601-1608
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
19
-
-
84878189119
-
Collective generation of natural image descriptions
-
Association for Computational Linguistics
-
P. Kuznetsova, V. Ordonez, A. C. Berg, T. L. Berg, and Y. Choi. Collective generation of natural image descriptions. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1, pages 359-368. Association for Computational Linguistics, 2012
-
(2012)
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1
, pp. 359-368
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
20
-
-
84919829999
-
Distributed representations of sentences and documents
-
JMLR. org
-
Q. V. Le and T. Mikolov. Distributed representations of sentences and documents. In Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, China, 21-26 June 2014, volume 32 of JMLR Proceedings, pages 1188-1196. JMLR. org, 2014
-
(2014)
Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, China, 21-26 June 2014, Volume 32 of JMLR Proceedings
, pp. 1188-1196
-
-
Le, Q.V.1
Mikolov, T.2
-
21
-
-
0032203257
-
Gradientbased learning applied to document recognition
-
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradientbased learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
22
-
-
84863895135
-
Independent component analysis: Theory and applications [book review]
-
T.-W. Lee. Independent component analysis: theory and applications [book review]. IEEE Transactions on Neural Networks, 10(4):982-982, 1999
-
(1999)
IEEE Transactions on Neural Networks
, vol.10
, Issue.4
, pp. 982
-
-
Lee, T.-W.1
-
23
-
-
84862279067
-
Composing simple image descriptions using web-scale ngrams
-
Association for Computational Linguistics
-
S. Li, G. Kulkarni, T. L. Berg, A. C. Berg, and Y. Choi. Composing simple image descriptions using web-scale ngrams. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning, pages 220-228. Association for Computational Linguistics, 2011
-
(2011)
Proceedings of the Fifteenth Conference on Computational Natural Language Learning
, pp. 220-228
-
-
Li, S.1
Kulkarni, G.2
Berg, T.L.3
Berg, A.C.4
Choi, Y.5
-
24
-
-
84906493406
-
Microsoft coco: Common objects in context
-
D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars, editors,Springer International Publishing
-
T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollr, and C. Zitnick. Microsoft coco: Common objects in context. In D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars, editors, Computer Vision ECCV 2014, volume 8693 of Lecture Notes in Computer Science, pages 740-755. Springer International Publishing, 2014
-
(2014)
Computer Vision ECCV 2014, Volume 8693 of Lecture Notes in Computer Science
, pp. 740-755
-
-
Lin, T.-Y.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollr, P.7
Zitnick, C.8
-
26
-
-
84951072975
-
-
arXiv preprint arXiv:1410. 1090
-
J. Mao, W. Xu, Y. Yang, J. Wang, and A. L. Yuille. Explain images with multimodal recurrent neural networks. arXiv preprint arXiv:1410. 1090, 2014
-
(2014)
Explain Images with Multimodal Recurrent Neural Networks
-
-
Mao, J.1
Xu, W.2
Yang, Y.3
Wang, J.4
Yuille, A.L.5
-
27
-
-
85083951332
-
Efficient estimation of word representations in vector space
-
abs/1301. 3781
-
T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. CoRR, abs/1301. 3781, 2013
-
(2013)
CoRR
-
-
Mikolov, T.1
Chen, K.2
Corrado, G.3
Dean, J.4
-
28
-
-
84898956512
-
Distributed representations of words and phrases and their compositionality
-
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems, pages 3111-3119, 2013
-
(2013)
Advances in Neural Information Processing Systems
, pp. 3111-3119
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Corrado, G.S.4
Dean, J.5
-
29
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Association for Computational Linguistics
-
M. Mitchell, X. Han, J. Dodge, A. Mensch, A. Goyal, A. Berg, K. Yamaguchi, T. Berg, K. Stratos, and H. Daumé III. Midge: Generating image descriptions from computer vision detections. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 747-756. Association for Computational Linguistics, 2012
-
(2012)
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
, pp. 747-756
-
-
Mitchell, M.1
Han, X.2
Dodge, J.3
Mensch, A.4
Goyal, A.5
Berg, A.6
Yamaguchi, K.7
Berg, T.8
Stratos, K.9
Daumé, H.10
-
30
-
-
34547970628
-
Three new graphical models for statistical language modelling
-
New York, NY, USA,ACM
-
A. Mnih and G. Hinton. Three new graphical models for statistical language modelling. In Proceedings of the 24th International Conference on Machine Learning, ICML '07, pages 641-648, New York, NY, USA, 2007. ACM
-
(2007)
Proceedings of the 24th International Conference on Machine Learning, ICML '07
, pp. 641-648
-
-
Mnih, A.1
Hinton, G.2
-
31
-
-
0000273048
-
Annealed importance sampling
-
R. Neal. Annealed importance sampling. Statistics and Computing, 11(2):125-139, 2001
-
(2001)
Statistics and Computing
, vol.11
, Issue.2
, pp. 125-139
-
-
Neal, R.1
-
32
-
-
84906510060
-
Action recognition with stacked fisher vectors
-
Springer
-
X. Peng, C. Zou, Y. Qiao, and Q. Peng. Action recognition with stacked fisher vectors. In Computer Vision-ECCV 2014, pages 581-595. Springer, 2014
-
(2014)
Computer Vision-ECCV 2014
, pp. 581-595
-
-
Peng, X.1
Zou, C.2
Qiao, Y.3
Peng, Q.4
-
34
-
-
77955992063
-
Large-scale image retrieval with compressed fisher vectors
-
IEEE
-
F. Perronnin, Y. Liu, J. Sánchez, and H. Poirier. Large-scale image retrieval with compressed fisher vectors. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 3384-3391. IEEE, 2010
-
(2010)
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
, pp. 3384-3391
-
-
Perronnin, F.1
Liu, Y.2
Sánchez, J.3
Poirier, H.4
-
35
-
-
78149348137
-
Improving the fisher kernel for large-scale image classification
-
Springer
-
F. Perronnin, J. Sánchez, and T. Mensink. Improving the fisher kernel for large-scale image classification. In Computer Vision-ECCV 2010, pages 143-156. Springer, 2010
-
(2010)
Computer Vision-ECCV 2010
, pp. 143-156
-
-
Perronnin, F.1
Sánchez, J.2
Mensink, T.3
-
36
-
-
85090348677
-
Collecting image annotations using amazon's mechanical turk
-
Association for Computational Linguistics
-
C. Rashtchian, P. Young, M. Hodosh, and J. Hockenmaier. Collecting image annotations using amazon's mechanical turk. In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, pages 139-147. Association for Computational Linguistics, 2010
-
(2010)
Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
, pp. 139-147
-
-
Rashtchian, C.1
Young, P.2
Hodosh, M.3
Hockenmaier, J.4
-
37
-
-
84883487458
-
Image classification with the fisher vector: Theory and practice
-
J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek. Image classification with the fisher vector: Theory and practice. International journal of computer vision, 105(3):222-245, 2013
-
(2013)
International Journal of Computer Vision
, vol.105
, Issue.3
, pp. 222-245
-
-
Sánchez, J.1
Perronnin, F.2
Mensink, T.3
Verbeek, J.4
-
38
-
-
84898428370
-
Fisher vector faces in the wild
-
K. Simonyan, O. M. Parkhi, A. Vedaldi, and A. Zisserman. Fisher vector faces in the wild. In Proc. BMVC, volume 1, page 7, 2013
-
(2013)
Proc. BMVC
, vol.1
, pp. 7
-
-
Simonyan, K.1
Parkhi, O.M.2
Vedaldi, A.3
Zisserman, A.4
-
40
-
-
84933585162
-
Very deep convolutional networks for large-scale image recognition
-
abs/1409. 1556
-
K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409. 1556, 2014
-
(2014)
CoRR
-
-
Simonyan, K.1
Zisserman, A.2
-
41
-
-
33745938597
-
Discovering objects and their location in images
-
IEEE
-
J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman. Discovering objects and their location in images. In Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on, volume 1, pages 370-377. IEEE, 2005
-
(2005)
Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on
, vol.1
, pp. 370-377
-
-
Sivic, J.1
Russell, B.C.2
Efros, A.A.3
Zisserman, A.4
Freeman, W.T.5
-
42
-
-
77955998009
-
Connecting modalities: Semisupervised segmentation and annotation of images using unaligned text corpora
-
IEEE
-
R. Socher and L. Fei-Fei. Connecting modalities: Semisupervised segmentation and annotation of images using unaligned text corpora. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 966-973. IEEE, 2010
-
(2010)
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
, pp. 966-973
-
-
Socher, R.1
Fei-Fei, L.2
-
43
-
-
84928030723
-
Grounded compositional semantics for finding and describing images with sentences
-
R. Socher, Q. Le, C. Manning, and A. Ng. Grounded compositional semantics for finding and describing images with sentences. In NIPS Deep Learning Workshop, 2013
-
(2013)
NIPS Deep Learning Workshop
-
-
Socher, R.1
Le, Q.2
Manning, C.3
Ng, A.4
-
46
-
-
0001318292
-
Canonical ridge and econometrics of joint production
-
H. Vinod. Canonical ridge and econometrics of joint production. Journal of Econometrics, 4(2):147-166, 1976
-
(1976)
Journal of Econometrics
, vol.4
, Issue.2
, pp. 147-166
-
-
Vinod, H.1
-
48
-
-
84898772194
-
Learning the visual interpretation of sentences
-
IEEE
-
C. L. Zitnick, D. Parikh, and L. Vanderwende. Learning the visual interpretation of sentences. In Computer Vision (ICCV), 2013 IEEE International Conference on, pages 1681-1688. IEEE, 2013.
-
(2013)
Computer Vision (ICCV), 2013 IEEE International Conference on
, pp. 1681-1688
-
-
Zitnick, C.L.1
Parikh, D.2
Vanderwende, L.3
|