-
1
-
-
85040308578
-
Bottom-up and top-down attention for image captioning and vqa
-
Anderson, P.; He, X.; Buehler, C.; Teney, D.; Johnson, M.; Gould, S.; and Zhang, L. 2017. Bottom-up and top-down attention for image captioning and vqa. In VQA Workshop at CVPR.
-
(2017)
VQA Workshop at CVPR
-
-
Anderson, P.1
He, X.2
Buehler, C.3
Teney, D.4
Johnson, M.5
Gould, S.6
Zhang, L.7
-
2
-
-
84993660571
-
Learning to compose neural networks for question answering
-
Andreas, J.; Marcus, R.; Darrell, T.; and Klein, D. 2016a. Learning to compose neural networks for question answering. In NAACL.
-
(2016)
NAACL
-
-
Andreas, J.1
Marcus, R.2
Darrell, T.3
Klein, D.4
-
4
-
-
84973890960
-
VQA: Visual question answering
-
Antol, S.; Agrawal, A.; Lu, J.; Mitchell, M.; Batra, D.; Zitnick, C. L.; and Parikh, D. 2015. VQA: Visual Question Answering. In ICCV.
-
(2015)
ICCV
-
-
Antol, S.1
Agrawal, A.2
Lu, J.3
Mitchell, M.4
Batra, D.5
Zitnick, C.L.6
Parikh, D.7
-
5
-
-
84899013802
-
Translating embeddings for modeling multi-relational data
-
Burges, C. J. C.; Bottou, L.; Welling, M.; Ghahramani, Z.; and Weinberger, K. Q., eds, Curran Associates, Inc
-
Bordes, A.; Usunier, N.; Garcia-Duran, A.; Weston, J.; and Yakhnenko, O. 2013. Translating embeddings for modeling multi-relational data. In Burges, C. J. C.; Bottou, L.; Welling, M.; Ghahramani, Z.; and Weinberger, K. Q., eds., NIPS. Curran Associates, Inc. 2787-2795.
-
(2013)
NIPS
, pp. 2787-2795
-
-
Bordes, A.1
Usunier, N.2
Garcia-Duran, A.3
Weston, J.4
Yakhnenko, O.5
-
7
-
-
85043992858
-
Modulating early visual processing by language
-
de Vries, H.; Strub, F.; Mary, J.; Larochelle, H.; Pietquin, O.; and Courville, A. C. 2017. Modulating early visual processing by language. In NIPS.
-
(2017)
NIPS
-
-
De Vries, H.1
Strub, F.2
Mary, J.3
Larochelle, H.4
Pietquin, O.5
Courville, A.C.6
-
8
-
-
85088228106
-
A learned representation for artistic style
-
Dumoulin, V.; Shlens, J.; and Kudlur, M. 2017. A learned representation for artistic style. In ICLR.
-
(2017)
ICLR
-
-
Dumoulin, V.1
Shlens, J.2
Kudlur, M.3
-
9
-
-
85083952626
-
Learning factored representations in a deep mixture of experts
-
Eigen, D.; Ranzato, M.; and Sutskever, I. 2014. Learning factored representations in a deep mixture of experts. In ICLR Workshops.
-
(2014)
ICLR Workshops
-
-
Eigen, D.1
Ranzato, M.2
Sutskever, I.3
-
10
-
-
85046994169
-
Convolutional sequence to sequence learning
-
Gehring, J.; Auli, M.; Grangier, D.; Yarats, D.; and Dauphin, Y. N. 2017. Convolutional sequence to sequence learning. In ICML.
-
(2017)
ICML
-
-
Gehring, J.1
Auli, M.2
Grangier, D.3
Yarats, D.4
Dauphin, Y.N.5
-
11
-
-
84925422907
-
-
National Acad Sciences
-
Geman, D.; Geman, S.; Hallonquist, N.; and Younes, L. 2015. Visual turing test for computer vision systems. volume 112, 3618-3623. National Acad Sciences.
-
(2015)
Visual Turing Test for Computer Vision Systems
, vol.112
, pp. 3618-3623
-
-
Geman, D.1
Geman, S.2
Hallonquist, N.3
Younes, L.4
-
12
-
-
85029539334
-
-
CoRR abs/1705.06830
-
Ghiasi, G.; Lee, H.; Kudlur, M.; Dumoulin, V.; and Shlens, J. 2017. Exploring the structure of a real-time, arbitrary neural artistic stylization network. CoRR abs/1705.06830.
-
(2017)
Exploring the Structure of a Real-Time, Arbitrary Neural Artistic Stylization Network
-
-
Ghiasi, G.1
Lee, H.2
Kudlur, M.3
Dumoulin, V.4
Shlens, J.5
-
13
-
-
85041900002
-
Making the V in VQA matter: Elevating the role of image understanding in Visual Question Answering
-
Goyal, Y.; Khot, T.; Summers-Stay, D.; Batra, D.; and Parikh, D. 2017. Making the V in VQA matter: Elevating the role of image understanding in Visual Question Answering. In CVPR.
-
(2017)
CVPR
-
-
Goyal, Y.1
Khot, T.2
Summers-Stay, D.3
Batra, D.4
Parikh, D.5
-
14
-
-
84959890267
-
Traversing knowledge graphs in vector space
-
Guu, K.; Miller, J.; and Liang, P. 2015. Traversing knowledge graphs in vector space. In EMNLP.
-
(2015)
EMNLP
-
-
Guu, K.1
Miller, J.2
Liang, P.3
-
16
-
-
84986274465
-
Deep residual learning for image recognition
-
He, K.; Zhang, X.; Ren, S.; and Sun, J. 2016. Deep residual learning for image recognition. In CVPR.
-
(2016)
CVPR
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
18
-
-
85041904328
-
Learning to reason: End-to-end module networks for visual question answering
-
Hu, R.; Andreas, J.; Rohrbach, M.; Darrell, T.; and Saenko, K. 2017. Learning to reason: End-to-end module networks for visual question answering. In ICCV.
-
(2017)
ICCV
-
-
Hu, R.1
Andreas, J.2
Rohrbach, M.3
Darrell, T.4
Saenko, K.5
-
20
-
-
85041925505
-
Arbitrary style transfer in real-time with adaptive instance normalization
-
Huang, X., and Belongie, S. 2017. Arbitrary style transfer in real-time with adaptive instance normalization. In ICCV.
-
(2017)
ICCV
-
-
Huang, X.1
Belongie, S.2
-
21
-
-
84969584486
-
Batch normalization: Accelerating deep network training by reducing internal covariate shift
-
Ioffe, S., and Szegedy, C. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML.
-
(2015)
ICML
-
-
Ioffe, S.1
Szegedy, C.2
-
22
-
-
85041904911
-
CLEVR: A diagnostic dataset for compositional language and elementary visual reasoning
-
Johnson, J.; Hariharan, B.; van der Maaten, L.; Fei-Fei, L.; Zitnick, C. L.; and Girshick, R. B. 2017a. CLEVR: A diagnostic dataset for compositional language and elementary visual reasoning. In CVPR.
-
(2017)
CVPR
-
-
Johnson, J.1
Hariharan, B.2
Van Der Maaten, L.3
Fei-Fei, L.4
Zitnick, C.L.5
Girshick, R.B.6
-
23
-
-
85041924656
-
Inferring and executing programs for visual reasoning
-
Johnson, J.; Hariharan, B.; van der Maaten, L.; Hoffman, J.; Li, F.; Zitnick, C. L.; and Girshick, R. B. 2017b. Inferring and executing programs for visual reasoning. In ICCV.
-
(2017)
ICCV
-
-
Johnson, J.1
Hariharan, B.2
Van Der Maaten, L.3
Hoffman, J.4
Li, F.5
Zitnick, C.L.6
Girshick, R.B.7
-
24
-
-
0000262562
-
Hierarchical mixtures of experts and the em algorithm
-
Jordan, M. I., and Jacobs, R. A. 1994. Hierarchical mixtures of experts and the em algorithm. Neural Comput. 6(2):181-214.
-
(1994)
Neural Comput
, vol.6
, Issue.2
, pp. 181-214
-
-
Jordan, M.I.1
Jacobs, R.A.2
-
25
-
-
85039151782
-
Dynamic layer normalization for adaptive neural acoustic modeling in speech recognition
-
Kim, T.; Song, I.; and Bengio, Y. 2017. Dynamic layer normalization for adaptive neural acoustic modeling in speech recognition. In InterSpeech.
-
(2017)
InterSpeech
-
-
Kim, T.1
Song, I.2
Bengio, Y.3
-
26
-
-
85083951076
-
Adam: A method for stochastic optimization
-
Kingma, D. P., and Ba, J. 2015. Adam: A method for stochastic optimization. In ICLR.
-
(2015)
ICLR
-
-
Kingma, D.P.1
Ba, J.2
-
27
-
-
85016395012
-
Overcoming catastrophic forgetting in neural networks
-
Kirkpatrick, J.; Pascanu, R.; Rabinowitz, N.; Veness, J.; Des-jardins, G.; Rusu, A. A.; Milan, K.; Quan, J.; Ramalho, T.; Grabska-Barwinska, A.; Hassabis, D.; Clopath, C.; Kumaran, D.; and Hadsell, R. 2017. Overcoming catastrophic forgetting in neural networks. National Academy of Sciences 114(13):3521-3526.
-
(2017)
National Academy of Sciences
, vol.114
, Issue.13
, pp. 3521-3526
-
-
Kirkpatrick, J.1
Pascanu, R.2
Rabinowitz, N.3
Veness, J.4
Des-Jardins, G.5
Rusu, A.A.6
Milan, K.7
Quan, J.8
Ramalho, T.9
Grabska-Barwinska, A.10
Hassabis, D.11
Clopath, C.12
Kumaran, D.13
Hadsell, R.14
-
28
-
-
85018917850
-
Hierarchical question-image co-attention for visual question answering
-
Lu, J.; Yang, J.; Batra, D.; and Parikh, D. 2016. Hierarchical question-image co-attention for visual question answering. In NIPS.
-
(2016)
NIPS
-
-
Lu, J.1
Yang, J.2
Batra, D.3
Parikh, D.4
-
29
-
-
84937822746
-
A multi-world approach to question answering about real-world scenes based on uncertain input
-
Malinowski, M., and Fritz, M. 2014. A multi-world approach to question answering about real-world scenes based on uncertain input. In NIPS.
-
(2014)
NIPS
-
-
Malinowski, M.1
Fritz, M.2
-
30
-
-
84973896625
-
Ask your neurons: A neural-based approach to answering questions about images
-
Malinowski, M.; Rohrbach, M.; and Fritz, M. 2015. Ask your neurons: A neural-based approach to answering questions about images. In ICCV.
-
(2015)
ICCV
-
-
Malinowski, M.1
Rohrbach, M.2
Fritz, M.3
-
31
-
-
84898956512
-
Distributed representations of words and phrases and their compositionality
-
Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G. S.; and Dean, J. 2013. Distributed representations of words and phrases and their compositionality. In NIPS.
-
(2013)
NIPS
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Corrado, G.S.4
Dean, J.5
-
32
-
-
85057290984
-
Zero-shot task generalization with multi-task deep reinforcement learning
-
Oh, J.; Singh, S.; Lee, H.; and Kholi, P. 2017. Zero-shot task generalization with multi-task deep reinforcement learning. In ICML.
-
(2017)
ICML
-
-
Oh, J.1
Singh, S.2
Lee, H.3
Kholi, P.4
-
33
-
-
85055556180
-
Learning visual reasoning without strong priors
-
Perez, E.; de Vries, H.; Strub, F.; Dumoulin, V.; and Courville, A. C. 2017. Learning visual reasoning without strong priors. In MLSLP Workshop at ICML.
-
(2017)
MLSLP Workshop at ICML
-
-
Perez, E.1
De Vries, H.2
Strub, F.3
Dumoulin, V.4
Courville, A.C.5
-
34
-
-
85083950271
-
Unsupervised representation learning with deep convolutional generative adversarial networks
-
Radford, A.; Metz, L.; and Chintala, S. 2016. Unsupervised representation learning with deep convolutional generative adversarial networks. In ICLR.
-
(2016)
ICLR
-
-
Radford, A.1
Metz, L.2
Chintala, S.3
-
35
-
-
84947041871
-
Imagenet large scale visual recognition challenge
-
Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M. S.; Berg, A. C.; and Li, F. 2015. Imagenet large scale visual recognition challenge. IJCV 115(3):211-252.
-
(2015)
IJCV
, vol.115
, Issue.3
, pp. 211-252
-
-
Russakovsky, O.1
Deng, J.2
Su, H.3
Krause, J.4
Satheesh, S.5
Ma, S.6
Huang, Z.7
Karpathy, A.8
Khosla, A.9
Bernstein, M.S.10
Berg, A.C.11
Li, F.12
-
36
-
-
85032218685
-
-
CoRR abs/1706.01427
-
Santoro, A.; Raposo, D.; Barrett, D. G.; Malinowski, M.; Pascanu, R.; Battaglia, P.; and Lillicrap, T. 2017. A simple neural network module for relational reasoning. CoRR abs/1706.01427.
-
(2017)
A Simple Neural Network Module for Relational Reasoning
-
-
Santoro, A.1
Raposo, D.2
Barrett, D.G.3
Malinowski, M.4
Pascanu, R.5
Battaglia, P.6
Lillicrap, T.7
-
37
-
-
85088226307
-
Outrageously large neural networks: The sparsely-gated mixture-of-experts layer
-
Shazeer, N.; Mirhoseini, A.; Maziarz, K.; Davis, A.; Le, Q.; Hinton, G.; and Dean, J. 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. In ICLR.
-
(2017)
ICLR
-
-
Shazeer, N.1
Mirhoseini, A.2
Maziarz, K.3
Davis, A.4
Le, Q.5
Hinton, G.6
Dean, J.7
-
38
-
-
85011070895
-
-
CoRR abs/1609.03499
-
van den Oord, A.; Dieleman, S.; Zen, H.; Simonyan, K.; Vinyals, O.; Graves, A.; Kalchbrenner, N.; Senior, A.; and Kavukcuoglu, K. 2016a. Wavenet: A generative model for raw audio. CoRR abs/1609.03499.
-
(2016)
Wavenet: A Generative Model for Raw Audio
-
-
Van Den Oord, A.1
Dieleman, S.2
Zen, H.3
Simonyan, K.4
Vinyals, O.5
Graves, A.6
Kalchbrenner, N.7
Senior, A.8
Kavukcuoglu, K.9
-
39
-
-
85018873682
-
Conditional image generation with pixelcnn decoders
-
van den Oord, A.; Kalchbrenner, N.; Espeholt, L.; Vinyals, O.; Graves, A.; and Kavukcuoglu, K. 2016b. Conditional image generation with pixelcnn decoders. In NIPS.
-
(2016)
NIPS
-
-
Van Den Oord, A.1
Kalchbrenner, N.2
Espeholt, L.3
Vinyals, O.4
Graves, A.5
Kavukcuoglu, K.6
-
40
-
-
57249084011
-
Visualizing data using t-sne
-
Nov
-
van der Maaten, L., and Hinton, G. 2008. Visualizing data using t-sne. JMLR 9(Nov):2579-2605.
-
(2008)
JMLR
, vol.9
, pp. 2579-2605
-
-
Van Der Maaten, L.1
Hinton, G.2
-
41
-
-
85056814846
-
-
CoRR abs/1706.01433
-
Watters, N.; Tacchetti, A.; Weber, T.; Pascanu, R.; Battaglia, P.; and Zoran, D. 2017. Visual interaction networks. CoRR abs/1706.01433.
-
(2017)
Visual Interaction Networks
-
-
Watters, N.1
Tacchetti, A.2
Weber, T.3
Pascanu, R.4
Battaglia, P.5
Zoran, D.6
-
42
-
-
84986334021
-
Stacked attention networks for image question answering
-
Yang, Z.; He, X.; Gao, J.; Deng, L.; and Smola, A. J. 2016. Stacked attention networks for image question answering. In CVPR.
-
(2016)
CVPR
-
-
Yang, Z.1
He, X.2
Gao, J.3
Deng, L.4
Smola, A.J.5
|