SCOPUS 정보 검색 플랫폼

32nd AAAI Conference on Artificial Intelligence, AAAI 2018

Volumn , Issue , 2018, Pages 3942-3951

FiLM: Visual reasoning with a general conditioning layer

(5) Perez, Ethan a,b Strub, Florian d De Vries, Harm a Dumoulin, Vincent a Courville, Aaron a,c

a UNIVERSITÉ DE MONTRÉAL (Canada)

b RICE UNIVERSITY (United States)

c CIFAR Fellow (United States)

d UNIV LILLE (France)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; DEEP LEARNING;

AFFINE TRANSFORMATIONS; ARCHITECTURAL MODIFICATION; LEARNING METHODS; LINEAR MODULATIONS; MODEL REASONINGS; NETWORK COMPUTATIONS; STATE OF THE ART; VISUAL REASONING;

NETWORK LAYERS;

EID: 85055416465 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1888)

References (42)

1
- 85040308578
- Bottom-up and top-down attention for image captioning and vqa
- Anderson, P.; He, X.; Buehler, C.; Teney, D.; Johnson, M.; Gould, S.; and Zhang, L. 2017. Bottom-up and top-down attention for image captioning and vqa. In VQA Workshop at CVPR.
- (2017) VQA Workshop at CVPR
- Anderson, P.¹ He, X.² Buehler, C.³ Teney, D.⁴ Johnson, M.⁵ Gould, S.⁶ Zhang, L.⁷

2
- 84993660571
- Learning to compose neural networks for question answering
- Andreas, J.; Marcus, R.; Darrell, T.; and Klein, D. 2016a. Learning to compose neural networks for question answering. In NAACL.
- (2016) NAACL
- Andreas, J.¹ Marcus, R.² Darrell, T.³ Klein, D.⁴

3
- 84986272553
- Neural module networks
- Andreas, J.; Rohrbach, M.; Darrell, T.; and Klein, D. 2016b. Neural module networks. In CVPR.
- (2016) CVPR
- Andreas, J.¹ Rohrbach, M.² Darrell, T.³ Klein, D.⁴

4
- 84973890960
- VQA: Visual question answering
- Antol, S.; Agrawal, A.; Lu, J.; Mitchell, M.; Batra, D.; Zitnick, C. L.; and Parikh, D. 2015. VQA: Visual Question Answering. In ICCV.
- (2015) ICCV
- Antol, S.¹ Agrawal, A.² Lu, J.³ Mitchell, M.⁴ Batra, D.⁵ Zitnick, C.L.⁶ Parikh, D.⁷

5
- 84899013802
- Translating embeddings for modeling multi-relational data
- Burges, C. J. C.; Bottou, L.; Welling, M.; Ghahramani, Z.; and Weinberger, K. Q., eds, Curran Associates, Inc
- Bordes, A.; Usunier, N.; Garcia-Duran, A.; Weston, J.; and Yakhnenko, O. 2013. Translating embeddings for modeling multi-relational data. In Burges, C. J. C.; Bottou, L.; Welling, M.; Ghahramani, Z.; and Weinberger, K. Q., eds., NIPS. Curran Associates, Inc. 2787-2795.
- (2013) NIPS , pp. 2787-2795
- Bordes, A.¹ Usunier, N.² Garcia-Duran, A.³ Weston, J.⁴ Yakhnenko, O.⁵

6
- 84939821078
- Empirical evaluation of gated recurrent neural networks on sequence modeling
- Chung, J.; Gülçehre, Ç.; Cho, K.; and Bengio, Y. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. In Deep Learning Workshop at NIPS.
- (2014) Deep Learning Workshop at NIPS
- Chung, J.¹ Gülçehre, Ç.² Cho, K.³ Bengio, Y.⁴

7
- 85043992858
- Modulating early visual processing by language
- de Vries, H.; Strub, F.; Mary, J.; Larochelle, H.; Pietquin, O.; and Courville, A. C. 2017. Modulating early visual processing by language. In NIPS.
- (2017) NIPS
- De Vries, H.¹ Strub, F.² Mary, J.³ Larochelle, H.⁴ Pietquin, O.⁵ Courville, A.C.⁶

8
- 85088228106
- A learned representation for artistic style
- Dumoulin, V.; Shlens, J.; and Kudlur, M. 2017. A learned representation for artistic style. In ICLR.
- (2017) ICLR
- Dumoulin, V.¹ Shlens, J.² Kudlur, M.³

9
- 85083952626
- Learning factored representations in a deep mixture of experts
- Eigen, D.; Ranzato, M.; and Sutskever, I. 2014. Learning factored representations in a deep mixture of experts. In ICLR Workshops.
- (2014) ICLR Workshops
- Eigen, D.¹ Ranzato, M.² Sutskever, I.³

10
- 85046994169
- Convolutional sequence to sequence learning
- Gehring, J.; Auli, M.; Grangier, D.; Yarats, D.; and Dauphin, Y. N. 2017. Convolutional sequence to sequence learning. In ICML.
- (2017) ICML
- Gehring, J.¹ Auli, M.² Grangier, D.³ Yarats, D.⁴ Dauphin, Y.N.⁵

11
- 84925422907
- National Acad Sciences
- Geman, D.; Geman, S.; Hallonquist, N.; and Younes, L. 2015. Visual turing test for computer vision systems. volume 112, 3618-3623. National Acad Sciences.
- (2015) Visual Turing Test for Computer Vision Systems , vol.112 , pp. 3618-3623
- Geman, D.¹ Geman, S.² Hallonquist, N.³ Younes, L.⁴

12
- 85029539334
- CoRR abs/1705.06830
- Ghiasi, G.; Lee, H.; Kudlur, M.; Dumoulin, V.; and Shlens, J. 2017. Exploring the structure of a real-time, arbitrary neural artistic stylization network. CoRR abs/1705.06830.
- (2017) Exploring the Structure of a Real-Time, Arbitrary Neural Artistic Stylization Network
- Ghiasi, G.¹ Lee, H.² Kudlur, M.³ Dumoulin, V.⁴ Shlens, J.⁵

13
- 85041900002
- Making the V in VQA matter: Elevating the role of image understanding in Visual Question Answering
- Goyal, Y.; Khot, T.; Summers-Stay, D.; Batra, D.; and Parikh, D. 2017. Making the V in VQA matter: Elevating the role of image understanding in Visual Question Answering. In CVPR.
- (2017) CVPR
- Goyal, Y.¹ Khot, T.² Summers-Stay, D.³ Batra, D.⁴ Parikh, D.⁵

14
- 84959890267
- Traversing knowledge graphs in vector space
- Guu, K.; Miller, J.; and Liang, P. 2015. Traversing knowledge graphs in vector space. In EMNLP.
- (2015) EMNLP
- Guu, K.¹ Miller, J.² Liang, P.³

15
- 85060488748
- Hypernetworks
- Ha, D.; Dai, A.; and Le, Q. 2016. Hypernetworks. In ICLR.
- (2016) ICLR
- Ha, D.¹ Dai, A.² Le, Q.³

16
- 84986274465
- Deep residual learning for image recognition
- He, K.; Zhang, X.; Ren, S.; and Sun, J. 2016. Deep residual learning for image recognition. In CVPR.
- (2016) CVPR
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

17
- 0031573117
- Long short-term memory
- Hochreiter, S., and Schmidhuber, J. 1997. Long short-term memory. Neural Comput. 9(8):1735-1780.
- (1997) Neural Comput , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

18
- 85041904328
- Learning to reason: End-to-end module networks for visual question answering
- Hu, R.; Andreas, J.; Rohrbach, M.; Darrell, T.; and Saenko, K. 2017. Learning to reason: End-to-end module networks for visual question answering. In ICCV.
- (2017) ICCV
- Hu, R.¹ Andreas, J.² Rohrbach, M.³ Darrell, T.⁴ Saenko, K.⁵

19
- 85040697657
- Squeeze-and-excitation networks
- Hu, J.; Shen, L.; and Sun, G. 2017. Squeeze-and-Excitation Networks. In ILSVRC 2017 Workshop at CVPR.
- (2017) ILSVRC 2017 Workshop at CVPR
- Hu, J.¹ Shen, L.² Sun, G.³

20
- 85041925505
- Arbitrary style transfer in real-time with adaptive instance normalization
- Huang, X., and Belongie, S. 2017. Arbitrary style transfer in real-time with adaptive instance normalization. In ICCV.
- (2017) ICCV
- Huang, X.¹ Belongie, S.²

21
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Ioffe, S., and Szegedy, C. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML.
- (2015) ICML
- Ioffe, S.¹ Szegedy, C.²

22
- 85041904911
- CLEVR: A diagnostic dataset for compositional language and elementary visual reasoning
- Johnson, J.; Hariharan, B.; van der Maaten, L.; Fei-Fei, L.; Zitnick, C. L.; and Girshick, R. B. 2017a. CLEVR: A diagnostic dataset for compositional language and elementary visual reasoning. In CVPR.
- (2017) CVPR
- Johnson, J.¹ Hariharan, B.² Van Der Maaten, L.³ Fei-Fei, L.⁴ Zitnick, C.L.⁵ Girshick, R.B.⁶

23
- 85041924656
- Inferring and executing programs for visual reasoning
- Johnson, J.; Hariharan, B.; van der Maaten, L.; Hoffman, J.; Li, F.; Zitnick, C. L.; and Girshick, R. B. 2017b. Inferring and executing programs for visual reasoning. In ICCV.
- (2017) ICCV
- Johnson, J.¹ Hariharan, B.² Van Der Maaten, L.³ Hoffman, J.⁴ Li, F.⁵ Zitnick, C.L.⁶ Girshick, R.B.⁷

24
- 0000262562
- Hierarchical mixtures of experts and the em algorithm
- Jordan, M. I., and Jacobs, R. A. 1994. Hierarchical mixtures of experts and the em algorithm. Neural Comput. 6(2):181-214.
- (1994) Neural Comput , vol.6 , Issue.2 , pp. 181-214
- Jordan, M.I.¹ Jacobs, R.A.²

25
- 85039151782
- Dynamic layer normalization for adaptive neural acoustic modeling in speech recognition
- Kim, T.; Song, I.; and Bengio, Y. 2017. Dynamic layer normalization for adaptive neural acoustic modeling in speech recognition. In InterSpeech.
- (2017) InterSpeech
- Kim, T.¹ Song, I.² Bengio, Y.³

26
- 85083951076
- Adam: A method for stochastic optimization
- Kingma, D. P., and Ba, J. 2015. Adam: A method for stochastic optimization. In ICLR.
- (2015) ICLR
- Kingma, D.P.¹ Ba, J.²

27
- 85016395012
- Overcoming catastrophic forgetting in neural networks
- Kirkpatrick, J.; Pascanu, R.; Rabinowitz, N.; Veness, J.; Des-jardins, G.; Rusu, A. A.; Milan, K.; Quan, J.; Ramalho, T.; Grabska-Barwinska, A.; Hassabis, D.; Clopath, C.; Kumaran, D.; and Hadsell, R. 2017. Overcoming catastrophic forgetting in neural networks. National Academy of Sciences 114(13):3521-3526.
- (2017) National Academy of Sciences , vol.114 , Issue.13 , pp. 3521-3526
- Kirkpatrick, J.¹ Pascanu, R.² Rabinowitz, N.³ Veness, J.⁴ Des-Jardins, G.⁵ Rusu, A.A.⁶ Milan, K.⁷ Quan, J.⁸ Ramalho, T.⁹ Grabska-Barwinska, A.¹⁰ Hassabis, D.¹¹ Clopath, C.¹² Kumaran, D.¹³ Hadsell, R.¹⁴

28
- 85018917850
- Hierarchical question-image co-attention for visual question answering
- Lu, J.; Yang, J.; Batra, D.; and Parikh, D. 2016. Hierarchical question-image co-attention for visual question answering. In NIPS.
- (2016) NIPS
- Lu, J.¹ Yang, J.² Batra, D.³ Parikh, D.⁴

29
- 84937822746
- A multi-world approach to question answering about real-world scenes based on uncertain input
- Malinowski, M., and Fritz, M. 2014. A multi-world approach to question answering about real-world scenes based on uncertain input. In NIPS.
- (2014) NIPS
- Malinowski, M.¹ Fritz, M.²

30
- 84973896625
- Ask your neurons: A neural-based approach to answering questions about images
- Malinowski, M.; Rohrbach, M.; and Fritz, M. 2015. Ask your neurons: A neural-based approach to answering questions about images. In ICCV.
- (2015) ICCV
- Malinowski, M.¹ Rohrbach, M.² Fritz, M.³

31
- 84898956512
- Distributed representations of words and phrases and their compositionality
- Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G. S.; and Dean, J. 2013. Distributed representations of words and phrases and their compositionality. In NIPS.
- (2013) NIPS
- Mikolov, T.¹ Sutskever, I.² Chen, K.³ Corrado, G.S.⁴ Dean, J.⁵

32
- 85057290984
- Zero-shot task generalization with multi-task deep reinforcement learning
- Oh, J.; Singh, S.; Lee, H.; and Kholi, P. 2017. Zero-shot task generalization with multi-task deep reinforcement learning. In ICML.
- (2017) ICML
- Oh, J.¹ Singh, S.² Lee, H.³ Kholi, P.⁴

33
- 85055556180
- Learning visual reasoning without strong priors
- Perez, E.; de Vries, H.; Strub, F.; Dumoulin, V.; and Courville, A. C. 2017. Learning visual reasoning without strong priors. In MLSLP Workshop at ICML.
- (2017) MLSLP Workshop at ICML
- Perez, E.¹ De Vries, H.² Strub, F.³ Dumoulin, V.⁴ Courville, A.C.⁵

34
- 85083950271
- Unsupervised representation learning with deep convolutional generative adversarial networks
- Radford, A.; Metz, L.; and Chintala, S. 2016. Unsupervised representation learning with deep convolutional generative adversarial networks. In ICLR.
- (2016) ICLR
- Radford, A.¹ Metz, L.² Chintala, S.³

35
- 84947041871
- Imagenet large scale visual recognition challenge
- Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M. S.; Berg, A. C.; and Li, F. 2015. Imagenet large scale visual recognition challenge. IJCV 115(3):211-252.
- (2015) IJCV , vol.115 , Issue.3 , pp. 211-252
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.S.¹⁰ Berg, A.C.¹¹ Li, F.¹²

36
- 85032218685
- CoRR abs/1706.01427
- Santoro, A.; Raposo, D.; Barrett, D. G.; Malinowski, M.; Pascanu, R.; Battaglia, P.; and Lillicrap, T. 2017. A simple neural network module for relational reasoning. CoRR abs/1706.01427.
- (2017) A Simple Neural Network Module for Relational Reasoning
- Santoro, A.¹ Raposo, D.² Barrett, D.G.³ Malinowski, M.⁴ Pascanu, R.⁵ Battaglia, P.⁶ Lillicrap, T.⁷

37
- 85088226307
- Outrageously large neural networks: The sparsely-gated mixture-of-experts layer
- Shazeer, N.; Mirhoseini, A.; Maziarz, K.; Davis, A.; Le, Q.; Hinton, G.; and Dean, J. 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. In ICLR.
- (2017) ICLR
- Shazeer, N.¹ Mirhoseini, A.² Maziarz, K.³ Davis, A.⁴ Le, Q.⁵ Hinton, G.⁶ Dean, J.⁷

38
- 85011070895
- CoRR abs/1609.03499
- van den Oord, A.; Dieleman, S.; Zen, H.; Simonyan, K.; Vinyals, O.; Graves, A.; Kalchbrenner, N.; Senior, A.; and Kavukcuoglu, K. 2016a. Wavenet: A generative model for raw audio. CoRR abs/1609.03499.
- (2016) Wavenet: A Generative Model for Raw Audio
- Van Den Oord, A.¹ Dieleman, S.² Zen, H.³ Simonyan, K.⁴ Vinyals, O.⁵ Graves, A.⁶ Kalchbrenner, N.⁷ Senior, A.⁸ Kavukcuoglu, K.⁹

39
- 85018873682
- Conditional image generation with pixelcnn decoders
- van den Oord, A.; Kalchbrenner, N.; Espeholt, L.; Vinyals, O.; Graves, A.; and Kavukcuoglu, K. 2016b. Conditional image generation with pixelcnn decoders. In NIPS.
- (2016) NIPS
- Van Den Oord, A.¹ Kalchbrenner, N.² Espeholt, L.³ Vinyals, O.⁴ Graves, A.⁵ Kavukcuoglu, K.⁶

40
- 57249084011
- Visualizing data using t-sne
- Nov
- van der Maaten, L., and Hinton, G. 2008. Visualizing data using t-sne. JMLR 9(Nov):2579-2605.
- (2008) JMLR , vol.9 , pp. 2579-2605
- Van Der Maaten, L.¹ Hinton, G.²

41
- 85056814846
- CoRR abs/1706.01433
- Watters, N.; Tacchetti, A.; Weber, T.; Pascanu, R.; Battaglia, P.; and Zoran, D. 2017. Visual interaction networks. CoRR abs/1706.01433.
- (2017) Visual Interaction Networks
- Watters, N.¹ Tacchetti, A.² Weber, T.³ Pascanu, R.⁴ Battaglia, P.⁵ Zoran, D.⁶

42
- 84986334021
- Stacked attention networks for image question answering
- Yang, Z.; He, X.; Gao, J.; Deng, L.; and Smola, A. J. 2016. Stacked attention networks for image question answering. In CVPR.
- (2016) CVPR
- Yang, Z.¹ He, X.² Gao, J.³ Deng, L.⁴ Smola, A.J.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.