SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Volumn 2016-December, Issue , 2016, Pages 30-38

Image question answering using convolutional neural network with dynamic parameter prediction

(3) Noh, Hyeonwoo a Seo, Paul Hongsuck a Han, Bohyung a

a Pohang University of Science and Technology (POSTECH) (South Korea)

Author keywords

[No Author keywords available]

Indexed keywords

BACKPROPAGATION; BENCHMARKING; COMPUTER VISION; CONVOLUTION; FORECASTING; HASH FUNCTIONS; NEURAL NETWORKS; PATTERN RECOGNITION;

ADAPTIVE PARAMETERS; CONVOLUTIONAL NEURAL NETWORK; DYNAMIC PARAMETERS; HASHING TECHNIQUES; JOINT NETWORK; PARAMETER PREDICTION; QUESTION ANSWERING; STATE-OF-THE-ART PERFORMANCE;

COMPLEX NETWORKS;

EID: 84986261711 PISSN: 10636919 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2016.11 Document Type: Conference Paper

Times cited : (359)

References (31)

1
- 84973890960
- VQA: Visual question answering
- 1, 2, 5, 6, 7, 8
- S. Antol, A. Agrawal, J. Lu, M. Mitchell, D. Batra, C. L. Zitnick, and D. Parikh. VQA: visual question answering. In ICCV, 2015.
- (2015) ICCV
- Antol, S.¹ Agrawal, A.² Lu, J.³ Mitchell, M.⁴ Batra, D.⁵ Zitnick, C.L.⁶ Parikh, D.⁷

2
- 84973882857
- Predicting deep zero-shot convolutional neural networks using textual descriptions
- 2
- J. Ba, K. Swersky, S. Fidler, and R. Salakhutdinov. Predicting deep zero-shot convolutional neural networks using textual descriptions. In ICCV, 2015.
- (2015) ICCV
- Ba, J.¹ Swersky, K.² Fidler, S.³ Salakhutdinov, R.⁴

3
- 84969930652
- Compressing neural networks with the hashing trick
- 2, 4, 5
- W. Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, and Y. Chen. Compressing neural networks with the hashing trick. In ICML, 2015.
- (2015) ICML
- Chen, W.¹ Wilson, J.T.² Tyree, S.³ Weinberger, K.Q.⁴ Chen, Y.⁵

4
- 84939821078
- Empirical evaluation of gated recurrent neural networks on sequence modeling
- 4, 5, 7
- J. Chung, C. Gulcehre, K. Cho, and Y. Bengio. Empirical evaluation of gated recurrent neural networks on sequence modeling. In NIPS Deep Learning Workshop, 2014.
- (2014) NIPS Deep Learning Workshop
- Chung, J.¹ Gulcehre, C.² Cho, K.³ Bengio, Y.⁴

5
- 84911453074
- Describing textures in the wild
- 1
- M. Cimpoi, S. Maji, I. Kokkinos, S. Mohamed, and A. Vedaldi. Describing textures in the wild. In CVPR, 2014.
- (2014) CVPR
- Cimpoi, M.¹ Maji, S.² Kokkinos, I.³ Mohamed, S.⁴ Vedaldi, A.⁵

6
- 85198028989
- Imagenet: A large-scale hierarchical image database
- 3
- J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
- (2009) CVPR
- Deng, J.¹ Dong, W.² Socher, R.³ Li, L.-J.⁴ Li, K.⁵ Fei-Fei, L.⁶

7
- 84898971588
- Predicting parameters in deep learning
- 5
- M. Denil, B. Shakibi, L. Dinh, N. de Freitas, et al. Predicting parameters in deep learning. In NIPS, 2013.
- (2013) NIPS
- Denil, M.¹ Shakibi, B.² Dinh, L.³ De Freitas, N.⁴

8
- 84919881041
- DeCAF: A deep convolutional activation feature for generic visual recognition
- 1
- J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. DeCAF: A deep convolutional activation feature for generic visual recognition. In ICML, 2014.
- (2014) ICML
- Donahue, J.¹ Jia, Y.² Vinyals, O.³ Hoffman, J.⁴ Zhang, N.⁵ Tzeng, E.⁶ Darrell, T.⁷

9
- 0004289791
- 6
- C. Fellbaum. Wordnet: An electronic database, 1998.
- (1998) Wordnet: An Electronic Database
- Fellbaum, C.¹

10
- 84965148420
- Are you talking to a machine dataset and methods for multilingual image question answering
- 1, 2
- H. Gao, J. Mao, J. Zhou, Z. Huang, L. Wang, and W. Xu. Are you talking to a machine dataset and methods for multilingual image question answering. In NIPS, 2015.
- (2015) NIPS
- Gao, H.¹ Mao, J.² Zhou, J.³ Huang, Z.⁴ Wang, L.⁵ Xu, W.⁶

11
- 0031573117
- Long short-term memory
- 5
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9 (8): 1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

12
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- 6
- S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, 2015.
- (2015) ICML
- Ioffe, S.¹ Szegedy, C.²

13
- 85083951076
- Adam: A method for stochastic optimization
- 6
- D. Kingma and J. Ba. Adam: A method for stochastic optimization. In ICLR, 2015.
- (2015) ICLR
- Kingma, D.¹ Ba, J.²

14
- 84965153327
- Skip-thought vectors
- 2, 4, 5
- R. Kiros, Y. Zhu, R. Salakhutdinov, R. S. Zemel, A. Torralba, R. Urtasun, and S. Fidler. Skip-thought vectors. In NIPS, 2015.
- (2015) NIPS
- Kiros, R.¹ Zhu, Y.² Salakhutdinov, R.³ Zemel, R.S.⁴ Torralba, A.⁵ Urtasun, R.⁶ Fidler, S.⁷

15
- 85009931853
- Microsoft COCO: Common objects in context
- 6
- T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick. Microsoft COCO: common objects in context. In ECCV, 2014.
- (2014) ECCV
- Lin, T.-Y.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollár, P.⁷ Zitnick, C.L.⁸

16
- 85007153677
- Learning to answer questions from image using convolutional neural network
- 1, 2, 3, 7
- L. Ma, Z. Lu, and H. Li. Learning to answer questions from image using convolutional neural network. In AAAI, 2016.
- (2016) AAAI
- Ma, L.¹ Lu, Z.² Li, H.³

17
- 84937822746
- A multi-world approach to question answering about real-world scenes based on uncertain input
- 1, 2, 6, 7
- M. Malinowski and M. Fritz. A multi-world approach to question answering about real-world scenes based on uncertain input. In NIPS, 2014.
- (2014) NIPS
- Malinowski, M.¹ Fritz, M.²

18
- 84973896625
- Ask your neurons: A neural-based approach to answering questions about images
- 1, 2, 7
- M. Malinowski, M. Rohrbach, and M. Fritz. Ask your neurons: A neural-based approach to answering questions about images. In ICCV, 2015.
- (2015) ICCV
- Malinowski, M.¹ Rohrbach, M.² Fritz, M.³

19
- 79959829092
- Recurrent neural network based language model
- 5
- T. Mikolov, M. Karafiát, L. Burget, J. Cernockỳ, and S. Khudanpur. Recurrent neural network based language model. In INTERSPEECH, 2010.
- (2010) INTERSPEECH
- Mikolov, T.¹ Karafiát, M.² Burget, L.³ Cernockỳ, J.⁴ Khudanpur, S.⁵

20
- 84886073305
- Indoor segmentation and support inference from rgbd images
- 6
- P. K. Nathan Silberman, Derek Hoiem and R. Fergus. Indoor segmentation and support inference from rgbd images. In ECCV, 2012.
- (2012) ECCV
- Nathan Silberman, P.K.¹ Hoiem, D.² Fergus, R.³

21
- 84911449395
- Learning and transferring mid-level image representations using convolutional neural networks
- 1
- M. Oquab, L. Bottou, I. Laptev, and J. Sivic. Learning and transferring mid-level image representations using convolutional neural networks. In CVPR, 2014.
- (2014) CVPR
- Oquab, M.¹ Bottou, L.² Laptev, I.³ Sivic, J.⁴

22
- 84897497795
- On the difficulty of training recurrent neural networks
- 6
- R. Pascanu, T. Mikolov, and Y. Bengio. On the difficulty of training recurrent neural networks. In ICML, 2013.
- (2013) ICML
- Pascanu, R.¹ Mikolov, T.² Bengio, Y.³

23
- 84965170394
- Exploring models and data for image question answering
- 1, 2, 3, 5, 6, 7
- M. Ren, R. Kiros, and R. S. Zemel. Exploring models and data for image question answering. In NIPS, 2015.
- (2015) NIPS
- Ren, M.¹ Kiros, R.² Zemel, R.S.³

24
- 85083953063
- Very deep convolutional networks for large-scale image recognition
- 1, 3
- K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
- (2015) ICLR
- Simonyan, K.¹ Zisserman, A.²

25
- 84928547704
- Sequence to sequence learning with neural networks
- 5
- I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In NIPS, 2014.
- (2014) NIPS
- Sutskever, I.¹ Vinyals, O.² Le, Q.V.³

26
- 84937522268
- Going deeper with convolutions
- 1
- C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In CVPR, 2015.
- (2015) CVPR
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶ Erhan, D.⁷ Vanhoucke, V.⁸ Rabinovich, A.⁹

27
- 84911198048
- Deepface: Closing the gap to human-level performance in face verification
- 1
- L. Wolf. Deepface: Closing the gap to human-level performance in face verification. In CVPR, 2014.
- (2014) CVPR
- Wolf, L.¹

28
- 85146676791
- Verbs semantics and lexical selection
- 6
- Z. Wu and M. Palmer. Verbs semantics and lexical selection. In ACL, 1994.
- (1994) ACL
- Wu, Z.¹ Palmer, M.²

29
- 84970002232
- Show, attend and tell: Neural image caption generation with visual attention
- 6, 8
- K. Xu, J. Ba, R. Kiros, A. Courville, R. Salakhutdinov, R. Zemel, and Y. Bengio. Show, attend and tell: Neural image caption generation with visual attention. In ICML, 2015.
- (2015) ICML
- Xu, K.¹ Ba, J.² Kiros, R.³ Courville, A.⁴ Salakhutdinov, R.⁵ Zemel, R.⁶ Bengio, Y.⁷

30
- 84866687133
- Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation
- 1
- J. Yao, S. Fidler, and R. Urtasun. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation. In CVPR, 2012.
- (2012) CVPR
- Yao, J.¹ Fidler, S.² Urtasun, R.³

31
- 84937964578
- Learning deep features for scene recognition using places database
- 1
- B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning deep features for scene recognition using places database. In NIPS, 2014.
- (2014) NIPS
- Zhou, B.¹ Lapedriza, A.² Xiao, J.³ Torralba, A.⁴ Oliva, A.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.