SCOPUS 정보 검색 플랫폼

33rd International Conference on Machine Learning, ICML 2016

Volumn 3, Issue , 2016, Pages 1681-1690

Generative adversarial text to image synthesis

(6) Reed, Scott a Akata, Zeynep b Yan, Xinchen a Logeswaran, Lajanugen a Schiele, Bernt b Lee, Honglak a

a UNIVERSITY OF MICHIGAN (United States)

b MAX PLANCK INSTITUTE FOR INFORMATICS (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; NETWORK ARCHITECTURE; NEURAL NETWORKS; RECURRENT NEURAL NETWORKS;

ADVERSARIAL NETWORKS; AUTOMATIC SYNTHESIS; DEEP ARCHITECTURES; IMAGE MODELING; IMAGE SYNTHESIS; REALISTIC IMAGES; TEXT FEATURE; VISUAL CONCEPT;

IMAGE PROCESSING;

EID: 84998636515 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1429)

References (38)

1
- 84959243017
- Evaluation of Output Embeddings for Fine-Grained Image Classification
- Akata, Z., Reed, S., Walter, D., Lee, H., and Schiele, B. Evaluation of Output Embeddings for Fine-Grained Image Classification. In CVPR, 2015.
- (2015) CVPR
- Akata, Z.¹ Reed, S.² Walter, D.³ Lee, H.⁴ Schiele, B.⁵

2
- 85083951076
- Adam: A method for stochastic optimization
- Ba, J. and Kingma, D. Adam: A method for stochastic optimization. In ICLR, 2015.
- (2015) ICLR
- Ba, J.¹ Kingma, D.²

3
- 84882266451
- Better mixing via deep representations
- Bengio, Y, Mesnil, G., Dauphin, Y, and Rifai, S. Better mixing via deep representations. In ICML, 2013.
- (2013) ICML
- Bengio, Y.¹ Mesnil, G.² Dauphin, Y.³ Rifai, S.⁴

4
- 84965143571
- Deep generative image models using a laplacian pyramid of adversarial networks
- Denton, E. L., Chintala, S., Fergus, R., et al. Deep generative image models using a laplacian pyramid of adversarial networks. In NIPS, 2015.
- (2015) NIPS
- Denton, E.L.¹ Chintala, S.² Fergus, R.³

5
- 84959236502
- Longterm recurrent convolutional networks for visual recognition and description
- Donahue, J., Hendricks, L. A., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., and Darrell, T. Longterm recurrent convolutional networks for visual recognition and description. In CVPR, 2015.
- (2015) CVPR
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

6
- 84959184995
- Learning to generate chairs with convolutional neural networks
- Dosovitskiy, A., Tobias Springenberg, J., and Brox, T. Learning to generate chairs with convolutional neural networks. In CVPR, 2015.
- (2015) CVPR
- Dosovitskiy, A.¹ Tobias Springenberg, J.² Brox, T.³

7
- 70450207704
- Describing objects by their attributes
- Farhadi, A., Endres, I., Hoiem, D., and Forsyth, D. Describing objects by their attributes. In CVPR, 2009.
- (2009) CVPR
- Farhadi, A.¹ Endres, I.² Hoiem, D.³ Forsyth, D.⁴

8
- 84906482165
- Transductive multi-view embedding for zero-shot recognition and annotation
- Fu, Y, Hospedales, T. M., Xiang, T., Fu, Z., and Gong, S. Transductive multi-view embedding for zero-shot recognition and annotation. In ECCV, 2014.
- (2014) ECCV
- Fu, Y.¹ Hospedales, T.M.² Xiang, T.³ Fu, Z.⁴ Gong, S.⁵

9
- 84986326086
- Technical report
- Gauthier, J. Conditional generative adversarial nets for convolutional face generation. Technical report, 2015.
- (2015) Conditional Generative Adversarial Nets for Convolutional Face Generation
- Gauthier, J.¹

10
- 84937849144
- Generative adversarial nets
- Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. Generative adversarial nets. In NIPS, 2014.
- (2014) NIPS
- Goodfellow, I.¹ Pouget-Abadie, J.² Mirza, M.³ Xu, B.⁴ Warde-Farley, D.⁵ Ozair, S.⁶ Courville, A.⁷ Bengio, Y.⁸

11
- 84983208884
- Draw: A recurrent neural network for image generation
- Gregor, K., Danihelka, I., Graves, A., Rezende, D., and Wierstra, D. Draw: A recurrent neural network for image generation. In ICML, 2015.
- (2015) ICML
- Gregor, K.¹ Danihelka, I.² Graves, A.³ Rezende, D.⁴ Wierstra, D.⁵

12
- 0031573117
- Long short-term memory
- Hochreiter, S. and Schmidhuber, J. Long short-term memory. Neural computation, 9(8):1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

13
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Ioffe, S. and Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, 2015.
- (2015) ICML
- Ioffe, S.¹ Szegedy, C.²

14
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- Karpathy, A. and Li, F. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015.
- (2015) CVPR
- Karpathy, A.¹ Li, F.²

15
- 84952349298
- Unifying visual-semantic embeddings with multimodal neural language models
- Kiros, R., Salakhutdinov, R., and Zemel, R. S. Unifying visual-semantic embeddings with multimodal neural language models. In ACL, 2014.
- (2014) ACL
- Kiros, R.¹ Salakhutdinov, R.² Zemel, R.S.³

16
- 77953185711
- Attribute and simile classifiers for face verification
- Kumar, N., Berg, A. C., Belhumeur, P. N., and Nayar, S. K. Attribute and simile classifiers for face verification. In ICCV, 2009.
- (2009) ICCV
- Kumar, N.¹ Berg, A.C.² Belhumeur, P.N.³ Nayar, S.K.⁴

17
- 84894522762
- Attributebased classification for zero-shot visual object categorization
- Lampert, C. H., Nickisch, H., and Harmeling, S. Attributebased classification for zero-shot visual object categorization. TPAMI, 36(3):453-465, 2014.
- (2014) TPAMI , vol.36 , Issue.3 , pp. 453-465
- Lampert, C.H.¹ Nickisch, H.² Harmeling, S.³

18
- 84937834115
- Microsoft coco: Common objects in context
- Lin, T.-Y, Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., and Zitnick, C. L. Microsoft coco: Common objects in context. In ECCV. 2014.
- (2014) ECCV
- Lin, T.-Y.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollár, P.⁷ Zitnick, C.L.⁸

19
- 85083950885
- Generating images from captions with attention
- Mansimov, E., Parisotto, E., Ba, J. L., and Salakhutdinov, R. Generating images from captions with attention. ICLR, 2016.
- (2016) ICLR
- Mansimov, E.¹ Parisotto, E.² Ba, J.L.³ Salakhutdinov, R.⁴

20
- 85083950512
- Deep captioning with multimodal recurrent neural networks (m-rnn)
- Mao, J., Xu, W., Yang, Y, Wang, J., and Yuille, A. Deep captioning with multimodal recurrent neural networks (m-rnn). ICLR, 2015.
- (2015) ICLR
- Mao, J.¹ Xu, W.² Yang, Y.³ Wang, J.⁴ Yuille, A.⁵

21
- 84987947153
- arXiv preprint arXiv:1411.1784
- Mirza, M. and Osindero, S. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784, 2014.
- (2014) Conditional Generative Adversarial Nets
- Mirza, M.¹ Osindero, S.²

22
- 80053437179
- Multimodal deep learning
- Ngiam, J., Khosla, A., Kim, M., Nam, J., Lee, H., and Ng, A. Y Multimodal deep learning. In ICML, 2011.
- (2011) ICML
- Ngiam, J.¹ Khosla, A.² Kim, M.³ Nam, J.⁴ Lee, H.⁵ Ng, A.Y.⁶

23
- 84856670612
- Relative attributes
- Parikh, D. and Grauman, K. Relative attributes. In ICCV, 2011.
- (2011) ICCV
- Parikh, D.¹ Grauman, K.²

24
- 85083950271
- Radford, A., Metz, L., and Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. 2016.
- (2016) Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
- Radford, A.¹ Metz, L.² Chintala, S.³

25
- 84919832734
- Learning to disentangle factors of variation with manifold interaction
- Reed, S., Sohn, K., Zhang, Y, and Lee, H. Learning to disentangle factors of variation with manifold interaction. In ICML, 2014.
- (2014) ICML
- Reed, S.¹ Sohn, K.² Zhang, Y.³ Lee, H.⁴

26
- 84965113821
- Deep visual analogy-making
- Reed, S., Zhang, Y, Zhang, Y, and Lee, H. Deep visual analogy-making. In NIPS, 2015.
- (2015) NIPS
- Reed, S.¹ Zhang, Y.² Zhang, Y.³ Lee, H.⁴

27
- 84986250442
- Learning deep representations for fine-grained visual descriptions
- Reed, S., Akata, Z., Lee, H., and Schiele, B. Learning deep representations for fine-grained visual descriptions. In CVPR, 2016.
- (2016) CVPR
- Reed, S.¹ Akata, Z.² Lee, H.³ Schiele, B.⁴

28
- 84965170394
- Exploring models and data for image question answering
- Ren, M., Kiros, R., and Zemel, R. Exploring models and data for image question answering. In NIPS, 2015.
- (2015) NIPS
- Ren, M.¹ Kiros, R.² Zemel, R.³

29
- 84937873395
- Improved multimodal deep learning with variation of information
- Sohn, K., Shang, W., and Lee, H. Improved multimodal deep learning with variation of information. In NIPS, 2014.
- (2014) NIPS
- Sohn, K.¹ Shang, W.² Lee, H.³

30
- 84877724347
- Multimodal learning with deep boltzmann machines
- Srivastava, N. and Salakhutdinov, R. R. Multimodal learning with deep boltzmann machines. In NIPS, 2012.
- (2012) NIPS
- Srivastava, N.¹ Salakhutdinov, R.R.²

31
- 84937522268
- Going deeper with convolutions
- Szegedy, C, Liu, W., Jia, Y, Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. Going deeper with convolutions. In CVPR, 2015.
- (2015) CVPR
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶ Erhan, D.⁷ Vanhoucke, V.⁸ Rabinovich, A.⁹

32
- 84946747440
- Show and tell: A neural image caption generator
- Vinyals, O., Toshev, A., Bengio, S., and Erhan, D. Show and tell: A neural image caption generator. In CVPR, 2015.
- (2015) CVPR
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

33
- 84878084353
- Wah, C, Branson, S., Welinder, P., Perona, P., and Belongie, S. The caltech-ucsd birds-200-2011 dataset. 2011.
- (2011) The Caltech-ucsd Birds-200-2011 Dataset
- Wah, C.¹ Branson, S.² Welinder, P.³ Perona, P.⁴ Belongie, S.⁵

34
- 84998721476
- arXiv preprint arXiv: 1511.02570
- Wang, P., Wu, Q., Shen, C, Hengel, A. v. d., and Dick, A. Explicit knowledge-based reasoning for visual question answering. arXiv preprint arXiv: 1511.02570, 2015.
- (2015) Explicit Knowledge-based Reasoning for Visual Question Answering
- Wang, P.¹ Wu, Q.² Shen, C.³ Hengel, A.V.D.⁴ Dick, A.⁵

35
- 84970002232
- Show, attend and tell: Neural image caption generation with visual attention
- Xu, K., Ba, J., Kiros, R., Courville, A., Salakhutdinov, R., Zemel, R., and Bengio, Y. Show, attend and tell: Neural image caption generation with visual attention. In ICML, 2015.
- (2015) ICML
- Xu, K.¹ Ba, J.² Kiros, R.³ Courville, A.⁴ Salakhutdinov, R.⁵ Zemel, R.⁶ Bengio, Y.⁷

36
- 84988339664
- arXiv preprint arXiv: 1512.00570
- Yan, X., Yang, J., Sohn, K., and Lee, H. Attribute2image: Conditional image generation from visual attributes. arXiv preprint arXiv: 1512.00570, 2015.
- (2015) Attribute2image: Conditional Image Generation from Visual Attributes
- Yan, X.¹ Yang, J.² Sohn, K.³ Lee, H.⁴

37
- 84965161391
- Weaklysupervised disentangling with recurrent transformations for 3d view synthesis
- Yang, J., Reed, S., Yang, M.-H., and Lee, H. Weaklysupervised disentangling with recurrent transformations for 3d view synthesis. In NIPS, 2015.
- (2015) NIPS
- Yang, J.¹ Reed, S.² Yang, M.-H.³ Lee, H.⁴

38
- 84973911532
- Aligning books and movies: Towards story-like visual explanations by watching movies and reading books
- Zhu, Y, Kiros, R., Zemel, R., Salakhutdinov, R., Urtasun, R., Torralba, A., and Fidler, S. Aligning books and movies: Towards story-like visual explanations by watching movies and reading books. In ICCV, 2015.
- (2015) ICCV
- Zhu, Y.¹ Kiros, R.² Zemel, R.³ Salakhutdinov, R.⁴ Urtasun, R.⁵ Torralba, A.⁶ Fidler, S.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.