SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Volumn 2016-December, Issue , 2016, Pages 59-68

Multi-cue zero-shot learning with strong supervision

(4) Akata, Zeynep a Malinowski, Mateusz a Fritz, Mario a Schiele, Bernt a

a MAX PLANCK INSTITUTE FOR INFORMATICS (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; SEMANTICS;

AUXILIARY INFORMATION; CATEGORY RECOGNITION; COMMON SPACES; LEARNING APPROACH; SEMANTIC PARTS; STATE OF THE ART; TRAINING DATA; UNSTRUCTURED TEXTS;

PATTERN RECOGNITION;

EID: 84986309960 PISSN: 10636919 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2016.14 Document Type: Conference Paper

Times cited : (159)

References (55)

1
- 84986259594
- Labelembedding for image classification
- 2, 3
- Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid. Labelembedding for image classification. TPAMI, 2015.
- (2015) TPAMI
- Akata, Z.¹ Perronnin, F.² Harchaoui, Z.³ Schmid, C.⁴

2
- 84959243017
- Evaluation of output embeddings for fine-grained image classification
- 2, 3, 4, 5, 6
- Z. Akata, S. Reed, D. Walter, H. Lee, and B. Schiele. Evaluation of Output Embeddings for Fine-Grained Image Classification. In CVPR, 2015.
- (2015) CVPR
- Akata, Z.¹ Reed, S.² Walter, D.³ Lee, H.⁴ Schiele, B.⁵

3
- 85141266799
- Support vector machines for multiple-instance learning
- 3
- S. Andrews, I. Tsochantaridis, and T. Hofmann. Support vector machines for multiple-instance learning. In NIPS, 2002.
- (2002) NIPS
- Andrews, S.¹ Tsochantaridis, I.² Hofmann, T.³

4
- 84973882857
- Predicting deep zero-shot convolutional neural networks using textual descriptions
- 2, 6
- J. Ba, K. Swersky, S. Fidler, and R. Salakhutdinov. Predicting deep zero-shot convolutional neural networks using textual descriptions. In ICCV, 2015.
- (2015) ICCV
- Ba, J.¹ Swersky, K.² Fidler, S.³ Salakhutdinov, R.⁴

5
- 85162050606
- Label embedding trees for large multi-class tasks
- 2
- S. Bengio, J. Weston, and D. Grangier. Label embedding trees for large multi-class tasks. In NIPS, 2010.
- (2010) NIPS
- Bengio, S.¹ Weston, J.² Grangier, D.³

6
- 84937873698
- Articulated pose estimation by a graphical model with image dependent pairwise relations
- 3
- X. Chen and A. Yuille. Articulated pose estimation by a graphical model with image dependent pairwise relations. In NIPS, 2014.
- (2014) NIPS
- Chen, X.¹ Yuille, A.²

7
- 84973879622
- P-cnn: Pose-based cnn features for action recognition
- 3
- G. Cheron, I. Laptev, and C. Schmid. P-cnn: Pose-based cnn features for action recognition. In ICCV, 2015.
- (2015) ICCV
- Cheron, G.¹ Laptev, I.² Schmid, C.³

8
- 85037338954
- Generating typed dependency parses from phrase structure parses
- 2
- M.-C. De Marneffe, B. MacCartney, and C. Manning. Generating typed dependency parses from phrase structure parses. In LREC, 2006.
- (2006) LREC
- De Marneffe, M.-C.¹ MacCartney, B.² Manning, C.³

9
- 84883475136
- 3
- C. Desai and D. Ramanan. Detecting actions, poses, and objects with relational phraselets. 2012.
- (2012) Detecting Actions, Poses, and Objects with Relational Phraselets
- Desai, C.¹ Ramanan, D.²

10
- 84959236502
- Long-term recurrent convolutional networks for visual recognition and description
- 2
- J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. In CVPR, 2015.
- (2015) CVPR
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

11
- 84866719272
- Discovering localized attributes for fine-grained recognition
- 2
- K. Duan, D. Parikh, D. J. Crandall, and K. Grauman. Discovering localized attributes for fine-grained recognition. In CVPR, 2012.
- (2012) CVPR
- Duan, K.¹ Parikh, D.² Crandall, D.J.³ Grauman, K.⁴

12
- 77956006784
- Attribute-centric recognition for cross-category generalization
- 2
- A. Farhadi, I. Endres, and D. Hoiem. Attribute-centric recognition for cross-category generalization. In CVPR, 2010.
- (2010) CVPR
- Farhadi, A.¹ Endres, I.² Hoiem, D.³

13
- 77955422240
- Object detection with discriminatively trained partbased models
- 3
- P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained partbased models. TPAMI, 2010.
- (2010) TPAMI
- Felzenszwalb, P.F.¹ Girshick, R.B.² McAllester, D.³ Ramanan, D.⁴

14
- 33750397657
- Weakly supervised scale-invariant learning of models for visual recognition
- 3
- R. Fergus, P. Perona, and A. Zisserman. Weakly supervised scale-invariant learning of models for visual recognition. IJCV, 71, 2007.
- (2007) IJCV , vol.71
- Fergus, R.¹ Perona, P.² Zisserman, A.³

15
- 70450219358
- Learning visual attributes
- 2
- V. Ferrari and A. Zisserman. Learning visual attributes. In NIPS, 2007.
- (2007) NIPS
- Ferrari, V.¹ Zisserman, A.²

16
- 84898958665
- Devise: A deep visual-semantic embedding model
- 2, 3
- A. Frome, G. S. Corrado, J. Shlens, S. Bengio, J. Dean, and T. Mikolov. Devise: A deep visual-semantic embedding model. In NIPS, 2013.
- (2013) NIPS
- Frome, A.¹ Corrado, G.S.² Shlens, J.³ Bengio, S.⁴ Dean, J.⁵ Mikolov, T.⁶

17
- 84906482165
- Transductive multi-view embedding for zero-shot recognition and annotation
- 2
- Y. Fu, T. M. Hospedales, T. Xiang, Z. Fu, and S. Gong. Transductive multi-view embedding for zero-shot recognition and annotation. In ECCV, 2014.
- (2014) ECCV
- Fu, Y.¹ Hospedales, T.M.² Xiang, T.³ Fu, Z.⁴ Gong, S.⁵

18
- 84965148420
- Are you talking to a machine dataset and methods for multilingual image question answering
- 2
- H. Gao, J. Mao, J. Zhou, Z. Huang, L. Wang, and W. Xu. Are you talking to a machine dataset and methods for multilingual image question answering. NIPS, 2015.
- (2015) NIPS
- Gao, H.¹ Mao, J.² Zhou, J.³ Huang, Z.⁴ Wang, L.⁵ Xu, W.⁶

19
- 0000679216
- Distributional structure
- 2
- Z. Harris. Distributional structure. Word, 10 (23), 1954.
- (1954) Word , vol.10 , Issue.23
- Harris, Z.¹

20
- 0031573117
- Long short-term memory
- 2
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Computation, 1997.
- (1997) Neural Computation
- Hochreiter, S.¹ Schmidhuber, J.²

21
- 84866713663
- Online incremental attribute-based zero-shot learning
- 2
- P. Kankuekul, A. Kawewong, S. Tangruamsub, and O. Hasegawa. Online incremental attribute-based zero-shot learning. In CVPR, 2012.
- (2012) CVPR
- Kankuekul, P.¹ Kawewong, A.² Tangruamsub, S.³ Hasegawa, O.⁴

22
- 84937843643
- Deep fragment embeddings for bidirectional image sentence mapping
- 2
- A. Karpathy, A. Joulin, and F. Li. Deep fragment embeddings for bidirectional image sentence mapping. In NIPS, 2014.
- (2014) NIPS
- Karpathy, A.¹ Joulin, A.² Li, F.³

23
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- 2, 3
- A. Karpathy and F. Li. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015.
- (2015) CVPR
- Karpathy, A.¹ Li, F.²

24
- 84965125568
- Fisher vectors derived from hybrid Gaussian-laplacian mixture models for image annotation
- 2
- B. Klein, G. Lev, G. Sadeh, and L. Wolf. Fisher vectors derived from hybrid Gaussian-laplacian mixture models for image annotation. CVPR, 2015.
- (2015) CVPR
- Klein, B.¹ Lev, G.² Sadeh, G.³ Wolf, L.⁴

25
- 77953185711
- Attribute and simile classifiers for face verification
- 3
- N. Kumar, A. C. Berg, P. N. Belhumeur, and S. K. Nayar. Attribute and simile classifiers for face verification. In ICCV, 2009.
- (2009) ICCV
- Kumar, N.¹ Berg, A.C.² Belhumeur, P.N.³ Nayar, S.K.⁴

26
- 84925402963
- Attribute-based classification for zero-shot visual object categorization
- 2, 3, 4
- C. Lampert, H. Nickisch, and S. Harmeling. Attribute-based classification for zero-shot visual object categorization. In TPAMI, 2013.
- (2013) TPAMI
- Lampert, C.¹ Nickisch, H.² Harmeling, S.³

27
- 39749124915
- Robust object detection with interleaved categorization and segmentation
- 3
- B. Leibe, A. Leonardis, and B. Schiele. Robust object detection with interleaved categorization and segmentation. IJCV, 77, 2008.
- (2008) IJCV , vol.77
- Leibe, B.¹ Leonardis, A.² Schiele, B.³

28
- 84943788934
- Linguistic regularities in sparse and explicit word representations
- 4
- O. Levy and Y. Goldberg. Linguistic regularities in sparse and explicit word representations. In CONLL, 2014.
- (2014) CONLL
- Levy, O.¹ Goldberg, Y.²

29
- 84973896625
- Ask your neurons: A neural-based approach to answering questions about images
- 2
- M. Malinowski, M. Rohrbach, and M. Fritz. Ask your neurons: A neural-based approach to answering questions about images. ICCV, 2015.
- (2015) ICCV
- Malinowski, M.¹ Rohrbach, M.² Fritz, M.³

30
- 85083950512
- Deep captioning with multimodal recurrent neural networks (m-rnn)
- 2
- J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. L. Yuille. Deep captioning with multimodal recurrent neural networks (m-rnn). In ICLR, 2015.
- (2015) ICLR
- Mao, J.¹ Xu, W.² Yang, Y.³ Wang, J.⁴ Huang, Z.⁵ Yuille, A.L.⁶

31
- 85083951332
- arXiv: 1301. 3781, 4
- T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. ArXiv: 1301. 3781, 2013.
- (2013) Efficient Estimation of Word Representations in Vector Space
- Mikolov, T.¹ Chen, K.² Corrado, G.³ Dean, J.⁴

32
- 84898956512
- Distributed representations of words and phrases and their compositionality
- 2, 4, 5
- T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
- (2013) NIPS
- Mikolov, T.¹ Sutskever, I.² Chen, K.³ Corrado, G.S.⁴ Dean, J.⁵

33
- 84926179397
- Linguistic regularities in continuous space word representations
- 4
- T. Mikolov, W.-t. Yih, and G. Zweig. Linguistic regularities in continuous space word representations. In Proceedings of NAACL-HLT, 2013.
- (2013) Proceedings of NAACL-HLT
- Mikolov, T.¹ Yih, W.-T.² Zweig, G.³

34
- 84898979068
- arXiv: 1312. 5650, 2
- M. Norouzi, T. Mikolov, S. Bengio, Y. Singer, J. Shlens, A. Frome, G. Corrado, and J. Dean. Zero-shot learning by convex combination of semantic embeddings. ArXiv: 1312. 5650, 2013.
- (2013) Zero-shot Learning by Convex Combination of Semantic Embeddings
- Norouzi, M.¹ Mikolov, T.² Bengio, S.³ Singer, Y.⁴ Shlens, J.⁵ Frome, A.⁶ Corrado, G.⁷ Dean, J.⁸

35
- 84973896919
- Person recognition in personal photo collections
- 2
- S. Oh, R. Benenson, M. Fritz, and B. Shiele. Person recognition in personal photo collections. In ICCV, 2015.
- (2015) ICCV
- Oh, S.¹ Benenson, R.² Fritz, M.³ Shiele, B.⁴

36
- 84856670612
- Relative attributes
- 2
- D. Parikh and K. Grauman. Relative attributes. In ICCV, 2011.
- (2011) ICCV
- Parikh, D.¹ Grauman, K.²

37
- 84973871157
- Fine-grained activity recognition with holistic and pose based features
- 3
- L. Pishchulin, M. Andriluka, and B. Schiele. Fine-grained activity recognition with holistic and pose based features. In GCPR, 2014.
- (2014) GCPR
- Pishchulin, L.¹ Andriluka, M.² Schiele, B.³

38
- 84962816362
- Image question answering: A visual semantic embedding model and a new dataset
- 2
- M. Ren, R. Kiros, and R. Zemel. Image question answering: A visual semantic embedding model and a new dataset. NIPS, 2015.
- (2015) NIPS
- Ren, M.¹ Kiros, R.² Zemel, R.³

39
- 80052892795
- Evaluating knowledge transfer and zero-shot learning in a large-scale setting
- 2, 5
- M. Rohrbach, M. Stark, and B. Schiele. Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In CVPR, 2011.
- (2011) CVPR
- Rohrbach, M.¹ Stark, M.² Schiele, B.³

40
- 0036152936
- Learning words from sights and sounds: A computational model
- 1
- D. K. Roy and A. P. Pentland. Learning words from sights and sounds: A computational model. Cognitive science, 26 (1): 113-146, 2002.
- (2002) Cognitive Science , vol.26 , Issue.1 , pp. 113-146
- Roy, D.K.¹ Pentland, A.P.²

41
- 84937504995
- arXiv: 1412. 4564, 5, 6
- A. Vedaldi and K. Lenc. Matconvnet-convolutional neural networks for matlab. ArXiv: 1412. 4564, 2014.
- (2014) Matconvnet-convolutional Neural Networks for Matlab
- Vedaldi, A.¹ Lenc, K.²

42
- 84946747440
- Show and tell: A neural image caption generator
- 2
- O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015.
- (2015) CVPR
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

43
- 77953177673
- Joint learning of visual attributes, object classes and visual saliency
- 3
- G. Wang and D. Forsyth. Joint learning of visual attributes, object classes and visual saliency. In ICCV, 2009.
- (2009) ICCV
- Wang, G.¹ Forsyth, D.²

44
- 80052913382
- A discriminative latent model of object classes and attributes
- 3
- Y. Wang and G. Mori. A discriminative latent model of object classes and attributes. In ECCV, 2010.
- (2010) ECCV
- Wang, Y.¹ Mori, G.²

45
- 80052891795
- Technical Report CNS-TR-2010-001, Caltech, 1, 2, 4, 5
- P. Welinder, S. Branson, T. Mita, C. Wah, F. Schroff, S. Belongie, and P. Perona. Caltech-UCSD Birds 200. Technical Report CNS-TR-2010-001, Caltech, 2010.
- (2010) Caltech-UCSD Birds 200
- Welinder, P.¹ Branson, S.² Mita, T.³ Wah, C.⁴ Schroff, F.⁵ Belongie, S.⁶ Perona, P.⁷

46
- 77955654853
- Large scale image annotation: Learning to rank with joint word-image embeddings
- 2
- J. Weston, S. Bengio, and N. Usunier. Large scale image annotation: Learning to rank with joint word-image embeddings. ECML, 2010.
- (2010) ECML
- Weston, J.¹ Bengio, S.² Usunier, N.³

47
- 84867117593
- Wsabie: Scaling up to large vocabulary image annotation
- 2
- J. Weston, S. Bengio, and N. Usunier. Wsabie: Scaling up to large vocabulary image annotation. In IJCAI, 2011.
- (2011) IJCAI
- Weston, J.¹ Bengio, S.² Usunier, N.³

48
- 84904687911
- Beyond pascal: A benchmark for 3D object detection in the wild
- 2
- Y. Xiang, R. Mottaghi, and S. Savarese. Beyond pascal: A benchmark for 3D object detection in the wild. In WACV, 2014.
- (2014) WACV
- Xiang, Y.¹ Mottaghi, R.² Savarese, S.³

49
- 84887598018
- Articulated human detection with flexible mixtures of parts
- 3
- Y. Yang and D. Ramanan. Articulated human detection with flexible mixtures of parts. TPAMI, 35, 2013.
- (2013) TPAMI , vol.35
- Yang, Y.¹ Ramanan, D.²

50
- 84855413670
- Attribute-based transfer learning for object categorization with zero or one training example
- 2
- X. Yu and Y. Aloimonos. Attribute-based transfer learning for object categorization with zero or one training example. In ECCV, 2010.
- (2010) ECCV
- Yu, X.¹ Aloimonos, Y.²

51
- 84956617559
- Partbased R-CNNs for fine-grained category detection
- 2, 3, 6
- N. Zhang, J. Donahue, R. Girshick, and T. Darrell. Partbased R-CNNs for fine-grained category detection. In ECCV, 2014.
- (2014) ECCV
- Zhang, N.¹ Donahue, J.² Girshick, R.³ Darrell, T.⁴

52
- 84911443783
- Panda: Pose aligned networks for deep attribute modeling
- 3
- N. Zhang, M. Paluri, M. Ranzato, T. Darrell, and L. Bourdev. Panda: Pose aligned networks for deep attribute modeling. In CVPR, 2014.
- (2014) CVPR
- Zhang, N.¹ Paluri, M.² Ranzato, M.³ Darrell, T.⁴ Bourdev, L.⁵

53
- 85083952996
- Object detectors emerge in deep scene cnns
- 3
- B. Zhou, A. Khosla, À. Lapedriza, A. Oliva, and A. Torralba. Object detectors emerge in deep scene cnns. In ICLR, 2015.
- (2015) ICLR
- Zhou, B.¹ Khosla, A.² Lapedriza, À.³ Oliva, A.⁴ Torralba, A.⁵

54
- 84937964578
- Learning deep features for scene recognition using places database
- 3
- B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning deep features for scene recognition using places database. In NIPS. 2014.
- (2014) NIPS.
- Zhou, B.¹ Lapedriza, A.² Xiao, J.³ Torralba, A.⁴ Oliva, A.⁵

55
- 84866667680
- Face detection, pose estimation, and landmark localization in the wild
- 3
- X. Zhu and D. Ramanan. Face detection, pose estimation, and landmark localization in the wild. In CVPR, 2012.
- (2012) CVPR
- Zhu, X.¹ Ramanan, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.