메뉴 건너뛰기




Volumn 8692 LNCS, Issue PART 4, 2014, Pages 529-545

Improving image-sentence embeddings using large weakly annotated photo collections

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SCIENCE; COMPUTERS; ARTIFICIAL INTELLIGENCE; BIOINFORMATICS;

EID: 84906484732     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-319-10593-2_35     Document Type: Conference Paper
Times cited : (212)

References (41)
  • 1
    • 78149311145 scopus 로고    scopus 로고
    • Every picture tells a story: Generating sentences from images
    • Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. Springer, Heidelberg
    • Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: Generating sentences from images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15-29. Springer, Heidelberg (2010)
    • (2010) LNCS , vol.6314 , pp. 15-29
    • Farhadi, A.1    Hejrati, M.2    Sadeghi, M.A.3    Young, P.4    Rashtchian, C.5    Hockenmaier, J.6    Forsyth, D.7
  • 3
    • 84862279067 scopus 로고    scopus 로고
    • Composing simple image descriptions using web-scale n-grams
    • Li, S., Kulkarni, G., Berg, T.L., Berg, A.C., Choi, Y.: Composing simple image descriptions using web-scale n-grams. In: CoNLL (2011)
    • (2011) CoNLL
    • Li, S.1    Kulkarni, G.2    Berg, T.L.3    Berg, A.C.4    Choi, Y.5
  • 5
    • 84887365305 scopus 로고    scopus 로고
    • A sentence is worth a thousand pixels
    • Fidler, S., Sharma, A., Urtasun, R.: A sentence is worth a thousand pixels. In: CVPR (2013)
    • (2013) CVPR
    • Fidler, S.1    Sharma, A.2    Urtasun, R.3
  • 8
    • 85162522202 scopus 로고    scopus 로고
    • Im2Text: Describing images using 1 million captioned photographs
    • Ordonez, V., Kulkarni, G., Berg, T.L.: Im2Text: Describing images using 1 million captioned photographs. In: NIPS (2011)
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 9
    • 84906925854 scopus 로고    scopus 로고
    • Grounded compositional semantics for finding and describing images with sentences
    • Socher, R., Le, Q.V., Manning, C.D., Ng, A.Y.: Grounded compositional semantics for finding and describing images with sentences. In: ACL (2013)
    • (2013) ACL
    • Socher, R.1    Le, Q.V.2    Manning, C.D.3    Ng, A.Y.4
  • 11
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: ACL, pp. 311-318 (2002)
    • (2002) ACL , pp. 311-318
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.J.4
  • 12
    • 10044285992 scopus 로고    scopus 로고
    • Canonical correlation analysis; an overview with application to learning methods
    • Hardoon, D., Szedmak, S., Shawe-Taylor, J.: Canonical correlation analysis; an overview with application to learning methods. Neural Computation 16 (2004)
    • (2004) Neural Computation , vol.16
    • Hardoon, D.1    Szedmak, S.2    Shawe-Taylor, J.3
  • 13
    • 84906498766 scopus 로고    scopus 로고
    • A multi-view embedding space for modeling internet images, tags, and their semantics
    • Gong, Y., Ke, Q., Isard, M., Lazebnik, S.: A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV (2013)
    • (2013) IJCV
    • Gong, Y.1    Ke, Q.2    Isard, M.3    Lazebnik, S.4
  • 14
    • 84897476317 scopus 로고    scopus 로고
    • Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation
    • Gong, B., Grauman, K., Sha, F.: Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation. In: ICML, pp. 222-230 (2013)
    • (2013) ICML , pp. 222-230
    • Gong, B.1    Grauman, K.2    Sha, F.3
  • 15
    • 78149318752 scopus 로고    scopus 로고
    • Adapting visual category models to new domains
    • Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. Springer, Heidelberg
    • Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting visual category models to new domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 213-226. Springer, Heidelberg (2010)
    • (2010) LNCS , vol.6314 , pp. 213-226
    • Saenko, K.1    Kulis, B.2    Fritz, M.3    Darrell, T.4
  • 18
    • 84866661767 scopus 로고    scopus 로고
    • Large-scale knowledge transfer for object localization in imageNet
    • Guillaumin, M., Ferrari, V.: Large-scale knowledge transfer for object localization in imageNet. In: CVPR, 3202-3209 (2012)
    • (2012) CVPR , pp. 3202-3209
    • Guillaumin, M.1    Ferrari, V.2
  • 19
    • 77956006653 scopus 로고    scopus 로고
    • Multimodal semi-supervised learning for image classification
    • Guillaumin, M., Verbeek, J., Schmid, C.: Multimodal semi-supervised learning for image classification. In: CVPR, 902-909 (2010)
    • (2010) CVPR , pp. 902-909
    • Guillaumin, M.1    Verbeek, J.2    Schmid, C.3
  • 20
    • 35148862171 scopus 로고    scopus 로고
    • Learning visual representations using images with captions
    • Quattoni, A., Collins, M., Darrell, T.: Learning visual representations using images with captions. In: CVPR (2007)
    • (2007) CVPR
    • Quattoni, A.1    Collins, M.2    Darrell, T.3
  • 21
    • 70450207253 scopus 로고    scopus 로고
    • Building text features for object image classification
    • Wang, G., Hoiem, D., Forsyth, D.: Building text features for object image classification. In: CVPR (2009)
    • (2009) CVPR
    • Wang, G.1    Hoiem, D.2    Forsyth, D.3
  • 22
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • Young, P., Lai, A., Hodosh, M., Hockenmaier, J.: From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. In: TACL (2014)
    • (2014) TACL
    • Young, P.1    Lai, A.2    Hodosh, M.3    Hockenmaier, J.4
  • 23
    • 0035328421 scopus 로고    scopus 로고
    • Modeling the shape of the scene: A holistic representation of the spatial envelope
    • Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. IJCV (2001)
    • (2001) IJCV
    • Oliva, A.1    Torralba, A.2
  • 24
    • 77955426203 scopus 로고    scopus 로고
    • Evaluating color descriptors for object and scene recognition
    • van de Sande, K.E.A., Gevers, T., Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. PAMI 32(9), 1582-1596 (2010)
    • (2010) PAMI , vol.32 , Issue.9 , pp. 1582-1596
    • Van De Sande, K.E.A.1    Gevers, T.2    Snoek, C.G.M.3
  • 25
    • 33645146449 scopus 로고    scopus 로고
    • Histograms of oriented gradients for human detection
    • Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
    • (2005) CVPR
    • Dalal, N.1    Triggs, B.2
  • 26
    • 77956004473 scopus 로고    scopus 로고
    • Aggregating local descriptors into a compact image representation
    • Jégou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: CVPR (2010)
    • (2010) CVPR
    • Jégou, H.1    Douze, M.2    Schmid, C.3    Perez, P.4
  • 27
    • 84876231242 scopus 로고    scopus 로고
    • ImageNet classification with deep convolutional neural networks
    • Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 28
    • 84906504048 scopus 로고    scopus 로고
    • DeCAF: A deep convolutional activation feature for generic visual recognition
    • abs/1310.1531
    • Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: DeCAF: A deep convolutional activation feature for generic visual recognition. CoRR abs/1310.1531 (2013)
    • (2013) CoRR
    • Donahue, J.1    Jia, Y.2    Vinyals, O.3    Hoffman, J.4    Zhang, N.5    Tzeng, E.6    Darrell, T.7
  • 31
    • 84867117593 scopus 로고    scopus 로고
    • Wsabie: Scaling up to large vocabulary image annotation
    • Weston, J., Bengio, S., Usunier, N.: Wsabie: Scaling up to large vocabulary image annotation. In: IJCAI (2011)
    • (2011) IJCAI
    • Weston, J.1    Bengio, S.2    Usunier, N.3
  • 32
    • 80052250414 scopus 로고    scopus 로고
    • Adaptive subgradient methods for online learning and stochastic optimization
    • Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization. JMLR (2011)
    • (2011) JMLR
    • Duchi, J.1    Hazan, E.2    Singer, Y.3
  • 35
    • 0000107975 scopus 로고
    • Relations between two sets of variables
    • Hotelling, H.: Relations between two sets of variables. Biometrika 28, 312-377 (1936)
    • (1936) Biometrika , vol.28 , pp. 312-377
    • Hotelling, H.1
  • 36
  • 37
    • 84863396387 scopus 로고    scopus 로고
    • Domain adaptation for object recognition: An unsupervised approach
    • Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: An unsupervised approach. In: ICCV (2011)
    • (2011) ICCV
    • Gopalan, R.1    Li, R.2    Chellappa, R.3
  • 38
    • 84906513179 scopus 로고    scopus 로고
    • From sBoW to dCoT: Marginalized encoders for text representation
    • Xu, Z., Chen, M., Weinberger, K.Q., Sha, F.: From sBoW to dCoT: Marginalized encoders for text representation. In: CIKM (2011)
    • (2011) CIKM
    • Xu, Z.1    Chen, M.2    Weinberger, K.Q.3    Sha, F.4
  • 39
    • 77953218689 scopus 로고    scopus 로고
    • Random features for large-scale kernel machines
    • Rahimi, A., Recht, B.: Random features for large-scale kernel machines. In: NIPS (2007)
    • (2007) NIPS
    • Rahimi, A.1    Recht, B.2
  • 40
    • 56449089103 scopus 로고    scopus 로고
    • Extracting and composing robust features with denoising autoencoders
    • Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: ICML, pp. 1096-1103 (2008)
    • (2008) ICML , pp. 1096-1103
    • Vincent, P.1    Larochelle, H.2    Bengio, Y.3    Manzagol, P.A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.