메뉴 건너뛰기




Volumn 55, Issue , 2016, Pages 409-442

Automatic description generation from images: A survey of models, datasets, and evaluation measures

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARKING; IMAGE ANALYSIS; NATURAL LANGUAGE PROCESSING SYSTEMS; SURVEYS;

EID: 84960130911     PISSN: 10769757     EISSN: None     Source Type: Journal    
DOI: 10.1613/jair.4900     Document Type: Article
Times cited : (358)

References (105)
  • 31
    • 10044285992 scopus 로고    scopus 로고
    • Canonical correlation analysis: An overview with application to learning methods
    • Hardoon, D. R., Szedmak, S., & Shawe-Taylor, J. (2004). Canonical correlation analysis: An overview with application to learning methods. Neural Computation, 16 (12), 2639-2664.
    • (2004) Neural Computation , vol.16 , Issue.12 , pp. 2639-2664
    • Hardoon, D.R.1    Szedmak, S.2    Shawe-Taylor, J.3
  • 33
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • Hodosh, M., Young, P., & Hockenmaier, J. (2013). Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics. Journal of Artificial Intelligence Research, 47, 853-899.
    • (2013) Journal of Artificial Intelligence Research , vol.47 , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 34
    • 0000107975 scopus 로고
    • Relations between two sets of variates
    • Hotelling, H. (1936). Relations between two sets of variates. Biometrika, 0, 321-377.
    • (1936) Biometrika , pp. 321-377
    • Hotelling, H.1
  • 35
    • 0033909136 scopus 로고    scopus 로고
    • A conceptual framework for indexing visual information at multiple levels
    • Jaimes, A., & Chang, S.-F. (2000). A conceptual framework for indexing visual information at multiple levels. In IST SPIE Internet Imaging.
    • (2000) IST SPIE Internet Imaging
    • Jaimes, A.1    Chang, S.-F.2
  • 51
    • 84877085938 scopus 로고    scopus 로고
    • Learning dependency-based compositional semantics
    • Liang, P., Jordan, M. I., & Klein, D. (2012). Learning dependency-based compositional semantics. Computational Linguistics, 39 (2), 389-446.
    • (2012) Computational Linguistics , vol.39 , Issue.2 , pp. 389-446
    • Liang, P.1    Jordan, M.I.2    Klein, D.3
  • 55
    • 3042535216 scopus 로고    scopus 로고
    • Distinctive image features from scale-invariant keypoints
    • Lowe, D. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60 (4), 91-110.
    • (2004) International Journal of Computer Vision , vol.60 , Issue.4 , pp. 91-110
    • Lowe, D.1
  • 56
    • 85007153677 scopus 로고    scopus 로고
    • Learning to answer questions from image using convolutional neural network
    • Ma, L., Lu, Z., & Li, H. (2016). Learning to answer questions from image using convolutional neural network. In AAAI Conference on Artificial Intelligence.
    • (2016) AAAI Conference on Artificial Intelligence
    • Ma, L.1    Lu, Z.2    Li, H.3
  • 57
    • 84937822746 scopus 로고    scopus 로고
    • A multi-world approach to question answering about real-world scenes based on uncertain input
    • Malinowski, M., & Fritz, M. (2014a). A multi-world approach to question answering about real-world scenes based on uncertain input. In Advances in Neural Information Processing Systems.
    • (2014) Advances in Neural Information Processing Systems
    • Malinowski, M.1    Fritz, M.2
  • 66
    • 0035328421 scopus 로고    scopus 로고
    • Modeling the shape of the scene: A holistic representation of the spatial envelope
    • Oliva, A., & Torralba, A. (2001). Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42 (3), 145-175.
    • (2001) International Journal of Computer Vision , vol.42 , Issue.3 , pp. 145-175
    • Oliva, A.1    Torralba, A.2
  • 73
    • 84900870389 scopus 로고    scopus 로고
    • The SUN attribute database: Beyond categories for deeper scene understanding
    • Patterson, G., Xu, C., Su, H., & Hays, J. (2014). The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding. International Journal of Computer Vision, 108 (1-2), 59-81.
    • (2014) International Journal of Computer Vision , vol.108 , Issue.1-2 , pp. 59-81
    • Patterson, G.1    Xu, C.2    Su, H.3    Hays, J.4
  • 77
    • 71749094730 scopus 로고    scopus 로고
    • An investigation into the validity of some metrics for automatically evaluating natural language generation systems
    • Reiter, E., & Belz, A. (2009). An investigation into the validity of some metrics for automatically evaluating natural language generation systems. Computational Linguistics, 35 (4), 529-588.
    • (2009) Computational Linguistics , vol.35 , Issue.4 , pp. 529-588
    • Reiter, E.1    Belz, A.2
  • 84
    • 84952235015 scopus 로고
    • Analyzing the subject of a picture: A theoretical approach
    • Shatford, S. (1986). Analyzing the subject of a picture: A theoretical approach. Cataloging & Classification Quarterly, 6, 39-62.
    • (1986) Cataloging & Classification Quarterly , vol.6 , pp. 39-62
    • Shatford, S.1
  • 86
    • 77955998009 scopus 로고    scopus 로고
    • Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora
    • Socher, R., & Fei-Fei, L. (2010). Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora. In IEEE Conference on Computer Vision and Pattern Recognition.
    • (2010) IEEE Conference on Computer Vision and Pattern Recognition
    • Socher, R.1    Fei-Fei, L.2
  • 93
    • 85088059797 scopus 로고    scopus 로고
    • Im2Text and Text2Im: Associating images and texts for cross-modal retrieval
    • Verma, Y., & Jawahar, C. V. (2014). Im2Text and Text2Im: Associating Images and Texts for Cross-Modal Retrieval. In British Machine Vision Conference.
    • (2014) British Machine Vision Conference
    • Verma, Y.1    Jawahar, C.V.2
  • 101


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.