메뉴 건너뛰기




Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 2623-2631

Multimodal convolutional neural networks for matching image and sentence

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; CONVOLUTION; NEURAL NETWORKS; SEMANTICS;

EID: 84973864182     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2015.301     Document Type: Conference Paper
Times cited : (378)

References (37)
  • 1
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural networks concepts to hybrid nn-hmm model for speech recognition
    • 3
    • O. Abdel Hamid, A. R. Mohamed, H. Jiang, and G. Penn. Applying convolutional neural networks concepts to hybrid nn-hmm model for speech recognition. ICASSP, 2012. 3
    • (2012) ICASSP
    • Abdel Hamid, O.1    Mohamed, A.R.2    Jiang, H.3    Penn, G.4
  • 2
    • 0000782329 scopus 로고    scopus 로고
    • Overfitting in neural nets: Backpropagation, conjugate gradient, and early stopping
    • 6
    • R. Caruana, S. Lawrence, and C. L. Giles. Overfitting in neural nets: Backpropagation, conjugate gradient, and early stopping. NIPS, 2000. 6
    • (2000) NIPS
    • Caruana, R.1    Lawrence, S.2    Giles, C.L.3
  • 4
    • 84890527827 scopus 로고    scopus 로고
    • Improving deep neural networks for lvcsr using rectified linear units and dropout
    • 2
    • G. E. Dahl, T. N. Sainath, and G. E. Hinton. Improving deep neural networks for lvcsr using rectified linear units and dropout. ICASSP, 2013. 2
    • (2013) ICASSP
    • Dahl, G.E.1    Sainath, T.N.2    Hinton, G.E.3
  • 7
    • 84911400494 scopus 로고    scopus 로고
    • Rich feature hierachies for accurate object detection and semantic segmentation
    • 8
    • R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierachies for accurate object detection and semantic segmentation. CVPR, 2014. 8
    • (2014) CVPR
    • Girshick, R.1    Donahue, J.2    Darrell, T.3    Malik, J.4
  • 8
    • 84959243872 scopus 로고    scopus 로고
    • Improving image-sentence embeddings using large weakly annotated photo collections
    • 1
    • Y. Gong, L. Wang, M. Hodosh, J. Hockenmaier, and S. lazebnik. Improving image-sentence embeddings using large weakly annotated photo collections. ECCV, 2014. 1
    • (2014) ECCV
    • Gong, Y.1    Wang, L.2    Hodosh, M.3    Hockenmaier, J.4    Lazebnik, S.5
  • 9
    • 34447620428 scopus 로고    scopus 로고
    • A neural network to retrieve images from text queries
    • 2
    • D. Grangier and S. Bengio. A neural network to retrieve images from text queries. ICANN, 2006. 2
    • (2006) ICANN
    • Grangier, D.1    Bengio, S.2
  • 10
    • 84928278589 scopus 로고    scopus 로고
    • Spatial pyramid pooling in deep convolutional networks for visual recognition
    • 1
    • K. He, X. Zhang, S. Ren, and J. Sun. Spatial pyramid pooling in deep convolutional networks for visual recognition. ECCV, 2014. 1
    • (2014) ECCV
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 13
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • 1, 2, 6
    • M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. Journal of Artificial Intelligence Research, 47: 853-899, 2013. 1, 2, 6
    • (2013) Journal of Artificial Intelligence Research , vol.47 , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 14
    • 84937936034 scopus 로고    scopus 로고
    • Convolutional neural network architectures for matching natural language sentences
    • 1, 3
    • B. Hu, Z. Lu, H. Li, and Q. Chen. Convolutional neural network architectures for matching natural language sentences. NIPS, 2014. 1, 3
    • (2014) NIPS
    • Hu, B.1    Lu, Z.2    Li, H.3    Chen, Q.4
  • 15
    • 84906922163 scopus 로고    scopus 로고
    • A convolutional neural network for modelling sentences
    • 1
    • N. Kalchbrenner, E. Grefenstette, and P. Blunsom. A convolutional neural network for modelling sentences. ACL, 2014. 1
    • (2014) ACL
    • Kalchbrenner, N.1    Grefenstette, E.2    Blunsom, P.3
  • 16
    • 84937843643 scopus 로고    scopus 로고
    • Deep fragment embeddings for bidirectional image sentence mapping
    • 1, 2, 6, 7, 8
    • A. Karpathy, A. Joulin, and F.-F. Li. Deep fragment embeddings for bidirectional image sentence mapping. NIPS, 2014. 1, 2, 6, 7, 8
    • (2014) NIPS
    • Karpathy, A.1    Joulin, A.2    Li, F.-F.3
  • 18
    • 84961376850 scopus 로고    scopus 로고
    • Convolutional neural network for sentence classification
    • 1
    • Y. Kim. Convolutional neural network for sentence classification. EMNLP, 2014. 1
    • (2014) EMNLP
    • Kim, Y.1
  • 21
    • 84877777478 scopus 로고    scopus 로고
    • Deep representations and codes for image auto-annotation
    • 1
    • R. Kiros and C. Szepesvári. Deep representations and codes for image auto-annotation. NIPS, 2012. 1
    • (2012) NIPS
    • Kiros, R.1    Szepesvári, C.2
  • 26
    • 85162522202 scopus 로고    scopus 로고
    • Im2txt: Describing images using 1 million captioned photogrphs
    • 1
    • V. Ordonez, G. Kulkarni, and T. L. Berg. Im2txt: Describing images using 1 million captioned photogrphs. NIPS, 2011. 1
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 27
    • 80052889458 scopus 로고    scopus 로고
    • Recognition using visual phrases
    • 1, 2
    • M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. CVPR, 2011. 1, 2
    • (2011) CVPR
    • Sadeghi, M.A.1    Farhadi, A.2
  • 32
    • 84877724347 scopus 로고    scopus 로고
    • Multimodal learning with deep boltzmann machines
    • 1, 2
    • N. Srivastava and R. Salakhutdinov. Multimodal learning with deep boltzmann machines. NIPS, 2012. 1, 2
    • (2012) NIPS
    • Srivastava, N.1    Salakhutdinov, R.2
  • 35
    • 84867117593 scopus 로고    scopus 로고
    • Wsabie: Scaling up to large vocabulary image annotation
    • 2
    • J. Weston, S. Bengio, and N. Usunier. Wsabie: Scaling up to large vocabulary image annotation. IJCAI, 2011. 2
    • (2011) IJCAI
    • Weston, J.1    Bengio, S.2    Usunier, N.3
  • 37
    • 84898772194 scopus 로고    scopus 로고
    • Learning the visual interpretation of sentences
    • 1, 2
    • C. L. Zitnick, D. Parikh, and L. Vanderwende. Learning the visual interpretation of sentences. ICCV, 2013. 1, 2
    • (2013) ICCV
    • Zitnick, C.L.1    Parikh, D.2    Vanderwende, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.