메뉴 건너뛰기




Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 4462-4470

Weakly-supervised alignment of video with text

Author keywords

[No Author keywords available]

Indexed keywords

COMBINATORIAL OPTIMIZATION; INTEGER PROGRAMMING; QUADRATIC PROGRAMMING; RELAXATION PROCESSES; VISUAL LANGUAGES;

EID: 84973883674     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2015.507     Document Type: Conference Paper
Times cited : (157)

References (47)
  • 1
    • 84898797429 scopus 로고    scopus 로고
    • Monte carlo tree search for scheduling activity recognition
    • M. R. Amer, S. Todorovic, A. Fern, and S.-C. Zhu. Monte carlo tree search for scheduling activity recognition. In ICCV, 2013.
    • (2013) ICCV
    • Amer, M.R.1    Todorovic, S.2    Fern, A.3    Zhu, S.-C.4
  • 2
    • 84900675076 scopus 로고    scopus 로고
    • Diffrac: A discriminative and flexible framework for clustering
    • F. Bach and Z. Harchaoui. Diffrac: A discriminative and flexible framework for clustering. In NIPS, 2007.
    • (2007) NIPS
    • Bach, F.1    Harchaoui, Z.2
  • 9
    • 0038401728 scopus 로고    scopus 로고
    • Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary
    • P. Duygulu, K. Barnard, J. F. G. d. Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In ECCV, 2002.
    • (2002) ECCV
    • Duygulu, P.1    Barnard, K.2    Freitas, J.F.G.D.3    Forsyth, D.A.4
  • 13
    • 80052915321 scopus 로고    scopus 로고
    • Actom sequence models for efficient action detection
    • A. Gaidon, Z. Harchaoui, and C. Schmid. Actom sequence models for efficient action detection. In CVPR, 2011.
    • (2011) CVPR
    • Gaidon, A.1    Harchaoui, Z.2    Schmid, C.3
  • 14
    • 84894905366 scopus 로고    scopus 로고
    • A multi-view embedding space for modeling internet images, tags, and their semantics
    • Y. Gong, Q. Ke, M. Isard, and S. Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV, 2014.
    • (2014) IJCV
    • Gong, Y.1    Ke, Q.2    Isard, M.3    Lazebnik, S.4
  • 15
    • 84959394156 scopus 로고    scopus 로고
    • A markovian approach to distributional semantics with application to semantic compositionality
    • E. Grave, G. Obozinski, and F. Bach. A markovian approach to distributional semantics with application to semantic compositionality. In COLING, 2014.
    • (2014) COLING
    • Grave, E.1    Obozinski, G.2    Bach, F.3
  • 16
    • 84898930423 scopus 로고    scopus 로고
    • Convex relaxations of latent variable training
    • Y. Guo and D. Schuurmans. Convex relaxations of latent variable training. In NIPS, 2007.
    • (2007) NIPS
    • Guo, Y.1    Schuurmans, D.2
  • 17
    • 10044285992 scopus 로고    scopus 로고
    • Canonical correlation analysis: An overview with application to learning methods
    • D. Hardoon, S. Szedmak, and J. Shawe-Taylor. Canonical correlation analysis: An overview with application to learning methods. Neural computation, 16(12):2639-2664, 2004.
    • (2004) Neural Computation , vol.16 , Issue.12 , pp. 2639-2664
    • Hardoon, D.1    Szedmak, S.2    Shawe-Taylor, J.3
  • 18
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, pages 853-899, 2013.
    • (2013) JAIR , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 19
    • 0000107975 scopus 로고
    • Relations between two sets of variates
    • H. Hotelling. Relations between two sets of variates. Biometrika, 3:321-377, 1936.
    • (1936) Biometrika , vol.3 , pp. 321-377
    • Hotelling, H.1
  • 20
    • 77955990943 scopus 로고    scopus 로고
    • Discriminative clustering for image co-segmentation
    • A. Joulin, F. Bach, and J. Ponce. Discriminative clustering for image co-segmentation. In CVPR, 2010.
    • (2010) CVPR
    • Joulin, A.1    Bach, F.2    Ponce, J.3
  • 21
  • 22
    • 84943738421 scopus 로고    scopus 로고
    • Efficient image and video co-localization with frank-wolfe algorithm
    • A. Joulin, K. Tang, and L. Fei-Fei. Efficient image and video co-localization with frank-wolfe algorithm. In ECCV, 2014.
    • (2014) ECCV
    • Joulin, A.1    Tang, K.2    Fei-Fei, L.3
  • 23
    • 84937843643 scopus 로고    scopus 로고
    • Deep fragment embeddings for bidirectional image sentence mapping
    • A. Karpathy, A. Joulin, and F. F. F. Li. Deep fragment embeddings for bidirectional image sentence mapping. In NIPS, 2014.
    • (2014) NIPS
    • Karpathy, A.1    Joulin, A.2    Li, F.F.F.3
  • 24
    • 84915757230 scopus 로고    scopus 로고
    • Combining perframe and per-track cues for multi-person action recognition
    • S. Khamis, V. I. Morariu, and L. S. Davis. Combining perframe and per-track cues for multi-person action recognition. In ECCV, 2012.
    • (2012) ECCV
    • Khamis, S.1    Morariu, V.I.2    Davis, L.S.3
  • 25
    • 80052882471 scopus 로고    scopus 로고
    • Scenario-based video event recognition by constraint flow
    • S. Kwak, B. Han, and J. H. Han. Scenario-based video event recognition by constraint flow. In CVPR, 2011.
    • (2011) CVPR
    • Kwak, S.1    Han, B.2    Han, J.H.3
  • 27
    • 34948883502 scopus 로고    scopus 로고
    • Leveraging temporal, contextual and ordering constraints for recognizing complex activities in video
    • B. Laxton, J. Lim, and D. J. Kriegman. Leveraging temporal, contextual and ordering constraints for recognizing complex activities in video. In CVPR, 2007.
    • (2007) CVPR
    • Laxton, B.1    Lim, J.2    Kriegman, D.J.3
  • 32
    • 84898956512 scopus 로고    scopus 로고
    • Distributed representations of words and phrases and their compositionality
    • T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
    • (2013) NIPS
    • Mikolov, T.1    Sutskever, I.2    Chen, K.3    Corrado, G.S.4    Dean, J.5
  • 33
    • 85162522202 scopus 로고    scopus 로고
    • Im2text: Describing images using 1 million captioned photographs
    • V. Ordonez, G. Kulkarni, and T. L. Berg. Im2text: Describing images using 1 million captioned photographs. In NIPS, 2011.
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 36
    • 84943782750 scopus 로고    scopus 로고
    • Linking people with "their" names using coreference resolution
    • V. Ramanathan, A. Joulin, P. Liang, and L. Fei-Fei. Linking people with "their" names using coreference resolution. In ECCV, 2014.
    • (2014) ECCV
    • Ramanathan, V.1    Joulin, A.2    Liang, P.3    Fei-Fei, L.4
  • 39
    • 33845588233 scopus 로고    scopus 로고
    • Recognition of composite human activities through context-free grammar based representation
    • M. S. Ryoo and J. K. Aggarwal. Recognition of composite human activities through context-free grammar based representation. In CVPR, 2006.
    • (2006) CVPR
    • Ryoo, M.S.1    Aggarwal, J.K.2
  • 40
  • 41
    • 80052901415 scopus 로고    scopus 로고
    • Modeling the temporal extent of actions
    • S. Satkin and M. Hebert. Modeling the temporal extent of actions. In ECCV, 2010.
    • (2010) ECCV
    • Satkin, S.1    Hebert, M.2
  • 42
    • 77955998009 scopus 로고    scopus 로고
    • Connecting modalities: Semisupervised segmentation and annotation of images using unaligned text corpora
    • R. Socher and L. Fei-Fei. Connecting modalities: Semisupervised segmentation and annotation of images using unaligned text corpora. In CVPR, 2010.
    • (2010) CVPR
    • Socher, R.1    Fei-Fei, L.2
  • 43
    • 84964474107 scopus 로고    scopus 로고
    • Grounded compositional semantics for finding and describing images with sentences
    • R. Socher, A. Karpathy, Q. V. Le, C. D. Manning, and A. Y. Ng. Grounded compositional semantics for finding and describing images with sentences. TACL, 2014.
    • (2014) TACL
    • Socher, R.1    Karpathy, A.2    Le, Q.V.3    Manning, C.D.4    Ng, A.Y.5
  • 44
    • 84866659479 scopus 로고    scopus 로고
    • Knock! knock! who is it?" probabilistic person identification in tv-series
    • M. Tapaswi, M. Bäuml, and R. Stiefelhagen. "knock! knock! who is it?" probabilistic person identification in tv-series. In CVPR, 2012.
    • (2012) CVPR
    • Tapaswi, M.1    Bäuml, M.2    Stiefelhagen, R.3
  • 45
    • 84959255361 scopus 로고    scopus 로고
    • Book2movie: Aligning video scenes with book chapters
    • M. Tapaswi, M. Bäuml, and R. Stiefelhagen. Book2movie: Aligning video scenes with book chapters. In CVPR, 2015.
    • (2015) CVPR
    • Tapaswi, M.1    Bäuml, M.2    Stiefelhagen, R.3
  • 46
    • 84898805910 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • H. Wang and C. Schmid. Action recognition with improved trajectories. In ICCV, 2013.
    • (2013) ICCV
    • Wang, H.1    Schmid, C.2
  • 47
    • 78149328370 scopus 로고    scopus 로고
    • Canonical time warping for alignment of human behavior
    • F. Zhou and F. De La Torre. Canonical time warping for alignment of human behavior. NIPS, 2009.
    • (2009) NIPS
    • Zhou, F.1    De La Torre, F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.