메뉴 건너뛰기




Volumn , Issue , 2013, Pages 2634-2641

A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching

Author keywords

multimodal topic model; natural language; video to text; video understanding

Indexed keywords

BOTTOM-UP AND TOP-DOWN; IMAGE DESCRIPTIONS; NATURAL LANGUAGES; NEAREST NEIGHBOR TECHNIQUE; TOPIC MODELING; VIDEO TO TEXT; VIDEO UNDERSTANDING; VISION COMMUNITIES;

EID: 84887345951     PISSN: 10636919     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2013.340     Document Type: Conference Paper
Times cited : (318)

References (34)
  • 2
    • 84893365704 scopus 로고    scopus 로고
    • Comparing automatic and human evaluation of nlg systems
    • A. Belz and E. Reiter. Comparing automatic and human evaluation of nlg systems. In EACL, 2006.
    • (2006) EACL
    • Belz, A.1    Reiter, E.2
  • 3
    • 84864999445 scopus 로고    scopus 로고
    • Evaluation of local descriptors for action recognition in videos
    • P. Bilinski and F. Bremond. Evaluation of local descriptors for action recognition in videos. In ICCV, 2011.
    • (2011) ICCV
    • Bilinski, P.1    Bremond, F.2
  • 4
    • 1542287501 scopus 로고    scopus 로고
    • Modeling annotated data
    • D. M. Blei and M. I. Jordan. Modeling annotated data. In SIGIR, 2003.
    • (2003) SIGIR
    • Blei, D.M.1    Jordan, M.I.2
  • 6
    • 50649087214 scopus 로고    scopus 로고
    • Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes
    • L. Cao and L. Fei-Fei. Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes. In ICCV, 2007.
    • (2007) ICCV
    • Cao, L.1    Fei-Fei, L.2
  • 7
    • 84887329824 scopus 로고    scopus 로고
    • Translating related words to videos and back through latent topics
    • P. Das, R. K. Srihari, and J. J. Corso. Translating related words to videos and back through latent topics. In ACM WSDM, 2013.
    • (2013) ACM WSDM
    • Das, P.1    Srihari, R.K.2    Corso, J.J.3
  • 11
    • 80053231413 scopus 로고    scopus 로고
    • Topic models for image annotation and text illustration
    • Y. Feng and M. Lapata. Topic models for image annotation and text illustration. In NAACL HLT, 2010.
    • (2010) NAACL HLT
    • Feng, Y.1    Lapata, M.2
  • 12
    • 77953202699 scopus 로고    scopus 로고
    • Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation
    • M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In ICCV, 2009.
    • (2009) ICCV
    • Guillaumin, M.1    Mensink, T.2    Verbeek, J.3    Schmid, C.4
  • 13
    • 85085175172 scopus 로고    scopus 로고
    • A markov clustering topic model for mining behaviour in video
    • T. M. Hospedales, S. Gong, and T. Xiang. A markov clustering topic model for mining behaviour in video. In ICCV, 2009.
    • (2009) ICCV
    • Hospedales, T.M.1    Gong, S.2    Xiang, T.3
  • 14
    • 84863075153 scopus 로고    scopus 로고
    • Towards coherent natural language description of video streams
    • M. U. G. Khan, L. Zhang, and Y. Gotoh. Towards coherent natural language description of video streams. In ICCVW, 2011.
    • (2011) ICCVW
    • Khan, M.U.G.1    Zhang, L.2    Gotoh, Y.3
  • 15
    • 84898426452 scopus 로고    scopus 로고
    • A spatio-temporal descriptor based on 3d-gradients
    • A. Klaser, M. Marszalek, and C. Schmid. A spatio-temporal descriptor based on 3d-gradients. In BMVC, 2008.
    • (2008) BMVC
    • Klaser, A.1    Marszalek, M.2    Schmid, C.3
  • 17
    • 29344465396 scopus 로고    scopus 로고
    • Automatic evaluation of summaries using n-gram co-occurrence statistics
    • C.-Y. Lin and E. Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In NAACL HLT, 2003.
    • (2003) NAACL HLT
    • Lin, C.-Y.1    Hovy, E.2
  • 18
    • 70449580491 scopus 로고    scopus 로고
    • A new baseline for image annotation
    • A. Makadia, V. Pavlovic, and S. Kumar. A new baseline for image annotation. In ECCV, 2008.
    • (2008) ECCV
    • Makadia, A.1    Pavlovic, V.2    Kumar, S.3
  • 21
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: a method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 22
    • 77955999239 scopus 로고    scopus 로고
    • Topic regression multi-modal latent dirichlet allocation for image annotation
    • D. Putthividhya, H. T. Attias, and S. S. Nagarajan. Topic regression multi-modal latent dirichlet allocation for image annotation. In CVPR, 2010.
    • (2010) CVPR
    • Putthividhya, D.1    Attias, H.T.2    Nagarajan, S.S.3
  • 25
    • 84866718894 scopus 로고    scopus 로고
    • Action bank: A high-level representation of activity in video
    • S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. In CVPR, 2012.
    • (2012) CVPR
    • Sadanand, S.1    Corso, J.J.2
  • 26
    • 80052889458 scopus 로고    scopus 로고
    • Recognition using visual phrases
    • M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011.
    • (2011) CVPR
    • Sadeghi, M.A.1    Farhadi, A.2
  • 27
    • 80052908300 scopus 로고    scopus 로고
    • Unbiased look at dataset bias
    • A. Torralba and A. A. Efros. Unbiased look at dataset bias. In CVPR, 2011.
    • (2011) CVPR
    • Torralba, A.1    Efros, A.A.2
  • 28
    • 77955426203 scopus 로고    scopus 로고
    • Evaluating color descriptors for object and scene recognition
    • K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek. Evaluating color descriptors for object and scene recognition. TPAMI, 2010.
    • (2010) IntPAMI
    • Sande De Van A., K.E.1    Gevers, T.2    Snoek, C.G.M.3
  • 29
    • 84877615350 scopus 로고    scopus 로고
    • Efficiently scaling up crowdsourced video annotation
    • C. Vondrick, D. Patterson, and D. Ramanan. Efficiently scaling up crowdsourced video annotation. IJCV.
    • IJCV
    • Vondrick, C.1    Patterson, D.2    Ramanan, D.3
  • 32
    • 70450178502 scopus 로고    scopus 로고
    • Simultaneous image classification and annotation
    • C. Wang, D. M. Blei, and F.-F. Li. Simultaneous image classification and annotation. In CVPR, 2009.
    • (2009) CVPR
    • Wang, C.1    Blei, D.M.2    Li, F.-F.3
  • 33
    • 77952406197 scopus 로고    scopus 로고
    • Topic models for semantics-preserving video compression
    • J. Wanke, A. Ulges, C. H. Lampert, and T. M. Breuel. Topic models for semantics-preserving video compression. In MIR, 2010.
    • (2010) MIR
    • Wanke, J.1    Ulges, A.2    Lampert, C.H.3    Breuel, T.M.4
  • 34


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.