-
1
-
-
84885996388
-
Video in sentences out
-
A. Barbu, A. Bridge, Z. Burchill, D. Coroian, S. J. Dickinson, S. Fidler, A. Michaux, S. Mussman, S. Narayanaswamy, D. Salvi, L. Schmidt, J. Shangguan, J. M. Siskind, J. W. Waggoner, S. Wang, J. Wei, Y. Yin, and Z. Zhang. Video in sentences out. In UAI, 2012.
-
(2012)
UAI
-
-
Barbu, A.1
Bridge, A.2
Burchill, Z.3
Coroian, D.4
Dickinson, S.J.5
Fidler, S.6
Michaux, A.7
Mussman, S.8
Narayanaswamy, S.9
Salvi, D.10
Schmidt, L.11
Shangguan, J.12
Siskind, J.M.13
Waggoner, J.W.14
Wang, S.15
Wei, J.16
Yin, Y.17
Zhang, Z.18
-
2
-
-
84893365704
-
Comparing automatic and human evaluation of nlg systems
-
A. Belz and E. Reiter. Comparing automatic and human evaluation of nlg systems. In EACL, 2006.
-
(2006)
EACL
-
-
Belz, A.1
Reiter, E.2
-
3
-
-
84864999445
-
Evaluation of local descriptors for action recognition in videos
-
P. Bilinski and F. Bremond. Evaluation of local descriptors for action recognition in videos. In ICCV, 2011.
-
(2011)
ICCV
-
-
Bilinski, P.1
Bremond, F.2
-
6
-
-
50649087214
-
Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes
-
L. Cao and L. Fei-Fei. Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes. In ICCV, 2007.
-
(2007)
ICCV
-
-
Cao, L.1
Fei-Fei, L.2
-
7
-
-
84887329824
-
Translating related words to videos and back through latent topics
-
P. Das, R. K. Srihari, and J. J. Corso. Translating related words to videos and back through latent topics. In ACM WSDM, 2013.
-
(2013)
ACM WSDM
-
-
Das, P.1
Srihari, R.K.2
Corso, J.J.3
-
8
-
-
77951298115
-
The Pascal Visual Object Classes (VOC) Challenge
-
M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The Pascal Visual Object Classes (VOC) Challenge. IJCV, 2010.
-
(2010)
IJCV
-
-
Everingham, M.1
Van Gool, L.2
Williams, C.K.I.3
Winn, J.4
Zisserman, A.5
-
9
-
-
80052017343
-
Every picture tells a story: Generating sentences from images
-
A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: generating sentences from images. In ECCV, 2010.
-
(2010)
ECCV
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
11
-
-
80053231413
-
Topic models for image annotation and text illustration
-
Y. Feng and M. Lapata. Topic models for image annotation and text illustration. In NAACL HLT, 2010.
-
(2010)
NAACL HLT
-
-
Feng, Y.1
Lapata, M.2
-
12
-
-
77953202699
-
Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation
-
M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In ICCV, 2009.
-
(2009)
ICCV
-
-
Guillaumin, M.1
Mensink, T.2
Verbeek, J.3
Schmid, C.4
-
13
-
-
85085175172
-
A markov clustering topic model for mining behaviour in video
-
T. M. Hospedales, S. Gong, and T. Xiang. A markov clustering topic model for mining behaviour in video. In ICCV, 2009.
-
(2009)
ICCV
-
-
Hospedales, T.M.1
Gong, S.2
Xiang, T.3
-
14
-
-
84863075153
-
Towards coherent natural language description of video streams
-
M. U. G. Khan, L. Zhang, and Y. Gotoh. Towards coherent natural language description of video streams. In ICCVW, 2011.
-
(2011)
ICCVW
-
-
Khan, M.U.G.1
Zhang, L.2
Gotoh, Y.3
-
15
-
-
84898426452
-
A spatio-temporal descriptor based on 3d-gradients
-
A. Klaser, M. Marszalek, and C. Schmid. A spatio-temporal descriptor based on 3d-gradients. In BMVC, 2008.
-
(2008)
BMVC
-
-
Klaser, A.1
Marszalek, M.2
Schmid, C.3
-
16
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In CVPR, 2011.
-
(2011)
CVPR
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
17
-
-
29344465396
-
Automatic evaluation of summaries using n-gram co-occurrence statistics
-
C.-Y. Lin and E. Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In NAACL HLT, 2003.
-
(2003)
NAACL HLT
-
-
Lin, C.-Y.1
Hovy, E.2
-
19
-
-
84893398951
-
Generating natural-language video descriptions using text-mined knowledge
-
G. Malkarnenkar, N. Krishnamoorthy, S. Guadarrama, K. Saenko, and R. Mooney. Generating natural-language video descriptions using text-mined knowledge. In AAAI, 2013.
-
(2013)
AAAI
-
-
Malkarnenkar, G.1
Krishnamoorthy, N.2
Guadarrama, S.3
Saenko, K.4
Mooney, R.5
-
20
-
-
84905274625
-
Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics
-
P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, W. Kraaij, A. F. Smeaton, and G. Quéenot. Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics. In TRECVID 2012, 2012.
-
(2012)
IntRECVID 2012
-
-
Over, P.1
Awad, G.2
Michel, M.3
Fiscus, J.4
Sanders, G.5
Shaw, B.6
Kraaij, W.7
Smeaton, A.F.8
Quéenot, G.9
-
21
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: a method for automatic evaluation of machine translation. In ACL, 2002.
-
(2002)
ACL
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
22
-
-
77955999239
-
Topic regression multi-modal latent dirichlet allocation for image annotation
-
D. Putthividhya, H. T. Attias, and S. S. Nagarajan. Topic regression multi-modal latent dirichlet allocation for image annotation. In CVPR, 2010.
-
(2010)
CVPR
-
-
Putthividhya, D.1
Attias, H.T.2
Nagarajan, S.S.3
-
24
-
-
84887351648
-
Script data for attribute-based recognition of composite activities
-
M. Rohrbach, M. Regneri, M. Andriluka, S. Amin, M. Pinkal, and B. Schiele. Script data for attribute-based recognition of composite activities. In ECCV, 2012.
-
(2012)
ECCV
-
-
Rohrbach, M.1
Regneri, M.2
Andriluka, M.3
Amin, S.4
Pinkal, M.5
Schiele, B.6
-
25
-
-
84866718894
-
Action bank: A high-level representation of activity in video
-
S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. In CVPR, 2012.
-
(2012)
CVPR
-
-
Sadanand, S.1
Corso, J.J.2
-
26
-
-
80052889458
-
Recognition using visual phrases
-
M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011.
-
(2011)
CVPR
-
-
Sadeghi, M.A.1
Farhadi, A.2
-
27
-
-
80052908300
-
Unbiased look at dataset bias
-
A. Torralba and A. A. Efros. Unbiased look at dataset bias. In CVPR, 2011.
-
(2011)
CVPR
-
-
Torralba, A.1
Efros, A.A.2
-
32
-
-
70450178502
-
Simultaneous image classification and annotation
-
C. Wang, D. M. Blei, and F.-F. Li. Simultaneous image classification and annotation. In CVPR, 2009.
-
(2009)
CVPR
-
-
Wang, C.1
Blei, D.M.2
Li, F.-F.3
-
33
-
-
77952406197
-
Topic models for semantics-preserving video compression
-
J. Wanke, A. Ulges, C. H. Lampert, and T. M. Breuel. Topic models for semantics-preserving video compression. In MIR, 2010.
-
(2010)
MIR
-
-
Wanke, J.1
Ulges, A.2
Lampert, C.H.3
Breuel, T.M.4
|