-
1
-
-
51949084160
-
Utilizing semantic word similarity measures for video retrieval
-
Y. Aytar, M. Shah, and J. Luo. Utilizing semantic word similarity measures for video retrieval. In CVPR, 2008.
-
(2008)
CVPR
-
-
Aytar, Y.1
Shah, M.2
Luo, J.3
-
2
-
-
84885996388
-
Video-in-sentences out
-
A. Barbu, A. Bridge, Z. Burchill, D. Coroian, S. Dickinson, S. Fidler, A. Michaux, S. Mussman, S. Narayanaswamy, D. Salvi, L. Schmidt, J. Shangguan, J. Siskind, J. Waggoner, S. Wang, J. Wei, Y. Yin, and Z. Zhang. Video-in-sentences out. In UAI, 2012.
-
(2012)
UAI
-
-
Barbu, A.1
Bridge, A.2
Burchill, Z.3
Coroian, D.4
Dickinson, S.5
Fidler, S.6
Michaux, A.7
Mussman, S.8
Narayanaswamy, S.9
Salvi, D.10
Schmidt, L.11
Shangguan, J.12
Siskind, J.13
Waggoner, J.14
Wang, S.15
Wei, J.16
Yin, Y.17
Zhang, Z.18
-
3
-
-
0041876117
-
Matching words and pictures
-
K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. Blei, and M. Jordan. Matching words and pictures. In JMLR, 2003.
-
(2003)
JMLR
-
-
Barnard, K.1
Duygulu, P.2
Forsyth, D.3
De Freitas, N.4
Blei, D.5
Jordan, M.6
-
4
-
-
0036538619
-
Shape matching and object recognition using shape contexts
-
S. Belongie, J. Malik, and J. Puzicha. Shape matching and object recognition using shape contexts. IEEE Transaction on PAMI, 24(24), 2002.
-
(2002)
IEEE Transaction on PAMI
, vol.24
, Issue.24
-
-
Belongie, S.1
Malik, J.2
Puzicha, J.3
-
5
-
-
84889607930
-
Zero-shot video retrieval using content and concepts
-
J. Dalton, J. Allan, and P. Mirajkar. Zero-shot video retrieval using content and concepts. In CIKM, 2013.
-
(2013)
CIKM
-
-
Dalton, J.1
Allan, J.2
Mirajkar, P.3
-
6
-
-
80051961229
-
Every picture tells a story: Generating sentences for images
-
A. Farhadi, M. Hejrati, M. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences for images. In ECCV, 2010.
-
(2010)
ECCV
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
7
-
-
77955422240
-
Object detection with discriminatively trained part based models
-
P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ra-manan. Object detection with discriminatively trained part based models. PAMI, 32(9), 2010.
-
(2010)
PAMI
, vol.32
, Issue.9
-
-
Felzenszwalb, P.1
Girshick, R.2
McAllester, D.3
Ra-Manan, D.4
-
8
-
-
84887365305
-
A sentence is worth a thousand pixels
-
S. Fidler, A. Sharma, and R. Urtasun. A sentence is worth a thousand pixels. In CVPR, 2013.
-
(2013)
CVPR
-
-
Fidler, S.1
Sharma, A.2
Urtasun, R.3
-
9
-
-
84866704163
-
Are we ready for autonomous driving? the kitti vision benchmark suite
-
A. Geiger, P. Lenz, and R. Urtasun. Are we ready for autonomous driving? the kitti vision benchmark suite. In CVPR, 2012.
-
(2012)
CVPR
-
-
Geiger, A.1
Lenz, P.2
Urtasun, R.3
-
10
-
-
84911393175
-
Stereoscan: Dense 3d reconstruction in real-time
-
A. Geiger, J. Ziegler, and C. Stiller. Stereoscan: Dense 3d reconstruction in real-time. In IVS (IV), 2011.
-
(2011)
IVS (IV)
-
-
Geiger, A.1
Ziegler, J.2
Stiller, C.3
-
11
-
-
84883075039
-
Joint visual-text modeling for automatic retrieval of multimedia documents
-
G. Iyengar, P. Duygulu, S. Feng, P. Ircing, S. Khudanpur, D. Klakow, M. Krause, R. Manmatha, and H. Nock. Joint visual-text modeling for automatic retrieval of multimedia documents. In Proc. of ACM Multimedia, 2005.
-
(2005)
Proc. of ACM Multimedia
-
-
Iyengar, G.1
Duygulu, P.2
Feng, S.3
Ircing, P.4
Khudanpur, S.5
Klakow, D.6
Krause, M.7
Manmatha, R.8
Nock, H.9
-
12
-
-
84911370987
-
What are you talking about? Text-to-image coreference
-
C. Kong, D. Lin, M. Bansal, R. Urtasun, and S. Fidler. What are you talking about? text-to-image coreference. In CVPR, 2014.
-
(2014)
CVPR
-
-
Kong, C.1
Lin, D.2
Bansal, M.3
Urtasun, R.4
Fidler, S.5
-
13
-
-
33745130042
-
Content-based multimedia information retrieval
-
M. S. Lew, N. Sebe, C. Djeraba, and R. Jain. Content-based multimedia information retrieval. ACM Trans. on Multimedia Computing, Comm., and Applications, 2006.
-
(2006)
ACM Trans. on Multimedia Computing, Comm., and Applications
-
-
Lew, M.S.1
Sebe, N.2
Djeraba, C.3
Jain, R.4
-
14
-
-
70450219021
-
Towards total scene un-derstanding:classification, annotation and segmentation in an automatic framework
-
L. Li, R. Socher, and L. Fei-Fei. Towards total scene un-derstanding:classification, annotation and segmentation in an automatic framework. In CVPR, 2009.
-
(2009)
CVPR
-
-
Li, L.1
Socher, R.2
Fei-Fei, L.3
-
15
-
-
84867118595
-
A joint model of language and perception for grounded attribute learning
-
C. Matuszek, N. FitzGerald, L. Zettlemoyer, L. Bo, and D. Fox. A joint model of language and perception for grounded attribute learning. In ICML, 2013.
-
(2013)
ICML
-
-
Matuszek, C.1
Fitzgerald, N.2
Zettlemoyer, L.3
Bo, L.4
Fox, D.5
-
16
-
-
80052904076
-
Globally-optimal greedy algorithms for tracking a variable number of objects
-
H. Pirsiavash, D. Ramanan, and C. Fowlkes. Globally-optimal greedy algorithms for tracking a variable number of objects. In CVPR, 2011.
-
(2011)
CVPR
-
-
Pirsiavash, H.1
Ramanan, D.2
Fowlkes, C.3
-
17
-
-
84898775239
-
Translating video content to natural language descriptions
-
M. Rohrbach, W. Qiu, I. Titov, S. Thater, M. Pinkal, and B. Schiele. Translating video content to natural language descriptions. In ICCV, 2013.
-
(2013)
ICCV
-
-
Rohrbach, M.1
Qiu, W.2
Titov, I.3
Thater, S.4
Pinkal, M.5
Schiele, B.6
-
18
-
-
84881536861
-
Indoor segmentation and support inference from rgbd images
-
N. Silberman, D. Hoiem, P. Kohli, and R. Fergus. Indoor segmentation and support inference from rgbd images. In ECCV, 2012.
-
(2012)
ECCV
-
-
Silberman, N.1
Hoiem, D.2
Kohli, P.3
Fergus, R.4
-
19
-
-
0345414182
-
Video google: A text retrieval approach to object matching in videos
-
J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In ICCV, 2003.
-
(2003)
ICCV
-
-
Sivic, J.1
Zisserman, A.2
-
20
-
-
34547455218
-
Adding semantics to detectors for video retrieval
-
C. G. M. Snoek, B. Huurnink, L. Hollink, M. de Rijke, G. Schreiber, and M. Worring. Adding semantics to detectors for video retrieval. IEEE Transaction of Multimedia, 9(5):975-986, 2007.
-
(2007)
IEEE Transaction of Multimedia
, vol.9
, Issue.5
, pp. 975-986
-
-
Snoek, C.G.M.1
Huurnink, B.2
Hollink, L.3
De Rijke, M.4
Schreiber, G.5
Worring, M.6
-
24
-
-
24944537843
-
Large margin methods for structured and interdependent output variables
-
I. Tsochantaridis, T. Joachims, T. Hofmann, and Y. Altun. Large margin methods for structured and interdependent output variables. JMLR, 6:1453-1484, 2005.
-
(2005)
JMLR
, vol.6
, pp. 1453-1484
-
-
Tsochantaridis, I.1
Joachims, T.2
Hofmann, T.3
Altun, Y.4
-
25
-
-
37848999897
-
The importance of query-concept-mapping for automatic video retrieval
-
D. Wang, X. Li, J. Li, and B. Zhang. The importance of query-concept-mapping for automatic video retrieval. In Proc. of ACM Multimedia, 2007.
-
(2007)
Proc. of ACM Multimedia
-
-
Wang, D.1
Li, X.2
Li, J.3
Zhang, B.4
-
28
-
-
51949088494
-
Global data association for multi-object tracking using network flows
-
L. Zhang, Y. Li, and R. Nevatia. Global data association for multi-object tracking using network flows. In CVPR'08.
-
CVPR'08
-
-
Zhang, L.1
Li, Y.2
Nevatia, R.3
|