-
1
-
-
80052890189
-
Novelty detection from an ego-centric perspective
-
O. Aghazadeh, J. Sullivan, and S. Carlsson. Novelty detection from an ego-centric perspective. In CVPR, 2011.
-
(2011)
CVPR
-
-
Aghazadeh, O.1
Sullivan, J.2
Carlsson, S.3
-
3
-
-
85072028231
-
Return of the devil in the details: Delving deep into convolutional nets
-
K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman. Return of the devil in the details: Delving deep into convolutional nets. In BMVC, 2014.
-
(2014)
BMVC
-
-
Chatfield, K.1
Simonyan, K.2
Vedaldi, A.3
Zisserman, A.4
-
4
-
-
84898784577
-
Space-time tradeoffs in photo sequencing
-
T. Dekel, Y. Moses, and S. Avidan. Space-time tradeoffs in photo sequencing. In ICCV, 2013.
-
(2013)
ICCV
-
-
Dekel, T.1
Moses, Y.2
Avidan, S.3
-
6
-
-
85198028989
-
ImageNet: A Large-Scale Hierarchical Image Database
-
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, 2009.
-
(2009)
CVPR
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.-J.4
Li, K.5
Fei-Fei, L.6
-
8
-
-
84906504048
-
Decaf: A deep convolutional activation feature for generic visual recognition
-
J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. CoRR, 2013.
-
(2013)
CoRR
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
9
-
-
84856655308
-
Understanding egocentric activities
-
A. Fathi, A. Farhadi, and J. M. Rehg. Understanding egocentric activities. In ICCV, 2011.
-
(2011)
ICCV
-
-
Fathi, A.1
Farhadi, A.2
Rehg, J.M.3
-
10
-
-
84866649288
-
Social interactions: A first-person perspective
-
A. Fathi, J. Hodgins, and J. Rehg. Social interactions: A first-person perspective. In CVPR, 2012.
-
(2012)
CVPR
-
-
Fathi, A.1
Hodgins, J.2
Rehg, J.3
-
11
-
-
84881506730
-
Learning to recognize daily actions using gaze
-
A. Fathi, Y. Li, and J. Rehg. Learning to recognize daily actions using gaze. In ECCV. 2012.
-
(2012)
ECCV
-
-
Fathi, A.1
Li, Y.2
Rehg, J.3
-
12
-
-
80052894345
-
Learning to recognize objects in egocentric activities
-
A. Fathi, X. Ren, and J. Rehg. Learning to recognize objects in egocentric activities. In CVPR, 2011.
-
(2011)
CVPR
-
-
Fathi, A.1
Ren, X.2
Rehg, J.3
-
13
-
-
84959223985
-
Modeling video evolution for action recognition
-
B. Fernando, E. Gavves, J. Oramas, A. Ghodrati, and T. Tuytelaars. Modeling video evolution for action recognition. In CVPR, 2015.
-
(2015)
CVPR
-
-
Fernando, B.1
Gavves, E.2
Oramas, J.3
Ghodrati, A.4
Tuytelaars, T.5
-
14
-
-
84911400494
-
Rich feature hierarchies for accurate object detection and semantic segmentation
-
R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In CVPR, 2014.
-
(2014)
CVPR
-
-
Girshick, R.1
Donahue, J.2
Darrell, T.3
Malik, J.4
-
15
-
-
84959255777
-
Matchnet: Unifying feature and metric learning for patchbased matching
-
X. Han, T. Leun, Y. Jia, R. Sukthankar, and A. C. Berg. Matchnet: Unifying feature and metric learning for patchbased matching. In CVPR, 2015.
-
(2015)
CVPR
-
-
Han, X.1
Leun, T.2
Jia, Y.3
Sukthankar, R.4
Berg, A.C.5
-
16
-
-
84866664428
-
Max-margin early event detectors
-
M. Hoai and F. De la Torre. Max-margin early event detectors. In CVPR, 2012.
-
(2012)
CVPR
-
-
Hoai, M.1
De La Torre, F.2
-
17
-
-
84937225746
-
Caffe: Convolutional architecture for fast feature embedding
-
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. B. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. CoRR, 2014.
-
(2014)
CoRR
-
-
Jia, Y.1
Shelhamer, E.2
Donahue, J.3
Karayev, S.4
Long, J.5
Girshick, R.B.6
Guadarrama, S.7
Darrell, T.8
-
18
-
-
80052870292
-
Fast unsupervised ego-action learning for first-person sports videos
-
K. Kitani, T. Okabe, Y. Sato, and A. Sugimoto. Fast unsupervised ego-action learning for first-person sports videos. In CVPR, 2011.
-
(2011)
CVPR
-
-
Kitani, K.1
Okabe, T.2
Sato, Y.3
Sugimoto, A.4
-
20
-
-
84898426452
-
A spatio-temporal descriptor based on 3d-gradients
-
A. Kläser, M. Marsza?ek, and C. Schmid. A spatio-temporal descriptor based on 3d-gradients. In BMVC, 2008.
-
(2008)
BMVC
-
-
Kläser, A.1
Marszaek, M.2
Schmid, C.3
-
21
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS. 2012.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
24
-
-
84866723224
-
Discovering important people and objects for egocentric video summarization
-
Y. J. Lee, J. Ghosh, and K. Grauman. Discovering important people and objects for egocentric video summarization. In CVPR, 2012.
-
(2012)
CVPR
-
-
Lee, Y.J.1
Ghosh, J.2
Grauman, K.3
-
25
-
-
84898812374
-
Learning to predict gaze in egocentric video
-
Y. Li, A. Fathi, and J. Rehg. Learning to predict gaze in egocentric video. In ICCV, 2013.
-
(2013)
ICCV
-
-
Li, Y.1
Fathi, A.2
Rehg, J.3
-
26
-
-
84887342438
-
Story-driven summarization for egocentric video
-
Z. Lu and K. Grauman. Story-driven summarization for egocentric video. In CVPR, 2013.
-
(2013)
CVPR
-
-
Lu, Z.1
Grauman, K.2
-
27
-
-
84911449395
-
Learning and transferring mid-level image representations using convolutional neural networks
-
M. Oquab, L. Bottou, I. Laptev, and J. Sivic. Learning and transferring mid-level image representations using convolutional neural networks. In CVPR, 2014.
-
(2014)
CVPR
-
-
Oquab, M.1
Bottou, L.2
Laptev, I.3
Sivic, J.4
-
28
-
-
84856646751
-
Parsing video events with goal inference and intent prediction
-
M. Pei, Y. Jia, and S.-C. Zhu. Parsing video events with goal inference and intent prediction. In ICCV, 2011.
-
(2011)
ICCV
-
-
Pei, M.1
Jia, Y.2
Zhu, S.-C.3
-
29
-
-
84911385613
-
Seeing the arrow of time
-
L. C. Pickup, Z. Pan, D. Wei, Y. Shih, C. Zhang, A. Zisserman, B. Schölkopf, and W. T. Freeman. Seeing the arrow of time. In CVPR, 2014.
-
(2014)
CVPR
-
-
Pickup, L.C.1
Pan, Z.2
Wei, D.3
Shih, Y.4
Zhang, C.5
Zisserman, A.6
Schölkopf, B.7
Freeman, W.T.8
-
30
-
-
84866652986
-
Detecting activities of daily living in first-person camera views
-
H. Pirsiavash and D. Ramanan. Detecting activities of daily living in first-person camera views. In CVPR, 2012.
-
(2012)
CVPR
-
-
Pirsiavash, H.1
Ramanan, D.2
-
31
-
-
77955991434
-
Figure-ground segmentation improves handled object recognition in egocentric video
-
X. Ren and C. Gu. Figure-ground segmentation improves handled object recognition in egocentric video. In CVPR, 2010.
-
(2010)
CVPR
-
-
Ren, X.1
Gu, C.2
-
32
-
-
84856688144
-
Human activity prediction: Early recognition of ongoing activities from streaming videos
-
M. Ryoo. Human activity prediction: Early recognition of ongoing activities from streaming videos. In ICCV, 2011.
-
(2011)
ICCV
-
-
Ryoo, M.1
-
33
-
-
84887376594
-
First-person activity recognition: What are they doing to me? in
-
M. Ryoo and L. Matthies. First-person activity recognition: What are they doing to me? In CVPR, 2013.
-
(2013)
CVPR
-
-
Ryoo, M.1
Matthies, L.2
-
34
-
-
0017930815
-
Dynamic programming algorithm optimization for spoken word recognition
-
H. Sakoe and S. Chiba. Dynamic programming algorithm optimization for spoken word recognition. ICASSP, 1978.
-
(1978)
ICASSP
-
-
Sakoe, H.1
Chiba, S.2
-
35
-
-
84906510379
-
Two-stream convolutional networks for action recognition in videos
-
K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. CoRR, 2014.
-
(2014)
CoRR
-
-
Simonyan, K.1
Zisserman, A.2
-
36
-
-
84911380009
-
Patch to the future: Unsupervised visual prediction
-
J. Walker, A. Gupta, and M. Hebert. Patch to the future: Unsupervised visual prediction. In CVPR, 2014.
-
(2014)
CVPR
-
-
Walker, J.1
Gupta, A.2
Hebert, M.3
-
38
-
-
84898890371
-
Evaluation of local spatio-temporal features for action recognition
-
H. Wang, M. M. Ullah, A. Klaser, I. Laptev, and C. Schmid. Evaluation of local spatio-temporal features for action recognition. In BMVC, 2009.
-
(2009)
BMVC
-
-
Wang, H.1
Ullah, M.M.2
Klaser, A.3
Laptev, I.4
Schmid, C.5
-
40
-
-
84886833674
-
A data-driven approach for event prediction
-
J. Yuen and A. Torralba. A data-driven approach for event prediction. In ECCV, 2010.
-
(2010)
ECCV
-
-
Yuen, J.1
Torralba, A.2
-
41
-
-
84911443783
-
Panda: Pose aligned networks for deep attribute modeling
-
N. Zhang, M. Paluri, M. Ranzato, T. Darrell, and L. Bourdev. Panda: Pose aligned networks for deep attribute modeling. In CVPR, 2014.
-
(2014)
CVPR
-
-
Zhang, N.1
Paluri, M.2
Ranzato, M.3
Darrell, T.4
Bourdev, L.5
-
42
-
-
84937964578
-
Learning Deep Features for Scene Recognition using Places Database
-
B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning Deep Features for Scene Recognition using Places Database. NIPS, 2014.
-
(2014)
NIPS
-
-
Zhou, B.1
Lapedriza, A.2
Xiao, J.3
Torralba, A.4
Oliva, A.5
|