메뉴 건너뛰기




Volumn 105, Issue 3, 2013, Pages 222-245

Image classification with the fisher vector: Theory and practice

Author keywords

Bag of Visual words; Fisher kernel; Fisher vector; Image classification; Large scale classification; Product quantization

Indexed keywords

BAG-OF-VISUAL WORDS; ENCODING TECHNIQUES; FISHER KERNELS; FISHER VECTORS; GAUSSIAN MIXTURE MODEL; LINEAR CLASSIFIERS; PRODUCT QUANTIZATIONS; THEORY AND PRACTICE;

EID: 84883487458     PISSN: 09205691     EISSN: 15731405     Source Type: Journal    
DOI: 10.1007/s11263-013-0636-x     Document Type: Article
Times cited : (1485)

References (79)
  • 3
    • 84866705398 scopus 로고    scopus 로고
    • Meta-class features for large-scale object categorization on a budget
    • Bergamo, A., & Torresani, L. (2012). Meta-class features for large-scale object categorization on a budget. In CVPR.
    • (2012) CVPR
    • Bergamo, A.1    Torresani, L.2
  • 4
    • 0001740650 scopus 로고
    • Training with noise is equivalent to tikhonov regularization
    • Bishop, C. (1995). Training with noise is equivalent to tikhonov regularization. In Neural computation (Vol 7).
    • (1995) Neural Computation , vol.7
    • Bishop, C.1
  • 5
    • 78149331496 scopus 로고    scopus 로고
    • Efficient match kernels between sets of features for visual recognition
    • Bo, L., & Sminchisescu, C. (2009). Efficient match kernels between sets of features for visual recognition. In NIPS.
    • (2009) NIPS
    • Bo, L.1    Sminchisescu, C.2
  • 6
    • 84883487070 scopus 로고    scopus 로고
    • Multipath sparse coding using hierarchical matching pursuit
    • Bo, L., Ren, X., & Fox, D. (2012). Multipath sparse coding using hierarchical matching pursuit. In NIPS workshop on deep learning.
    • (2012) NIPS Workshop on Deep Learning
    • Bo, L.1    Ren, X.2    Fox, D.3
  • 7
    • 51949090223 scopus 로고    scopus 로고
    • In defense of nearest-neighbor based image classification
    • Boiman, O., Shechtman, E., & Irani, M. (2008). In defense of nearest-neighbor based image classification. In CVPR.
    • (2008) CVPR
    • Boiman, O.1    Shechtman, E.2    Irani, M.3
  • 9
    • 48849085774 scopus 로고    scopus 로고
    • The tradeoffs of large scale learning
    • Bottou, L., & Bousquet, O. (2007). The tradeoffs of large scale learning. In NIPS.
    • (2007) NIPS
    • Bottou, L.1    Bousquet, O.2
  • 10
    • 77955993281 scopus 로고    scopus 로고
    • Learning mid-level features for recognition
    • Boureau, Y. L., Bach, F., LeCun, Y., & Ponce, J. (2010). Learning mid-level features for recognition. In CVPR.
    • (2010) CVPR
    • Boureau, Y.L.1    Bach, F.2    Lecun, Y.3    Ponce, J.4
  • 11
    • 84856649187 scopus 로고    scopus 로고
    • Ask the locals: Multi-way local pooling for image recognition
    • Boureau, Y. L., LeRoux, N., Bach, F., Ponce, J., & LeCun, Y. (2011). Ask the locals: Multi-way local pooling for image recognition. In ICCV.
    • (2011) ICCV
    • Boureau, Y.L.1    Leroux, N.2    Bach, F.3    Ponce, J.4    Lecun, Y.5
  • 12
    • 0025754995 scopus 로고
    • A norm selection criterion for the generalized delta rule
    • 10.1109/72.80298
    • Burrascano, P. (1991). A norm selection criterion for the generalized delta rule. IEEE Transactions on Neural Networks, 2(1), 125-30.
    • (1991) IEEE Transactions on Neural Networks , vol.2 , Issue.1 , pp. 125-130
    • Burrascano, P.1
  • 13
    • 84898420173 scopus 로고    scopus 로고
    • The devil is in the details: An evaluation of recent feature encoding methods
    • Chatfield, K., Lempitsky, V., Vedaldi, A., & Zisserman, A. (2011). The devil is in the details: An evaluation of recent feature encoding methods. In BMVC.
    • (2011) BMVC
    • Chatfield, K.1    Lempitsky, V.2    Vedaldi, A.3    Zisserman, A.4
  • 14
    • 84866669546 scopus 로고    scopus 로고
    • Image categorization using Fisher kernels of non-iid image models
    • Cinbis, G., Verbeek, J., & Schmid, C. (2012). Image categorization using Fisher kernels of non-iid image models. In CVPR.
    • (2012) CVPR
    • Cinbis, G.1    Verbeek, J.2    Schmid, C.3
  • 18
    • 80052876786 scopus 로고    scopus 로고
    • What does classifying more than 10,000 image categories tell us?
    • Deng, J., Berg, A., Li, K., & Fei-Fei, L. (2010). What does classifying more than 10,000 image categories tell us?. In ECCV.
    • (2010) ECCV
    • Deng, J.1    Berg, A.2    Li, K.3    Fei-Fei, L.4
  • 24
    • 85067032737 scopus 로고    scopus 로고
    • On feature combination for multiclass object classification
    • Gehler, P., & Nowozin, S. (2009). On feature combination for multiclass object classification. In ICCV.
    • (2009) ICCV
    • Gehler, P.1    Nowozin, S.2
  • 27
    • 77956006653 scopus 로고    scopus 로고
    • Multimodal semi-supervised learning for image classification
    • Guillaumin, M., Verbeek, J., & Schmid, C. (2010). Multimodal semi-supervised learning for image classification. In CVPR.
    • (2010) CVPR
    • Guillaumin, M.1    Verbeek, J.2    Schmid, C.3
  • 28
    • 77953202990 scopus 로고    scopus 로고
    • Combining efficient object localization and image classification
    • Harzallah, H., Jurie, F., & Schmid, C. (2009). Combining efficient object localization and image classification. In ICCV.
    • (2009) ICCV
    • Harzallah, H.1    Jurie, F.2    Schmid, C.3
  • 30
    • 0002853450 scopus 로고    scopus 로고
    • Exploiting generative models in discriminative classifiers
    • Jaakkola, T., & Haussler, D. (1998). Exploiting generative models in discriminative classifiers. In NIPS.
    • (1998) NIPS
    • Jaakkola, T.1    Haussler, D.2
  • 31
    • 70450183957 scopus 로고    scopus 로고
    • On the burstiness of visual elements
    • Jégou, H., Douze, M., & Schmid, C. (2009). On the burstiness of visual elements. In CVPR.
    • (2009) CVPR
    • Jégou, H.1
  • 32
    • 77956004473 scopus 로고    scopus 로고
    • Aggregating local descriptors into a compact image representation
    • Jégou, H., Douze, M., Schmid, C., & Pérez, P. (2010). Aggregating local descriptors into a compact image representation. In CVPR.
    • (2010) CVPR
    • Jégou, H.1
  • 33
    • 84875881757 scopus 로고    scopus 로고
    • Product quantization for nearest neighbor search
    • Jégou, H., Douze, M., & Schmid, C. (2011). Product quantization for nearest neighbor search. In IEEE PAMI.
    • (2011) IEEE PAMI
    • Jégou, H.1
  • 35
    • 84856626270 scopus 로고    scopus 로고
    • Modeling spatial layout with fisher vectors for image categorization
    • Krapac, J., Verbeek, J., & Jurie, F. (2011). Modeling spatial layout with fisher vectors for image categorization. In ICCV.
    • (2011) ICCV
    • Krapac, J.1    Verbeek, J.2    Jurie, F.3
  • 36
    • 84876231242 scopus 로고    scopus 로고
    • Image classification with deep convolutional neural networks
    • Krizhevsky, A., Sutskever, I., & Hinton, G. (2012). Image classification with deep convolutional neural networks. In NIPS.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.3
  • 37
    • 80052875476 scopus 로고    scopus 로고
    • Discriminative affine sparse codes for image classification
    • Kulkarni, N., & Li, B. (2011). Discriminative affine sparse codes for image classification. In CVPR.
    • (2011) CVPR
    • Kulkarni, N.1    Li, B.2
  • 38
    • 33845572523 scopus 로고    scopus 로고
    • Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
    • Lazebnik, S., Schmid, C., & Ponce, J. (2006). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR.
    • (2006) CVPR
    • Lazebnik, S.1    Schmid, C.2    Ponce, J.3
  • 39
    • 84867135575 scopus 로고    scopus 로고
    • Building high-level features using large scale unsupervised learning
    • Le, Q., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G., et al. (2012). Building high-level features using large scale unsupervised learning. In ICML.
    • (2012) ICML
    • Le, Q.1    Ranzato, M.2    Monga, R.3    Devin, M.4    Chen, K.5    Corrado, G.6
  • 40
    • 80052870284 scopus 로고    scopus 로고
    • Large-scale image classification: Fast feature extraction and svm training
    • Lin, Y., Lv, F., Zhu, S., Yu, K., Yang, M., & Cour, T. (2011). Large-scale image classification: Fast feature extraction and svm training. In CVPR.
    • (2011) CVPR
    • Lin, Y.1    Lv, F.2    Zhu, S.3    Yu, K.4    Yang, M.5    Cour, T.6
  • 41
    • 51949106482 scopus 로고    scopus 로고
    • A similarity measure between unordered vector sets with application to image categorization
    • Liu, Y., & Perronnin, F. (2008). A similarity measure between unordered vector sets with application to image categorization. In CVPR.
    • (2008) CVPR
    • Liu, Y.1    Perronnin, F.2
  • 42
    • 3042535216 scopus 로고    scopus 로고
    • Distinctive image features from scale-invariant keypoints
    • 10.1023/B:VISI.0000029664.99615.94
    • Lowe, D. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91-110.
    • (2004) International Journal of Computer Vision , vol.60 , Issue.2 , pp. 91-110
    • Lowe, D.1
  • 43
    • 24644504191 scopus 로고    scopus 로고
    • Mercer kernels for object recognition with local features
    • Lyu, S. (2005). Mercer kernels for object recognition with local features. In CVPR.
    • (2005) CVPR
    • Lyu, S.1
  • 44
    • 77953184603 scopus 로고    scopus 로고
    • Max-margin additive classifiers for detection
    • Maji, S., & Berg, A. (2009). Max-margin additive classifiers for detection. In ICCV.
    • (2009) ICCV
    • Maji, S.1    Berg, A.2
  • 45
    • 51949098112 scopus 로고    scopus 로고
    • Classification using intersection kernel support vector machines is efficient
    • Maji, S., Berg, A., & Malik, J. (2008). Classification using intersection kernel support vector machines is efficient. In CVPR.
    • (2008) CVPR
    • Maji, S.1    Berg, A.2    Malik, J.3
  • 46
    • 84883488616 scopus 로고    scopus 로고
    • Metric learning for large scale image classification: Generalizing to new classes at near-zero cost
    • Mensink, T., Verbeek, J., Csurka, G., & Perronnin, F. (2012). Metric learning for large scale image classification: Generalizing to new classes at near-zero cost. In ECCV.
    • (2012) ECCV
    • Mensink, T.1    Verbeek, J.2    Csurka, G.3    Perronnin, F.4
  • 47
    • 34948815101 scopus 로고    scopus 로고
    • Fisher kernels on visual vocabularies for image categorization
    • Perronnin, F., & Dance, C. (2007). Fisher kernels on visual vocabularies for image categorization. In CVPR.
    • (2007) CVPR
    • Perronnin, F.1    Dance, C.2
  • 48
    • 34948822288 scopus 로고    scopus 로고
    • Adapted vocabularies for generic visual categorization
    • Perronnin, F., Dance, C., Csurka, G., & Bressan, M. (2006). Adapted vocabularies for generic visual categorization. In ECCV.
    • (2006) ECCV
    • Perronnin, F.1    Dance, C.2    Csurka, G.3    Bressan, M.4
  • 49
    • 77955992063 scopus 로고    scopus 로고
    • Large-scale image retrieval with compressed Fisher vectors
    • Perronnin, F., Liu, Y., Sánchez, J., & Poirier, H. (2010a). Large-scale image retrieval with compressed Fisher vectors. In CVPR.
    • (2010) CVPR
    • Perronnin, F.1
  • 50
    • 77956008923 scopus 로고    scopus 로고
    • Large-scale image categorization with explicit data embedding
    • Perronnin, F., Sánchez, J., & Liu, Y. (2010b). Large-scale image categorization with explicit data embedding. In CVPR.
    • (2010) CVPR
    • Perronnin, F.1
  • 51
    • 79959771606 scopus 로고    scopus 로고
    • Improving the Fisher kernel for large-scale image classification
    • Perronnin, F., Sánchez, J., & Mensink, T. (2010c). Improving the Fisher kernel for large-scale image classification. In ECCV.
    • (2010) ECCV
    • Perronnin, F.1
  • 52
    • 84866652997 scopus 로고    scopus 로고
    • Towards good practice in large-scale learning for image classification
    • Perronnin, F., Akata, Z., Harchaoui, Z., & Schmid, C. (2012). Towards good practice in large-scale learning for image classification. In CVPR.
    • (2012) CVPR
    • Perronnin, F.1    Akata, Z.2    Harchaoui, Z.3    Schmid, C.4
  • 53
  • 54
    • 80052885179 scopus 로고    scopus 로고
    • High-dimensional signature compression for large-scale image classification
    • Sánchez, J., & Perronnin, F. (2011). High-dimensional signature compression for large-scale image classification. In CVPR.
    • (2011) CVPR
    • Sánchez, J.1
  • 55
    • 84866552328 scopus 로고    scopus 로고
    • Modeling the spatial layout of images beyond spatial pyramids
    • 10.1016/j.patrec.2012.07.019
    • Sánchez, J., Perronnin, F., & de Campos, T. (2012). Modeling the spatial layout of images beyond spatial pyramids. Pattern Recognition Letters, 33(16), 2216-2223.
    • (2012) Pattern Recognition Letters , vol.33 , Issue.16 , pp. 2216-2223
    • Sánchez, J.1    Perronnin, F.2    De Campos, T.3
  • 56
    • 48849117633 scopus 로고    scopus 로고
    • Pegasos: Primal estimate sub-gradient solver for SVM
    • Shalev-Shwartz, S., Singer, Y., & Srebro, N. (2007). Pegasos: Primal estimate sub-gradient solver for SVM. In ICML.
    • (2007) ICML
    • Shalev-Shwartz, S.1    Singer, Y.2    Srebro, N.3
  • 57
    • 0345414182 scopus 로고    scopus 로고
    • Video Google: A text retrieval approach to object matching in videos
    • Sivic, J., & Zisserman, A. (2003). Video Google: A text retrieval approach to object matching in videos. In ICCV.
    • (2003) ICCV
    • Sivic, J.1    Zisserman, A.2
  • 58
    • 0009588481 scopus 로고    scopus 로고
    • Speech recognition using SVMs
    • Smith, N., & Gales, M. (2001). Speech recognition using SVMs. In NIPS.
    • (2001) NIPS
    • Smith, N.1    Gales, M.2
  • 60
    • 34548040492 scopus 로고    scopus 로고
    • Asymptotic distribution of coordinates on high dimensional spheres
    • Spruill, M. (2007). Asymptotic distribution of coordinates on high dimensional spheres. In Electronic communications in probability (Vol. 12).
    • (2007) Electronic Communications in Probability , vol.12
    • Spruill, M.1
  • 63
    • 80052908300 scopus 로고    scopus 로고
    • Unbiased look at dataset bias
    • Torralba, A., & Efros, A. A. (2011). Unbiased look at dataset bias. In CVPR.
    • (2011) CVPR
    • Torralba, A.1    Efros, A.A.2
  • 64
    • 70450175500 scopus 로고    scopus 로고
    • What is the spatial extent of an object?
    • Uijlings, J., Smeulders, A., & Scha, R. (2009). What is the spatial extent of an object? In CVPR.
    • (2009) CVPR
    • Uijlings, J.1    Smeulders, A.2    Scha, R.3
  • 65
    • 77955426203 scopus 로고    scopus 로고
    • Evaluating color descriptors for object and scene recognition
    • 10.1109/TPAMI.2009.154
    • van de Sande, K., Gevers, T., & Snoek, C. (2010). Evaluating color descriptors for object and scene recognition. IEEE PAMI, 32(9), 1582-1596.
    • (2010) IEEE PAMI , vol.32 , Issue.9 , pp. 1582-1596
    • Van De Sande, K.1    Gevers, T.2    Snoek, C.3
  • 67
    • 77955989063 scopus 로고    scopus 로고
    • Efficient additive kernels via explicit feature maps
    • Vedaldi, A., & Zisserman, A. (2010). Efficient additive kernels via explicit feature maps. In CVPR.
    • (2010) CVPR
    • Vedaldi, A.1    Zisserman, A.2
  • 68
    • 84866644207 scopus 로고    scopus 로고
    • Sparse kernel approximations for efficient classification and detection
    • Vedaldi, A., & Zisserman, A. (2012). Sparse kernel approximations for efficient classification and detection. In CVPR.
    • (2012) CVPR
    • Vedaldi, A.1    Zisserman, A.2
  • 69
    • 0345414121 scopus 로고    scopus 로고
    • Recognition with local features: The kernel recipe
    • Wallraven, C., Caputo, B., & Graf, A. (2003). Recognition with local features: the kernel recipe. In ICCV.
    • (2003) ICCV
    • Wallraven, C.1    Caputo, B.2    Graf, A.3
  • 70
    • 77953194802 scopus 로고    scopus 로고
    • Learning image similarity from flickr groups using stochastic intersection kernel machines
    • Wang, G., Hoiem, D., & Forsyth, D. (2009). Learning image similarity from flickr groups using stochastic intersection kernel machines. In ICCV.
    • (2009) ICCV
    • Wang, G.1    Hoiem, D.2    Forsyth, D.3
  • 71
    • 77955996870 scopus 로고    scopus 로고
    • Locality-constrained linear coding for image classification
    • Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., & Gong, Y. (2010). Locality-constrained linear coding for image classification. In CVPR.
    • (2010) CVPR
    • Wang, J.1    Yang, J.2    Yu, K.3    Lv, F.4    Huang, T.5    Gong, Y.6
  • 72
    • 50649119191 scopus 로고    scopus 로고
    • Object categorization by learned visual dictionary
    • Winn, J., Criminisi, A., & Minka, T. (2005). Object categorization by learned visual dictionary. In ICCV.
    • (2005) ICCV
    • Winn, J.1    Criminisi, A.2    Minka, T.3
  • 73
    • 77955988947 scopus 로고    scopus 로고
    • SUN database: Large-scale scene recognition from abbey to zoo
    • Xiao, J., Hays, J., Ehinger, K., Oliva, A., & Torralba, A. (2010). SUN database: Large-scale scene recognition from abbey to zoo. In CVPR.
    • (2010) CVPR
    • Xiao, J.1    Hays, J.2    Ehinger, K.3    Oliva, A.4    Torralba, A.5
  • 75
    • 77952494909 scopus 로고    scopus 로고
    • Group sensitive multiple kernel learning for object categorization
    • Yang, J., Li, Y., Tian, Y., Duan, L., & Gao, W. (2009). Group sensitive multiple kernel learning for object categorization. In ICCV.
    • (2009) ICCV
    • Yang, J.1    Li, Y.2    Tian, Y.3    Duan, L.4    Gao, W.5
  • 76
    • 70450209196 scopus 로고    scopus 로고
    • Linear spatial pyramid matching using sparse coding for image classification
    • Yang, J., Yu, K., Gong, Y., & Huang, T. (2009b). Linear spatial pyramid matching using sparse coding for image classification. In CVPR.
    • (2009) CVPR
    • Yang, J.1    Yu, K.2    Gong, Y.3    Huang, T.4
  • 78
    • 33846580425 scopus 로고    scopus 로고
    • Local features and kernels for classification of texture and object categories: A comprehensive study
    • 10.1007/s11263-006-9794-4
    • Zhang, J., Marszalek, M., Lazebnik, S., & Schmid, C. (2007). Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision, 73(2), 123-138.
    • (2007) International Journal of Computer Vision , vol.73 , Issue.2 , pp. 123-138
    • Zhang, J.1    Marszalek, M.2    Lazebnik, S.3    Schmid, C.4
  • 79
    • 80052886214 scopus 로고    scopus 로고
    • Image classification using super-vector coding of local image descriptors
    • Zhou, Z., Yu, K., Zhang, T., & Huang, T. (2010). Image classification using super-vector coding of local image descriptors. In ECCV.
    • (2010) ECCV
    • Zhou, Z.1    Yu, K.2    Zhang, T.3    Huang, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.