메뉴 건너뛰기




Volumn 31, Issue 12, 2010, Pages 1535-1542

A learning approach to hierarchical feature selection and aggregation for audio classification

Author keywords

Audio classification; Feature aggregation; Feature selection; Temporal modeling

Indexed keywords

AUDIO CLASSIFICATION; FEATURE AGGREGATION; FEATURE SELECTION; HIERARCHICAL FEATURES; LEARNING APPROACH; LOW-LEVEL FEATURES; MACHINE LEARNING METHODS; PERFORMANCE GAIN; TEMPORAL AGGREGATION; TEMPORAL MODELING;

EID: 77955560086     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patrec.2009.12.036     Document Type: Article
Times cited : (21)

References (42)
  • 1
    • 77955556919 scopus 로고    scopus 로고
    • Self-optimized spectral correlation method for background music identification
    • Abe, M.; Nishiguchi, M.; 2002. Self-optimized spectral correlation method for background music identification. In: Proc. IEEE ICME'02, Lausanne, pp. 333-336.
    • (2002) Proc. IEEE ICME'02, Lausanne , pp. 333-336
    • Abe, M.1    Nishiguchi, M.2
  • 2
    • 2942747947 scopus 로고    scopus 로고
    • Representing musical genre: A state of the art
    • J. Aucouturier, and F. Pachet Representing musical genre: A state of the art J. New Music Res. 32 1 2003 1 12
    • (2003) J. New Music Res. , vol.32 , Issue.1 , pp. 1-12
    • Aucouturier, J.1    Pachet, F.2
  • 4
  • 8
    • 84923878812 scopus 로고    scopus 로고
    • Geometry in sound: A speech/music audio classifier inspired by an image classifier
    • Barcelona, Spain
    • Casagrande, N.; Eck, D.; Kégl, B.; 2005b. Geometry in sound: A speech/music audio classifier inspired by an image classifier. In: ICMC 2005, Barcelona, Spain.
    • (2005) ICMC 2005
    • Casagrande, N.1    Eck, D.2    Kégl, B.3
  • 10
    • 0030649155 scopus 로고    scopus 로고
    • Psychoacoustical roughness: Implementation of an optimized model
    • P. Daniel, and R. Weber Psychoacoustical roughness: Implementation of an optimized model Acustica 83 1997 113 123
    • (1997) Acustica , vol.83 , pp. 113-123
    • Daniel, P.1    Weber, R.2
  • 16
    • 0034164230 scopus 로고    scopus 로고
    • Additive logistic regression: A statistical view of boosting
    • J. Friedman, T. Hastie, and R. Tibshirani Additive logistic regression: A statistical view of boosting Ann. Statist. 28 2 2000 337 374
    • (2000) Ann. Statist. , vol.28 , Issue.2 , pp. 337-374
    • Friedman, J.1    Hastie, T.2    Tibshirani, R.3
  • 17
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B.R. Glasberg, and B.C.J. Moore Derivation of auditory filter shapes from notched-noise data Hearing Res. 47 1990 103 138
    • (1990) Hearing Res. , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 20
    • 34547940048 scopus 로고    scopus 로고
    • Primitives-based evaluation and estimation of emotions in speech
    • M. Grimm, K. Kroschel, E. Mower, and S. Narayanan Primitives-based evaluation and estimation of emotions in speech Speech Comm. 49 2007 787 800
    • (2007) Speech Comm. , vol.49 , pp. 787-800
    • Grimm, M.1    Kroschel, K.2    Mower, E.3    Narayanan, S.4
  • 21
    • 0004215702 scopus 로고    scopus 로고
    • American Institute of Physics Press Woodbury, New York
    • W.M. Hartmann Signals, Sound, and Sensation 1997 American Institute of Physics Press Woodbury, New York
    • (1997) Signals, Sound, and Sensation
    • Hartmann, W.M.1
  • 27
    • 2942720260 scopus 로고    scopus 로고
    • Features for audio and music classification
    • Music Information Retrieval, Baltimore, MD, USA
    • McKinney, M.F.; Breebaart, J.; 2003. Features for audio and music classification. In: ISMIR 2003, 4th Internat. Conf. on Music Information Retrieval, Baltimore, MD, USA.
    • (2003) ISMIR 2003, 4th Internat. Conf.
    • McKinney . M, F.1    Breebaart, J.2
  • 29
    • 52049113229 scopus 로고    scopus 로고
    • Automatic recognition of urban soundscenes
    • S. Ntalampiras, I. Potamitis, and N. Fakotakis Automatic recognition of urban soundscenes G.A. Tsihrintzis, M. Virvou, R.J. Howlett, L.C. Jain, New Directions in Intelligent Interactive Multimedia Studies in Computational Intelligence vol. 142 2008 Springer 147 153
    • (2008) Studies in Computational Intelligence , vol.142 , pp. 147-153
    • Ntalampiras, S.1    Potamitis, I.2    Fakotakis, N.3
  • 30
    • 46749150988 scopus 로고    scopus 로고
    • Exploring billions of audio features
    • Eurasip (Ed.)
    • Pachet, F.; Roy, P.; 2007. Exploring billions of audio features. In: Eurasip (Ed.), Proceedings of CBMI 07.
    • (2007) Proceedings of CBMI 07
    • Pachet, F.1    Roy, P.2
  • 32
    • 51649122549 scopus 로고    scopus 로고
    • Auditory mood detection for social and educational robots
    • Ruvolo, P.; Fasel, I.R.; Movellan, J.R.; 2008. Auditory mood detection for social and educational robots. In: ICRA, pp. 3551-3556.
    • (2008) ICRA , pp. 3551-3556
    • Ruvolo, P.1    Fasel . I, R.2    Movellan, J.R.3
  • 33
    • 67650269716 scopus 로고    scopus 로고
    • Automatic cry detection in early childhood education settings
    • Ruvolo, P.; Movellan, J.R.; 2008. Automatic cry detection in early childhood education settings. In: Proc. ICDL, pp. 204-208.
    • (2008) Proc. ICDL , pp. 204-208
    • Ruvolo, P.1    Movellan, J.R.2
  • 34
    • 0031972902 scopus 로고    scopus 로고
    • Tempo and beat analysis of acoustic musical signals
    • E. Scheirer Tempo and beat analysis of acoustic musical signals J. Acoust. Soc. Amer. 103 1 1998 588 601
    • (1998) J. Acoust. Soc. Amer. , vol.103 , Issue.1 , pp. 588-601
    • Scheirer, E.1
  • 35
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music disciminator
    • Scheirer, E.; Slaney, M.; 1997. Construction and evaluation of a robust multifeature speech/music disciminator. In: Proc. ICASSP.
    • (1997) Proc. ICASSP
    • Scheirer, E.1    Slaney, M.2
  • 36
    • 0022246799 scopus 로고
    • Fast approximate realization of linear filters by translating cascading sum-box technique
    • Shen, J.; Castan, S.; 1985. Fast approximate realization of linear filters by translating cascading sum-box technique. In: Proc. CVPR, pp. 678-680.
    • (1985) Proc. CVPR , pp. 678-680
    • Shen, J.1    Castan, S.2
  • 37
    • 33745194491 scopus 로고    scopus 로고
    • On multi-scale fourier transform analysis of speech signals
    • Tyagi, V.; Bourlard, H.; 2003. On multi-scale fourier transform analysis of speech signals. IDIAP Research Report 03-32.
    • (2003) IDIAP Research Report 03-32
    • Tyagi, V.1    Bourlard, H.2
  • 40
    • 2142812371 scopus 로고    scopus 로고
    • Robust real-time object detection
    • P. Viola, and M. Jones Robust real-time object detection Internat. J. Comput. Vision. 57 2 2004 137 154
    • (2004) Internat. J. Comput. Vision. , vol.57 , Issue.2 , pp. 137-154
    • Viola, P.1    Jones, M.2
  • 41
    • 0016036633 scopus 로고
    • Sharpness as an attribute of the timbre of steady sounds
    • G. von Bismarck Sharpness as an attribute of the timbre of steady sounds Acustica 30 1974 159 172
    • (1974) Acustica , vol.30 , pp. 159-172
    • Von Bismarck, G.1
  • 42
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification, search, and retrieval of audio
    • E. Wold, T. Blum, D. Keislar, and J. Wheaton Content-based classification, search, and retrieval of audio IEEE Multimedia 3 2 1996
    • (1996) IEEE Multimedia , vol.3 , Issue.2
    • Wold, E.1    Blum, T.2    Keislar, D.3    Wheaton, J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.