SCOPUS 정보 검색 플랫폼

Pattern Recognition Letters

Volumn 31, Issue 12, 2010, Pages 1535-1542

A learning approach to hierarchical feature selection and aggregation for audio classification

(3) Ruvolo, Paul a Fasel, Ian b Movellan, Javier R a

a UNIVERSITY OF CALIFORNIA (United States)

b University of Arizona (United States)

Author keywords

Audio classification; Feature aggregation; Feature selection; Temporal modeling

Indexed keywords

AUDIO CLASSIFICATION; FEATURE AGGREGATION; FEATURE SELECTION; HIERARCHICAL FEATURES; LEARNING APPROACH; LOW-LEVEL FEATURES; MACHINE LEARNING METHODS; PERFORMANCE GAIN; TEMPORAL AGGREGATION; TEMPORAL MODELING;

AUDIO ACOUSTICS; LEARNING SYSTEMS;

FEATURE EXTRACTION;

EID: 77955560086 PISSN: 01678655 EISSN: None Source Type: Journal
DOI: 10.1016/j.patrec.2009.12.036 Document Type: Article

Times cited : (21)

References (42)

1
- 77955556919
- Self-optimized spectral correlation method for background music identification
- Abe, M.; Nishiguchi, M.; 2002. Self-optimized spectral correlation method for background music identification. In: Proc. IEEE ICME'02, Lausanne, pp. 333-336.
- (2002) Proc. IEEE ICME'02, Lausanne , pp. 333-336
- Abe, M.¹ Nishiguchi, M.²

2
- 2942747947
- Representing musical genre: A state of the art
- J. Aucouturier, and F. Pachet Representing musical genre: A state of the art J. New Music Res. 32 1 2003 1 12
- (2003) J. New Music Res. , vol.32 , Issue.1 , pp. 1-12
- Aucouturier, J.¹ Pachet, F.²

3
- 34547505718
- Audio information retrieval using semantic similarity
- Barrington, L.; Chan, A.; Turnbull, D.; Lanckriet, G.; 2007. Audio information retrieval using semantic similarity. In: IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2, pp. 725-728.
- (2007) IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP) , vol.2 , pp. 725-728
- Barrington, L.¹ Chan, A.² Turnbull, D.³ Lanckriet, G.⁴

4
- 33751531805
- Aggregate features and adaboost for music classification
- J. Bergstra, N. Casagrande, D. Erhan, D. Eck, and B. Kégl Aggregate features and adaboost for music classification Machine Learning 65 2-3 2006 473 484
- (2006) Machine Learning , vol.65 , Issue.23 , pp. 473-484
- Bergstra, J.¹ Casagrande, N.² Erhan, D.³ Eck, D.⁴ Kégl, B.⁵

5
- 0003517937
- Wiley-Interscience
- P. Bloomfield Fourier Analysis of Time Series: An Introduction (Wiley Series in Probability and Statistics) 2000 Wiley-Interscience
- (2000) Fourier Analysis of Time Series: An Introduction (Wiley Series in Probability and Statistics)
- Bloomfield, P.¹

6
- 34247610490
- A database of german emotional speech
- Burkhardt, F.; Paeschke, A.; Rolfes, M.; Sendlmeiser, W.; Weiss, B.; 2005. A database of german emotional speech. In: Interspeech Proceedings.
- (2005) Interspeech Proceedings
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeiser, W.⁴ Weiss, B.⁵

7
- 84873550748
- Frame-level audio feature extraction using adaboost
- University of London London
- N. Casagrande, D. Eck, and B. Kégl Frame-level audio feature extraction using adaboost Proc. 6th Internat. Conf. on Music Information Retrieval 2005 University of London London 345 350
- (2005) Proc. 6th Internat. Conf. on Music Information Retrieval , pp. 345-350
- Casagrande, N.¹ Eck, D.² Kégl, B.³

8
- 84923878812
- Geometry in sound: A speech/music audio classifier inspired by an image classifier
- Barcelona, Spain
- Casagrande, N.; Eck, D.; Kégl, B.; 2005b. Geometry in sound: A speech/music audio classifier inspired by an image classifier. In: ICMC 2005, Barcelona, Spain.
- (2005) ICMC 2005
- Casagrande, N.¹ Eck, D.² Kégl, B.³

9
- 34247578210
- Where am I? Scene recognition for mobile robots using audio features
- Chu, S.; Narayanan, S.; Kuo, C.-C.J.; Matarić, M.J.; 2006. Where am I? Scene recognition for mobile robots using audio features. In: IEEE Internat. Conf. on Multimedia & Expo (ICME).
- (2006) IEEE Internat. Conf. on Multimedia & Expo (ICME)
- Chu, S.¹ Narayanan, S.² Kuo, C.-C.J.³ Matarić, M.J.⁴

10
- 0030649155
- Psychoacoustical roughness: Implementation of an optimized model
- P. Daniel, and R. Weber Psychoacoustical roughness: Implementation of an optimized model Acustica 83 1997 113 123
- (1997) Acustica , vol.83 , pp. 113-123
- Daniel, P.¹ Weber, R.²

11
- 0000029159
- Structure driven image database retrieval
- De Bonet, J.; Viola, P.; 1997. Structure driven image database retrieval. In: Advances in Neural Information Processing, Vol. 10.
- (1997) Advances in Neural Information Processing , vol.10
- De Bonet, J.¹ Viola, P.²

12
- 85085181752
- Classification of music signals in the visual domain
- Limerick, Ireland
- Deshpande, H.; Singh, R.; Nam, U.; 2001. Classification of music signals in the visual domain. In: Proc. COST G-6 Conf. on Digital Audio Effects (DAFX-01), Limerick, Ireland.
- (2001) Proc. COST G-6 Conf. on Digital Audio Effects (DAFX-01)
- Deshpande, H.¹ Singh, R.² Nam, U.³

13
- 10144252471
- A generative framework for real-time object detection and classification
- I. Fasel, B. Fortenberry, and J.R. Movellan A generative framework for real-time object detection and classification Comput. Vision Image Understanding 98 2005 182 210
- (2005) Comput. Vision Image Understanding , vol.98 , pp. 182-210
- Fasel, I.¹ Fortenberry, B.² Movellan, J.R.³

14
- 84892260159
- Springer-Verlag Berlin, Heidelberg, Germany
- H. Fastl, and E. Zwicker Psychoacoustics Facts and Models 1990 Springer-Verlag Berlin, Heidelberg, Germany
- (1990) Psychoacoustics Facts and Models
- Fastl, H.¹ Zwicker, E.²

15
- 0002978642
- Experiments with a new boosting algorithm
- Morgan Kaufmann, pp. 148-146
- Freund, Y.; Schapire, R.E.; 1996. Experiments with a new boosting algorithm. In: Proc. 13th Internat. Conf. on Machine Learning. Morgan Kaufmann, pp. 148-146.
- (1996) Proc. 13th Internat. Conf. on Machine Learning
- Freund, Y.¹ Schapire, R.E.²

16
- 0034164230
- Additive logistic regression: A statistical view of boosting
- J. Friedman, T. Hastie, and R. Tibshirani Additive logistic regression: A statistical view of boosting Ann. Statist. 28 2 2000 337 374
- (2000) Ann. Statist. , vol.28 , Issue.2 , pp. 337-374
- Friedman, J.¹ Hastie, T.² Tibshirani, R.³

17
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- B.R. Glasberg, and B.C.J. Moore Derivation of auditory filter shapes from notched-noise data Hearing Res. 47 1990 103 138
- (1990) Hearing Res. , vol.47 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

18
- 0004215426
- Kluwer Academic Publishers
- F.W. Glover, and M. Laguna Tabu Search 1997 Kluwer Academic Publishers
- (1997) Tabu Search
- Glover, F.W.¹ Laguna, M.²

19
- 84891583348
- Wiley
- B. Gold, and N. Morgan Speech and Audio Signal Processing: Processing and Perception of Speech and Music 2000 Wiley
- (2000) Speech and Audio Signal Processing: Processing and Perception of Speech and Music
- Gold, B.¹ Morgan, N.²

20
- 34547940048
- Primitives-based evaluation and estimation of emotions in speech
- M. Grimm, K. Kroschel, E. Mower, and S. Narayanan Primitives-based evaluation and estimation of emotions in speech Speech Comm. 49 2007 787 800
- (2007) Speech Comm. , vol.49 , pp. 787-800
- Grimm, M.¹ Kroschel, K.² Mower, E.³ Narayanan, S.⁴

21
- 0004215702
- American Institute of Physics Press Woodbury, New York
- W.M. Hartmann Signals, Sound, and Sensation 1997 American Institute of Physics Press Woodbury, New York
- (1997) Signals, Sound, and Sensation
- Hartmann, W.M.¹

22
- 0022758627
- Filtering by repeated integration
- Heckbert, P.S.; 1986. Filtering by repeated integration. In: Internat. Conf. on Computer Graphics and Interactive Techniques, pp. 315-321.
- (1986) Internat. Conf. on Computer Graphics and Interactive Techniques , pp. 315-321
- Heckbert, P.S.¹

23
- 33646784009
- SOLAR: Sound object localization and retrieval in complex audio environments
- Hoiem, D.; Ke, Y.; Sukthankar, R.; 2005. SOLAR: Sound object localization and retrieval in complex audio environments. In: IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 5, pp. 429-432.
- (2005) IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) , vol.5 , pp. 429-432
- Hoiem, D.¹ Ke, Y.² Sukthankar, R.³

24
- 0003770709
- Kluwer Academic Boston
- J. Junqua, and J. Haton Robustness in Automatic Speech Recognition 1996 Kluwer Academic Boston
- (1996) Robustness in Automatic Speech Recognition
- Junqua, J.¹ Haton, J.²

25
- 0003456805
- Academic New York
- S.G. Mallat A Wavelet Tour of Signal Processing 1999 Academic New York
- (1999) A Wavelet Tour of Signal Processing
- Mallat, S.G.¹

26
- 0019728174
- Box-filtering techniques
- M.J. McDonnell Box-filtering techniques Comput. Graph. Image Process. 17 1 1981
- (1981) Comput. Graph. Image Process. , vol.17 , Issue.1
- McDonnell, M.J.¹

27
- 2942720260
- Features for audio and music classification
- Music Information Retrieval, Baltimore, MD, USA
- McKinney, M.F.; Breebaart, J.; 2003. Features for audio and music classification. In: ISMIR 2003, 4th Internat. Conf. on Music Information Retrieval, Baltimore, MD, USA.
- (2003) ISMIR 2003, 4th Internat. Conf.
- McKinney . M, F.¹ Breebaart, J.²

28
- 77955551578
- MPLab Tutorials.
- Movellan, J.R.; 2006. Tutorial on multinomial logistic regression. MPLab Tutorials. http://mplab.ucsd.edu.
- (2006) Tutorial on Multinomial Logistic Regression
- Movellan, J.R.¹

29
- 52049113229
- Automatic recognition of urban soundscenes
- S. Ntalampiras, I. Potamitis, and N. Fakotakis Automatic recognition of urban soundscenes G.A. Tsihrintzis, M. Virvou, R.J. Howlett, L.C. Jain, New Directions in Intelligent Interactive Multimedia Studies in Computational Intelligence vol. 142 2008 Springer 147 153
- (2008) Studies in Computational Intelligence , vol.142 , pp. 147-153
- Ntalampiras, S.¹ Potamitis, I.² Fakotakis, N.³

30
- 46749150988
- Exploring billions of audio features
- Eurasip (Ed.)
- Pachet, F.; Roy, P.; 2007. Exploring billions of audio features. In: Eurasip (Ed.), Proceedings of CBMI 07.
- (2007) Proceedings of CBMI 07
- Pachet, F.¹ Roy, P.²

31
- 0033329296
- Emotion in speech: Recognition and application to call centers
- Petrushin, V.; 1999. Emotion in speech: Recognition and application to call centers. In: Proc. Conf. on Artificial Neural Networks in Engineering (ANNIE '99).
- (1999) Proc. Conf. on Artificial Neural Networks in Engineering (ANNIE '99)
- Petrushin, V.¹

32
- 51649122549
- Auditory mood detection for social and educational robots
- Ruvolo, P.; Fasel, I.R.; Movellan, J.R.; 2008. Auditory mood detection for social and educational robots. In: ICRA, pp. 3551-3556.
- (2008) ICRA , pp. 3551-3556
- Ruvolo, P.¹ Fasel . I, R.² Movellan, J.R.³

33
- 67650269716
- Automatic cry detection in early childhood education settings
- Ruvolo, P.; Movellan, J.R.; 2008. Automatic cry detection in early childhood education settings. In: Proc. ICDL, pp. 204-208.
- (2008) Proc. ICDL , pp. 204-208
- Ruvolo, P.¹ Movellan, J.R.²

34
- 0031972902
- Tempo and beat analysis of acoustic musical signals
- E. Scheirer Tempo and beat analysis of acoustic musical signals J. Acoust. Soc. Amer. 103 1 1998 588 601
- (1998) J. Acoust. Soc. Amer. , vol.103 , Issue.1 , pp. 588-601
- Scheirer, E.¹

35
- 0030648077
- Construction and evaluation of a robust multifeature speech/music disciminator
- Scheirer, E.; Slaney, M.; 1997. Construction and evaluation of a robust multifeature speech/music disciminator. In: Proc. ICASSP.
- (1997) Proc. ICASSP
- Scheirer, E.¹ Slaney, M.²

36
- 0022246799
- Fast approximate realization of linear filters by translating cascading sum-box technique
- Shen, J.; Castan, S.; 1985. Fast approximate realization of linear filters by translating cascading sum-box technique. In: Proc. CVPR, pp. 678-680.
- (1985) Proc. CVPR , pp. 678-680
- Shen, J.¹ Castan, S.²

37
- 33745194491
- On multi-scale fourier transform analysis of speech signals
- Tyagi, V.; Bourlard, H.; 2003. On multi-scale fourier transform analysis of speech signals. IDIAP Research Report 03-32.
- (2003) IDIAP Research Report 03-32
- Tyagi, V.¹ Bourlard, H.²

38
- 0010053023
- Automatic musical genre classification of audio signals
- Bloomington, IN, USA
- Tzanetakis, G.; Essl, G.; Cook, P.; 2001. Automatic musical genre classification of audio signals. In: Proc. Internat. Symp. on Music Information Retrieval (ISMIR), Bloomington, IN, USA, pp. 205-210.
- (2001) Proc. Internat. Symp. on Music Information Retrieval (ISMIR) , pp. 205-210
- Tzanetakis, G.¹ Essl, G.² Cook, P.³

39
- 0003450542
- Springer-Verlag Heidelberg, DE
- V.N. Vapnik The Nature of Statistical Learning Theory 1995 Springer-Verlag Heidelberg, DE
- (1995) The Nature of Statistical Learning Theory
- Vapnik, V.N.¹

40
- 2142812371
- Robust real-time object detection
- P. Viola, and M. Jones Robust real-time object detection Internat. J. Comput. Vision. 57 2 2004 137 154
- (2004) Internat. J. Comput. Vision. , vol.57 , Issue.2 , pp. 137-154
- Viola, P.¹ Jones, M.²

41
- 0016036633
- Sharpness as an attribute of the timbre of steady sounds
- G. von Bismarck Sharpness as an attribute of the timbre of steady sounds Acustica 30 1974 159 172
- (1974) Acustica , vol.30 , pp. 159-172
- Von Bismarck, G.¹

42
- 0030242072
- Content-based classification, search, and retrieval of audio
- E. Wold, T. Blum, D. Keislar, and J. Wheaton Content-based classification, search, and retrieval of audio IEEE Multimedia 3 2 1996
- (1996) IEEE Multimedia , vol.3 , Issue.2
- Wold, E.¹ Blum, T.² Keislar, D.³ Wheaton, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.