메뉴 건너뛰기




Volumn 17, Issue 1, 2009, Pages 174-186

Temporal integration for audio classification with application to musical instrument classification

Author keywords

Alignment kernels; Audio classification; Music information retrieval (MIR); Musical instrument recognition; Support vector machine (SVM); Temporal feature integration

Indexed keywords

AUDIO CLASSIFICATION; CLASSIFICATION SYSTEM; EXPERIMENTAL STUDIES; INSTRUMENT RECOGNITION; MUSIC INFORMATION RETRIEVAL; MUSIC INFORMATION RETRIEVAL (MIR); MUSIC SIMILARITY; MUSICAL AUDIO; MUSICAL GENRE; MUSICAL PHRASE; STATE OF THE ART; SUPPORT VECTOR MACHINE (SVM); TEMPORAL FEATURE INTEGRATION; TEMPORAL INTEGRATION; TEMPORAL PROPERTY; TIME HORIZONS; VARIABLE FRAME LENGTH;

EID: 70350482320     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2007613     Document Type: Article
Times cited : (110)

References (53)
  • 1
    • 34547529353 scopus 로고    scopus 로고
    • Automatic bass line transcription from streaming polyphonic audio
    • M. Ryynänen and A. Klapuri, "Automatic bass line transcription from streaming polyphonic audio," in Proc. IEEE ICASSP, Honolulu, HI, Apr. 2007, pp. IV-1437-IV-1440.
    • (2007) Proc. IEEE ICASSP, Honolulu, HI, Apr.
    • Ryynänen, M.1    Klapuri, A.2
  • 2
    • 0013301483 scopus 로고    scopus 로고
    • A study of musical instrument classification using Gaussian mixture models and support vector machines
    • Cambridge, MA
    • J. Marques and P. Moreno, "A study of musical instrument classification using Gaussian mixture models and support vector machines," Tech. Rep. Compaq Computer Corp., Cambridge, MA, 1999.
    • (1999) Tech. Rep. Compaq Computer Corp.
    • Marques, J.1    Moreno, P.2
  • 7
    • 0037503637 scopus 로고    scopus 로고
    • Timbre recognition with combined stationary and temporal features
    • S. Dubnov and X. Rodet, "Timbre recognition with combined stationary and temporal features," in Proc. Int. Comput. Music Conf., 1998.
    • (1998) Proc. Int. Comput. Music Conf.
    • Dubnov, S.1    Rodet, X.2
  • 8
    • 84873606114 scopus 로고    scopus 로고
    • Polyphonic instrument recognition using spectral clustering
    • L. G. Martins, J. J. Burred, G. Tzanetakis, and M. Lagrange, "Polyphonic instrument recognition using spectral clustering," in Proc. ISMIR, 2007, pp. 213-218.
    • (2007) Proc. ISMIR , pp. 213-218
    • Martins, L.G.1    Burred, J.J.2    Tzanetakis, G.3    Lagrange, M.4
  • 9
    • 84873430088 scopus 로고    scopus 로고
    • Instrument classification using hidden Markov models
    • M. Eichner, M. Wolff, and R. Hoffmann, "Instrument classification using hidden Markov models," in Proc. ISMIR, 2006, pp. 349-350.
    • (2006) Proc. ISMIR , pp. 349-350
    • Eichner, M.1    Wolff, M.2    Hoffmann, R.3
  • 10
    • 85084302182 scopus 로고    scopus 로고
    • Automatic musical genre classification of audio signals
    • G. Tzanetakis, G. Essl, and P. Cook, "Automatic musical genre classification of audio signals," in Proc. ISMIR, 2001.
    • (2001) Proc. ISMIR
    • Tzanetakis, G.1    Essl, G.2    Cook, P.3
  • 11
    • 84873528643 scopus 로고    scopus 로고
    • Song-level features and SVMs for music classification
    • M. Mandel and D. Ellis, "Song-level features and SVMs for music classification," in Proc. ISMIR, 2005, pp. 594-599.
    • (2005) Proc. ISMIR , pp. 594-599
    • Mandel, M.1    Ellis, D.2
  • 12
    • 33644624152 scopus 로고    scopus 로고
    • On the modeling of time information for automatic genre recognition systems in audio signals
    • N. Scaringella and G. Zoia, "On the modeling of time information for automatic genre recognition systems in audio signals," in Proc. ISMIR, 2005, pp. 666-671.
    • (2005) Proc. ISMIR , pp. 666-671
    • Scaringella, N.1    Zoia, G.2
  • 17
    • 79956265554 scopus 로고    scopus 로고
    • Musical instrument recognition using ica-based transform of features and discriminatively trained hmms
    • A. Eronen, "Musical instrument recognition using ica-based transform of features and discriminatively trained hmms," in Proc. 7th Int. Symp. Signal Process. and its Applicat., 2003, pp. 133-136.
    • (2003) Proc. 7th Int. Symp. Signal Process. and its Applicat , pp. 133-136
    • Eronen, A.1
  • 18
    • 46249124867 scopus 로고    scopus 로고
    • Musical instrument recognizer "instrogram" and its application to music retrieval based on instrumentation similarity
    • T. Kitahara, M. Goto, K. Komatani, T. Ogata, and H. G. Okuno, "Musical instrument recognizer "instrogram" and its application to music retrieval based on instrumentation similarity," in Proc. IEEE Int. Symp. Multimedia, 2006, pp. 265-274.
    • (2006) Proc. IEEE Int. Symp. Multimedia , pp. 265-274
    • Kitahara, T.1    Goto, M.2    Komatani, K.3    Ogata, T.4    Okuno, H.G.5
  • 19
    • 84893499976 scopus 로고    scopus 로고
    • On-line handwriting recognition with support vector machines-A kernel approach
    • C. Bahlmann, B. Haasdonk, and H. Burkhardt, "On-line handwriting recognition with support vector machines-A kernel approach," in Proc. 8th IWFHR, 2002, pp. 49-54.
    • (2002) Proc. 8th IWFHR , pp. 49-54
    • Bahlmann, C.1    Haasdonk, B.2    Burkhardt, H.3
  • 21
    • 34547535371 scopus 로고    scopus 로고
    • A kernel for time series based on global alignments
    • Apr.
    • M. Cuturi, J.-P. Vert, O. Birkenes, and T. Matsui, "A kernel for time series based on global alignments," in Proc. IEEE ICASSP, Apr. 2007, vol.2, pp. 413-416.
    • (2007) Proc. IEEE ICASSP , vol.2 , pp. 413-416
    • Cuturi, M.1    Vert, J.-P.2    Birkenes, O.3    Matsui, T.4
  • 22
    • 0017595342 scopus 로고
    • Multidimensional perceptual scaling of musical timbres
    • J. Grey, "Multidimensional perceptual scaling of musical timbres," J. Acoust. Soc. Amer., pp. 1270-1277, 1977.
    • (1977) J. Acoust. Soc. Amer. , pp. 1270-1277
    • Grey, J.1
  • 23
    • 33744991719 scopus 로고    scopus 로고
    • Instrument recognition in polyphonic music based on automatic taxonomies
    • Jan.
    • S. Essid, G. Richard, and B. David, "Instrument recognition in polyphonic music based on automatic taxonomies," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.1, pp. 68-80, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.1 , pp. 68-80
    • Essid, S.1    Richard, G.2    David, B.3
  • 24
    • 0029547750 scopus 로고
    • Automatic source identification of monophonic musical instrument sounds
    • I. Kaminskyj and A. Materka, "Automatic source identification of monophonic musical instrument sounds," in Proc. IEEE Int. Conf. Neural Netw., 1995, pp. 189-194.
    • (1995) Proc. IEEE Int. Conf. Neural Netw , pp. 189-194
    • Kaminskyj, I.1    Materka, A.2
  • 25
    • 0003407268 scopus 로고    scopus 로고
    • Sound-source recognition: A theory and computational model
    • Cambridge
    • K. D. Martin, "Sound-source recognition: A theory and computational model," Ph.D. dissertation, Mass. Inst. Technol., Cambridge, 1999.
    • (1999) Ph.D. dissertation, Mass. Inst. Technol.
    • Martin, K.D.1
  • 27
    • 33846174759 scopus 로고    scopus 로고
    • Instrument identification in polyphonic music: Feature weighting with mixed sounds, pitch-dependent timber modeling, and use of musical context
    • T. Kitahara, M. Goto, K. Komatani, T. Ogata, and H. Okuno, "Instrument identification in polyphonic music: Feature weighting with mixed sounds, pitch-dependent timber modeling, and use of musical context," in Proc. ISMIR, 2005, pp. 558-563.
    • (2005) Proc. ISMIR , pp. 558-563
    • Kitahara, T.1    Goto, M.2    Komatani, K.3    Ogata, T.4    Okuno, H.5
  • 29
    • 85208828224 scopus 로고    scopus 로고
    • Methodology and tools for the evaluation of automatic onset detection algorithms in music
    • P. Leveau, L. Daudet, and G. Richard, "Methodology and tools for the evaluation of automatic onset detection algorithms in music," in Proc. ISMIR, 2004, pp. 72-75.
    • (2004) Proc. ISMIR , pp. 72-75
    • Leveau, P.1    Daudet, L.2    Richard, G.3
  • 30
    • 2942528770 scopus 로고    scopus 로고
    • On the use of phase and energy for musical onset detection in the complex domain
    • Jun.
    • J. Bello, C. Duxburry,M. Davies, and M. Sandler, "On the use of phase and energy for musical onset detection in the complex domain," IEEE Signal Process. Lett., vol.11, no.6, pp. 553-556, Jun. 2004.
    • (2004) IEEE Signal Process. Lett. , vol.11 , Issue.6 , pp. 553-556
    • Bello, J.1    Duxburry, C.2    Davies, M.3    Sandler, M.4
  • 31
    • 0035100239 scopus 로고    scopus 로고
    • Feature dependence in the automatic identification of musical woodwind instruments
    • J. C. Brown, O. Houix, and S. McAdams, "Feature dependence in the automatic identification of musical woodwind instruments," J. Acoust. Soc. Amer., pp. 1064-1072, 2000.
    • (2000) J. Acoust. Soc. Amer. , pp. 1064-1072
    • Brown, J.C.1    Houix, O.2    McAdams, S.3
  • 32
    • 84884300840 scopus 로고    scopus 로고
    • Musical instrument recognition based on class pairwise feature selection
    • S. Essid, G. Richard, and B. David, "Musical instrument recognition based on class pairwise feature selection," in Proc. ISMIR, 2004, pp. 560-567.
    • (2004) Proc. ISMIR , pp. 560-567
    • Essid, S.1    Richard, G.2    David, B.3
  • 33
    • 33947647554 scopus 로고    scopus 로고
    • Automatic classification of large musical instrument databases using hierarchical classifiers with inertia ratio maximization
    • G. Peeters, "Automatic classification of large musical instrument databases using hierarchical classifiers with inertia ratio maximization," in Proc. 115th AES Convention, 2003.
    • (2003) Proc. 115th AES Convention
    • Peeters, G.1
  • 34
    • 33644626634 scopus 로고    scopus 로고
    • A Large Set of Audio Features for Sound Description (Similarity and Classification) in the Cuidado Project
    • G. Peeters, "A Large Set of Audio Features for Sound Description (Similarity and Classification) in the Cuidado Project," Tech. Rep. IRCAM, 2004.
    • (2004) Tech. Rep. IRCAM
    • Peeters, G.1
  • 35
    • 70350505412 scopus 로고    scopus 로고
    • ISO/IEC information technology - Multimedia content description interface - Part 4: Audio
    • ISO/IEC, "Information Technology - Multimedia Content Description Interface - Part 4: Audio", , Jun. 2001, Int. Standard ISO/IEC FDIS 15938-15944: 2001(E).
    • (2001) Jun. 2001, Int. Standard ISO/IEC FDIS, (E) , pp. 15938-15944
  • 36
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • Munich, Germany
    • E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. ICASSP '97, Munich, Germany, 1997, pp. 1331-1334.
    • (1997) Proc. ICASSP '97 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 38
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to feature and variable selection
    • I. Guyon and A. Elisseeff, "An introduction to feature and variable selection," J. Mach. Learn. Res., pp. 1157-1182, 2003.
    • (2003) J. Mach. Learn. Res. , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 41
    • 0003243224 scopus 로고    scopus 로고
    • Probabilistic outputs for support vector machines and comparison to regularized likelihood methods
    • A. Smola, P. Bartlett, B. Schölkopf, and D. Schuurmans, Eds. Cambridge, MA: MIT Press
    • J. Platt, "Probabilistic outputs for support vector machines and comparison to regularized likelihood methods," in Advances in Large Margin Classifiers, A. Smola, P. Bartlett, B. Schölkopf, and D. Schuurmans, Eds. Cambridge, MA: MIT Press, 1999.
    • (1999) Advances in Large Margin Classifiers
    • Platt, J.1
  • 44
    • 84873533162 scopus 로고    scopus 로고
    • An investigation of feature models for music genre classification using the support vector classifier
    • A. Meng and J. Shawe-Taylor, "An investigation of feature models for music genre classification using the support vector classifier," in Proc. ISMIR, 2005, pp. 604-609.
    • (2005) Proc. ISMIR , pp. 604-609
    • Meng, A.1    Shawe-Taylor, J.2
  • 46
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol.77, no.2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 47
    • 33645791324 scopus 로고    scopus 로고
    • What hmms can do
    • J. A. Bilmes, "What hmms can do," IEICE - Trans. Inf. Syst., vol.E89-D, no.3, pp. 869-891, 2006.
    • (2006) IEICE - Trans. Inf. Syst. , vol.E89-D , Issue.3 , pp. 869-891
    • Bilmes, J.A.1
  • 51
    • 70350505411 scopus 로고    scopus 로고
    • [Online]. Available:
    • T. Schneider and A. Neumaier, Arfit [Online]. Available: http://www.gps.caltech.edu/tapio/arfit/
    • Arfit
    • Schneider, T.1    Neumaier, A.2
  • 53
    • 70350472122 scopus 로고    scopus 로고
    • Murphy K.
    • [Online]. Available:
    • K. Murphy,HMMMatlab Toolbox [Online]. Available: http://www.cs. ubc.ca/murphyk/Software/HMM/hmm.html.
    • HMMMatlab Toolbox


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.