메뉴 건너뛰기




Volumn 13, Issue 2, 2011, Pages 303-319

A survey of audio-based music classification and annotation

Author keywords

Acoustic signal processing; classification algorithms; feature extraction; music information retrieval

Indexed keywords

AUDIO-BASED; CLASSIFICATION ALGORITHMS; CLASSIFICATION TASKS; FUNDAMENTAL PROBLEM; GENRE CLASSIFICATION; INSTRUMENT RECOGNITION; MUSIC CLASSIFICATION; MUSIC DATA; MUSIC INDUSTRY; MUSIC INFORMATION RETRIEVAL; RAPID DEVELOPMENT; RECENT PROGRESS; RESEARCH AREAS; RESEARCH COMMUNITIES;

EID: 79952972450     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2010.2098858     Document Type: Article
Times cited : (377)

References (149)
  • 1
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • DOI 10.1109/TSA.2002.800560, PII 1011092002800560
    • G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. Speech Audio Process., vol. 10, no. 5, pp. 293-302, 2002. (Pubitemid 34950067)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 2
    • 1542439119 scopus 로고    scopus 로고
    • A comparative study of content-based music genre classification
    • T. Li, M. Ogihara, and Q. Li, "A comparative study of content-based music genre classification," in Proc. SIGIR, 2003.
    • (2003) Proc. SIGIR
    • Li, T.1    Ogihara, M.2    Li, Q.3
  • 3
    • 84873536955 scopus 로고    scopus 로고
    • Evaluation of feature extractors and psychoacoustic transformations for music genre classification
    • T. Lidy and A. Rauber, "Evaluation of feature extractors and psychoacoustic transformations for music genre classification," in Proc. Int. Conf. Music Information Retrieval, 2005.
    • (2005) Proc. Int. Conf. Music Information Retrieval
    • Lidy, T.1    Rauber, A.2
  • 4
    • 84873533162 scopus 로고    scopus 로고
    • An investigation of feature models for music genre classification using the support vector classifier
    • A. Meng and J. Shawe-Taylor, "An investigation of feature models for music genre classification using the support vector classifier," in Proc. Int. Conf. Music Information Retrieval, 2005.
    • (2005) Proc. Int. Conf. Music Information Retrieval
    • Meng, A.1    Shawe-Taylor, J.2
  • 5
    • 15544385732 scopus 로고    scopus 로고
    • Automatic feature extraction for classifying audio data
    • DOI 10.1007/s10994-005-5824-7
    • I. Mierswa and K. Morik, "Automatic feature extraction for classifying audio data," Mach. Learn., vol. 58, pp. 127-149, 2005. (Pubitemid 40400635)
    • (2005) Machine Learning , vol.58 , Issue.2-3 , pp. 127-149
    • Mierswa, I.1    Morik, K.2
  • 6
    • 33644624152 scopus 로고    scopus 로고
    • On the modelling of time information for automatic genre recognition systems in audio signals
    • N. Scaringella and G. Zoia, "On the modelling of time information for automatic genre recognition systems in audio signals," in Proc. Int. Conf. Music Information Retrieval, 2005.
    • (2005) Proc. Int. Conf. Music Information Retrieval
    • Scaringella, N.1    Zoia, G.2
  • 7
    • 17044405097 scopus 로고    scopus 로고
    • Fast recognition of musical genres using RBF networks
    • DOI 10.1109/TKDE.2005.62
    • D. Turnbull and C. Elkan, "Fast recognition of musical genres using RBF networks," IEEE Trans. Knowl. Data Eng., vol. 17, no. 4, pp. 580-584, 2005. (Pubitemid 40495598)
    • (2005) IEEE Transactions on Knowledge and Data Engineering , vol.17 , Issue.4 , pp. 580-584
    • Turnbull, D.1    Elkan, C.2
  • 9
    • 33751531805 scopus 로고    scopus 로고
    • Aggregate features and ADABOOST for music classification
    • DOI 10.1007/s10994-006-9019-7, Special Issue on Machine Learning in and for Music
    • J. Bergstra, N. Casagrande, D. Erhan, D. Eck, and B. Kegl, "Aggregate features and ada boost for music classification," Mach. Learn., vol. 65, no. 2-3, pp. 473-484, 2006. (Pubitemid 44836054)
    • (2006) Machine Learning , vol.65 , Issue.2-3 , pp. 473-484
    • Bergstra, J.1    Casagrande, N.2    Erhan, D.3    Eck, D.4    Kegl, B.5
  • 11
    • 33749540712 scopus 로고    scopus 로고
    • Understandable models of music collections based on exhaustive feature generation with temporal statistics
    • F. Mochen, I. Mierswa, and A. Ultsch, "Understandable models of music collections based on exhaustive feature generation with temporal statistics," in Proc. ACM SIGKDD, 2006.
    • (2006) Proc. ACM SIGKDD
    • Mochen, F.1    Mierswa, I.2    Ultsch, A.3
  • 12
    • 33646739998 scopus 로고    scopus 로고
    • Toward intelligent music information retrieval
    • T. Li and M. Ogihara, "Toward intelligent music information retrieval," IEEE Trans. Multimedia, vol. 8, no. 3, pp. 564-574, 2006.
    • (2006) IEEE Trans. Multimedia , vol.8 , Issue.3 , pp. 564-574
    • Li, T.1    Ogihara, M.2
  • 13
    • 33845675053 scopus 로고    scopus 로고
    • Towards effective content-based music retrieval with multiple acoustic feature combination
    • DOI 10.1109/TMM.2006.884618
    • J. Shen, J. Shepherd, and A. Ngu, "Towards effective content-based music retrieval with multiple acoustic feature combination," IEEE Trans. Multimedia, vol. 8, no. 6, pp. 1179-1189, 2006. (Pubitemid 44955683)
    • (2006) IEEE Transactions on Multimedia , vol.8 , Issue.6 , pp. 1179-1189
    • Shen, J.1    Shepherd, J.2    Ngu, A.H.H.3
  • 14
    • 84873596118 scopus 로고    scopus 로고
    • Improving genre classification by combination of audio and symbolic descriptors using a transcription system
    • T. Lidy, A. Rauber, A. Pertusa, and J. Inesta, "Improving genre classification by combination of audio and symbolic descriptors using a transcription system," in Proc. Int. Conf. Music Information Retrieval, 2007.
    • (2007) Proc. Int. Conf. Music Information Retrieval
    • Lidy, T.1    Rauber, A.2    Pertusa, A.3    Inesta, J.4
  • 15
    • 49549085544 scopus 로고    scopus 로고
    • Temporal feature integration for music genre classification
    • A. Meng, P. Ahrendt, and J. Larsen, "Temporal feature integration for music genre classification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 5, pp. 1654-1664, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.5 , pp. 1654-1664
    • Meng, A.1    Ahrendt, P.2    Larsen, J.3
  • 17
    • 37249002308 scopus 로고    scopus 로고
    • Content-based information fusion for semi-supervised music genre classification
    • DOI 10.1109/TMM.2007.911305
    • Y. Song and C. Zhang, "Content based information fusion for semisupervised music genre classification," IEEE Trans. Multimedia, vol. 10, no. 1, pp. 145-152, 2008. (Pubitemid 350281203)
    • (2008) IEEE Transactions on Multimedia , vol.10 , Issue.1 , pp. 145-152
    • Song, Y.1    Zhang, C.2
  • 18
    • 67349176070 scopus 로고    scopus 로고
    • Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features
    • C.-H. Lin, J.-L. Shih, K.-M. Yu, and H.-S. Lin, "Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features," IEEE Trans. Multimedia, vol. 11, no. 4, pp. 670-682, 2009.
    • (2009) IEEE Trans. Multimedia , vol.11 , Issue.4 , pp. 670-682
    • Lin, C.-H.1    Shih, J.-L.2    Yu, K.-M.3    Lin, H.-S.4
  • 19
    • 84873668627 scopus 로고    scopus 로고
    • Music genre classification using locality preserving non-negative tensor factorization and sparse representations
    • I. Panagakis, C. Kotropoulos, and G. R. Arce, "Music genre classification using locality preserving non-negative tensor factorization and sparse representations," in Proc. Int. Conf. Music Information Retrieval, 2009.
    • (2009) Proc. Int. Conf. Music Information Retrieval
    • Panagakis, I.1    Kotropoulos, C.2    Arce, G.R.3
  • 21
    • 69949146648 scopus 로고    scopus 로고
    • Music retrieval by detecting mood via computational media aesthetics
    • Y. Feng, Y. Zhuang, and Y. Pan, "Music retrieval by detecting mood via computational media aesthetics," in Proc. Int. Conf. Web Intelligence, 2003.
    • (2003) Proc. Int. Conf. Web Intelligence
    • Feng, Y.1    Zhuang, Y.2    Pan, Y.3
  • 28
    • 39649089103 scopus 로고    scopus 로고
    • Score-independent audio features for description of music expression
    • L. Mion and G. D. Poli, "Score-independent audio features for description of music expression," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 2, pp. 458-466, 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 458-466
    • Mion, L.1    Poli, G.D.2
  • 34
  • 36
    • 33746916789 scopus 로고    scopus 로고
    • Support vector machine active learning for music retrieval
    • DOI 10.1007/s00530-006-0032-2
    • M. Mandel, G. Poliner, and D. Ellis, "Support vector machine active learning for music retrieval," Multimedia Syst., vol. 12, no. 1, pp. 3-13, 2006. (Pubitemid 44199912)
    • (2006) Multimedia Systems , vol.12 , Issue.1 , pp. 3-13
    • Mandel, M.I.1    Poliner, G.E.2    Ellis, D.P.W.3
  • 37
    • 33744976289 scopus 로고    scopus 로고
    • Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals
    • DOI 10.1109/TSA.2005.854091
    • W.-H. Tsai and H.-M.Wang, "Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 330-341, 2006. (Pubitemid 43863478)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 330-341
    • Tsai, W.-H.1    Wang, H.-M.2
  • 38
    • 37849032910 scopus 로고    scopus 로고
    • Exploring vibrato-motivated acoustic features for singer identification
    • T.-L. Nwe and H. Li, "Exploring vibrato-motivated acoustic features for singer identification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 519-530, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 519-530
    • Nwe, T.-L.1    Li, H.2
  • 39
    • 75649095708 scopus 로고    scopus 로고
    • A novel framework for efficient automated singer identification in large music databases
    • J. Shen, J. Shepherd, B. Cui, and K.-L. Tan, "A novel framework for efficient automated singer identification in large music databases," ACM Trans. Inf. Syst., vol. 27, no. 3, pp. 1-31, 2009.
    • (2009) ACM Trans. Inf. Syst. , vol.27 , Issue.3 , pp. 1-31
    • Shen, J.1    Shepherd, J.2    Cui, B.3    Tan, K.-L.4
  • 41
    • 0033005508 scopus 로고    scopus 로고
    • Computer identification of musical instruments using pattern recognition with cepstral coefficients as features
    • J. C. Brown, "Computer identification of musical instruments using pattern recognition with cepstral coefficients as features," J. Acoust. Soc. Amer., vol. 105, pp. 1933-1941, 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.105 , pp. 1933-1941
    • Brown, J.C.1
  • 42
    • 17444399233 scopus 로고    scopus 로고
    • Musical instrument timbres classification with spectral features
    • DOI 10.1155/S1110865703210118
    • G. Agostini, M. Longari, and E. Pollastri, "Musical instrument timbres classification with spectral features," EURASIP J. Appl. Signal Process., vol. 2003, no. 1, pp. 5-14, 2003. (Pubitemid 41283787)
    • (2003) Eurasip Journal on Applied Signal Processing , vol.2003 , Issue.1 , pp. 5-14
    • Agostini, G.1    Longari, M.2    Pollastri, E.3
  • 44
    • 33744983081 scopus 로고    scopus 로고
    • Musical instrument recognition by pairwise classification strategies
    • DOI 10.1109/TSA.2005.860842
    • S. Essid, G. Richard, and B. David, "Musical instrument recognition by pairwise classification strategies," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 4, pp. 1401-1412, 2006. (Pubitemid 46552930)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1401-1412
    • Essid, S.1    Richard, G.2    David, B.3
  • 45
    • 33744991719 scopus 로고    scopus 로고
    • Instrument recognition in polyphonic music based on automatic taxonomies
    • DOI 10.1109/TSA.2005.860351
    • S. Essid, G. Richard, and B. David, "Instrument recognition in polyphonic music based on automatic taxonomies," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 68-80, 2006. (Pubitemid 43863454)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 68-80
    • Essid, S.1    Richard, G.2    David, B.3
  • 46
    • 33846220762 scopus 로고    scopus 로고
    • Instrument identification in polyphonic music: Feature weighting to minimize influence of sound overlaps
    • T. Kitahara, M. Goto, K. Komatani, T. Ogata, and H. Okuno, "Instrument identification in polyphonic music: Feature weighting to minimize influence of sound overlaps," EURASIP J. Appl. Signal Process., vol. 2007, no. 1, pp. 155-155, 2007.
    • (2007) EURASIP J. Appl. Signal Process. , vol.2007 , Issue.1 , pp. 155-155
    • Kitahara, T.1    Goto, M.2    Komatani, K.3    Ogata, T.4    Okuno, H.5
  • 49
    • 84873605129 scopus 로고    scopus 로고
    • Scalability, generality and temporal aspects in automatic recognition of predominant musical instruments in polyphonic music
    • F. Fuhrmann, M. Haro, and P. Herrera, "Scalability, generality and temporal aspects in automatic recognition of predominant musical instruments in polyphonic music," in Proc. Int. Conf. Music Information Retrieval, 2009.
    • (2009) Proc. Int. Conf. Music Information Retrieval
    • Fuhrmann, F.1    Haro, M.2    P.Herrera, P.3
  • 50
    • 84873649320 scopus 로고    scopus 로고
    • Automatic identification of instrument classes in polyphonic and poly-instrument audio
    • P. Hamel, S. Wood, and D. Eck, "Automatic identification of instrument classes in polyphonic and poly-instrument audio," in Proc. Int. Conf. Music Information Retrieval, 2009.
    • (2009) Proc. Int. Conf. Music Information Retrieval
    • Hamel, P.1    Wood, S.2    Eck, D.3
  • 51
  • 58
    • 57049179026 scopus 로고    scopus 로고
    • Autotagger: A model for predicting social tags from acoustic features on large music databases
    • T. Bertin-Mahieux, D. Eck, F. Maillet, and P. Lamere, "Autotagger: A model for predicting social tags from acoustic features on large music databases," J. New Music Res., vol. 37, no. 2, pp. 115-135, 2008.
    • (2008) J. New Music Res. , vol.37 , Issue.2 , pp. 115-135
    • Bertin-Mahieux, T.1    Eck, D.2    Maillet, F.3    Lamere, P.4
  • 60
    • 63049114780 scopus 로고    scopus 로고
    • Music information retrieval using social tags and audio
    • M. Levy and M. Sandler, "Music information retrieval using social tags and audio," IEEE Trans. Multimedia, vol. 11, no. 3, pp. 383-395, 2009.
    • (2009) IEEE Trans. Multimedia , vol.11 , Issue.3 , pp. 383-395
    • Levy, M.1    Sandler, M.2
  • 67
    • 85032752479 scopus 로고    scopus 로고
    • Automatic genre classification of music content-A survey
    • N. Scaringella, G. Zoia, and D. Mlynek, "Automatic genre classification of music content-A survey," IEEE Signal Process. Mag., vol. 23, no. 2, pp. 133-141, 2006.
    • (2006) IEEE Signal Process. Mag. , vol.23 , Issue.2 , pp. 133-141
    • Scaringella, N.1    Zoia, G.2    Mlynek, D.3
  • 69
    • 64649105397 scopus 로고    scopus 로고
    • Content-based music information retrieval: Current directions and future challenges
    • M. Casey, R. Veltkamp, M. Goto, M. Leman, C. Rhodes, and M. Slaney, "Content-based music information retrieval: Current directions and future challenges," Proc. IEEE, vol. 96, no. 4, pp. 668-696, 2008.
    • (2008) Proc. IEEE , vol.96 , Issue.4 , pp. 668-696
    • Casey, M.1    Veltkamp, R.2    Goto, M.3    Leman, M.4    Rhodes, C.5    Slaney, M.6
  • 72
    • 33947683446 scopus 로고    scopus 로고
    • Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection
    • E. Benetos, M. Kotti, and C. Kotropoulos, "Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 2006.
    • (2006) Proc. Int. Conf. Acoustics, Speech, Signal Processing
    • Benetos, E.1    Kotti, M.2    Kotropoulos, C.3
  • 74
    • 2542463254 scopus 로고    scopus 로고
    • Audio classification based on MPEG-7 spectral basis representation
    • H. G. Kim, N. Moreau, and T. Sikora, "Audio classification based on MPEG-7 spectral basis representation," IEEE Trans. Circuits Syst. Video Technol., vol. 14, no. 5, pp. 716-725, 2004.
    • (2004) IEEE Trans. Circuits Syst. Video Technol. , vol.14 , Issue.5 , pp. 716-725
    • Kim, H.G.1    Moreau, N.2    Sikora, T.3
  • 76
    • 85008016199 scopus 로고    scopus 로고
    • Audio classification and categorization based on wavelets and support vector machine
    • C. C. Lin, S. H. Chen, T. K. Truong, and Y. Chang, "Audio classification and categorization based on wavelets and support vector machine," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 644-651, 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 644-651
    • Lin, C.C.1    Chen, S.H.2    Truong, T.K.3    Chang, Y.4
  • 78
    • 77953504754 scopus 로고    scopus 로고
    • Stereo panning information for music information retrieval tasks
    • G. Tzanetakis, L. Martins, K. McNally, and R. Jones, "Stereo panning information for music information retrieval tasks," J. Audio Eng. Soc., vol. 58, no. 5, pp. 409-417, 2010.
    • (2010) J. Audio Eng. Soc. , vol.58 , Issue.5 , pp. 409-417
    • Tzanetakis, G.1    Martins, L.2    McNally3    Jones, R.4
  • 79
    • 0038376759 scopus 로고    scopus 로고
    • Content-based organization and visualization of music archives
    • E. Pampalk, A. Rauber, and D. Merkl, "Content-based organization and visualization of music archives," in Proc. ACM Multimedia, 2002.
    • (2002) Proc. ACM Multimedia
    • Pampalk, E.1    Rauber, A.2    Merkl, D.3
  • 81
    • 79952944449 scopus 로고    scopus 로고
    • Grove Music Online. [Online]. Available:
    • L. Macy, Grove Music Online. [Online].Available: http://www. oxfordmusiconline. com/public/book/omo-gmo.
    • Macy, L.1
  • 82
    • 0026057076 scopus 로고
    • Calculation of a constant spectral transform
    • J. C. Brown, "Calculation of a constant spectral transform," J. Acoust. Soc. Amer., vol. 89, no. 1, pp. 425-434, 1991.
    • (1991) J. Acoust. Soc. Amer. , vol.89 , Issue.1 , pp. 425-434
    • Brown, J.C.1
  • 84
  • 86
    • 33745000971 scopus 로고    scopus 로고
    • Improving timbre similarity: How high is the sky
    • J. Aucouturier and F. Pachet, "Improving timbre similarity: How high is the sky?," J. Negative Results Speech Audio Sci, vol. 1, no. 1, pp. 1-13, 2004.
    • (2004) J. Negative Results Speech Audio Sci , vol.1 , Issue.1 , pp. 1-13
    • Aucouturier, J.1    Pachet, F.2
  • 89
    • 39649095304 scopus 로고    scopus 로고
    • A general framework of progressive filtering and its application to query by singing/humming
    • J.-S. Jang and H.-R. Lee, "A general framework of progressive filtering and its application to query by singing/humming," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 2, pp. 350-358, 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 350-358
    • Jang, J.-S.1    Lee, H.-R.2
  • 90
    • 39649093552 scopus 로고    scopus 로고
    • Challenging uncertainty in query by humming systems: A fingerprinting approach
    • E. Unal, E. Chew, P. Georgiou, and S. Narayanan, "Challenging uncertainty in query by humming systems: A fingerprinting approach," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 2, pp. 359-371, 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 359-371
    • Unal, E.1    Chew, E.2    Georgiou, P.3    Narayanan, S.4
  • 91
    • 84873472146 scopus 로고    scopus 로고
    • The song remains the same: Identifying versions of the same piece using tonal descriptors
    • E. Gomez and P. Herrera, "The song remains the same: Identifying versions of the same piece using tonal descriptors," in Proc. Int. Conf. Music Information Retrieval, 2006.
    • (2006) Proc. Int. Conf. Music Information Retrieval
    • Gomez, E.1    Herrera, P.2
  • 92
    • 84873535580 scopus 로고    scopus 로고
    • A query-by-example technique for retrieving cover versions of popular songs with similar melodies
    • W. H. Tsai, H. M. Yu, and H. M. Wang, "A query-by-example technique for retrieving cover versions of popular songs with similar melodies," in Proc. Int. Conf. Music Information Retrieval, 2005.
    • (2005) Proc. Int. Conf. Music Information Retrieval
    • Tsai, W.H.1    Yu, H.M.2    Wang, H.M.3
  • 93
    • 70350074065 scopus 로고    scopus 로고
    • Chroma binary similarity and local alignment applied to cover song identification
    • J. Serra, E. Gomez, P. Herrera, and X. Serra, "Chroma binary similarity and local alignment applied to cover song identification," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 6, pp. 1138-1151, 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.6 , pp. 1138-1151
    • Serra, J.1    Gomez, E.2    Herrera, P.3    Serra, X.4
  • 94
    • 0034319894 scopus 로고    scopus 로고
    • A computationally efficient multipitch analysis model
    • T. Tolonen and M. Karjalainen, "A computationally efficient multipitch analysis model," IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 708-716, 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.6 , pp. 708-716
    • Tolonen, T.1    Karjalainen, M.2
  • 95
    • 77955516770 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation based on harmonicity and spectral smoothness
    • A. Klapuri, "Multiple fundamental frequency estimation based on harmonicity and spectral smoothness," IEEE Trans. Speech Audio Process., 2000.
    • (2000) IEEE Trans. Speech Audio Process.
    • Klapuri, A.1
  • 97
    • 85099848325 scopus 로고    scopus 로고
    • Realtime chord recognition of musical sound: A system using common lisp music
    • T. Fujishima, "Realtime chord recognition of musical sound: A system using common lisp music," in Proc. Int. Computer Music Conf., 1999, pp. 464-467.
    • (1999) Proc. Int. Computer Music Conf. , pp. 464-467
    • Fujishima, T.1
  • 98
    • 36549057588 scopus 로고    scopus 로고
    • Ph.D. dissertation, Dept. Technol., Universitat Pompeu Fabra, Barcelona, Spain
    • E. Gomez, "Tonal description of music audio signals," Ph.D. dissertation, Dept. Technol., Universitat Pompeu Fabra, Barcelona, Spain, 2006.
    • (2006) Tonal Description of Music Audio Signals
    • Gomez, E.1
  • 99
    • 84873459578 scopus 로고    scopus 로고
    • A mid-level melody-based representation for calculating audio similarity
    • M. Marolt, "A mid-level melody-based representation for calculating audio similarity," in Proc. Int. Conf. Music Information Retrieval, 2006.
    • (2006) Proc. Int. Conf. Music Information Retrieval
    • Marolt, M.1
  • 103
    • 33745688777 scopus 로고    scopus 로고
    • Estimating the tonality of polyphonic audio files: Cognitive versus machine learning modelling strategies
    • E. Gomez and P. Herrera, "Estimating the tonality of polyphonic audio files: Cognitive versus machine learning modelling strategies," in Proc. Int. Conf. Music Information Retrieval, 2004.
    • (2004) Proc. Int. Conf. Music Information Retrieval
    • Gomez, E.1    Herrera, P.2
  • 104
    • 84923550097 scopus 로고    scopus 로고
    • Automatic chord recognition using enhanced pitch class profile
    • K. Lee, "Automatic chord recognition using enhanced pitch class profile," in Proc. Int. Computer Music Conf., 2006.
    • (2006) Proc. Int. Computer Music Conf.
    • Lee, K.1
  • 105
    • 84873605856 scopus 로고    scopus 로고
    • Audio-based cover song retrieval using approximate chord sequences: Testing shifts, gaps, swaps and beats
    • J. P. Bello, "Audio-based cover song retrieval using approximate chord sequences: Testing shifts, gaps, swaps and beats," in Proc. Int. Conf. Music Information Retrieval, 2007.
    • (2007) Proc. Int. Conf. Music Information Retrieval
    • Bello, J.P.1
  • 110
  • 114
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proc. IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
    • (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 116
    • 84926662675 scopus 로고
    • Nearest neighbor pattern classification
    • T. Cover and P. Hart, "Nearest neighbor pattern classification, " IEEE Trans. Inf. Theory, vol. 13, no. 1, pp. 21-27, 1967.
    • (1967) IEEE Trans. Inf. Theory , vol.13 , Issue.1 , pp. 21-27
    • Cover, T.1    Hart, P.2
  • 119
    • 3042597440 scopus 로고    scopus 로고
    • Learning multi-label scene classification
    • DOI 10.1016/j.patcog.2004.03.009, PII S0031320304001074
    • M. Boutell, X. Shen, J. Luo, and C. Brown, "Learning multi-label semantic scene classification," Pattern Recognit., vol. 37, no. 9, pp. 1757-1771, 2004. (Pubitemid 38804465)
    • (2004) Pattern Recognition , vol.37 , Issue.9 , pp. 1757-1771
    • Boutell, M.R.1    Luo, J.2    Shen, X.3    Brown, C.M.4
  • 122
    • 39649092019 scopus 로고    scopus 로고
    • Music genre classification using nonnegative matrix factorization-based features
    • A. Holzapfel and Y. Stylianou, "Music genre classification using nonnegative matrix factorization-based features," IEEE Trans. Audio, Speech, Lang. Processing, vol. 16, no. 2, pp. 424-434, 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Processing , vol.16 , Issue.2 , pp. 424-434
    • Holzapfel, A.1    Stylianou, Y.2
  • 126
    • 72449148704 scopus 로고    scopus 로고
    • Improving automatic music tag annotation using stacked generalization of probabilistic SVM outputs
    • S. R. Ness, A. Theocharis, G. Tzanetakis, and L. G. Martins, "Improving automatic music tag annotation using stacked generalization of probabilistic SVM outputs," in Proc. ACM Multimedia, 2009.
    • (2009) Proc. ACM Multimedia
    • Ness, S.R.1    Theocharis, A.2    Tzanetakis, G.3    Martins, L.G.4
  • 127
    • 0031211090 scopus 로고    scopus 로고
    • A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting
    • Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting," J. Comput. Syst. Sci., vol. 55, no. 1, pp. 119-139, 1997. (Pubitemid 127433398)
    • (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
    • Freund, Y.1    Schapire, R.E.2
  • 130
    • 0029378080 scopus 로고
    • Spectral shape analysis in the central auditory system
    • K. Wang and S. A. Shamma, "Spectral shape analysis in the central auditory system," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 382-396, 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 382-396
    • Wang, K.1    Shamma, S.A.2
  • 132
    • 0001175419 scopus 로고
    • Experimental studies of the elements of expression in music
    • K. Hevner, "Experimental studies of the elements of expression in music," Amer. J. Psychol., vol. 48, no. 2, pp. 246-268, 1936.
    • (1936) Amer. J. Psychol. , vol.48 , Issue.2 , pp. 246-268
    • Hevner, K.1
  • 134
    • 0141827752 scopus 로고    scopus 로고
    • On the dimensional and hierarchical structure of affect
    • A. Tellegen, D.Watson, and L. A. Clark, "On the dimensional and hierarchical structure of affect," Psychol. Sci., vol. 10, no. 4, pp. 297-303, 1999.
    • (1999) Psychol. Sci. , vol.10 , Issue.4 , pp. 297-303
    • Tellegen, A.1    Watson, D.2    Clark, L.A.3
  • 135
    • 84873596722 scopus 로고    scopus 로고
    • Exploring mood metadata: Relationships with genre, artist and usage metadata
    • X. Hu and J. S. Downie, "Exploring mood metadata: Relationships with genre, artist and usage metadata," in Proc. Int. Conf. Music Information Retrieval, 2007.
    • (2007) Proc. Int. Conf. Music Information Retrieval
    • Hu, X.1    Downie, J.S.2
  • 137
    • 71149087300 scopus 로고    scopus 로고
    • Learning dictionaries of stable autoregressive models for audio scene analysis
    • Y. Cho and L. K. Saul, "Learning dictionaries of stable autoregressive models for audio scene analysis," in Proc. Int. Conf. Machine Learning, 2009.
    • (2009) Proc. Int. Conf. Machine Learning
    • Cho, Y.1    Saul, L.K.2
  • 138
    • 70350446810 scopus 로고    scopus 로고
    • Improving multilabel analysis of music titles: A large-scale validation of the correction approach
    • F. Pachet and P. Roy, "Improving multilabel analysis of music titles: A large-scale validation of the correction approach," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 335-343, 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 335-343
    • Pachet, F.1    Roy, P.2
  • 139
    • 46149118255 scopus 로고    scopus 로고
    • A discriminative Kernel-based approach to rank images from text queries
    • D. Grangier and S. Bengio, "A discriminative Kernel-based approach to rank images from text queries," IEEE Trans. Pattern Anal. Mach. Intell., vol. 30, no. 8, pp. 1371-1384, 2007.
    • (2007) IEEE Trans. Pattern Anal. Mach. Intell. , vol.30 , Issue.8 , pp. 1371-1384
    • Grangier, D.1    Bengio, S.2
  • 141
    • 79952959991 scopus 로고    scopus 로고
    • Semi-Supervised Learning Literature Survey. [Online]. Available:
    • J. Zhu, Semi-Supervised Learning Literature Survey. [Online]. Available: http://pages.cs.wisc.edu/~jerryzhu/research/ssl/semireview.html.
    • Zhu, J.1
  • 147
    • 57049173452 scopus 로고    scopus 로고
    • Scanning the dial: The rapid recognition of music genres
    • R. Gjerdingen and D. Perrott, "Scanning the dial: The rapid recognition of music genres," J. New Music Res., vol. 37, no. 2, pp. 93-100, 2008.
    • (2008) J. New Music Res. , vol.37 , Issue.2 , pp. 93-100
    • Gjerdingen, R.1    Perrott, D.2
  • 149
    • 55749112518 scopus 로고    scopus 로고
    • How many beans make five? The consensus problem in music-genre classification and a new evaluation method for single-genre categorisation systems
    • G. Wiggins and T. Crawford, "How many beans make five? The consensus problem in music-genre classification and a new evaluation method for single-genre categorisation systems," in Proc. Int. Conf. Music Information Retrieval, 2007.
    • (2007) Proc. Int. Conf. Music Information Retrieval
    • Wiggins, G.1    Crawford, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.