메뉴 건너뛰기




Volumn 22, Issue 2, 2014, Pages 556-575

Automatic chord estimation from audio: A review of the state of the art

Author keywords

Expert systems; Knowledge based systems; Machine learning; Music information retrieval; Supervised learning

Indexed keywords

DATA PROCESSING; EXPERT SYSTEMS; FEATURE EXTRACTION; INFORMATION RETRIEVAL; KNOWLEDGE BASED SYSTEMS; LEARNING SYSTEMS; SUPERVISED LEARNING; ARTIFICIAL INTELLIGENCE; DATA MINING;

EID: 84897934386     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2013.2294580     Document Type: Article
Times cited : (69)

References (105)
  • 1
    • 84897949418 scopus 로고    scopus 로고
    • 6th ed. Milwaukee, WI, USA, Hal LeonardCorp.
    • Various, The Real Book 6th ed. Milwaukee, WI, USA, Hal LeonardCorp., 2004.
    • (2004) Various, The Real Book
  • 3
    • 34547495486 scopus 로고    scopus 로고
    • Identifying 'cover songs' wisth chroma features and dynamic programming beat tracking
    • D. Ellis and G. Poliner, "Identifying 'cover songs' wisth chroma features and dynamic programming beat tracking," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2007, pp. 1429-1433.
    • Proc. Int. Conf. Acoust., Speech, Signal Process., 2007 , pp. 1429-1433
    • Ellis, D.1    Poliner, G.2
  • 4
    • 84873472146 scopus 로고    scopus 로고
    • The song remains the same: Identifying versions of the same piece using tonal descriptors
    • E. Gómez and P. Herrera, "The song remains the same: Identifying versions of the same piece using tonal descriptors," in Proc. 7th Int.Soc. Music Inf. Retrieval, 2006, pp. 180-185.
    • Proc. 7th Int.Soc. Music Inf. Retrieval, 2006 , pp. 180-185
    • Gómez, E.1    Herrera, P.2
  • 6
    • 24744446189 scopus 로고    scopus 로고
    • Key, chord, and rhythm tracking of popular music recordings
    • DOI 10.1162/0148926054798205
    • A. Shenoy and Y. Wang, "Key, chord, and rhythm tracking of popular music recordings," J. Comput. Music, vol. 29, no. 3, pp. 75-86, 2005. (Pubitemid 41294792)
    • (2005) Computer Music Journal , vol.29 , Issue.3 , pp. 75-86
    • Shenoy, A.1    Wang, Y.2
  • 7
    • 47749132959 scopus 로고    scopus 로고
    • Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio
    • Feb.
    • K. Lee and M. Slaney, "Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio,"IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 2, pp. 291-301, Feb. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 291-301
    • Lee, K.1    Slaney, M.2
  • 9
    • 70449674483 scopus 로고    scopus 로고
    • Genre classification using chords and stochastic language models
    • C. Perez-Sancho, D. Rizo, and J. Inesta, "Genre classification using chords and stochastic language models," Connect. Sci., vol. 21, no. 2-3,pp. 145-159, 2009.
    • (2009) Connect. Sci. , vol.21 , Issue.2-3 , pp. 145-159
    • Perez-Sancho, C.1    Rizo, D.2    Inesta, J.3
  • 11
    • 81155126217 scopus 로고    scopus 로고
    • Integrating additional chord information into HMM-based lyrics-to-audio alignment
    • Jan.
    • M. Mauch, H. Fujihara, and M. Goto, "Integrating additional chord information into HMM-based lyrics-to-audio alignment," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 200-210, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 200-210
    • Mauch, M.1    Fujihara, H.2    Goto, M.3
  • 12
    • 84873473666 scopus 로고    scopus 로고
    • Lyrics-to-audio alignment and phrase-level segmentation using incomplete internet-style chord annotations
    • M. Mauch, H. Fujihara, and M. Goto, "Lyrics-to-audio alignment and phrase-level segmentation using incomplete internet-style chord annotations,"in Proc. 7th Sound Music Comput. Conf., 2010, pp. 9-16.
    • Proc. 7th Sound Music Comput. Conf., 2010 , pp. 9-16
    • Mauch, M.1    Fujihara, H.2    Goto, M.3
  • 14
    • 79960524345 scopus 로고    scopus 로고
    • Using online chord databases to enhance chord recognition
    • M. McVicar, Y. Ni, R. Santos-Rodriguez, and T. De Bie, "Using online chord databases to enhance chord recognition," J. New Music Res., vol. 40, no. 2, pp. 139-152, 2011.
    • (2011) J. New Music Res. , vol.40 , Issue.2 , pp. 139-152
    • McVicar, M.1    Ni, Y.2    Santos-Rodriguez, R.3    De Bie, T.4
  • 18
    • 70149120051 scopus 로고    scopus 로고
    • Influences of signal processing, tone profiles, and chord progressions on a model for estimating the musical keyfrom audio
    • K. Noland and M. Sandler, "Influences of signal processing, tone profiles, and chord progressions on a model for estimating the musical keyfrom audio," J. Comput. Music, vol. 33, no. 1, pp. 42-56, 2009.
    • (2009) J. Comput. Music , vol.33 , Issue.1 , pp. 42-56
    • Noland, K.1    Sandler, M.2
  • 20
    • 36048990431 scopus 로고    scopus 로고
    • Mathematical representation of joint time-chroma distributions
    • G. Wakefield, "Mathematical representation of joint time-chroma distributions,"in Proc. Int. Symp. Opt. Sci., Eng. Instrum., 1999, vol. 99,pp. 18-23.
    • Proc. Int. Symp. Opt. Sci., Eng. Instrum., 1999 , vol.99 , pp. 18-23
    • Wakefield, G.1
  • 21
    • 84953652991 scopus 로고
    • Circularity in judgments of relative pitch
    • R. Shepard, "Circularity in judgments of relative pitch," J. Acoust. Soc. Amer., vol. 36, p. 2346, 1964.
    • (1964) J. Acoust. Soc. Amer. , vol.36 , pp. 2346
    • Shepard, R.1
  • 25
    • 84871610071 scopus 로고    scopus 로고
    • A music scene analysis system with the MRF-based information integration scheme
    • K. Kashino and N. Hagita, "A music scene analysis system with the MRF-based information integration scheme," in Proc. 13th Int. Conf. Pattern Recogn., 1996, vol. 2, pp. 725-729.
    • Proc. 13th Int. Conf. Pattern Recogn., 1996 , vol.2 , pp. 725-729
    • Kashino, K.1    Hagita, N.2
  • 28
    • 85099848325 scopus 로고    scopus 로고
    • Realtime chord recognition of musical sound: A system using Common Lisp Music
    • T. Fujishima, "Realtime chord recognition of musical sound: A system using Common Lisp Music," in Proc. Int. Comput. Music Conf., 1999, pp. 464-467.
    • Proc. Int. Comput. Music Conf., 1999 , pp. 464-467
    • Fujishima, T.1
  • 30
    • 0009448302 scopus 로고
    • Über den anschaulichen inhalt der quantentheoretischen kinematik und mechanik
    • W. Heisenberg, "Über den anschaulichen inhalt der quantentheoretischen kinematik und mechanik," Zeitschrift für Physik A Hadrons and Nuclei, vol. 43, no. 3, pp. 172-198, 1927.
    • (1927) Zeitschrift für Physik A Hadrons and Nuclei , vol.43 , Issue.3 , pp. 172-198
    • Heisenberg, W.1
  • 31
    • 0026057076 scopus 로고
    • Calculation of a Constant-Q spectral transform
    • J. Brown, "Calculation of a Constant-Q spectral transform," J. Acoust.Soc. Amer., vol. 89, no. 1, pp. 425-434, 1991.
    • (1991) J. Acoust.Soc. Amer. , vol.89 , Issue.1 , pp. 425-434
    • Brown, J.1
  • 39
    • 84863755854 scopus 로고    scopus 로고
    • Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram
    • N. Ono, K. Miyamoto, J. Le Roux, H. Kameoka, and S. Sagayama, "Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram," in Proc. Euro.Signal Process. Conf., 2008, pp. 445-450.
    • Proc. Euro.Signal Process. Conf., 2008 , pp. 445-450
    • Ono, N.1    Miyamoto, K.2    Le Roux, J.3    Kameoka, H.4    Sagayama, S.5
  • 42
    • 84873462366 scopus 로고    scopus 로고
    • Automatic chord recognition from audio using an HMM with supervised learning
    • K. Lee and M. Slaney, "Automatic chord recognition from audio using an HMM with supervised learning," in Proc. 7th Int. Soc. Music Inf. Retrieval, 2006, pp. 133-137.
    • Proc. 7th Int. Soc. Music Inf. Retrieval, 2006 , pp. 133-137
    • Lee, K.1    Slaney, M.2
  • 44
    • 77955798097 scopus 로고    scopus 로고
    • Simultaneous estimation of chords and musical context from audio
    • Aug.
    • M. Mauch and S. Dixon, "Simultaneous estimation of chords and musical context from audio," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1280-1289, Aug. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1280-1289
    • Mauch, M.1    Dixon, S.2
  • 45
    • 70350627374 scopus 로고    scopus 로고
    • A novel chroma representation of polyphonic music based on multiple pitch tracking techniques
    • M. Varewyck, J. Pauwels, and J. Martens, "A novel chroma representation of polyphonic music based on multiple pitch tracking techniques,"in Proc. 16th Int. Conf. Multimedia, 2008, pp. 667-670.
    • Proc. 16th Int. Conf. Multimedia, 2008 , pp. 667-670
    • Varewyck, M.1    Pauwels, J.2    Martens, J.3
  • 47
    • 13444289674 scopus 로고    scopus 로고
    • Chord segmentation and recognition using em-trained Hidden Markov Models
    • A. Sheh and D. Ellis, "Chord segmentation and recognition using em-trained Hidden Markov Models," in Proc. 4th Int. Soc. Music Inf. Retrieval, 2003, pp. 183-189.
    • Proc. 4th Int. Soc. Music Inf. Retrieval, 2003 , pp. 183-189
    • Sheh, A.1    Ellis, D.2
  • 48
    • 84866503117 scopus 로고    scopus 로고
    • Automatic chord identification using a quantised chromagram
    • C. Harte and M. Sandler, "Automatic chord identification using a quantised chromagram," in Proc. Audio Eng. Soc., 2005, pp. 291-301.
    • Proc. Audio Eng. Soc., 2005 , pp. 291-301
    • Harte, C.1    Sandler, M.2
  • 52
    • 0032653176 scopus 로고    scopus 로고
    • Real-time beat tracking for drumless audio signals: Chord change detection for musical decisions
    • M. Goto and Y. Muraoka, "Real-time beat tracking for drumless audio signals: Chord change detection for musical decisions," Speech Commun., vol. 27, no. 3, pp. 311-335, 1999.
    • (1999) Speech Commun. , vol.27 , Issue.3 , pp. 311-335
    • Goto, M.1    Muraoka, Y.2
  • 55
    • 84905192591 scopus 로고    scopus 로고
    • Exploring common variations in state of the art chord recognition systems
    • T. Cho, R. Weiss, and J. Bello, "Exploring common variations in state of the art chord recognition systems," in Proc. Sound Music Comput. Conf., 2010, vol. 1.
    • Proc. Sound Music Comput. Conf., 2010 , vol.1
    • Cho, T.1    Weiss, R.2    Bello, J.3
  • 56
    • 2342600462 scopus 로고    scopus 로고
    • Ph.D. dissertation, Mass. Inst. of Technol., Cambridge, MA, USA
    • E. Chew, "Towards a mathematical model of tonality," Ph.D. dissertation, Mass. Inst. of Technol., Cambridge, MA, USA, 2000.
    • (2000) Towards a Mathematical Model of Tonality
    • Chew, E.1
  • 57
    • 51849168570 scopus 로고    scopus 로고
    • A unified system for chord transcription and key extraction using Hidden Markov Models
    • K. Lee and M. Slaney, "A unified system for chord transcription and key extraction using Hidden Markov Models," in Proc. Int. Conf. Music Inf. Retrieval, 2007, pp. 245-250.
    • Proc. Int. Conf. Music Inf. Retrieval, 2007 , pp. 245-250
    • Lee, K.1    Slaney, M.2
  • 58
    • 47749142870 scopus 로고    scopus 로고
    • A system for automatic chord transcription from audio using genre-specific Hidden Markov Models
    • K. Lee, "A system for automatic chord transcription from audio using genre-specific Hidden Markov Models," Adaptive Multimedial Retrieval: Retrieval, User, and Semantics, pp. 134-146, 2008.
    • (2008) Adaptive Multimedial Retrieval: Retrieval, User, and Semantics , pp. 134-146
    • Lee, K.1
  • 63
    • 0001927585 scopus 로고
    • On information and sufficiency
    • S. Kullback and R. Leibler, "On information and sufficiency," Ann. Math. Stat., vol. 22, no. 1, pp. 79-86, 1951.
    • (1951) Ann. Math. Stat. , vol.22 , Issue.1 , pp. 79-86
    • Kullback, S.1    Leibler, R.2
  • 67
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 68
    • 0030287048 scopus 로고    scopus 로고
    • The expectation-maximization algorithm
    • Nov.
    • T. Moon, "The expectation-maximization algorithm," IEEE Signal Process. Mag., vol. 13, no. 6, pp. 47-60, Nov. 1996.
    • (1996) IEEE Signal Process. Mag. , vol.13 , Issue.6 , pp. 47-60
    • Moon, T.1
  • 69
    • 84905907351 scopus 로고    scopus 로고
    • Real-time implementation of HMM-based chord estimation in musical audio
    • T. Cho and J. Bello, "Real-time implementation of HMM-based chord estimation in musical audio," in Proc. Int. Comput. Music Conf., 2009,pp. 16-21.
    • Proc. Int. Comput. Music Conf., 2009 , pp. 16-21
    • Cho, T.1    Bello, J.2
  • 74
    • 84873631003 scopus 로고    scopus 로고
    • Use of hidden Markov models and factored language models for automatic chord recognition
    • M. Khadkevich and M. Omologo, "Use of hidden Markov models and factored language models for automatic chord recognition," in Proc. Int. Soc. Music Inf. Retrieval Conf., 2009, pp. 561-566.
    • Proc. Int. Soc. Music Inf. Retrieval Conf., 2009 , pp. 561-566
    • Khadkevich, M.1    Omologo, M.2
  • 75
    • 84873597170 scopus 로고    scopus 로고
    • A vocabulary-free infinity-gram model for nonparametric Bayesian chord progression analysis
    • K. Yoshii and M. Goto, "A vocabulary-free infinity-gram model for nonparametric Bayesian chord progression analysis," in Proc. 12th Int. Soc. Music Inf. Retrieval Conf., 2011, pp. 645-650.
    • Proc. 12th Int. Soc. Music Inf. Retrieval Conf., 2011 , pp. 645-650
    • Yoshii, K.1    Goto, M.2
  • 78
    • 79960524345 scopus 로고    scopus 로고
    • Using online chord databases to enhance chord recognition
    • M. McVicar, Y. Ni, R. Santos-Rodriguez, and T. De Bie, "Using online chord databases to enhance chord recognition," J. New Music Res., vol. 40, no. 2, pp. 139-152, 2011.
    • (2011) J. New Music Res. , vol.40 , Issue.2 , pp. 139-152
    • McVicar, M.1    Ni, Y.2    Santos-Rodriguez, R.3    De Bie, T.4
  • 80
    • 84873542451 scopus 로고    scopus 로고
    • Learning harmonic relationships in digital audio with Dirichlet-based Hidden Markov Models
    • J. Burgoyne and L. Saul, "Learning harmonic relationships in digital audio with Dirichlet-based Hidden Markov Models," in Proc. Int. Conf. Music Inf. Retrieval, 2005, pp. 438-443.
    • Proc. Int. Conf. Music Inf. Retrieval, 2005 , pp. 438-443
    • Burgoyne, J.1    Saul, L.2
  • 82
    • 84873595823 scopus 로고    scopus 로고
    • Approximate note transcription for the improved identification of difficult chords
    • M. Mauch and S. Dixon, "Approximate note transcription for the improved identification of difficult chords," in Proc. 11th Int. Soc. Music Inf. Retrieval Conf., 2010, pp. 135-140.
    • Proc. 11th Int. Soc. Music Inf. Retrieval Conf., 2010 , pp. 135-140
    • Mauch, M.1    Dixon, S.2
  • 83
    • 2942735564 scopus 로고    scopus 로고
    • A large-scale evaluation of acoustic and subjective music-similarity measures
    • A. Berenzweig, B. Logan, D. Ellis, and B. Whitman, "A large-scale evaluation of acoustic and subjective music-similarity measures," J. Comput. Music, vol. 28, no. 2, pp. 63-76, 2004.
    • (2004) J. Comput. Music , vol.28 , Issue.2 , pp. 63-76
    • Berenzweig, A.1    Logan, B.2    Ellis, D.3    Whitman, B.4
  • 85
    • 84919774094 scopus 로고    scopus 로고
    • New York, NY, USA: Oxford Univ. Press
    • F. Lerdahl, Tonal pitch space. New York, NY, USA: Oxford Univ. Press, 2005.
    • (2005) Tonal Pitch Space
    • Lerdahl, F.1
  • 93
    • 84887364231 scopus 로고    scopus 로고
    • Understanding effects of subjectivity in measuring chord estimation accuracy
    • Dec.
    • Y. Ni, M. McVicar, R. Santos-Rodriguez, and T. De Bie, "Understanding effects of subjectivity in measuring chord estimation accuracy," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no.12, pp. 2607-2615, Dec. 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.12 , pp. 2607-2615
    • Ni, Y.1    McVicar, M.2    Santos-Rodriguez, R.3    De Bie, T.4
  • 94
    • 77950265627 scopus 로고    scopus 로고
    • System and methods for recognizing sound and music signals in high noise and distortion
    • U.S. patent 6,990,453, Jan. 24
    • A. Wang and J. Smith, "System and methods for recognizing sound and music signals in high noise and distortion," U.S. patent 6,990,453, Jan. 24, 2006, III.
    • (2006)
    • Wang, A.1    Smith, J.2
  • 95
    • 75149146477 scopus 로고    scopus 로고
    • A unified probabilistic model for polyphonic music analysis
    • D. Temperley, "A unified probabilistic model for polyphonic music analysis," J. New Music Res., vol. 38, no. 1, pp. 3-18, 2009.
    • (2009) J. New Music Res. , vol.38 , Issue.1 , pp. 3-18
    • Temperley, D.1
  • 96
    • 78650975513 scopus 로고    scopus 로고
    • Sonic Visualiser: An open source application for viewing, analysing, and annotating music audiofiles
    • C. Cannam, C. Landone, and M. Sandler, "Sonic Visualiser: An open source application for viewing, analysing, and annotating music audiofiles," in Proc. ACM Multimedia 2010 Int. Conf., Oct. 2010, pp.1467-1468.
    • Proc. ACM Multimedia 2010 Int. Conf., Oct. 2010 , pp. 1467-1468
    • Cannam, C.1    Landone, C.2    Sandler, M.3
  • 103
    • 0037252945 scopus 로고    scopus 로고
    • Amazon.Com recommendations: Item-to-item collaborative filtering
    • Jan./Feb.
    • G. Linden, B. Smith, and J. York, "Amazon.com recommendations: Item-to-item collaborative filtering," IEEE Internet Comput., vol. 7,no. 1, pp. 76-80, Jan./Feb. 2003.
    • (2003) IEEE Internet Comput. , vol.7 , Issue.1 , pp. 76-80
    • Linden, G.1    Smith, B.2    York, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.