메뉴 건너뛰기




Volumn 18, Issue 3, 2010, Pages 649-662

Towards timbre-invariant audio features for harmony-based music

Author keywords

Audio matching; Chroma feature; Mel frequency cepstral coefficient (MFCC); Music retrieval; Pitch feature; Timbre invariance

Indexed keywords

AUDIO FEATURES; CEPSTRAL COEFFICIENTS; CHROMA FEATURES; MEL-FREQUENCY CEPSTRAL COEFFICIENT (MFCC); MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MUSIC RETRIEVAL; RETRIEVAL APPLICATIONS; SPECTRAL COMPONENTS; STATE OF THE ART;

EID: 76949093273     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2041394     Document Type: Article
Times cited : (91)

References (39)
  • 1
    • 13144282752 scopus 로고    scopus 로고
    • Audio thumbnailing of popular music using chroma-based representations
    • Feb.
    • M. A. Bartsch and G. H. Wakefield, "Audio thumbnailing of popular music using chroma-based representations," IEEE Trans. Multimedia, vol.7, no.1, pp. 96-104, Feb. 2005.
    • (2005) IEEE Trans. Multimedia , vol.7 , Issue.1 , pp. 96-104
    • Bartsch, M.A.1    Wakefield, G.H.2
  • 2
    • 36549057588 scopus 로고    scopus 로고
    • Ph.D. dissertation, Universitat Pompeu Fabra (UPF), Barcelona, Spain
    • E. Gómez, "Tonal description of music audio signals," Ph.D. dissertation, Universitat Pompeu Fabra (UPF), Barcelona, Spain, 2006.
    • (2006) Tonal Description of Music Audio Signals
    • Gómez, E.1
  • 4
    • 55249094688 scopus 로고    scopus 로고
    • Synchronization of music data in score-, MIDI- and PCM-format
    • V. Arifi, M. Clausen, F. Kurth, and M. Müller, "Synchronization of music data in score-, MIDI- and PCM-format," Comput. in Musicol., vol.13, pp. 9-33, 2004.
    • (2004) Comput. in Musicol. , vol.13 , pp. 9-33
    • Arifi, V.1    Clausen, M.2    Kurth, F.3    Müller, M.4
  • 5
    • 33747191724 scopus 로고    scopus 로고
    • Music score alignment and computer accompaniment
    • R. Dannenberg and C. Raphael, "Music score alignment and computer accompaniment," Commun. ACM, Special Iss., vol.49, no.8, pp. 39-43, 2006.
    • (2006) Commun. ACM, Special Iss. , vol.49 , Issue.8 , pp. 39-43
    • Dannenberg, R.1    Raphael, C.2
  • 9
    • 85032762641 scopus 로고    scopus 로고
    • Semantic segmentation and summarization of music: Methods based on tonality and recurrent structure
    • Mar.
    • W. Chai, "Semantic segmentation and summarization of music: Methods based on tonality and recurrent structure," IEEE Signal Process. Mag., vol.23, no.2, pp. 124-132, Mar. 2006.
    • (2006) IEEE Signal Process. Mag. , vol.23 , Issue.2 , pp. 124-132
    • Chai, W.1
  • 11
    • 34147158903 scopus 로고    scopus 로고
    • A chorus section detection method for musical audio signals and its application to a music listening station
    • Sep.
    • M. Goto, "A chorus section detection method for musical audio signals and its application to a music listening station," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.5, pp. 1783-1794, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1783-1794
    • Goto, M.1
  • 12
    • 33846198007 scopus 로고    scopus 로고
    • Towards structural analysis of audio recordings in the presence of musical variations
    • M. Müller and F. Kurth, "Towards structural analysis of audio recordings in the presence of musical variations," EURASIP J. Adv. Signal Process., vol.1, 2007, Article ID 89686.
    • (2007) EURASIP J. Adv. Signal Process. , vol.1 , pp. 89686
    • Müller, M.1    Kurth, F.2
  • 13
    • 84873584066 scopus 로고    scopus 로고
    • Sequence representation of music structure using higherorder similarity matrix and maximum-likelihood approach
    • Vienna, Austria
    • G. Peeters, "Sequence representation of music structure using higherorder similarity matrix and maximum-likelihood approach," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Vienna, Austria, 2007, pp. 35-40.
    • (2007) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 35-40
    • Peeters, G.1
  • 14
    • 68149101231 scopus 로고    scopus 로고
    • Algorithms for determining and labelling approximate hierarchical self-similarity
    • Vienna, Austria
    • C. Rhodes and M. Casey, "Algorithms for determining and labelling approximate hierarchical self-similarity," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Vienna, Austria, 2007, pp. 41-46.
    • (2007) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 41-46
    • Rhodes, C.1    Casey, M.2
  • 15
    • 84873443097 scopus 로고    scopus 로고
    • Song intersection by approximate nearest neighbor search
    • Victoria, BC, Canada
    • M. Casey and M. Slaney, "Song intersection by approximate nearest neighbor search," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Victoria, BC, Canada, 2006, pp. 144-149.
    • (2006) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 144-149
    • Casey, M.1    Slaney, M.2
  • 17
    • 70350074065 scopus 로고    scopus 로고
    • Chroma binary similarity and local alignment applied to cover song identification
    • Aug.
    • P. H. J. Serrà, E. Gómez, and X. Serra, "Chroma binary similarity and local alignment applied to cover song identification," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.6, pp. 1138-1151, Aug. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.6 , pp. 1138-1151
    • Serrà, P.H.J.1    Gómez, E.2    Serra, X.3
  • 21
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-28, no.4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 25
    • 47749132959 scopus 로고    scopus 로고
    • Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio
    • Feb.
    • K. Lee and M. Slaney, "Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.2, pp. 291-301, Feb. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 291-301
    • Lee, K.1    Slaney, M.2
  • 26
    • 0026057076 scopus 로고
    • Calculation of a constant Q spectral transform
    • J. C. Brown, "Calculation of a constant Q spectral transform," J. Acoust. Soc. Amer., vol.89, no.1, 1991.
    • (1991) J. Acoust. Soc. Amer. , vol.89 , Issue.1
    • Brown, J.C.1
  • 27
    • 39649094860 scopus 로고    scopus 로고
    • Multipitch analysis of polyphonic music and speech signals using an auditory model
    • Feb.
    • A. Klapuri, "Multipitch analysis of polyphonic music and speech signals using an auditory model," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.2, pp. 255-266, Feb. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 255-266
    • Klapuri, A.1
  • 28
    • 84873453261 scopus 로고    scopus 로고
    • The sonic visualiser: A visualisation platform for semantic descriptors from musical signals
    • Victoria, BC, Canada
    • C. Cannam, C. Landone, M. Sandler, and J. P. Bello, "The sonic visualiser: A visualisation platform for semantic descriptors from musical signals," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Victoria, BC, Canada, 2006.
    • (2006) Proc. Int. Conf. Music Inf. Retrieval (ISMIR)
    • Cannam, C.1    Landone, C.2    Sandler, M.3    Bello, J.P.4
  • 29
    • 84873572465 scopus 로고    scopus 로고
    • MIR in Matlab (II): A toolbox for musical feature extraction from audio
    • Vienna, Austria
    • O. Lartillot and P. Toiviainen, "MIR in Matlab (II): A toolbox for musical feature extraction from audio," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Vienna, Austria, 2007, pp. 127-130.
    • (2007) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 127-130
    • Lartillot, O.1    Toiviainen, P.2
  • 31
    • 0009985115 scopus 로고    scopus 로고
    • Mel frequency cepstral coefficients for music modeling
    • Plymouth, MA
    • B. Logan, "Mel frequency cepstral coefficients for music modeling," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Plymouth, MA, 2000, pp. 11-23.
    • (2000) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 11-23
    • Logan, B.1
  • 32
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • Jul.
    • G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. Speech Audio Process., vol.10, no.4, pp. 293-302, Jul. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.4 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 33
    • 0033690881 scopus 로고    scopus 로고
    • Musical instrument recognition using cepstral coefficients and temporal features
    • Istanbul, Turkey
    • A. Eronen and A. Klapuri, "Musical instrument recognition using cepstral coefficients and temporal features," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Istanbul, Turkey, 2000, pp. 753-756.
    • (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 753-756
    • Eronen, A.1    Klapuri, A.2
  • 34
    • 13444292995 scopus 로고    scopus 로고
    • Contentbased music structure analysis with applications to music semantics understanding
    • New York
    • N. C. Maddage, C. Xu, M. S. Kankanhalli, and X. Shao, "Contentbased music structure analysis with applications to music semantics understanding," in Proc. ACM Int. Conf. Multimedia, New York, 2004, pp. 112-119.
    • (2004) Proc. ACM Int. Conf. Multimedia , pp. 112-119
    • Maddage, N.C.1    Xu, C.2    Kankanhalli, M.S.3    Shao, X.4
  • 38


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.