메뉴 건너뛰기




Volumn 19, Issue 8, 2011, Pages 2385-2397

A conditional random field framework for robust and scalable audio-to-score matching

Author keywords

Audio signal processing; conditional random fields (CRFs); machine learning; music; music to score alignment; Viterbi algorithm

Indexed keywords

CONDITIONAL RANDOM FIELD; MACHINE-LEARNING; MUSIC; MUSIC-TO-SCORE ALIGNMENT; VITERBI;

EID: 80052630625     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2134092     Document Type: Article
Times cited : (67)

References (43)
  • 1
    • 85137018710 scopus 로고
    • An on-line algorithm for real-time accompaniment
    • R. B. Dannenberg, "An on-line algorithm for real-time accompaniment," in Proc. ICMC, 1984, pp. 193-198.
    • (1984) Proc. ICMC , pp. 193-198
    • Dannenberg, R.B.1
  • 2
    • 0040983679 scopus 로고
    • Score following by temporal pattern
    • J. D. Vantomme, "Score following by temporal pattern," Comput. Music. J., vol. 19, no. 3, pp. 50-59, 1995.
    • (1995) Comput. Music. J. , vol.19 , Issue.3 , pp. 50-59
    • Vantomme, J.D.1
  • 3
    • 85067755171 scopus 로고    scopus 로고
    • Score following: State of the art and new developments
    • N. Orio, S. Lemouton, and D. Schwarz, "Score following: State of the art and new developments," in Proc. NIME, 2003, pp. 36-41.
    • (2003) Proc. NIME , pp. 36-41
    • Orio, N.1    Lemouton, S.2    Schwarz, D.3
  • 5
    • 41649111308 scopus 로고    scopus 로고
    • A classifier-based approach to score-guided source separation of musical audio
    • DOI 10.1162/comj.2008.32.1.51
    • C. Raphael, "A classifier-based approach to score-guided source separation of musical audio," Comput. Music. J., vol. 32, no. 1, pp. 51-59, 2008. (Pubitemid 351479759)
    • (2008) Computer Music Journal , vol.32 , Issue.1 , pp. 51-59
    • Raphael, C.1
  • 6
    • 85159692534 scopus 로고    scopus 로고
    • Score-performance matching using HMMs
    • P. Cano, A. Loscos, and J. Bonada, "Score-performance matching using HMMs," in Proc. ICMC, 1999, pp. 441-444.
    • (1999) Proc. ICMC , pp. 441-444
    • Cano, P.1    Loscos, A.2    Bonada, J.3
  • 7
    • 0032679638 scopus 로고    scopus 로고
    • Automatic segmentation of acoustic musical signals using hidden Markov models
    • Apr.
    • C. Raphael, "Automatic segmentation of acoustic musical signals using hidden Markov models," IEEE Trans. Pattern Anal. Machine Intell., vol. 21, no. 4, pp. 360-370, Apr. 1999.
    • (1999) IEEE Trans. Pattern Anal. Machine Intell. , vol.21 , Issue.4 , pp. 360-370
    • Raphael, C.1
  • 8
    • 85159602457 scopus 로고    scopus 로고
    • Enhanced vocal performance tracking using multiple information sources
    • L. Grubb and R. B. Dannenberg, "Enhanced vocal performance tracking using multiple information sources," in Proc. ICMC, 1998, pp. 37-44.
    • (1998) Proc. ICMC , pp. 37-44
    • Grubb, L.1    Dannenberg, R.B.2
  • 9
    • 33947679415 scopus 로고    scopus 로고
    • Realtime audio to score alignment for polyphonic music instruments using sparse non-negative constraints and hierarchical HMMs
    • A. Cont, "Realtime audio to score alignment for polyphonic music instruments using sparse non-negative constraints and hierarchical HMMs," in Proc. IEEE ICASSP, 2006, pp. 245-248.
    • (2006) Proc. IEEE ICASSP , pp. 245-248
    • Cont, A.1
  • 10
    • 85077067841 scopus 로고
    • Score following using the sung voice
    • M. Puckette, "Score following using the sung voice," in Proc. ICMC, 1995, pp. 175-178.
    • (1995) Proc. ICMC , pp. 175-178
    • Puckette, M.1
  • 11
    • 77951621999 scopus 로고    scopus 로고
    • A coupled duration-focused architecture for real-time music-to-score alignment
    • Jun.
    • A. Cont, "A coupled duration-focused architecture for real-time music-to-score alignment," IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 6, pp. 974-987, Jun. 2010.
    • (2010) IEEE Trans. Pattern Anal. Mach. Intell. , vol.32 , Issue.6 , pp. 974-987
    • Cont, A.1
  • 12
    • 84873661147 scopus 로고    scopus 로고
    • A discrete filterbank approach to audio to score matching for score following
    • N. Montecchio and N. Orio, "A discrete filterbank approach to audio to score matching for score following," in Proc. ISMIR, 2009, pp. 495-500.
    • (2009) Proc. ISMIR , pp. 495-500
    • Montecchio, N.1    Orio, N.2
  • 13
    • 84945120882 scopus 로고    scopus 로고
    • Polyphonic audio matching and alignment for music retrieval
    • N. Hu, R. B. Dannenberg, and G. Tzanetakis, "Polyphonic audio matching and alignment for music retrieval," in Proc. IEEE WASPAA, 2003, pp. 185-188.
    • (2003) Proc. IEEE WASPAA , pp. 185-188
    • Hu, N.1    Dannenberg, R.B.2    Tzanetakis, G.3
  • 14
    • 51449092591 scopus 로고    scopus 로고
    • Path-constrained partial music synchronization
    • M. Müller and D. Appelt, "Path-constrained partial music synchronization," in Proc. IEEE ICASSP, 2008, pp. 65-68.
    • (2008) Proc. IEEE ICASSP , pp. 65-68
    • Müller, M.1    Appelt, D.2
  • 15
    • 70349223849 scopus 로고    scopus 로고
    • High resolution audio synchronization using chroma onset features
    • S. Ewert, M. Müller, and P. Grosche, "High resolution audio synchronization using chroma onset features," in Proc. IEEE ICASSP, 2009, pp. 1869-1872.
    • (2009) Proc. IEEE ICASSP , pp. 1869-1872
    • Ewert, S.1    Müller, M.2    Grosche, P.3
  • 16
    • 33846138142 scopus 로고    scopus 로고
    • Match: A music alignment tool chest
    • S. Dixon and G. Widmer, "Match: A music alignment tool chest," in Proc. ISMIR, 2005, pp. 192-197.
    • (2005) Proc. ISMIR , pp. 192-197
    • Dixon, S.1    Widmer, G.2
  • 17
    • 33645380532 scopus 로고    scopus 로고
    • Improving polyphonic and poly-instrumental music to score alignment
    • F. Soulez, X. Rodet, and D. Schwarz, "Improving polyphonic and poly-instrumental music to score alignment," in Proc. ISMIR, 2003, pp. 143-148.
    • (2003) Proc. ISMIR , pp. 143-148
    • Soulez, F.1    Rodet, X.2    Schwarz, D.3
  • 19
    • 84873428975 scopus 로고    scopus 로고
    • An efficient multiscale approach to audio synchronization
    • M. Müller, H. Mattes, and F. Kurth, "An efficient multiscale approach to audio synchronization," in Proc. ISMIR, 2006.
    • (2006) Proc. ISMIR
    • Müller, M.1    Mattes, H.2    Kurth, F.3
  • 20
    • 80051657095 scopus 로고    scopus 로고
    • A multi-pass algorithm for accurate audio-to-score alignment
    • B. Niedermayer and G. Widmer, "A multi-pass algorithm for accurate audio-to-score alignment," in Proc. ISMIR, 2010, pp. 417-422.
    • (2010) Proc. ISMIR , pp. 417-422
    • Niedermayer, B.1    Widmer, G.2
  • 21
    • 79960505828 scopus 로고    scopus 로고
    • Handling repeats and jumps in score-performance synchronization
    • C. Fremerey, M. Müller, and M. Clausen, "Handling repeats and jumps in score-performance synchronization," in Proc. ISMIR, 2010, pp. 243-248.
    • (2010) Proc. ISMIR , pp. 243-248
    • Fremerey, C.1    Müller, M.2    Clausen, M.3
  • 23
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 24
    • 29344441970 scopus 로고    scopus 로고
    • Modeling form for on-line following of musical performances
    • Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference, AAAI-05/IAAI-05
    • B. Pardo and W. Birmingham, "Modeling form for on-line following of musical performances," in Proc. Nat. Conf. Artif. Intell., 2005, pp. 1018-1023. (Pubitemid 43006740)
    • (2005) Proceedings of the National Conference on Artificial Intelligence , vol.2 , pp. 1018-1023
    • Pardo, B.1    Birmingham, W.2
  • 25
    • 85159445654 scopus 로고    scopus 로고
    • A stochastic method of tracking a vocal performer
    • L. Grubb and R. Dannenberg, "A stochastic method of tracking a vocal performer," in Proc. ICMC, 1997, pp. 301-308.
    • (1997) Proc. ICMC , pp. 301-308
    • Grubb, L.1    Dannenberg, R.2
  • 26
    • 84873572185 scopus 로고    scopus 로고
    • A probabilistic framework for matching music representations
    • Vienna, Austria
    • P. Peeling, A. T. Cemgil, and S. Godsill, "A probabilistic framework for matching music representations," in Proc. ISMIR, Vienna, Austria, 2007, pp. 267-272.
    • (2007) Proc. ISMIR , pp. 267-272
    • Peeling, P.1    Cemgil, A.T.2    Godsill, S.3
  • 27
    • 33751528957 scopus 로고    scopus 로고
    • Aligning music audio with symbolic scores using a hybrid graphical model
    • DOI 10.1007/s10994-006-8415-3, Special Issue on Machine Learning in and for Music
    • C. Raphael, "Aligning music audio with symbolic scores using a hybrid graphical model," Mach. Learn. J., vol. 65, pp. 389-409, 2006. (Pubitemid 44836050)
    • (2006) Machine Learning , vol.65 , Issue.2-3 , pp. 389-409
    • Raphael, C.1
  • 29
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proc. ICML, 2001.
    • (2001) Proc. ICML
    • Lafferty, J.1    McCallum, A.2    Pereira, F.3
  • 30
    • 78049392004 scopus 로고    scopus 로고
    • Cyclic tempogram-A mid-level tempo representation for music signals
    • Mar.
    • P. Grosche, M. Müller, and F.Kurth, "Cyclic tempogram-A mid-level tempo representation for music signals," in Proc. IEEE ICASSP, Mar. 2010, pp. 5522-5525.
    • (2010) Proc. IEEE ICASSP , pp. 5522-5525
    • Grosche, P.1    Müller, M.2    Kurth, F.3
  • 32
    • 0031271066 scopus 로고    scopus 로고
    • Belief networks, hidden Markov models, and Markov random fields: A unifying view
    • P. Smyth, "Belief networks, hidden Markov models, and Markov random fields: A unifying view," in Pattern Recogn. Lett., 1998, vol. 18, pp. 1261-1268.
    • (1998) Pattern Recogn. Lett. , vol.18 , pp. 1261-1268
    • Smyth, P.1
  • 33
    • 26944481683 scopus 로고    scopus 로고
    • Department of computer and information science Univ. of Pennsylvania, Philadelphia, Tech. Rep. MS-CIS-04-21
    • H. M. Wallach, Conditional random fields: An introduction, department of computer and information science Univ. of Pennsylvania, Philadelphia, 2004, Tech. Rep. MS-CIS-04-21.
    • (2004) Conditional Random Fields: An Introduction
    • Wallach, H.M.1
  • 34
    • 0032119668 scopus 로고    scopus 로고
    • The hierarchical hidden Markov model: Analysis and applications
    • S. Fine and Y. Singer, "The hierarchical hidden Markov model: Analysis and applications," in Mach. Learn. Conf., 1998, pp. 41-62.
    • (1998) Mach. Learn. Conf. , pp. 41-62
    • Fine, S.1    Singer, Y.2
  • 35
    • 18844398069 scopus 로고    scopus 로고
    • Capacity and complexity of HMM duration modeling techniques
    • DOI 10.1109/LSP.2005.845598
    • M. T. Johnson, "Capacity and complexity of HMM duration modeling techniques," IEEE Signal Process. Lett., vol. 12, no. 5, pp. 407-410, May 2005. (Pubitemid 40679646)
    • (2005) IEEE Signal Processing Letters , vol.12 , Issue.5 , pp. 407-410
    • Johnson, M.T.1
  • 36
    • 0028793271 scopus 로고
    • Time discrimination in a monotonic, isochronous sequence
    • A. Friberg and J. Sundberg, "Time discrimination in a monotonic, isochronous sequence," J. Acoust. Soc. Amer., vol. 98, pp. 2525-2531, 1995.
    • (1995) J. Acoust. Soc. Amer. , vol.98 , pp. 2525-2531
    • Friberg, A.1    Sundberg, J.2
  • 37
    • 85065358752 scopus 로고    scopus 로고
    • On tempo tracking: Tempogram representation and Kalman filtering
    • A. T. Cemgil, H. J. Kappen, P. Desain, and H. Honing, "On tempo tracking: Tempogram representation and Kalman filtering," J. New Music Res., vol. 28, no. 4, pp. 259-273, 2001.
    • (2001) J. New Music Res. , vol.28 , Issue.4 , pp. 259-273
    • Cemgil, A.T.1    Kappen, H.J.2    Desain, P.3    Honing, H.4
  • 38
    • 33646741047 scopus 로고    scopus 로고
    • Precise pitch profile feature extraction from musical audio for key detection
    • Jun.
    • Y. Zhu and M. Kankanhalli, "Precise pitch profile feature extraction from musical audio for key detection," IEEE Trans. Multimedia, vol. 8, no. 3, pp. 575-584, Jun. 2006.
    • (2006) IEEE Trans. Multimedia , vol.8 , Issue.3 , pp. 575-584
    • Zhu, Y.1    Kankanhalli, M.2
  • 39
    • 78049385456 scopus 로고    scopus 로고
    • A comparative study of tonal acoustic features for a symbolic level music-to-score alignment
    • C. Joder, S. Essid, and G. Richard, "A comparative study of tonal acoustic features for a symbolic level music-to-score alignment," in Proc. IEEE ICASSP, 2010, pp. 409-412.
    • (2010) Proc. IEEE ICASSP , pp. 409-412
    • Joder, C.1    Essid, S.2    Richard, G.3
  • 41
    • 80051648102 scopus 로고    scopus 로고
    • An improved hierarchical approach for music-to-symbolic score alignment
    • Utrecht, Holland, Aug.
    • C. Joder, S. Essid, and G. Richard, "An improved hierarchical approach for music-to-symbolic score alignment," in Proc. ISMIR, Utrecht, Holland, Aug. 2010, pp. 39-44.
    • (2010) Proc. ISMIR , pp. 39-44
    • Joder, C.1    Essid, S.2    Richard, G.3
  • 42
    • 77955826141 scopus 로고    scopus 로고
    • Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle
    • Aug
    • V. Emiya, R. Badeau, and B. David, "Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1643-1654, Aug. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1643-1654
    • Emiya, V.1    Badeau, R.2    David, B.3
  • 43
    • 0141623871 scopus 로고    scopus 로고
    • RWCmusic database: Popular, classical, and jazz music databases
    • M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWCmusic database: Popular, classical, and jazz music databases," in Proc. ISMIR, 2002, pp. 287-288.
    • (2002) Proc. ISMIR , pp. 287-288
    • Goto, M.1    Hashiguchi, H.2    Nishimura, T.3    Oka, R.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.