메뉴 건너뛰기




Volumn 5, Issue 6, 2011, Pages 1159-1169

Transcribing multi-instrument polyphonic music with hierarchical eigeninstruments

Author keywords

Eigeninstruments; Music; Non negative matrix factorization (NMF); Polyphonic transcription; Subspace

Indexed keywords

EIGENINSTRUMENTS; LEVELS OF ABSTRACTION; LINEAR MANIFOLD; MODEL PARAMETERS; MUSIC; MUSIC RECORDING; NON-NEGATIVE MATRIX FACTORIZATION (NMF); NON-NEGATIVE MATRIX FACTORIZATION ALGORITHMS; POLYPHONIC MUSIC; PRIOR KNOWLEDGE; PROBABILISTIC MODELS; SINGLE-CHANNEL; SUBSPACE;

EID: 80053031254     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2011.2162395     Document Type: Article
Times cited : (59)

References (44)
  • 1
    • 0028561099 scopus 로고
    • Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values
    • P. Paatero and U. Tapper, "Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values," Environmetrics, vol. 5, no. 2, pp. 111-126, 1994.
    • (1994) Environmetrics , vol.5 , Issue.2 , pp. 111-126
    • Paatero, P.1    Tapper, U.2
  • 2
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by nonnegative matrix factorization
    • D. D. Lee and H. S. Seung, "Learning the parts of objects by nonnegative matrix factorization," Nature, vol. 401, no. 6755, pp. 788-791, 1999.
    • (1999) Nature , vol.401 , Issue.6755 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 3
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 5
    • 84900510076 scopus 로고    scopus 로고
    • Non-negative matrix factorization with sparseness constraints
    • P. O. Hoyer, "Non-negative matrix factorization with sparseness constraints," J. Mach. Learn. Res., vol. 5, pp. 1457-1469, 2004.
    • (2004) J. Mach. Learn. Res. , vol.5 , pp. 1457-1469
    • Hoyer, P.O.1
  • 7
    • 47649133016 scopus 로고    scopus 로고
    • Probabilistic latent variable models as non-negative factorizations
    • Article ID 947438
    • M. Shashanka, B. Raj, and P. Smaragdis, "Probabilistic latent variable models as non-negative factorizations," Comput. Intell. Neurosci., vol. 2008, 2008, Article ID 947438.
    • (2008) Comput. Intell. Neurosci. , vol.2008
    • Shashanka, M.1    Raj, B.2    Smaragdis, P.3
  • 8
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
    • Mar
    • T. Virtanen, "Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 10
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis
    • C. Févotte, N. Bertin, and J. L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis," Neural Comput., vol. 21, no. 3, pp. 793-830, 2009.
    • (2009) Neural Comput. , vol.21 , Issue.3 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.L.3
  • 11
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separation using sparse non-negative matrix factorization
    • M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. Int. Conf. Spoken Lang. Process., 2006.
    • (2006) Proc. Int. Conf. Spoken Lang. Process.
    • Schmidt, M.N.1    Olsson, R.K.2
  • 12
    • 84858719009 scopus 로고    scopus 로고
    • A sparse non-parametric approach for single channel separation of known sounds
    • P. Smaragdis, M. Shashanka, and B. Raj, "A sparse non-parametric approach for single channel separation of known sounds," in Proc. Adv. Neural Inf. Process. Syst., 2009, pp. 1705-1713.
    • (2009) Proc. Adv. Neural Inf. Process. Syst. , pp. 1705-1713
    • Smaragdis, P.1    Shashanka, M.2    Raj, B.3
  • 14
  • 16
    • 80053048831 scopus 로고    scopus 로고
    • A probabilistic subspace model for polyphonic music transcription
    • G. Grindlay and D. P. W. Ellis, "A probabilistic subspace model for polyphonic music transcription," in Int. Conf. Music Inf. Retrieval, 2010, pp. 21-26.
    • (2010) Int. Conf. Music Inf. Retrieval , pp. 21-26
    • Grindlay, G.1    Ellis, D.P.W.2
  • 18
    • 84863690059 scopus 로고    scopus 로고
    • Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine
    • M. Helén and T. Virtanen, "Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine," in Proc. Eur. Signal Process. Conf., 2005.
    • (2005) Proc. Eur. Signal Process. Conf.
    • Helén, M.1    Virtanen, T.2
  • 19
    • 80052993673 scopus 로고    scopus 로고
    • Monophonic instrument sound segregation by clustering NMF components based on basis similarity and gain disjointness
    • K. Murao, M. Nakano, Y. Kitano, N. Ono, and S. Sagayama, "Monophonic instrument sound segregation by clustering NMF components based on basis similarity and gain disjointness," in Proc. Int. Soc. Music Inf. Retrieval Conf., 2010, pp. 375-380.
    • (2010) Proc. Int. Soc. Music Inf. Retrieval Conf. , pp. 375-380
    • Murao, K.1    Nakano, M.2    Kitano, Y.3    Ono, N.4    Sagayama, S.5
  • 20
    • 77952407197 scopus 로고    scopus 로고
    • Analysis of polyphonic audio using source-filter model and non-negative matrix factorization
    • T. Virtanen and A. Klapuri, "Analysis of polyphonic audio using source-filter model and non-negative matrix factorization," in Proc. Adv. Neural Inf. Process. Syst., 2006.
    • (2006) Proc. Adv. Neural Inf. Process. Syst.
    • Virtanen, T.1    Klapuri, A.2
  • 21
    • 84873616077 scopus 로고    scopus 로고
    • Musical instrument recognition in polyphonic audio using source-filter model for sound separation
    • T. Heittola, A. Klapuri, and T. Virtanen, "Musical instrument recognition in polyphonic audio using source-filter model for sound separation," in Proc. Int. Conf. Music Inf. Retrieval, 2009, pp. 327-332.
    • (2009) Proc. Int. Conf. Music Inf. Retrieval , pp. 327-332
    • Heittola, T.1    Klapuri, A.2    Virtanen, T.3
  • 22
    • 76949108729 scopus 로고    scopus 로고
    • Adaptive harmonic spectral decomposition for multiple pitch estimation
    • Mar
    • E. Vincent, N. Bertin, and R. Badeau, "Adaptive harmonic spectral decomposition for multiple pitch estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 528-537, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 528-537
    • Vincent, E.1    Bertin, N.2    Badeau, R.3
  • 23
    • 76949083547 scopus 로고    scopus 로고
    • Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription
    • Mar
    • N. Bertin, R. Badeau, and E. Vincent, "Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 538-549, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 538-549
    • Bertin, N.1    Badeau, R.2    Vincent, E.3
  • 24
    • 0036214787 scopus 로고    scopus 로고
    • YIN, a fundamental frequency estimator for speech and music
    • DOI 10.1121/1.1458024
    • A. de Cheveigné and H. Kawahara, "YIN, A fundamental frequency estimator for speech and music," The J. Acoust. Soc. Amer., vol. 111, no. 1917, pp. 1917-1930, 2002. (Pubitemid 34297247)
    • (2002) Journal of the Acoustical Society of America , vol.111 , Issue.4 , pp. 1917-1930
    • De Cheveigne, A.1
  • 25
    • 0347337997 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation based on harmonicity and spectral smoothness
    • Nov
    • A. Klapuri, "Multiple fundamental frequency estimation based on harmonicity and spectral smoothness," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 804-816, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 804-816
    • Klapuri, A.1
  • 26
    • 30844456955 scopus 로고    scopus 로고
    • Polyphonic music transcription by non-negative sparse coding of power spectra
    • S. A. Abdallah and M. D. Plumbley, "Polyphonic music transcription by non-negative sparse coding of power spectra," in Proc. Int. Conf. Music Inf. Retrieval, 2004, pp. 318-325.
    • (2004) Proc. Int. Conf. Music Inf. Retrieval , pp. 318-325
    • Abdallah, S.A.1    Plumbley, M.D.2
  • 27
    • 4644242508 scopus 로고    scopus 로고
    • Areal-timemusic-scene-descriptionsystem:Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals
    • M. Goto, "Areal-timemusic-scene-descriptionsystem:Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals," Speech Commun., vol. 43, no. 4, pp. 311-329, 2004.
    • (2004) Speech Commun. , vol.43 , Issue.4 , pp. 311-329
    • Goto, M.1
  • 28
    • 33846199251 scopus 로고    scopus 로고
    • A discriminative model for polyphonic piano transcription
    • Article ID 48317
    • G. Poliner and D. P. W. Ellis, "A discriminative model for polyphonic piano transcription," EURASIP J. Adv. Signal Process., 2007, Article ID 48317.
    • (2007) EURASIP J. Adv. Signal Process.
    • Poliner, G.1    Ellis, D.P.W.2
  • 29
    • 0003182324 scopus 로고
    • Organization of hierarchical perceptual sounds: Music scene analysis with autonomous processing modules and a quantitative information integration mechanism
    • K. Kashino, K. Nakadai, T. Kinoshita, and H. Tanaka, "Organization of hierarchical perceptual sounds: Music scene analysis with autonomous processing modules and a quantitative information integration mechanism," in Proc. Int. Joint Conf. Artif. Intell., 1995, pp. 158-164.
    • (1995) Proc. Int. Joint Conf. Artif. Intell. , pp. 158-164
    • Kashino, K.1    Nakadai, K.2    Kinoshita, T.3    Tanaka, H.4
  • 31
    • 57849142765 scopus 로고    scopus 로고
    • Instrument-specific harmonic atoms for mid-level music representation
    • Jan
    • P. Leveau, E. Vincent, G. Richard, and L. Daudet, "Instrument- specific harmonic atoms for mid-level music representation," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 1, pp. 116-128, Jan. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.1 , pp. 116-128
    • Leveau, P.1    Vincent, E.2    Richard, G.3    Daudet, L.4
  • 32
    • 50249173884 scopus 로고    scopus 로고
    • A multipitch analyzer based on harmonic temporal structured clustering
    • Mar
    • H. Kameoka, T. Nishimoto, and S. Sagayama, "A multipitch analyzer based on harmonic temporal structured clustering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982-994, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 982-994
    • Kameoka, H.1    Nishimoto, T.2    Sagayama, S.3
  • 39
    • 69249151355 scopus 로고    scopus 로고
    • Speech separation using speakeradapted eigenvoice speech models
    • R. J. Weiss and D. P. W. Ellis, "Speech separation using speakeradapted eigenvoice speech models," Comput. Speech Lang., vol. 24, no. 1, pp. 16-29, 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 16-29
    • Weiss, R.J.1    Ellis, D.P.W.2
  • 40
    • 0030737323 scopus 로고    scopus 로고
    • Modeling the manifolds of images of handwritten digits
    • PII S1045922797002373
    • G. E. Hinton, P. Dayan, and M. Revow, "Modelling the manifolds of images and handwritten digits," IEEE Trans. Neural Netw., vol. 8, no. 1, pp. 65-74, Jan. 1997. (Pubitemid 127767781)
    • (1997) IEEE Transactions on Neural Networks , vol.8 , Issue.1 , pp. 65-74
    • Hinton, G.E.1    Dayan, P.2    Revow, M.3
  • 43
    • 33749074089 scopus 로고    scopus 로고
    • Polyphonic music transcription using note event modeling
    • DOI 10.1109/ASPAA.2005.1540233, 1540233, 2005 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
    • M. Ryynänen and A. Klapuri, "Polyphonic music transcription using note event modeling," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust., 2005, pp. 319-322. (Pubitemid 44461857)
    • (2005) IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , pp. 319-322
    • Ryynanen, M.P.1    Klapuri, A.2
  • 44
    • 33646023117 scopus 로고    scopus 로고
    • An introduction to ROC analysis
    • T. Fawcett, "An introduction to ROC analysis," Pattern Recognit. Lett., vol. 27, no. 2006, pp. 861-874, 2005.
    • (2005) Pattern Recognit. Lett. , vol.27 , Issue.2006 , pp. 861-874
    • Fawcett, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.