메뉴 건너뛰기




Volumn 20, Issue 3, 2012, Pages 717-730

A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation

Author keywords

Bayesian nonparametrics; Dirichlet process; infinite latent harmonic allocation (iLHA); multipitch analysis

Indexed keywords


EID: 85008529841     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2011.2164530     Document Type: Article
Times cited : (32)

References (49)
  • 2
    • 4644242508 scopus 로고    scopus 로고
    • A real-time music scene description system: Predomi-nant-F0 estimation for detecting melody and bass lines in real-world audio signals
    • M. Goto, “A real-time music scene description system: Predomi-nant-F0 estimation for detecting melody and bass lines in real-world audio signals,” Speech Commun., vol. 43, no. 4, pp. 311–329, 2004.
    • (2004) Speech Commun. , vol.43 , Issue.4 , pp. 311-329
    • Goto, M.1
  • 3
    • 4544303298 scopus 로고    scopus 로고
    • Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds
    • H. Kameoka, T. Nishimoto, and S. Sagayama, “Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2004, vol. 4, pp. 297–300.
    • (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.4 , pp. 297-300
    • Kameoka, H.1    Nishimoto, T.2    Sagayama, S.3
  • 4
    • 50249173884 scopus 로고    scopus 로고
    • A multipitch analyzer based on harmonic temporal structured clustering
    • Mar.
    • H. Kameoka, T. Nishimoto, and S. Sagayama, “A multipitch analyzer based on harmonic temporal structured clustering,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982–994, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 982-994
    • Kameoka, H.1    Nishimoto, T.2    Sagayama, S.3
  • 8
    • 77955826141 scopus 로고    scopus 로고
    • Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle
    • Aug.
    • V. Emiya, R. Badeau, and B. David, “Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1643–1654, Aug. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1643-1654
    • Emiya, V.1    Badeau, R.2    David, B.3
  • 12
    • 76949105125 scopus 로고    scopus 로고
    • Generative spectrogram factorization models for polyphonic piano transcription
    • Mar.
    • P. H. Peeling, A. T. Cemgil, and S. J. Godsill, “Generative spectrogram factorization models for polyphonic piano transcription,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 519–527, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 519-527
    • Peeling, P.H.1    Cemgil, A.T.2    Godsill, S.J.3
  • 14
    • 47649088496 scopus 로고    scopus 로고
    • Extended nonnegative tensor factorisation models for musical sound source separation
    • D. FitzGerald, M. Cranitch, and E. Coyle, “Extended nonnegative tensor factorisation models for musical sound source separation,” Comput. Intell. Neurosci., vol. 2008, 2008.
    • (2008) Comput. Intell. Neurosci. , vol.2008
    • FitzGerald, D.1    Cranitch, M.2    Coyle, E.3
  • 15
    • 76949083547 scopus 로고    scopus 로고
    • Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription
    • Mar.
    • N. Bertin, R. Badeau, and E. Vincent, “Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 538–549, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 538-549
    • Bertin, N.1    Badeau, R.2    Vincent, E.3
  • 16
    • 76949108729 scopus 로고    scopus 로고
    • Adaptive harmonic spectral decomposition for multiple pitch estimation
    • Mar.
    • E. Vincent, N. Bertin, and R. Badeau, “Adaptive harmonic spectral decomposition for multiple pitch estimation,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 528–537, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 528-537
    • Vincent, E.1    Bertin, N.2    Badeau, R.3
  • 17
    • 70449658600 scopus 로고    scopus 로고
    • Realtime multiple pitch observation using sparse non-negative constraints
    • A. Cont, “Realtime multiple pitch observation using sparse non-negative constraints,” in Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR), 2006, pp. 206–211.
    • (2006) Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 206-211
    • Cont, A.1
  • 19
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
    • C. Févotte, N. Bertin, and J. -L. Durrieu, “Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis,” Neural Comput., vol. 21, no. 3, pp. 793–830, 2009.
    • (2009) Neural Comput. , vol.21 , Issue.3 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.-L.3
  • 22
    • 2642557862 scopus 로고    scopus 로고
    • A connectionist approach to transcription of polyphonic piano music
    • M. Marolt, “A connectionist approach to transcription of polyphonic piano music,” IEEE Trans. Multimedia, vol. 6, no. 3, pp. 439–449, 2004.
    • (2004) IEEE Trans. Multimedia , vol.6 , Issue.3 , pp. 439-449
    • Marolt, M.1
  • 23
    • 39649094860 scopus 로고    scopus 로고
    • Multipitch analysis of polyphonic music and speech signals using an auditory model
    • A. Klapuri, “Multipitch analysis of polyphonic music and speech signals using an auditory model,” IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 2, pp. 255–266, 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 255-266
    • Klapuri, A.1
  • 24
    • 84873444806 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation by summing harmonic amplitudes
    • A. Klapuri, “Multiple fundamental frequency estimation by summing harmonic amplitudes,” in Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR), 2006, pp. 216–221.
    • (2006) Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 216-221
    • Klapuri, A.1
  • 25
    • 0034319894 scopus 로고    scopus 로고
    • A computationally efficient multipitch analysis model
    • Nov.
    • T. Tolonen and M. Karjalainen, “A computationally efficient multipitch analysis model,” IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 708–716, Nov. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.6 , pp. 708-716
    • Tolonen, T.1    Karjalainen, M.2
  • 27
    • 34047272516 scopus 로고    scopus 로고
    • Automatic piano transcription using frequency and time-domain information
    • Nov.
    • J. P. Bello, L. Daudet, and M. B. Sandler, “Automatic piano transcription using frequency and time-domain information,” IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 6, pp. 2242–2251, Nov. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 2242-2251
    • Bello, J.P.1    Daudet, L.2    Sandler, M.B.3
  • 28
    • 84872697855 scopus 로고    scopus 로고
    • Extraction of the melody pitch contour from polyphonic audio
    • [Online]. Available: http://www.musicir.org/evalua-tion/mirex-results/articles/melody/dressler.pdf
    • K. Dressler, “Extraction of the melody pitch contour from polyphonic audio,” in Proc. 2nd Music Inf. Retrieval Eval. eXchange (MIREX), 2005 [Online]. Available: http://www.musicir.org/evalua-tion/mirex-results/articles/melody/dressler.pdf.
    • (2005) Proc. 2nd Music Inf. Retrieval Eval. eXchange (MIREX)
    • Dressler, K.1
  • 30
    • 76949096499 scopus 로고    scopus 로고
    • Source/filter model for unsupervised main melody extraction from polyphonic audio signals
    • Mar.
    • J. -L. Durrieu, G. Richard, B. David, and C. Févotte, “Source/filter model for unsupervised main melody extraction from polyphonic audio signals,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 564–575, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 564-575
    • Durrieu, J.-L.1    Richard, G.2    David, B.3    Févotte, C.4
  • 33
    • 0141607824 scopus 로고    scopus 로고
    • Latent Dirichlet allocation
    • D. Blei, A. Ng, and M. Jordan, “Latent Dirichlet allocation,” Mach. Learn. Res., vol. 3, pp. 993–1022, 2003.
    • (2003) Mach. Learn. Res. , vol.3 , pp. 993-1022
    • Blei, D.1    Ng, A.2    Jordan, M.3
  • 35
    • 47649133016 scopus 로고    scopus 로고
    • Probabilistic latent variable models as non-negative factorizations
    • M. Shashanka, B. Raj, and P. Smaragdis, “Probabilistic latent variable models as non-negative factorizations,” Comput. Intell. Neuosci., vol. 2008, 2008.
    • (2008) Comput. Intell. Neuosci. , vol.2008
    • Shashanka, M.1    Raj, B.2    Smaragdis, P.3
  • 37
    • 84898964031 scopus 로고    scopus 로고
    • A variational Bayesian framework for graphical models
    • H. Attias, “A variational Bayesian framework for graphical models,” in Adv. Neural Inf. Process. Syst. (NIPS), 2000, pp. 209–215.
    • (2000) Adv. Neural Inf. Process. Syst. (NIPS) , pp. 209-215
    • Attias, H.1
  • 40
    • 0001120413 scopus 로고
    • Bayesian analysis of some nonparametric problems
    • T. Ferguson, “Bayesian analysis of some nonparametric problems,” Ann. Statist., vol. 1, no. 2, pp. 209–230, 1973.
    • (1973) Ann. Statist. , vol.1 , Issue.2 , pp. 209-230
    • Ferguson, T.1
  • 41
    • 0000720609 scopus 로고
    • A constructive definition of Dirichlet priors
    • J. Sethuraman, “A constructive definition of Dirichlet priors,” Statist. Sinica, vol. 4, pp. 639–650, 1994.
    • (1994) Statist. Sinica , vol.4 , pp. 639-650
    • Sethuraman, J.1
  • 42
    • 1842816362 scopus 로고    scopus 로고
    • Gibbs sampling methods for stick-breaking priors
    • H. Ishwaran and L. F. James, “Gibbs sampling methods for stick-breaking priors,” J. Amer. Statist. Assoc., vol. 96, no. 453, pp. 161–173, 2001.
    • (2001) J. Amer. Statist. Assoc. , vol.96 , Issue.453 , pp. 161-173
    • Ishwaran, H.1    James, L.F.2
  • 43
    • 0021412027 scopus 로고
    • Vector quantization
    • Apr.
    • R. Gray, “Vector quantization,” IEEE ASSP Mag., vol. 1, no. 2, pp. 4–29, Apr. 1984.
    • (1984) IEEE ASSP Mag. , vol.1 , Issue.2 , pp. 4-29
    • Gray, R.1
  • 47
    • 67650107019 scopus 로고    scopus 로고
    • Second-order latent-space variational Bayes for approximate Bayesian inference
    • J. Sung, Z. Ghahramani, and S. -Y. Bang, “Second-order latent-space variational Bayes for approximate Bayesian inference,” IEEE Signal Process. Lett., vol. 15, pp. 918–921, 2008.
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 918-921
    • Sung, J.1    Ghahramani, Z.2    Bang, S.-Y.3
  • 49
    • 84873420337 scopus 로고    scopus 로고
    • Content-based musical similarity computation using the hierarchical Dirichlet process
    • M. Hoffman, D. Blei, and P. Cook, “Content-based musical similarity computation using the hierarchical Dirichlet process,” in Proc. 9th Int. Conf. Music Inf. Retrieval (ISMIR), 2008, pp. 349–354.
    • (2008) Proc. 9th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 349-354
    • Hoffman, M.1    Blei, D.2    Cook, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.