SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 3, 2012, Pages 717-730

A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation

(2) Yoshii, Kazuyoshi a Goto, Masataka a

a NATIONAL INSTITUTE OF ADVANCED INDUSTRIAL SCIENCE AND TECHNOLOGY AIST (Japan)

Author keywords

Bayesian nonparametrics; Dirichlet process; infinite latent harmonic allocation (iLHA); multipitch analysis

Indexed keywords

EID: 85008529841 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2011.2164530 Document Type: Article

Times cited : (32)

References (49)

1
- 80053454686
- Bayesian nonparametric models
- New York: Springer
- P. Orbanz and Y. W. Teh, “Bayesian nonparametric models,” in Encyclopedia of Machine Learning. New York: Springer, 2010.
- (2010) Encyclopedia of Machine Learning
- Orbanz, P.¹ Teh, Y.W.²

2
- 4644242508
- A real-time music scene description system: Predomi-nant-F0 estimation for detecting melody and bass lines in real-world audio signals
- M. Goto, “A real-time music scene description system: Predomi-nant-F0 estimation for detecting melody and bass lines in real-world audio signals,” Speech Commun., vol. 43, no. 4, pp. 311–329, 2004.
- (2004) Speech Commun. , vol.43 , Issue.4 , pp. 311-329
- Goto, M.¹

3
- 4544303298
- Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds
- H. Kameoka, T. Nishimoto, and S. Sagayama, “Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2004, vol. 4, pp. 297–300.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.4 , pp. 297-300
- Kameoka, H.¹ Nishimoto, T.² Sagayama, S.³

4
- 50249173884
- A multipitch analyzer based on harmonic temporal structured clustering
- Mar.
- H. Kameoka, T. Nishimoto, and S. Sagayama, “A multipitch analyzer based on harmonic temporal structured clustering,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982–994, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 982-994
- Kameoka, H.¹ Nishimoto, T.² Sagayama, S.³

5
- 84945140011
- Automatic transcription of piano music
- C. Raphael, “Automatic transcription of piano music,” in Proc. 3rd Int. Conf. Music Inf. Retrieval (ISMIR), 2002, pp. 161–166.
- (2002) Proc. 3rd Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 161-166
- Raphael, C.¹

6
- 33845250972
- A generative model for music transcription
- Mar.
- A. T. Cemgil, H. J. Kappen, and D. Barber, “A generative model for music transcription,” IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 2, pp. 679–694, Mar. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.2 , pp. 679-694
- Cemgil, A.T.¹ Kappen, H.J.² Barber, D.³

7
- 84873573353
- Multiple pitch transcription using DBN-based musicological models
- S. A. Raczynski, E. Vincent, F. Bimbot, and S. Sagayama, “Multiple pitch transcription using DBN-based musicological models,” in Proc. 11th Int. Conf. Music Inf. Retrieval (ISMIR), 2010, pp. 363–368.
- (2010) Proc. 11th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 363-368
- Raczynski, S.A.¹ Vincent, E.² Bimbot, F.³ Sagayama, S.⁴

8
- 77955826141
- Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle
- Aug.
- V. Emiya, R. Badeau, and B. David, “Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1643–1654, Aug. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1643-1654
- Emiya, V.¹ Badeau, R.² David, B.³

9
- 0001093042
- Algorithms for non-negative matrix factorization
- D. D. Lee and H. S. Seung, “Algorithms for non-negative matrix factorization,” in Proc. Adv. Neural Inf. Process. Syst. (NIPS), 2000, pp. 556–562.
- (2000) Proc. Adv. Neural Inf. Process. Syst. (NIPS) , pp. 556-562
- Lee, D.D.¹ Seung, H.S.²

10
- 84945116938
- Non-negative matrix factorization for polyphonic music transcription
- P. Smaragdis and J. C. Brown, “Non-negative matrix factorization for polyphonic music transcription,” in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), 2003, pp. 177–180.
- (2003) Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA) , pp. 177-180
- Smaragdis, P.¹ Brown, J.C.²

11
- 51449111646
- Bayesian extensions to nonnegative matrix factorisation for audio signal modelling
- T. O. Virtanen, A. T. Cemgil, and S. J. Godsill, “Bayesian extensions to nonnegative matrix factorisation for audio signal modelling,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008, pp. 45–48.
- (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 45-48
- Virtanen, T.O.¹ Cemgil, A.T.² Godsill, S.J.³

12
- 76949105125
- Generative spectrogram factorization models for polyphonic piano transcription
- Mar.
- P. H. Peeling, A. T. Cemgil, and S. J. Godsill, “Generative spectrogram factorization models for polyphonic piano transcription,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 519–527, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 519-527
- Peeling, P.H.¹ Cemgil, A.T.² Godsill, S.J.³

13
- 84873578051
- Multipitch analysis with harmonic non-negative matrix approximation
- S. A. Raczynski, N. Ono, and S. Sagayama, “Multipitch analysis with harmonic non-negative matrix approximation,” in Proc. 6th Int. Conf. Music Inf. Retrieval (ISMIR), 2007, pp. 381–386.
- (2007) Proc. 6th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 381-386
- Raczynski, S.A.¹ Ono, N.² Sagayama, S.³

14
- 47649088496
- Extended nonnegative tensor factorisation models for musical sound source separation
- D. FitzGerald, M. Cranitch, and E. Coyle, “Extended nonnegative tensor factorisation models for musical sound source separation,” Comput. Intell. Neurosci., vol. 2008, 2008.
- (2008) Comput. Intell. Neurosci. , vol.2008
- FitzGerald, D.¹ Cranitch, M.² Coyle, E.³

15
- 76949083547
- Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription
- Mar.
- N. Bertin, R. Badeau, and E. Vincent, “Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 538–549, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 538-549
- Bertin, N.¹ Badeau, R.² Vincent, E.³

16
- 76949108729
- Adaptive harmonic spectral decomposition for multiple pitch estimation
- Mar.
- E. Vincent, N. Bertin, and R. Badeau, “Adaptive harmonic spectral decomposition for multiple pitch estimation,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 528–537, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 528-537
- Vincent, E.¹ Bertin, N.² Badeau, R.³

17
- 70449658600
- Realtime multiple pitch observation using sparse non-negative constraints
- A. Cont, “Realtime multiple pitch observation using sparse non-negative constraints,” in Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR), 2006, pp. 206–211.
- (2006) Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 206-211
- Cont, A.¹

18
- 70349396097
- Complex NMF: A new sparse representation for acoustic signals
- H. Kameoka, T. Nishimoto, and S. Sagayama, “Complex NMF: A new sparse representation for acoustic signals,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009, pp. 45–48.
- (2009) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 45-48
- Kameoka, H.¹ Nishimoto, T.² Sagayama, S.³

19
- 63249085556
- Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
- C. Févotte, N. Bertin, and J. -L. Durrieu, “Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis,” Neural Comput., vol. 21, no. 3, pp. 793–830, 2009.
- (2009) Neural Comput. , vol.21 , Issue.3 , pp. 793-830
- Févotte, C.¹ Bertin, N.² Durrieu, J.-L.³

20
- 77956538800
- Bayesian nonparametric matrix factorization for recorded music
- M. Hoffman, D. Blei, and P. Cook, “Bayesian nonparametric matrix factorization for recorded music,” in Proc. 27th Int. Conf. Mach. Learn. (ICML), 2010, pp. 439–446.
- (2010) Proc. 27th Int. Conf. Mach. Learn. (ICML) , pp. 439-446
- Hoffman, M.¹ Blei, D.² Cook, P.³

21
- 85008567194
- A. Klapuri and M. Davy, Eds. New York: Springer
- SignalProcessingMethodsforMusic Transcription, A. Klapuri and M. Davy, Eds. New York: Springer, 2010.
- (2010) SignalProcessingMethodsforMusic Transcription

22
- 2642557862
- A connectionist approach to transcription of polyphonic piano music
- M. Marolt, “A connectionist approach to transcription of polyphonic piano music,” IEEE Trans. Multimedia, vol. 6, no. 3, pp. 439–449, 2004.
- (2004) IEEE Trans. Multimedia , vol.6 , Issue.3 , pp. 439-449
- Marolt, M.¹

23
- 39649094860
- Multipitch analysis of polyphonic music and speech signals using an auditory model
- A. Klapuri, “Multipitch analysis of polyphonic music and speech signals using an auditory model,” IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 2, pp. 255–266, 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 255-266
- Klapuri, A.¹

24
- 84873444806
- Multiple fundamental frequency estimation by summing harmonic amplitudes
- A. Klapuri, “Multiple fundamental frequency estimation by summing harmonic amplitudes,” in Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR), 2006, pp. 216–221.
- (2006) Proc. 7th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 216-221
- Klapuri, A.¹

25
- 0034319894
- A computationally efficient multipitch analysis model
- Nov.
- T. Tolonen and M. Karjalainen, “A computationally efficient multipitch analysis model,” IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 708–716, Nov. 2000.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.6 , pp. 708-716
- Tolonen, T.¹ Karjalainen, M.²

26
- 51449099172
- Multiple fundamental frequency estimation using Gaussian smoothness
- A. Pertusa and J. M. Inesta, “Multiple fundamental frequency estimation using Gaussian smoothness,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008, pp. 105–108.
- (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 105-108
- Pertusa, A.¹ Inesta, J.M.²

27
- 34047272516
- Automatic piano transcription using frequency and time-domain information
- Nov.
- J. P. Bello, L. Daudet, and M. B. Sandler, “Automatic piano transcription using frequency and time-domain information,” IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 6, pp. 2242–2251, Nov. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 2242-2251
- Bello, J.P.¹ Daudet, L.² Sandler, M.B.³

28
- 84872697855
- Extraction of the melody pitch contour from polyphonic audio
- [Online]. Available: http://www.musicir.org/evalua-tion/mirex-results/articles/melody/dressler.pdf
- K. Dressler, “Extraction of the melody pitch contour from polyphonic audio,” in Proc. 2nd Music Inf. Retrieval Eval. eXchange (MIREX), 2005 [Online]. Available: http://www.musicir.org/evalua-tion/mirex-results/articles/melody/dressler.pdf.
- (2005) Proc. 2nd Music Inf. Retrieval Eval. eXchange (MIREX)
- Dressler, K.¹

29
- 84873440865
- Transcription of the singing melody in polyphonic music
- M. P. Ryynänen and A. P. Klapuri, “Transcription of the singing melody in polyphonic music,” in 7th Int. Conf. Music Inf. Retrieval (ISMIR), 2006, pp. 206–211.
- (2006) 7th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 206-211
- Ryynänen, M.P.¹ Klapuri, A.P.²

30
- 76949096499
- Source/filter model for unsupervised main melody extraction from polyphonic audio signals
- Mar.
- J. -L. Durrieu, G. Richard, B. David, and C. Févotte, “Source/filter model for unsupervised main melody extraction from polyphonic audio signals,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 564–575, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 564-575
- Durrieu, J.-L.¹ Richard, G.² David, B.³ Févotte, C.⁴

31
- 48849095345
- Melody transcription from music audio: Approaches and evaluation
- May
- G. E. Poliner, D. P. Ellis, A. F. Ehmann, E. Gómez, S. Streich, and B. Ong, “Melody transcription from music audio: Approaches and evaluation,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1247–1256, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1247-1256
- Poliner, G.E.¹ Ellis, D.P.² Ehmann, A.F.³ Gómez, E.⁴ Streich, S.⁵ Ong, B.⁶

32
- 84872741459
- Finding latent sources in recorded music with a shift-invariant HDP
- M. Hoffman, D. Blei, and P. Cook, “Finding latent sources in recorded music with a shift-invariant HDP,” in Proc. 12th Int. Conf. Digital Audio Effects (DAFX), 2009.
- (2009) Proc. 12th Int. Conf. Digital Audio Effects (DAFX)
- Hoffman, M.¹ Blei, D.² Cook, P.³

33
- 0141607824
- Latent Dirichlet allocation
- D. Blei, A. Ng, and M. Jordan, “Latent Dirichlet allocation,” Mach. Learn. Res., vol. 3, pp. 993–1022, 2003.
- (2003) Mach. Learn. Res. , vol.3 , pp. 993-1022
- Blei, D.¹ Ng, A.² Jordan, M.³

34
- 85026972772
- Probabilistic latent semantic indexing
- T. Hofmann and J. Puzicha, “Probabilistic latent semantic indexing,” in Proc. 22nd Int. Conf. Res. Develop. Inf. Retrieval (SIGIR), 1999, pp. 50–57.
- (1999) Proc. 22nd Int. Conf. Res. Develop. Inf. Retrieval (SIGIR) , pp. 50-57
- Hofmann, T.¹ Puzicha, J.²

35
- 47649133016
- Probabilistic latent variable models as non-negative factorizations
- M. Shashanka, B. Raj, and P. Smaragdis, “Probabilistic latent variable models as non-negative factorizations,” Comput. Intell. Neuosci., vol. 2008, 2008.
- (2008) Comput. Intell. Neuosci. , vol.2008
- Shashanka, M.¹ Raj, B.² Smaragdis, P.³

36
- 84885621082
- Relation between pLSA and NMF and implications
- E. Gaussier and C. Goutte, “Relation between pLSA and NMF and implications,” in Proc. 28th Int. Conf. Res. Develop. Inf. Retrieval (SIGIR), 2005, pp. 601–602.
- (2005) Proc. 28th Int. Conf. Res. Develop. Inf. Retrieval (SIGIR) , pp. 601-602
- Gaussier, E.¹ Goutte, C.²

37
- 84898964031
- A variational Bayesian framework for graphical models
- H. Attias, “A variational Bayesian framework for graphical models,” in Adv. Neural Inf. Process. Syst. (NIPS), 2000, pp. 209–215.
- (2000) Adv. Neural Inf. Process. Syst. (NIPS) , pp. 209-215
- Attias, H.¹

38
- 0004087397
- Probabilistic inference using Markov chain Monte Carlo methods
- Tech. Rep. CRG-TR-93-1
- R. M. Neal, “Probabilistic inference using Markov chain Monte Carlo methods,” Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada, Tech. Rep. CRG-TR-93-1, 1993.
- (1993) Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
- Neal, R.M.¹

39
- 77956217715
- Dirichlet processes
- New York: Springer
- Y. W. Teh, “Dirichlet processes,” in Encyclopedia of Machine Learning. New York: Springer, 2010.
- (2010) Encyclopedia of Machine Learning
- Teh, Y.W.¹

40
- 0001120413
- Bayesian analysis of some nonparametric problems
- T. Ferguson, “Bayesian analysis of some nonparametric problems,” Ann. Statist., vol. 1, no. 2, pp. 209–230, 1973.
- (1973) Ann. Statist. , vol.1 , Issue.2 , pp. 209-230
- Ferguson, T.¹

41
- 0000720609
- A constructive definition of Dirichlet priors
- J. Sethuraman, “A constructive definition of Dirichlet priors,” Statist. Sinica, vol. 4, pp. 639–650, 1994.
- (1994) Statist. Sinica , vol.4 , pp. 639-650
- Sethuraman, J.¹

42
- 1842816362
- Gibbs sampling methods for stick-breaking priors
- H. Ishwaran and L. F. James, “Gibbs sampling methods for stick-breaking priors,” J. Amer. Statist. Assoc., vol. 96, no. 453, pp. 161–173, 2001.
- (2001) J. Amer. Statist. Assoc. , vol.96 , Issue.453 , pp. 161-173
- Ishwaran, H.¹ James, L.F.²

43
- 0021412027
- Vector quantization
- Apr.
- R. Gray, “Vector quantization,” IEEE ASSP Mag., vol. 1, no. 2, pp. 4–29, Apr. 1984.
- (1984) IEEE ASSP Mag. , vol.1 , Issue.2 , pp. 4-29
- Gray, R.¹

44
- 33749249312
- Hierarchical Dirichlet processes
- Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei, “Hierarchical Dirichlet processes,” J. Amer. Statist. Assoc., vol. 101, no. 476, pp. 1566–1581, 2006.
- (2006) J. Amer. Statist. Assoc. , vol.101 , Issue.476 , pp. 1566-1581
- Teh, Y.W.¹ Jordan, M.I.² Beal, M.J.³ Blei, D.M.⁴

45
- 85162054776
- Collapsed variational inference for HDP
- Y. W. Teh, K. Kurihara, and M. Welling, “Collapsed variational inference for HDP,” in Proc. Adv. Neural Inf. Process. Syst. (NIPS), 2007.
- (2007) Proc. Adv. Neural Inf. Process. Syst. (NIPS)
- Teh, Y.W.¹ Kurihara, K.² Welling, M.³

46
- 56549085814
- Latent-space variational Bayes
- Dec.
- J. Sung, Z. Ghahramani, and S. -Y. Bang, “Latent-space variational Bayes,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 30, no. 12, pp. 2236–2242, Dec. 2008.
- (2008) IEEE Trans. Pattern Anal. Mach. Intell. , vol.30 , Issue.12 , pp. 2236-2242
- Sung, J.¹ Ghahramani, Z.² Bang, S.-Y.³

47
- 67650107019
- Second-order latent-space variational Bayes for approximate Bayesian inference
- J. Sung, Z. Ghahramani, and S. -Y. Bang, “Second-order latent-space variational Bayes for approximate Bayesian inference,” IEEE Signal Process. Lett., vol. 15, pp. 918–921, 2008.
- (2008) IEEE Signal Process. Lett. , vol.15 , pp. 918-921
- Sung, J.¹ Ghahramani, Z.² Bang, S.-Y.³

48
- 84971574509
- RWC music database: Popular, classical, and jazz music database
- M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, “RWC music database: Popular, classical, and jazz music database,” in Proc. 3th Int. Conf. Music Inf. Retrieval (ISMIR), 2002, pp. 287–288.
- (2002) Proc. 3th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 287-288
- Goto, M.¹ Hashiguchi, H.² Nishimura, T.³ Oka, R.⁴

49
- 84873420337
- Content-based musical similarity computation using the hierarchical Dirichlet process
- M. Hoffman, D. Blei, and P. Cook, “Content-based musical similarity computation using the hierarchical Dirichlet process,” in Proc. 9th Int. Conf. Music Inf. Retrieval (ISMIR), 2008, pp. 349–354.
- (2008) Proc. 9th Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 349-354
- Hoffman, M.¹ Blei, D.² Cook, P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.