메뉴 건너뛰기




Volumn 16, Issue 4, 2008, Pages 766-778

Unsupervised single-channel music source separation by average harmonic structure modeling

Author keywords

Clustering; Harmonic structure; Multipitch estimation; Single channel source separation

Indexed keywords

CLUSTERING; CONCURRENT SOUNDS; HARMONIC STRUCTURE; LISTENING QUALITIES; MIXED SIGNALS; MULTIPITCH ESTIMATION; MUSICAL SIGNALS; NON-NEGATIVE MATRIX FACTORIZATIONS; SIDE EFFECTS; SINGLE-CHANNEL SOURCE SEPARATION; SOURCE SEPARATIONS;

EID: 64849117345     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.919073     Document Type: Article
Times cited : (81)

References (59)
  • 1
    • 0042826822 scopus 로고    scopus 로고
    • Independent component analysis: Algorithms and applications
    • A. Hyvärinen and E. Oja, "Independent component analysis: Algorithms and applications," Neural Netw., vol. 13, pp. 411-430, 2000.
    • (2000) Neural Netw , vol.13 , pp. 411-430
    • Hyvärinen, A.1    Oja, E.2
  • 2
    • 0036753896 scopus 로고    scopus 로고
    • Geometric source separation: Merging convolutive source separation with geometric beamforming
    • Sep
    • L. C. Parra and C. V. Alvino, "Geometric source separation: Merging convolutive source separation with geometric beamforming," IEEE Trans. Speech Audio Process., vol. 10, no. 6, pp. 352-362, Sep. 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.6 , pp. 352-362
    • Parra, L.C.1    Alvino, C.V.2
  • 3
    • 0032212942 scopus 로고    scopus 로고
    • Blind separation of convolved mixtures in the frequency domain
    • P. Smaragdis, "Blind separation of convolved mixtures in the frequency domain," Neurocomputing, vol. 22, pp. 21-34, 1998.
    • (1998) Neurocomputing , vol.22 , pp. 21-34
    • Smaragdis, P.1
  • 4
    • 84899018983 scopus 로고    scopus 로고
    • Blind source separation via multinode sparse representation
    • M. Zibulevsky, P. Kisilev, Y. Y. Zeevi, and B. Pearlmutter, "Blind source separation via multinode sparse representation," in Proc. NIPS, 2002, pp. 1049-1056.
    • (2002) Proc. NIPS , pp. 1049-1056
    • Zibulevsky, M.1    Kisilev, P.2    Zeevi, Y.Y.3    Pearlmutter, B.4
  • 5
    • 0032624821 scopus 로고    scopus 로고
    • Blind source separation of more sources than mixtures using overcomplete representations
    • Jun
    • T.-W. Lee, M. S. Lewicki, M. Girolami, and T. J. Sejnowski, "Blind source separation of more sources than mixtures using overcomplete representations," IEEE Signal Process. Lett., vol. 6, no. 4, pp. 87-90, Jun. 1999.
    • (1999) IEEE Signal Process. Lett , vol.6 , Issue.4 , pp. 87-90
    • Lee, T.-W.1    Lewicki, M.S.2    Girolami, M.3    Sejnowski, T.J.4
  • 6
    • 34347404627 scopus 로고    scopus 로고
    • A Bayesian approach for blind separation of sparse sources
    • Nov
    • C. Fevotte and S.J. Godsill, "A Bayesian approach for blind separation of sparse sources," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 6, pp. 2174-2188, Nov. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.6 , pp. 2174-2188
    • Fevotte, C.1    Godsill, S.J.2
  • 7
    • 34047253222 scopus 로고    scopus 로고
    • A method for separation of overlapping partials based on similarity of temporal envelopes in multichannel mixtures
    • May
    • H. Viste and G. Evangelista, "A method for separation of overlapping partials based on similarity of temporal envelopes in multichannel mixtures," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 1051-1061, May 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.3 , pp. 1051-1061
    • Viste, H.1    Evangelista, G.2
  • 8
    • 0141813716 scopus 로고    scopus 로고
    • Multi-channel source separation by factorial HMMs
    • pp. I-664-I-667
    • M. J. Reyes-Gomez, B. Raj, and D. Ellis, "Multi-channel source separation by factorial HMMs," in Proc. ICASSP, 2003, pp. I-664-I-667.
    • Proc. ICASSP , pp. 2003
    • Reyes-Gomez, M.J.1    Raj, B.2    Ellis, D.3
  • 9
    • 84898946024 scopus 로고    scopus 로고
    • One microphone source separation
    • S. T. Roweis, "One microphone source separation," in Proc. NIPS, 2001, pp. 15-19.
    • (2001) Proc. NIPS , pp. 15-19
    • Roweis, S.T.1
  • 10
    • 84898957327 scopus 로고    scopus 로고
    • Audio-visual sound separation via hidden Markov models
    • J. Hersheya and M. Casey, "Audio-visual sound separation via hidden Markov models," in Proc. NIPS, 2002, pp. 1173-1180.
    • (2002) Proc. NIPS , pp. 1173-1180
    • Hersheya, J.1    Casey, M.2
  • 11
    • 84899014722 scopus 로고    scopus 로고
    • A probabilistic approach to single channel blind signal separation
    • G.-J. Jang and T.-W. Lee, "A probabilistic approach to single channel blind signal separation," in Proc. NIPS, 2003, pp. 1173-1180.
    • (2003) Proc. NIPS , pp. 1173-1180
    • Jang, G.-J.1    Lee, T.-W.2
  • 12
    • 84873538214 scopus 로고    scopus 로고
    • Separation of vocal from polyphonic audio recordings
    • S. Vembu and S. Baumann, "Separation of vocal from polyphonic audio recordings," in Proc. ISMIR, 2005, pp. 337-344.
    • (2005) Proc. ISMIR , pp. 337-344
    • Vembu, S.1    Baumann, S.2
  • 13
    • 84863690059 scopus 로고    scopus 로고
    • Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine
    • Istanbul, Turkey, CD-ROM
    • M. Helen and T. Virtanen, "Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine," in Proc. EUSIPCO, Istanbul, Turkey, 2005, CD-ROM.
    • (2005) Proc. EUSIPCO
    • Helen, M.1    Virtanen, T.2
  • 15
    • 33744978751 scopus 로고    scopus 로고
    • Musical source separation using time-frequency source priors
    • Jan
    • E. Vincent, "Musical source separation using time-frequency source priors," IEEE Trans. Audio, Speech, Lang., Process., vol. 14, no. 1, pp. 91-98, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang., Process , vol.14 , Issue.1 , pp. 91-98
    • Vincent, E.1
  • 16
    • 33745683213 scopus 로고    scopus 로고
    • Single-channel mixture decomposition using Bayesian harmonic models
    • E. Vincent and M. D. Plumbley, "Single-channel mixture decomposition using Bayesian harmonic models," in Proc. ICA, 2006, pp. 722-730.
    • (2006) Proc. ICA , pp. 722-730
    • Vincent, E.1    Plumbley, M.D.2
  • 17
    • 33745725520 scopus 로고    scopus 로고
    • Harmonic source separation using prestored spectra
    • M. Bay and J. W. Beauchamp, "Harmonic source separation using prestored spectra," in Proc. ICA, 2006, pp. 561-568.
    • (2006) Proc. ICA , pp. 561-568
    • Bay, M.1    Beauchamp, J.W.2
  • 18
    • 84898936783 scopus 로고    scopus 로고
    • Blind one-microphone speech separation: A spectral learning approach
    • F. R. Bach and M. I. Jordan, "Blind one-microphone speech separation: A spectral learning approach," in Proc. NIPS, 2005, pp. 65-72.
    • (2005) Proc. NIPS , pp. 65-72
    • Bach, F.R.1    Jordan, M.I.2
  • 19
    • 17444393861 scopus 로고    scopus 로고
    • Methods for separation of harmonic sound sources using sinusoidal modeling
    • presented at the, Munich. Germany, unpublished
    • T. Tolonen, "Methods for separation of harmonic sound sources using sinusoidal modeling." presented at the AES 106th Convention, Munich. Germany, 1999, unpublished.
    • (1999) AES 106th Convention
    • Tolonen, T.1
  • 20
    • 0033707902 scopus 로고    scopus 로고
    • T. Virtanen and A. Klapuri, Separation of harmonic sound sources using sinusoidal modeling, in Proc. ICASSP, 2000, pp. II-765-U-768.
    • T. Virtanen and A. Klapuri, "Separation of harmonic sound sources using sinusoidal modeling," in Proc. ICASSP, 2000, pp. II-765-U-768.
  • 21
    • 85138804853 scopus 로고    scopus 로고
    • Algorithm for the separation of harmonic sounds with time-frequency smoothness constraint
    • T. Virtanen, "Algorithm for the separation of harmonic sounds with time-frequency smoothness constraint," in Proc. DAFx, 2003, pp. 35-40.
    • (2003) Proc. DAFx , pp. 35-40
    • Virtanen, T.1
  • 22
    • 33845951451 scopus 로고    scopus 로고
    • Separation of synchronous pitched notes by spectral filtering of harmonics
    • Sep
    • M. R. Every and J. E. Szymanski, "Separation of synchronous pitched notes by spectral filtering of harmonics," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1845-1856, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1845-1856
    • Every, M.R.1    Szymanski, J.E.2
  • 23
    • 50549089895 scopus 로고    scopus 로고
    • Separation of singing voice from music accompaniment for monaural recordings
    • May
    • Y. Li and D. L. Wang, "Separation of singing voice from music accompaniment for monaural recordings," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 4, pp. 1475-1487, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang., Process , vol.15 , Issue.4 , pp. 1475-1487
    • Li, Y.1    Wang, D.L.2
  • 25
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, 1994.
    • (1994) Comput. Speech Lang , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 26
    • 33745729246 scopus 로고    scopus 로고
    • Musical audio stream separation by non-negative matrix factorization
    • Glasgow, U.K
    • B. Wang and M. D. Plumbley. "Musical audio stream separation by non-negative matrix factorization," in Proc. DMRN Summer Conf., Glasgow, U.K., 2005, pp. 23-24.
    • (2005) Proc. DMRN Summer Conf , pp. 23-24
    • Wang, B.1    Plumbley, M.D.2
  • 27
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
    • Mar
    • T. Virtanen, "Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang., Process , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 28
    • 33144463127 scopus 로고    scopus 로고
    • Unsupervised analysis of polyphonic music using sparse coding
    • S. A. Abdallah and M. D. Plumbley, "Unsupervised analysis of polyphonic music using sparse coding," IEEE Trans. Neural Netw., vol. 17, no. 1, pp. 179-196, 2006.
    • (2006) IEEE Trans. Neural Netw , vol.17 , Issue.1 , pp. 179-196
    • Abdallah, S.A.1    Plumbley, M.D.2
  • 29
    • 34547493724 scopus 로고    scopus 로고
    • Separation of music signals by harmonic structure modeling
    • Y. Zhang and C. Zhang, "Separation of music signals by harmonic structure modeling," in Proc. NIPS, 2006, pp. 1617-1624.
    • (2006) Proc. NIPS , pp. 1617-1624
    • Zhang, Y.1    Zhang, C.2
  • 31
    • 85159261800 scopus 로고    scopus 로고
    • Separation of mixed audio sources by independent subspace analysis
    • M. A. Casey and A. Westner. "Separation of mixed audio sources by independent subspace analysis," in Proc. ICMC. 2000, pp. 154-161.
    • (2000) Proc. ICMC , pp. 154-161
    • Casey, M.A.1    Westner, A.2
  • 32
    • 32844468881 scopus 로고    scopus 로고
    • Extraction of drum tracks from polyphonic music using independent subspace analysis
    • C. Uhle, C. Dittmar, and T. Sporer, "Extraction of drum tracks from polyphonic music using independent subspace analysis," in Proc. ICA, 2003, pp. 843-848.
    • (2003) Proc. ICA , pp. 843-848
    • Uhle, C.1    Dittmar, C.2    Sporer, T.3
  • 33
    • 49749085046 scopus 로고    scopus 로고
    • M. K. 1. Molla and K. Hirose, Single-mixture audio source separation by subspace decomposistion of Hilbert spectrum, IEEE Trans. Audio, Speech, Lang., Process., 15, no. 3, pp. 893-900, Mar. 2007.
    • M. K. 1. Molla and K. Hirose, "Single-mixture audio source separation by subspace decomposistion of Hilbert spectrum," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 3, pp. 893-900, Mar. 2007.
  • 34
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by nonnegative matrix factorization
    • D. D. Lee and H. S. Seung, "Learning the parts of objects by nonnegative matrix factorization," Nature, vol. 401, pp. 788-791, 1999.
    • (1999) Nature , vol.401 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 35
    • 33745711481 scopus 로고    scopus 로고
    • Monaural music source separation: Non - negativity, sparseness, and shift-invariance
    • M. Kim and S. Choi, "Monaural music source separation: Non - negativity, sparseness, and shift-invariance," in ICA, 2006, pp. 617-624
    • (2006) ICA , pp. 617-624
    • Kim, M.1    Choi, S.2
  • 36
    • 48149090146 scopus 로고    scopus 로고
    • Estimating single-channel source separation masks: Relevance vector machine classifiers vs. pitch-based masking
    • Oct
    • R. Weiss and D. Ellis, "Estimating single-channel source separation masks: Relevance vector machine classifiers vs. pitch-based masking," in Proc. Workshop Statistical Perceptual Audition (SAPA'06), Oct. 2006, pp. 31-36.
    • (2006) Proc. Workshop Statistical Perceptual Audition (SAPA'06) , pp. 31-36
    • Weiss, R.1    Ellis, D.2
  • 37
    • 64849112529 scopus 로고    scopus 로고
    • E. Vincent, M. G. Jafari, S. A. Abdallah, M. D. Plumbley, and M. E. Davies, Model-based audio source separation, Queen Mary Univ. of London, London, U.K., Tech. Rep. C4DM-TR-05-01, 2006.
    • E. Vincent, M. G. Jafari, S. A. Abdallah, M. D. Plumbley, and M. E. Davies, "Model-based audio source separation," Queen Mary Univ. of London, London, U.K., Tech. Rep. C4DM-TR-05-01, 2006.
  • 38
    • 0013301483 scopus 로고    scopus 로고
    • A Study of musical instrument classification using Gaussian mixture models and support vector machines Cambridge Res. Lab
    • Tech. Rep. Series CRL/4
    • J. Marques and P. Moreno, A Study of musical instrument classification using Gaussian mixture models and support vector machines Cambridge Res. Lab. Tech. Rep. Series CRL/4, 1999.
    • (1999)
    • Marques, J.1    Moreno, P.2
  • 39
    • 0008094967 scopus 로고    scopus 로고
    • 3rd ed. Sacramento, CA: Brooks Cole, California State Univ
    • D. E. Hall, Musical Acoustics, 3rd ed. Sacramento, CA: Brooks Cole, California State Univ., 2002.
    • (2002) Musical Acoustics
    • Hall, D.E.1
  • 40
    • 7044269007 scopus 로고    scopus 로고
    • Pitch-dependent musical instrument identification and its application to musical sound ontology
    • New York: Springer
    • T. Kitahara, M. Goto, and H. G. Okuno, "Pitch-dependent musical instrument identification and its application to musical sound ontology," in Developments in Applied Artificial Intelligence. New York: Springer, 2003.
    • (2003) Developments in Applied Artificial Intelligence
    • Kitahara, T.1    Goto, M.2    Okuno, H.G.3
  • 43
    • 33745001646 scopus 로고    scopus 로고
    • Application of missing feature theory to the recognition of musical instruments in polyphonic audio
    • J. Eggink and G. J. Brown, "Application of missing feature theory to the recognition of musical instruments in polyphonic audio," in Proc. ISMIR, 2003, pp. 125-131.
    • (2003) Proc. ISMIR , pp. 125-131
    • Eggink, J.1    Brown, G.J.2
  • 44
    • 64849105653 scopus 로고    scopus 로고
    • X. Serra, Musical sound modeling with sinusoids plus noise, in Musical Signal Processing, C. Roads, S. Popea, A. Picialli, and G. D. Poli, Eds. London, U.K.: Swets & Zeitlinger, 1997.
    • X. Serra, "Musical sound modeling with sinusoids plus noise," in Musical Signal Processing, C. Roads, S. Popea, A. Picialli, and G. D. Poli, Eds. London, U.K.: Swets & Zeitlinger, 1997.
  • 46
    • 0005005638 scopus 로고    scopus 로고
    • Online, Available
    • "Sound quality assessment material (SQAM)," [Online], Available: http://www.ebu.ch/en/technical/publications/tech3000-se - ries/tech3253/
    • Sound quality assessment material (SQAM)
  • 47
    • 0012739478 scopus 로고    scopus 로고
    • Musical sound signal analysis/synthesis: Sinusoidal+residual and elementary waveform models
    • presented at the, Online, Available:, unpublished
    • X. Rodet, "Musical sound signal analysis/synthesis: Sinusoidal+residual and elementary waveform models," presented at the IEEE Time-Frequency and Time-Scale Workshop, 1997 [Online]. Available: http://recherche.ircam.fr/equipes/analyse-synthese/listePublications/ articlesRodet/TFTS97/TFTS97-ASP.ps, unpublished
    • (1997) IEEE Time-Frequency and Time-Scale Workshop
    • Rodet, X.1
  • 48
    • 84906261797 scopus 로고
    • PARSHL: An analysis/synthesis program for non-harmonic sounds based on a sinusoidal representation
    • J. O. Smith and X. Sena, "PARSHL: An analysis/synthesis program for non-harmonic sounds based on a sinusoidal representation," in Proc. ICMC, 1987, pp. 290-297.
    • (1987) Proc. ICMC , pp. 290-297
    • Smith, J.O.1    Sena, X.2
  • 49
    • 33645360635 scopus 로고    scopus 로고
    • Bayesian analysis of polyphonic Western tonal music
    • M. Davy, S. Godsill, and J. Idier, "Bayesian analysis of polyphonic Western tonal music," J. Acoust. Soc. Amer., vol. 119, no. 4, pp. 2498-2517, 2006.
    • (2006) J. Acoust. Soc. Amer , vol.119 , Issue.4 , pp. 2498-2517
    • Davy, M.1    Godsill, S.2    Idier, J.3
  • 50
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwarz. "Estimating the dimension of a model," Ann. Statist., vol. 6, pp. 461-464, 1978.
    • (1978) Ann. Statist , vol.6 , pp. 461-464
    • Schwarz, G.1
  • 51
    • 47649111947 scopus 로고    scopus 로고
    • Melody extraction and musical onset detection via probabilistic models of framewise STFT peak data
    • May
    • H. Thornburg, R. J. Leistikow, and J. Berger, "Melody extraction and musical onset detection via probabilistic models of framewise STFT peak data," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1257-1272, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.4 , pp. 1257-1272
    • Thornburg, H.1    Leistikow, R.J.2    Berger, J.3
  • 53
    • 84873444806 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation by summing harmonic amplitudes
    • A. Klapuri, "Multiple fundamental frequency estimation by summing harmonic amplitudes," in Proc. ISMIR, 2006, pp. 216-221.
    • (2006) Proc. ISMIR , pp. 216-221
    • Klapuri, A.1
  • 54
    • 9444241023 scopus 로고    scopus 로고
    • Clustering in knowledge embedded space
    • Y. Zhang, C. Zhang, and S. Wang, "Clustering in knowledge embedded space," in Proc. ECML, 2003, pp. 480-491.
    • (2003) Proc. ECML , pp. 480-491
    • Zhang, Y.1    Zhang, C.2    Wang, S.3
  • 55
    • 79951586633 scopus 로고
    • Auditory model inversion for sound separation
    • M. Slaney, D. Naar, and R. F. Lyon, "Auditory model inversion for sound separation," in Proc. ICASSP, 1994, pp. 77-80.
    • (1994) Proc. ICASSP , pp. 77-80
    • Slaney, M.1    Naar, D.2    Lyon, R.F.3
  • 57
    • 33745729265 scopus 로고    scopus 로고
    • Rennes, France, IR1SA Tech. Rep. 1706, Apr, Online, Available
    • C. Fevotte, R. Gribonval, and E. Vincent, "BSS EVAL Toolbox User Guide," Rennes, France, IR1SA Tech. Rep. 1706, Apr. 2005 [Online], Available: http://www.irisa.fr/metiss/bss eval/
    • (2005) BSS EVAL Toolbox User Guide
    • Fevotte, C.1    Gribonval, R.2    Vincent, E.3
  • 58
    • 64849094782 scopus 로고    scopus 로고
    • E. Vincent and R. Gribonval, BSS ORACLE Toolbox User Guide Version 1.0, 2005 [Online]. Available: http://www.irisa.fr/metiss/bss oracle/
    • E. Vincent and R. Gribonval, "BSS ORACLE Toolbox User Guide Version 1.0, 2005 [Online]. Available: http://www.irisa.fr/metiss/bss oracle/
  • 59
    • 19544361927 scopus 로고    scopus 로고
    • Univ. of Jyväskylä. Kopijyvä, Jyväskylä, Finland, Online, Available
    • T. Eerola and P. Toiviainen, "MIDI Toolbox: MATLAB tools for music research," Univ. of Jyväskylä. Kopijyvä, Jyväskylä, Finland, 2004 [Online]. Available: http://www.jyu.fi/ musica/miditoolbox/
    • (2004) MIDI Toolbox: MATLAB tools for music research
    • Eerola, T.1    Toiviainen, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.