메뉴 건너뛰기




Volumn 16, Issue 2, 2008, Pages 278-290

Normalized cuts for predominant melodic source separation

Author keywords

Computational auditory scene analysis (CASA); Music information retrieval (MIR); Normalized cut; Sinusoidal modeling; Spectral clustering

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA); MUSIC INFORMATION RETRIEVAL (MIR); NORMALIZED CUT; SINUSOIDAL MODELING; SPECTRAL CLUSTERING;

EID: 64849087459     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.909260     Document Type: Article
Times cited : (32)

References (42)
  • 2
    • 0036648502 scopus 로고    scopus 로고
    • Musical genre classification of audio signals
    • Jul
    • G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. Speech Audio Process., vol. 10, no. 5, pp. 293-302, Jul. 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.5 , pp. 293-302
    • Tzanetakis, G.1    Cook, P.2
  • 3
    • 33745000971 scopus 로고    scopus 로고
    • Improving timbre similarity: How high is the skv?
    • J.-J. Aucouturier and F. Pachet, "Improving timbre similarity: How high is the skv?," J. Neg. Results Speech Audio Set, vol. 1, no. 1, pp. 1-13, 2004.
    • (2004) J. Neg. Results Speech Audio Set , vol.1 , Issue.1 , pp. 1-13
    • Aucouturier, J.-J.1    Pachet, F.2
  • 4
    • 0029456574 scopus 로고
    • Query by humming: Musical information retrieval in an audio database
    • A. Ghias, J. Logan, D. Chamberlin, andB. Smith, "Query by humming: Musical information retrieval in an audio database," ACM Multimedia, pp. 213-236, 1995.
    • (1995) ACM Multimedia , pp. 213-236
    • Ghias, A.1    Logan, J.2    Chamberlin, D.3    andB4    Smith5
  • 9
    • 84873538214 scopus 로고    scopus 로고
    • Separation of vocals from polyphonic audio recordings
    • London, U.K
    • S. Vembu and S. Baumann, "Separation of vocals from polyphonic audio recordings," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), London, U.K., 2005, pp. 337-344.
    • (2005) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 337-344
    • Vembu, S.1    Baumann, S.2
  • 13
    • 64849093548 scopus 로고    scopus 로고
    • D. Rosenthal and H. Okuno, Eds, Mahwah, NJ: Lawrence Erlbaum Associates
    • D. Rosenthal and H. Okuno, Eds., Computational Auditoty Scene Anal - ysis. Mahwah, NJ: Lawrence Erlbaum Associates, 1998.
    • (1998) Computational Auditoty Scene Anal - ysis
  • 15
    • 33744978751 scopus 로고    scopus 로고
    • Musical source separation using time-frequency priors
    • Jan
    • E. Vincent, "Musical source separation using time-frequency priors," IEEE Trans. Audio, Speech, Lang, Pwcess., vol. 14, no. 1, pp. 91-98, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang, Pwcess , vol.14 , Issue.1 , pp. 91-98
    • Vincent, E.1
  • 16
    • 64849095171 scopus 로고    scopus 로고
    • S. T. Roweis, One microphone source separation, in Proc. Neural Inf. Process. Syst. (NIPS), 2000, pp. 793-799. [17] J. Shi and J. Malik, Normalized cuts and image segmentation, IEEE Trans. Pattern Anal. Mach. Intell., 22. no. 8. pp. 888-905, Aug. 2000.
    • S. T. Roweis, "One microphone source separation," in Proc. Neural Inf. Process. Syst. (NIPS), 2000, pp. 793-799. [17] J. Shi and J. Malik, "Normalized cuts and image segmentation," IEEE Trans. Pattern Anal. Mach. Intell., vol. 22. no. 8. pp. 888-905, Aug. 2000.
  • 18
    • 84883096856 scopus 로고    scopus 로고
    • Unsupervised content discovery in composite audio
    • R. Cai, L. Lu, and A. Hanjalic, "Unsupervised content discovery in composite audio," in Proc. ACM Multimedia, 2005, pp. 628-637.
    • (2005) Proc. ACM Multimedia , pp. 628-637
    • Cai, R.1    Lu, L.2    Hanjalic, A.3
  • 20
    • 64849098287 scopus 로고    scopus 로고
    • F. Bach and M. 1. Jordan, Blind one-microphone speech separation: A spectral learning approach, in Proc. Neural Inf. Process. Syst. (NIPS). Vancouver, BC, Canada, 2004, pp. 65-72.
    • F. Bach and M. 1. Jordan, "Blind one-microphone speech separation: A spectral learning approach," in Proc. Neural Inf. Process. Syst. (NIPS). Vancouver, BC, Canada, 2004, pp. 65-72.
  • 21
    • 33749317042 scopus 로고    scopus 로고
    • Learning spectral clustering, with application to speech separation, j
    • F. R. Bach and M. I. Jordan, "Learning spectral clustering, with application to speech separation," j. Mach. Learn. Res., vol. 7, pp. 1963-2001, 2006.
    • (2006) Mach. Learn. Res , vol.7 , pp. 1963-2001
    • Bach, F.R.1    Jordan, M.I.2
  • 24
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • Aug
    • R. McAulay and T. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744-754, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-34 , Issue.4 , pp. 744-754
    • McAulay, R.1    Quatieri, T.2
  • 25
    • 64149087955 scopus 로고    scopus 로고
    • Enhancing the tracking of partials for the sinusoidal modeling of polyphonic sounds
    • Jul
    • M. Lagrange, S. Marchand, and J. Rault, "Enhancing the tracking of partials for the sinusoidal modeling of polyphonic sounds," IEEE Trans. Acoust., Speech, Signal Process., vol. 15, no. 5, pp. 1625-1634, Jul. 2007.
    • (2007) IEEE Trans. Acoust., Speech, Signal Process , vol.15 , Issue.5 , pp. 1625-1634
    • Lagrange, M.1    Marchand, S.2    Rault, J.3
  • 26
    • 0032022514 scopus 로고    scopus 로고
    • Accuracy of frequency estimates using the phase vocoder
    • Mar
    • M. S. Puckette and J. C. Brown, "Accuracy of frequency estimates using the phase vocoder," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 166-176, Mar. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.2 , pp. 166-176
    • Puckette, M.S.1    Brown, J.C.2
  • 27
    • 84862626296 scopus 로고    scopus 로고
    • On the equivalence of phase-based methods for the estimation of instantaneous frequency
    • S. Marchand and M. Lagrange, "On the equivalence of phase-based methods for the estimation of instantaneous frequency," in Proc. Eur. Conf. Signal Pwcess. (EUSIPCO'06), 2006.
    • (2006) Proc. Eur. Conf. Signal Pwcess. (EUSIPCO'06)
    • Marchand, S.1    Lagrange, M.2
  • 28
    • 34249885836 scopus 로고    scopus 로고
    • M. Lagrange and S. Marchand, Estimating the instantaneous frequency of sinusoidal components using phase-based methods, j. Audio Eng. Soc, 55, no. 1, pp. 385-397, May 2007.
    • M. Lagrange and S. Marchand, "Estimating the instantaneous frequency of sinusoidal components using phase-based methods," j. Audio Eng. Soc, vol. 55, no. 1, pp. 385-397, May 2007.
  • 29
    • 33646822007 scopus 로고    scopus 로고
    • Design criteria for simple sinusoidal parameter estimation based on quadratic interpolation of FFT magnitude peaks
    • San Francisco, CA, Oct, preprint 6256
    • M. Abe and J. O. Smith, "Design criteria for simple sinusoidal parameter estimation based on quadratic interpolation of FFT magnitude peaks," in Proc. 117th Conv. Audio Eng. Soc, San Francisco, CA, Oct. 2004, preprint 6256.
    • (2004) Proc. 117th Conv. Audio Eng. Soc
    • Abe, M.1    Smith, J.O.2
  • 30
    • 33645308277 scopus 로고    scopus 로고
    • High resolution spectral analysis of mixtures of complex exponentials modulated bv polynomials
    • Apr
    • R. Badeau, B. David, and G. Richard, "High resolution spectral analysis of mixtures of complex exponentials modulated bv polynomials," 'IEEE Trans. Signal Process., vol. 54, no. 4, pp. 1341-1350, Apr. 2006.
    • (2006) IEEE Trans. Signal Process , vol.54 , Issue.4 , pp. 1341-1350
    • Badeau, R.1    David, B.2    Richard, G.3
  • 31
    • 0033707902 scopus 로고    scopus 로고
    • Separation of harmonic sound sources using sinusoidal modeling
    • T. Virtanen and A. Klapuri, "Separation of harmonic sound sources using sinusoidal modeling," in Proc. ICASSP, 2000, vol. 2, pp. 765-768.
    • (2000) Proc. ICASSP , vol.2 , pp. 765-768
    • Virtanen, T.1    Klapuri, A.2
  • 32
    • 64849113697 scopus 로고    scopus 로고
    • Unsupervised classification techniques for multipitch estimation
    • preprint 6037
    • J. Rosier and Y. Grenier, "Unsupervised classification techniques for multipitch estimation," in Proc. 116th Conv. Audio Eng. Soc, 2004, preprint 6037.
    • (2004) Proc. 116th Conv. Audio Eng. Soc
    • Rosier, J.1    Grenier, Y.2
  • 34
    • 84872699414 scopus 로고    scopus 로고
    • Assessing the quality of the extraction and tracking of sinusoidal components: Towards an evaluation methodology
    • preprint 5524
    • M. Lagrange and S. Marchand, "Assessing the quality of the extraction and tracking of sinusoidal components: Towards an evaluation methodology," in Proc. Digital Audio Effects (DAFx'06) Conf, 2006, pp. 239-245', preprint 5524.
    • (2006) Proc. Digital Audio Effects (DAFx'06) Conf , pp. 239-245
    • Lagrange, M.1    Marchand, S.2
  • 36
    • 84873444806 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation by summing harmonic amplitudes
    • Victoria, BC, Canada
    • A. Klapuri, "Multiple fundamental frequency estimation by summing harmonic amplitudes," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Victoria, BC, Canada, 2006, pp. 216-221.
    • (2006) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 216-221
    • Klapuri, A.1
  • 37
    • 64849094125 scopus 로고    scopus 로고
    • P. Boersma and D. Weenink, Praat: Doing phonetics bv computer Version 4.5.06, retrieved Dec. 13, 2006, Online, Available
    • P. Boersma and D. Weenink, "Praat: Doing phonetics bv computer (Version 4.5.06)," retrieved Dec. 13, 2006. [Online], Available: http:// www.praat.org/
  • 38
    • 0019053271 scopus 로고
    • Experiments in syllable-based recognition of continuous speech
    • Aug
    • S. Davis and P. Mermelstein, "Experiments in syllable-based recognition of continuous speech," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 40
    • 0033692661 scopus 로고    scopus 로고
    • Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures
    • A. Jourjine, S. Richard, and O. Yilmaz, "Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures," in Proc. ICASSP, 2000, pp. 2985-2988.
    • (2000) Proc. ICASSP , pp. 2985-2988
    • Jourjine, A.1    Richard, S.2    Yilmaz, O.3
  • 41
    • 84945129845 scopus 로고    scopus 로고
    • Frequency-domain source identification and manipulation in stereo mixes for enhancement, suppression, and re-panning applications
    • New Paltz, NY
    • C. Avendano, "Frequency-domain source identification and manipulation in stereo mixes for enhancement, suppression, and re-panning applications," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), New Paltz, NY, 2003, pp. 55-58.
    • (2003) Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA) , pp. 55-58
    • Avendano, C.1
  • 42
    • 84866511945 scopus 로고    scopus 로고
    • Semi-automatic mono to stereo up-mixing using sound source formation
    • Vienna, May, preprint 7042
    • M. Lagrange, L. G. Martins, and G. Tzanetakis, "Semi-automatic mono to stereo up-mixing using sound source formation," in Proc, 122th Conv. Audio Eng. Soc, Vienna, May 2007, preprint 7042.
    • (2007) Proc, 122th Conv. Audio Eng. Soc
    • Lagrange, M.1    Martins, L.G.2    Tzanetakis, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.