SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 16, Issue 2, 2008, Pages 278-290

Normalized cuts for predominant melodic source separation

(4) Lagrange, Mathieu a Martins, Luis Gustavo b Murdoch, Jennifer a Tzanetakis, George a

Author keywords

Computational auditory scene analysis (CASA); Music information retrieval (MIR); Normalized cut; Sinusoidal modeling; Spectral clustering

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA); MUSIC INFORMATION RETRIEVAL (MIR); NORMALIZED CUT; SINUSOIDAL MODELING; SPECTRAL CLUSTERING;

COMPUTER VISION; ELECTRONIC MUSICAL INSTRUMENTS; IMAGE SEGMENTATION; INFORMATION RETRIEVAL; INFORMATION SERVICES; PATIENT REHABILITATION;

COMPUTER MUSIC;

EID: 64849087459 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.909260 Document Type: Article

Times cited : (32)

References (42)

1
- 0038535979
- A classification of musical genre
- F. Pachet and D. Cazaly, "A classification of musical genre," in Proc. RIAO Content-Based Multimedia Inf. Access Conf., 2000, pp. 1238-1245.
- (2000) Proc. RIAO Content-Based Multimedia Inf. Access Conf , pp. 1238-1245
- Pachet, F.¹ Cazaly, D.²

2
- 0036648502
- Musical genre classification of audio signals
- Jul
- G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. Speech Audio Process., vol. 10, no. 5, pp. 293-302, Jul. 2002.
- (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.5 , pp. 293-302
- Tzanetakis, G.¹ Cook, P.²

3
- 33745000971
- Improving timbre similarity: How high is the skv?
- J.-J. Aucouturier and F. Pachet, "Improving timbre similarity: How high is the skv?," J. Neg. Results Speech Audio Set, vol. 1, no. 1, pp. 1-13, 2004.
- (2004) J. Neg. Results Speech Audio Set , vol.1 , Issue.1 , pp. 1-13
- Aucouturier, J.-J.¹ Pachet, F.²

4
- 0029456574
- Query by humming: Musical information retrieval in an audio database
- A. Ghias, J. Logan, D. Chamberlin, andB. Smith, "Query by humming: Musical information retrieval in an audio database," ACM Multimedia, pp. 213-236, 1995.
- (1995) ACM Multimedia , pp. 213-236
- Ghias, A.¹ Logan, J.² Chamberlin, D.³ andB⁴ Smith⁵

5
- 2942750514
- The MUSART testbed for query-by-humming evaluation
- R. B. Dannenberg, W. P. Birmingham, G. Tzanetakis, C. Meek, N. Hu, and B. Pardo, "The MUSART testbed for query-by-humming evaluation," Comput. Music J., vol. 28, no. 2, pp. 34-48,2004.
- (2004) Comput. Music J , vol.28 , Issue.2 , pp. 34-48
- Dannenberg, R.B.¹ Birmingham, W.P.² Tzanetakis, G.³ Meek, C.⁴ Hu, N.⁵ Pardo, B.⁶

6
- 84873476754
- Towards quantifying the album effect in artist identification
- Y. Kim, D. Williamson, and S. Pilli, "Towards quantifying the album effect in artist identification," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), 2006, pp. 393-394.
- (2006) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 393-394
- Kim, Y.¹ Williamson, D.² Pilli, S.³

7
- 41649099242
- Singing voice separation from monaural recordings
- Y. Li and D. Wang, "Singing voice separation from monaural recordings," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), 2006. pp. 176-179.
- (2006) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 176-179
- Li, Y.¹ Wang, D.²

8
- 33745686986
- One microphone singing voice separation using source-adapted models
- New Paltz. NY
- A. Ozerov, P. Philippe, R. Gribonval, and F. Bimbot, "One microphone singing voice separation using source-adapted models," in Proc. IEEE Workshop Applicat. Signal Pwcess. Audio Acoust. (WASPAA), New Paltz. NY, 2005, pp. 90-93.
- (2005) Proc. IEEE Workshop Applicat. Signal Pwcess. Audio Acoust. (WASPAA) , pp. 90-93
- Ozerov, A.¹ Philippe, P.² Gribonval, R.³ Bimbot, F.⁴

9
- 84873538214
- Separation of vocals from polyphonic audio recordings
- London, U.K
- S. Vembu and S. Baumann, "Separation of vocals from polyphonic audio recordings," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), London, U.K., 2005, pp. 337-344.
- (2005) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 337-344
- Vembu, S.¹ Baumann, S.²

10
- 48849095345
- Melody transcription from music audio: Approaches and evaluation
- May
- G. Polmer, D. Ellis, A. Ehmann, E. Gomez, S. Stieich, and B. Ong, "Melody transcription from music audio: Approaches and evaluation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1247-1256, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.4 , pp. 1247-1256
- Polmer, G.¹ Ellis, D.² Ehmann, A.³ Gomez, E.⁴ Stieich, S.⁵ Ong, B.⁶

11
- 0003684441
- Cambridge, MA: MIT Press
- A. Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis: The Perceptual Organization of Sound
- Bregman, A.¹

12
- 34547507985
- Sound source tracking and formation using normalized cuts
- Honolulu, HI
- M. Lagrange and G. Tzanetakis, "Sound source tracking and formation using normalized cuts," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Honolulu, HI, 2007, pp. 1-61-1-64.
- (2007) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
- Lagrange, M.¹ Tzanetakis, G.²

13
- 64849093548
- D. Rosenthal and H. Okuno, Eds, Mahwah, NJ: Lawrence Erlbaum Associates
- D. Rosenthal and H. Okuno, Eds., Computational Auditoty Scene Anal - ysis. Mahwah, NJ: Lawrence Erlbaum Associates, 1998.
- (1998) Computational Auditoty Scene Anal - ysis

14
- 71049180205
- D. Wang and G. J. Brown, Eds, New York: Wiley
- D. Wang and G. J. Brown, Eds., Computational Auditoty Scene Analysis: Principles, Algorithms and Applications. New York: Wiley, 2006.
- (2006) Computational Auditoty Scene Analysis: Principles, Algorithms and Applications

15
- 33744978751
- Musical source separation using time-frequency priors
- Jan
- E. Vincent, "Musical source separation using time-frequency priors," IEEE Trans. Audio, Speech, Lang, Pwcess., vol. 14, no. 1, pp. 91-98, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang, Pwcess , vol.14 , Issue.1 , pp. 91-98
- Vincent, E.¹

16
- 64849095171
- S. T. Roweis, One microphone source separation, in Proc. Neural Inf. Process. Syst. (NIPS), 2000, pp. 793-799. [17] J. Shi and J. Malik, Normalized cuts and image segmentation, IEEE Trans. Pattern Anal. Mach. Intell., 22. no. 8. pp. 888-905, Aug. 2000.
- S. T. Roweis, "One microphone source separation," in Proc. Neural Inf. Process. Syst. (NIPS), 2000, pp. 793-799. [17] J. Shi and J. Malik, "Normalized cuts and image segmentation," IEEE Trans. Pattern Anal. Mach. Intell., vol. 22. no. 8. pp. 888-905, Aug. 2000.

17
- 14944367313
- Minimal-impact audio-based personal archives
- New York
- D. Ellis and K. Lee, "Minimal-impact audio-based personal archives," in Proc. ACM Workshop Continuous A rchival and Retrieval of Personal Experience (CARPE), New York, 2004, pp. 744-747.
- (2004) Proc. ACM Workshop Continuous A rchival and Retrieval of Personal Experience (CARPE) , pp. 744-747
- Ellis, D.¹ Lee, K.²

18
- 84883096856
- Unsupervised content discovery in composite audio
- R. Cai, L. Lu, and A. Hanjalic, "Unsupervised content discovery in composite audio," in Proc. ACM Multimedia, 2005, pp. 628-637.
- (2005) Proc. ACM Multimedia , pp. 628-637
- Cai, R.¹ Lu, L.² Hanjalic, A.³

19
- 85069867844
- Audio segmentation by singular value clustering
- S. Dubnov and T. Appel, "Audio segmentation by singular value clustering," in Proc. Int. Conf. Computer Music (ICMC), 2004, pp. 454-457.
- (2004) Proc. Int. Conf. Computer Music (ICMC) , pp. 454-457
- Dubnov, S.¹ Appel, T.²

20
- 64849098287
- F. Bach and M. 1. Jordan, Blind one-microphone speech separation: A spectral learning approach, in Proc. Neural Inf. Process. Syst. (NIPS). Vancouver, BC, Canada, 2004, pp. 65-72.
- F. Bach and M. 1. Jordan, "Blind one-microphone speech separation: A spectral learning approach," in Proc. Neural Inf. Process. Syst. (NIPS). Vancouver, BC, Canada, 2004, pp. 65-72.

21
- 33749317042
- Learning spectral clustering, with application to speech separation, j
- F. R. Bach and M. I. Jordan, "Learning spectral clustering, with application to speech separation," j. Mach. Learn. Res., vol. 7, pp. 1963-2001, 2006.
- (2006) Mach. Learn. Res , vol.7 , pp. 1963-2001
- Bach, F.R.¹ Jordan, M.I.²

22
- 0141743471
- Harmonicity and dynamics based audio separation
- S. Srinivasan and M. Kankanhalli, "Harmonicity and dynamics based audio separation," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP'03), 2003, vol. 5, pp. 640-643.
- (2003) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP'03) , vol.5 , pp. 640-643
- Srinivasan, S.¹ Kankanhalli, M.²

23
- 4544366081
- Auditory blobs
- S. Srinivasan, "Auditory blobs," in Proc. Int. Conf. Acoust., Speech, Signal Pwcess. (ICASSP'04), 2004, vol. 4, pp. 313-316.
- (2004) Proc. Int. Conf. Acoust., Speech, Signal Pwcess. (ICASSP'04) , vol.4 , pp. 313-316
- Srinivasan, S.¹

24
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- Aug
- R. McAulay and T. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744-754, Aug. 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-34 , Issue.4 , pp. 744-754
- McAulay, R.¹ Quatieri, T.²

25
- 64149087955
- Enhancing the tracking of partials for the sinusoidal modeling of polyphonic sounds
- Jul
- M. Lagrange, S. Marchand, and J. Rault, "Enhancing the tracking of partials for the sinusoidal modeling of polyphonic sounds," IEEE Trans. Acoust., Speech, Signal Process., vol. 15, no. 5, pp. 1625-1634, Jul. 2007.
- (2007) IEEE Trans. Acoust., Speech, Signal Process , vol.15 , Issue.5 , pp. 1625-1634
- Lagrange, M.¹ Marchand, S.² Rault, J.³

26
- 0032022514
- Accuracy of frequency estimates using the phase vocoder
- Mar
- M. S. Puckette and J. C. Brown, "Accuracy of frequency estimates using the phase vocoder," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 166-176, Mar. 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.2 , pp. 166-176
- Puckette, M.S.¹ Brown, J.C.²

27
- 84862626296
- On the equivalence of phase-based methods for the estimation of instantaneous frequency
- S. Marchand and M. Lagrange, "On the equivalence of phase-based methods for the estimation of instantaneous frequency," in Proc. Eur. Conf. Signal Pwcess. (EUSIPCO'06), 2006.
- (2006) Proc. Eur. Conf. Signal Pwcess. (EUSIPCO'06)
- Marchand, S.¹ Lagrange, M.²

28
- 34249885836
- M. Lagrange and S. Marchand, Estimating the instantaneous frequency of sinusoidal components using phase-based methods, j. Audio Eng. Soc, 55, no. 1, pp. 385-397, May 2007.
- M. Lagrange and S. Marchand, "Estimating the instantaneous frequency of sinusoidal components using phase-based methods," j. Audio Eng. Soc, vol. 55, no. 1, pp. 385-397, May 2007.

29
- 33646822007
- Design criteria for simple sinusoidal parameter estimation based on quadratic interpolation of FFT magnitude peaks
- San Francisco, CA, Oct, preprint 6256
- M. Abe and J. O. Smith, "Design criteria for simple sinusoidal parameter estimation based on quadratic interpolation of FFT magnitude peaks," in Proc. 117th Conv. Audio Eng. Soc, San Francisco, CA, Oct. 2004, preprint 6256.
- (2004) Proc. 117th Conv. Audio Eng. Soc
- Abe, M.¹ Smith, J.O.²

30
- 33645308277
- High resolution spectral analysis of mixtures of complex exponentials modulated bv polynomials
- Apr
- R. Badeau, B. David, and G. Richard, "High resolution spectral analysis of mixtures of complex exponentials modulated bv polynomials," 'IEEE Trans. Signal Process., vol. 54, no. 4, pp. 1341-1350, Apr. 2006.
- (2006) IEEE Trans. Signal Process , vol.54 , Issue.4 , pp. 1341-1350
- Badeau, R.¹ David, B.² Richard, G.³

31
- 0033707902
- Separation of harmonic sound sources using sinusoidal modeling
- T. Virtanen and A. Klapuri, "Separation of harmonic sound sources using sinusoidal modeling," in Proc. ICASSP, 2000, vol. 2, pp. 765-768.
- (2000) Proc. ICASSP , vol.2 , pp. 765-768
- Virtanen, T.¹ Klapuri, A.²

32
- 64849113697
- Unsupervised classification techniques for multipitch estimation
- preprint 6037
- J. Rosier and Y. Grenier, "Unsupervised classification techniques for multipitch estimation," in Proc. 116th Conv. Audio Eng. Soc, 2004, preprint 6037.
- (2004) Proc. 116th Conv. Audio Eng. Soc
- Rosier, J.¹ Grenier, Y.²

33
- 64849102257
- PCM to MIDI transposition
- L. Martins and A. Ferreira, "PCM to MIDI transposition," in Proc. Audio Eng. Soc. (AES), 2002.
- (2002) Proc. Audio Eng. Soc. (AES)
- Martins, L.¹ Ferreira, A.²

34
- 84872699414
- Assessing the quality of the extraction and tracking of sinusoidal components: Towards an evaluation methodology
- preprint 5524
- M. Lagrange and S. Marchand, "Assessing the quality of the extraction and tracking of sinusoidal components: Towards an evaluation methodology," in Proc. Digital Audio Effects (DAFx'06) Conf, 2006, pp. 239-245', preprint 5524.
- (2006) Proc. Digital Audio Effects (DAFx'06) Conf , pp. 239-245
- Lagrange, M.¹ Marchand, S.²

35
- 84892200847
- A. Klapuri and M. Davy, Eds, New York: Springer
- A. Klapuri and M. Davy, Eds., Signal Processing Methods for Music Transcription. New York: Springer, 2006.
- (2006) Signal Processing Methods for Music Transcription

36
- 84873444806
- Multiple fundamental frequency estimation by summing harmonic amplitudes
- Victoria, BC, Canada
- A. Klapuri, "Multiple fundamental frequency estimation by summing harmonic amplitudes," in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), Victoria, BC, Canada, 2006, pp. 216-221.
- (2006) Proc. Int. Conf. Music Inf. Retrieval (ISMIR) , pp. 216-221
- Klapuri, A.¹

37
- 64849094125
- P. Boersma and D. Weenink, Praat: Doing phonetics bv computer Version 4.5.06, retrieved Dec. 13, 2006, Online, Available
- P. Boersma and D. Weenink, "Praat: Doing phonetics bv computer (Version 4.5.06)," retrieved Dec. 13, 2006. [Online], Available: http:// www.praat.org/

38
- 0019053271
- Experiments in syllable-based recognition of continuous speech
- Aug
- S. Davis and P. Mermelstein, "Experiments in syllable-based recognition of continuous speech," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

39
- 0003957032
- 2nd ed. San Mateo, CA: Morgan Kaufmann
- I. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, 2nd ed. San Mateo, CA: Morgan Kaufmann, 2005.
- (2005) Data Mining: Practical Machine Learning Tools and Techniques
- Witten, I.¹ Frank, E.²

40
- 0033692661
- Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures
- A. Jourjine, S. Richard, and O. Yilmaz, "Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures," in Proc. ICASSP, 2000, pp. 2985-2988.
- (2000) Proc. ICASSP , pp. 2985-2988
- Jourjine, A.¹ Richard, S.² Yilmaz, O.³

41
- 84945129845
- Frequency-domain source identification and manipulation in stereo mixes for enhancement, suppression, and re-panning applications
- New Paltz, NY
- C. Avendano, "Frequency-domain source identification and manipulation in stereo mixes for enhancement, suppression, and re-panning applications," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), New Paltz, NY, 2003, pp. 55-58.
- (2003) Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA) , pp. 55-58
- Avendano, C.¹

42
- 84866511945
- Semi-automatic mono to stereo up-mixing using sound source formation
- Vienna, May, preprint 7042
- M. Lagrange, L. G. Martins, and G. Tzanetakis, "Semi-automatic mono to stereo up-mixing using sound source formation," in Proc, 122th Conv. Audio Eng. Soc, Vienna, May 2007, preprint 7042.
- (2007) Proc, 122th Conv. Audio Eng. Soc
- Lagrange, M.¹ Martins, L.G.² Tzanetakis, G.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.