SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 3, 2010, Pages 564-575

Source/filter model for unsupervised main melody extraction from polyphonic audio signals

(4) Durrieu, Jean Louis a Richard, Gaël a David, Bertrand a Fevotte, Cédric a

Author keywords

Blind audio source separation; Expectation ; Maximization (EM) algorithm; Gaussian scaled mixture model (GSMM); Main melody extraction; Maximum likelihood; Music; Non negative matrix factorization (NMF); Source filter model; Spectral analysis

Indexed keywords

AUDIO SOURCE SEPARATION; BLIND AUDIO SOURCE SEPARATION; GAUSSIANS; MAIN MELODY EXTRACTION; MIXTURE MODEL; NON-NEGATIVE MATRIX FACTORIZATION (NMF); NONNEGATIVE MATRIX FACTORIZATION; SPECTRAL ANALYSIS;

AUDIO ACOUSTICS; AUDIO SYSTEMS; FACTORIZATION; LIGHT MEASUREMENT; MAXIMUM LIKELIHOOD ESTIMATION; MIXTURES; SEPARATION; SIGNAL ANALYSIS; SPECTRUM ANALYSIS; SPECTRUM ANALYZERS;

BLIND SOURCE SEPARATION;

EID: 76949096499 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2041114 Document Type: Article

Times cited : (142)

References (27)

1
- 51449109542
- Query by humming of midi and audio using locality sensitive hashing
- Las Vegas, NV Apr.
- M. Ryynänen and A. Klapuri, "Query by humming of midi and audio using locality sensitive hashing," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Las Vegas, NV, Apr. 2008, pp. 2249-2252.
- (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 2249-2252
- Ryynänen, M.¹ Klapuri, A.²

2
- 84873584066
- Sequence representation of music structure using higherorder similarity matrix and maximum-likelihood approach
- G. Peeters, "Sequence representation of music structure using higherorder similarity matrix and maximum-likelihood approach," in Proc. Int. Conf. Music Inf. Retrieval, 2007.
- (2007) Proc. Int. Conf. Music Inf. Retrieval
- Peeters, G.¹

3
- 70350074065
- Chroma binary similarity and local alignment applied to cover song identification
- Aug.
- J. Serra, E. Gomez, P. Herrera, and X. Serra, "Chroma binary similarity and local alignment applied to cover song identification," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.6, pp. 1138-1151, Aug. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.6 , pp. 1138-1151
- Serra, J.¹ Gomez, E.² Herrera, P.³ Serra, X.⁴

4
- 0033677009
- Robust predominant-F 0 estimation method for real-time detection of melody and bass lines in CD recordings
- M. Goto, "Robust predominant-F 0 estimation method for real-time detection of melody and bass lines in CD recordings," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2000, vol.2, pp. 757-760.
- (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.2 , pp. 757-760
- Goto, M.¹

5
- 64649090924
- Ph.D. dissertation, Univ. of Coimbra, Coimbra, Portugal
- R. Paiva, "Melody detection in polyphonic audio," Ph.D. dissertation, Univ. of Coimbra, Coimbra, Portugal, 2007.
- (2007) Melody Detection in Polyphonic Audio
- Paiva, R.¹

6
- 84873440865
- Transcription of the singing melody in polyphonic music
- M. P. Ryynänen and A. P. Klapuri, "Transcription of the singing melody in polyphonic music," in Proc. Int. Conf. Music Inf. Retrieval, 2006.
- (2006) Proc. Int. Conf. Music Inf. Retrieval
- Ryynänen, M.P.¹ Klapuri, A.P.²

7
- 84873560057
- A classification approach to melody transcription
- G. Poliner and D. Ellis, "A classification approach to melody transcription," in Proc. Int. Conf. Music Inf. Retrieval, 2005, pp. 161-166.
- (2005) Proc. Int. Conf. Music Inf. Retrieval , pp. 161-166
- Poliner, G.¹ Ellis, D.²

8
- 85020963348
- Transcription of vocal melodies using voice characteristics and algorithm fusion
- C. Sutton, E. Vincent, M. Plumbley, and J. Bello, "Transcription of vocal melodies using voice characteristics and algorithm fusion," in Proc. Music Inf. Retrieval Eval. eXchange, 2006.
- (2006) Proc. Music Inf. Retrieval Eval. EXchange
- Sutton, C.¹ Vincent, E.² Plumbley, M.³ Bello, J.⁴

9
- 63249085556
- Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
- Mar.
- C. Févotte, N. Bertin, and J.-L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis," Neural Comput., vol.21, no.3, pp. 793-830, Mar. 2009.
- (2009) Neural Comput. , vol.21 , Issue.3 , pp. 793-830
- Févotte, C.¹ Bertin, N.² Durrieu, J.-L.³

10
- 33744968614
- Audio source separation with a single sensor
- Jan.
- L. Benaroya, F. Bimbot, and R. Gribonval, "Audio source separation with a single sensor," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.1, pp. 191-199, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 191-199
- Benaroya, L.¹ Bimbot, F.² Gribonval, R.³

11
- 51449094735
- Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs
- Jul.
- A. Ozerov, P. Philippe, F. Bimbot, and R. Gribonval, "Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.5, pp. 1564-1578, Jul. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.5 , pp. 1564-1578
- Ozerov, A.¹ Philippe, P.² Bimbot, F.³ Gribonval, R.⁴

12
- 0003418124
- New York: Mouton De Gruyter
- G. Fant, Acoustic Theory of Speech Production. New York: Mouton De Gruyter, 1970.
- (1970) Acoustic Theory of Speech Production
- Fant, G.¹

13
- 0002629270
- Maximum likelihood from incomplete data via the EMalgorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EMalgorithm," J. R. Statist. Soc. Ser. B (Methodological), vol.39, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. Ser. B (Methodological) , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

14
- 0001093042
- Algorithms for non-negative matrix factorization
- D. D. Lee and H. S. Seung, "Algorithms for non-negative matrix factorization," in Proc. Neural Inf. Process. Syst., 2000, pp. 556-562.
- (2000) Proc. Neural Inf. Process. Syst. , pp. 556-562
- Lee, D.D.¹ Seung, H.S.²

15
- 50249152311
- Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
- Mar.
- T. Virtanen, "Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.3, pp. 1066-1074, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 1066-1074
- Virtanen, T.¹

16
- 48849095345
- Melody transcription from music audio: Approaches and evaluation
- May
- G. Poliner, D. Ellis, A. Ehmann, E. Gómez, S. Streich, and B. Ong, "Melody transcription from music audio: Approaches and evaluation," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1247-1256, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1247-1256
- Poliner, G.¹ Ellis, D.² Ehmann, A.³ Gómez, E.⁴ Streich, S.⁵ Ong, B.⁶

17
- 51449108099
- Singer melody extraction in polyphonic signals using source separation methods
- J.-L. Durrieu, G. Richard, and B. David, "Singer melody extraction in polyphonic signals using source separation methods," in IEEE Int. Conf. Acoust., Speech, Signal Process., 2008, pp. 169-172.
- (2008) IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 169-172
- Durrieu, J.-L.¹ Richard, G.² David, B.³

18
- 84872960154
- Multiple f0 estimation in polyphonic music (mirex 2008)
- C. Cao and M. Li, "Multiple f0 estimation in polyphonic music (mirex 2008)," in Proc. Music Inf. Retrieval Evaluation eXchange, 2008.
- (2008) Proc. Music Inf. Retrieval Evaluation EXchange
- Cao, C.¹ Li, M.²

19
- 76949096483
- Tracking melody in polyphonic audio. mirex 2008
- P. Cancela, "Tracking melody in polyphonic audio. mirex 2008," in Proc. Music Inf. Retrieval Evaluation eXchange, 2008.
- (2008) Proc. Music Inf. Retrieval Evaluation EXchange
- Cancela, P.¹

20
- 76949090165
- Melody extraction using harmonic matching
- V. Rao and P. Rao, "Melody extraction using harmonic matching," in Proc. Music Inf. Retrieval Evaluation eXchange, 2008.
- (2008) Proc. Music Inf. Retrieval Evaluation EXchange
- Rao, V.¹ Rao, P.²

21
- 84872697855
- Extraction of the melody pitch contour from polyphonic audio
- K. Dressler, "Extraction of the melody pitch contour from polyphonic audio," in Proc. Music Inf. Retrieval Evaluation eXchange, 2005.
- (2005) Proc. Music Inf. Retrieval Evaluation EXchange
- Dressler, K.¹

22
- 54049086684
- Accompaniment separation and karaoke application based on automatic melody transcription
- M. Ryynänen, T. Virtanen, J. Paulus, and A. Klapuri, "Accompaniment separation and karaoke application based on automatic melody transcription," in Proc. IEEE Int. Conf. Multimedia Expo, 2008, pp. 1417-1420.
- (2008) Proc. IEEE Int. Conf. Multimedia Expo , pp. 1417-1420
- Ryynänen, M.¹ Virtanen, T.² Paulus, J.³ Klapuri, A.⁴

23
- 84946031315
- Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music
- Percept. Audition, Brisbane, Australia Sep.
- T. Virtanen, A. Mesaros, and M. Ryynänen, "Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music," in ISCA Tutorial Res. Workshop Statist. Percept. Audition, Brisbane, Australia, Sep. 2008.
- (2008) ISCA Tutorial Res. Workshop Statist
- Virtanen, T.¹ Mesaros, A.² Ryynänen, M.³

24
- 70349466738
- An iterative approach to monaural musical mixture de-soloing
- J.-L. Durrieu, G. Richard, and B. David, "An iterative approach to monaural musical mixture de-soloing," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2009, pp. 105-108.
- (2009) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 105-108
- Durrieu, J.-L.¹ Richard, G.² David, B.³

25
- 33744975847
- Performance measurement in blind audio source separation
- DOI 10.1109/TSA.2005.858005
- E. Vincent, R. Gribonval, and C. Févotte, "Performance measurement in blind audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1462-1469, Jul. 2006. (Pubitemid 46547636)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1462-1469
- Vincent, E.¹ Gribonval, R.² Fevotte, C.³

26
- 0025321354
- Analysis, synthesis, and perception of voice quality variations among female and male talkers
- D. Klatt and L. Klatt, "Analysis, synthesis, and perception of voice quality variations among female and male talkers," J. Acoust. Soc. Amer., vol.87, no.2, pp. 820-857, 1990.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.2 , pp. 820-857
- Klatt, D.¹ Klatt, L.²

27
- 4344592923
- Ph.D. dissertation, Université de Paris 6, Paris, France
- N. Henrich, "Etude de la source glottique en voix parlée et chantée," Ph.D. dissertation, Université de Paris 6, Paris, France, 2001.
- (2001) Etude de la Source Glottique en Voix Parlée et Chantée
- Henrich, N.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.