메뉴 건너뛰기




Volumn 18, Issue 3, 2010, Pages 564-575

Source/filter model for unsupervised main melody extraction from polyphonic audio signals

Author keywords

Blind audio source separation; Expectation ; Maximization (EM) algorithm; Gaussian scaled mixture model (GSMM); Main melody extraction; Maximum likelihood; Music; Non negative matrix factorization (NMF); Source filter model; Spectral analysis

Indexed keywords

AUDIO SOURCE SEPARATION; BLIND AUDIO SOURCE SEPARATION; GAUSSIANS; MAIN MELODY EXTRACTION; MIXTURE MODEL; NON-NEGATIVE MATRIX FACTORIZATION (NMF); NONNEGATIVE MATRIX FACTORIZATION; SPECTRAL ANALYSIS;

EID: 76949096499     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2041114     Document Type: Article
Times cited : (142)

References (27)
  • 1
    • 51449109542 scopus 로고    scopus 로고
    • Query by humming of midi and audio using locality sensitive hashing
    • Las Vegas, NV Apr.
    • M. Ryynänen and A. Klapuri, "Query by humming of midi and audio using locality sensitive hashing," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Las Vegas, NV, Apr. 2008, pp. 2249-2252.
    • (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 2249-2252
    • Ryynänen, M.1    Klapuri, A.2
  • 2
    • 84873584066 scopus 로고    scopus 로고
    • Sequence representation of music structure using higherorder similarity matrix and maximum-likelihood approach
    • G. Peeters, "Sequence representation of music structure using higherorder similarity matrix and maximum-likelihood approach," in Proc. Int. Conf. Music Inf. Retrieval, 2007.
    • (2007) Proc. Int. Conf. Music Inf. Retrieval
    • Peeters, G.1
  • 3
    • 70350074065 scopus 로고    scopus 로고
    • Chroma binary similarity and local alignment applied to cover song identification
    • Aug.
    • J. Serra, E. Gomez, P. Herrera, and X. Serra, "Chroma binary similarity and local alignment applied to cover song identification," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.6, pp. 1138-1151, Aug. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.6 , pp. 1138-1151
    • Serra, J.1    Gomez, E.2    Herrera, P.3    Serra, X.4
  • 4
    • 0033677009 scopus 로고    scopus 로고
    • Robust predominant-F 0 estimation method for real-time detection of melody and bass lines in CD recordings
    • M. Goto, "Robust predominant-F 0 estimation method for real-time detection of melody and bass lines in CD recordings," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2000, vol.2, pp. 757-760.
    • (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.2 , pp. 757-760
    • Goto, M.1
  • 5
    • 64649090924 scopus 로고    scopus 로고
    • Ph.D. dissertation, Univ. of Coimbra, Coimbra, Portugal
    • R. Paiva, "Melody detection in polyphonic audio," Ph.D. dissertation, Univ. of Coimbra, Coimbra, Portugal, 2007.
    • (2007) Melody Detection in Polyphonic Audio
    • Paiva, R.1
  • 9
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
    • Mar.
    • C. Févotte, N. Bertin, and J.-L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis," Neural Comput., vol.21, no.3, pp. 793-830, Mar. 2009.
    • (2009) Neural Comput. , vol.21 , Issue.3 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.-L.3
  • 11
    • 51449094735 scopus 로고    scopus 로고
    • Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs
    • Jul.
    • A. Ozerov, P. Philippe, F. Bimbot, and R. Gribonval, "Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.5, pp. 1564-1578, Jul. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.5 , pp. 1564-1578
    • Ozerov, A.1    Philippe, P.2    Bimbot, F.3    Gribonval, R.4
  • 14
    • 0001093042 scopus 로고    scopus 로고
    • Algorithms for non-negative matrix factorization
    • D. D. Lee and H. S. Seung, "Algorithms for non-negative matrix factorization," in Proc. Neural Inf. Process. Syst., 2000, pp. 556-562.
    • (2000) Proc. Neural Inf. Process. Syst. , pp. 556-562
    • Lee, D.D.1    Seung, H.S.2
  • 15
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
    • Mar.
    • T. Virtanen, "Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 22
    • 54049086684 scopus 로고    scopus 로고
    • Accompaniment separation and karaoke application based on automatic melody transcription
    • M. Ryynänen, T. Virtanen, J. Paulus, and A. Klapuri, "Accompaniment separation and karaoke application based on automatic melody transcription," in Proc. IEEE Int. Conf. Multimedia Expo, 2008, pp. 1417-1420.
    • (2008) Proc. IEEE Int. Conf. Multimedia Expo , pp. 1417-1420
    • Ryynänen, M.1    Virtanen, T.2    Paulus, J.3    Klapuri, A.4
  • 23
    • 84946031315 scopus 로고    scopus 로고
    • Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music
    • Percept. Audition, Brisbane, Australia Sep.
    • T. Virtanen, A. Mesaros, and M. Ryynänen, "Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music," in ISCA Tutorial Res. Workshop Statist. Percept. Audition, Brisbane, Australia, Sep. 2008.
    • (2008) ISCA Tutorial Res. Workshop Statist
    • Virtanen, T.1    Mesaros, A.2    Ryynänen, M.3
  • 26
    • 0025321354 scopus 로고
    • Analysis, synthesis, and perception of voice quality variations among female and male talkers
    • D. Klatt and L. Klatt, "Analysis, synthesis, and perception of voice quality variations among female and male talkers," J. Acoust. Soc. Amer., vol.87, no.2, pp. 820-857, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.2 , pp. 820-857
    • Klatt, D.1    Klatt, L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.