메뉴 건너뛰기




Volumn 20, Issue 4, 2012, Pages 1118-1133

A general flexible framework for the handling of prior information in audio source separation

Author keywords

Audio source separation; Expectation maximization; Local Gaussian model; Nonnegative matrix factorization

Indexed keywords

AUDIO SOURCE SEPARATION; EXPECTATION MAXIMIZATION; EXPECTATION-MAXIMIZATION ALGORITHMS; FLEXIBLE FRAMEWORK; LOCAL GAUSSIAN MODELING; NEW EFFICIENT METHOD; NONNEGATIVE MATRIX FACTORIZATION; SEPARATION PROBLEMS;

EID: 84897584695     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2172425     Document Type: Article
Times cited : (238)

References (57)
  • 7
    • 51449094735 scopus 로고    scopus 로고
    • Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs
    • Jul.
    • A. Ozerov, P. Philippe, F. Bimbot, and R. Gribonval, "Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 5, pp. 1564-1578, Jul. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.5 , pp. 1564-1578
    • Ozerov, A.1    Philippe, P.2    Bimbot, F.3    Gribonval, R.4
  • 9
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis
    • Mar.
    • C. Févotte, N. Bertin, and J.-L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis," Neural Comput., vol. 21, no. 3, pp. 793-830, Mar. 2009.
    • (2009) Neural Comput. , vol.21 , Issue.3 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.-L.3
  • 13
    • 76949094445 scopus 로고    scopus 로고
    • Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation
    • Mar.
    • A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 550-563, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 550-563
    • Ozerov, A.1    Févotte, C.2
  • 14
    • 76949108729 scopus 로고    scopus 로고
    • Adaptive harmonic spectral decomposition for multiple pitch estimation
    • Mar.
    • E. Vincent, N. Bertin, and R. Badeau, "Adaptive harmonic spectral decomposition for multiple pitch estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 528-537, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 528-537
    • Vincent, E.1    Bertin, N.2    Badeau, R.3
  • 15
    • 76949083547 scopus 로고    scopus 로고
    • Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription
    • Mar.
    • N. Bertin, R. Badeau, and E. Vincent, "Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 538-549, Mar. 2010.
    • (2010) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 538-549
    • Bertin, N.1    Badeau, R.2    Vincent, E.3
  • 16
    • 76949096499 scopus 로고    scopus 로고
    • Source/filter model for unsupervised main melody extraction from polyphonic audio signals
    • Mar.
    • J. L. Durrieu, G. Richard, B. David, and C. Févotte, "Source/filter model for unsupervised main melody extraction from polyphonic audio signals," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 564-575, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 564-575
    • Durrieu, J.L.1    Richard, G.2    David, B.3    Févotte, C.4
  • 19
    • 77955675017 scopus 로고    scopus 로고
    • Under-determined reverberant audio source separation using a full-rank spatial covariance model
    • Sep.
    • N. Q. K. Duong, E. Vincent, and R. Gribonval, "Under-determined reverberant audio source separation using a full-rank spatial covariance model," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1830-1840, Sep. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1830-1840
    • Duong, N.Q.K.1    Vincent, E.2    Gribonval, R.3
  • 20
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • Methodological
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., ser. B, vol. 39, Methodological, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc., Ser. B , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 22
    • 57949113893 scopus 로고    scopus 로고
    • Component separation with flexible models - Application to multichannel astrophysical observations
    • Oct.
    • J.-F. Cardoso, M. Le Jeune, J. Delabrouille, M. Betoule, and G. Patanchon, "Component separation with flexible models - Application to multichannel astrophysical observations," IEEE J. Sel. Topics Signal Process., vol. 2, no. 5, pp. 735-746, Oct. 2008.
    • (2008) IEEE J. Sel. Topics Signal Process. , vol.2 , Issue.5 , pp. 735-746
    • Cardoso, J.-F.1    Le Jeune, M.2    Delabrouille, J.3    Betoule, M.4    Patanchon, G.5
  • 23
    • 47649088496 scopus 로고    scopus 로고
    • Extended nonnegative tensor factorisation models for musical sound source separation
    • New York: Hindawi
    • D. FitzGerald, M. Cranitch, and E. Coyle, "Extended nonnegative tensor factorisation models for musical sound source separation," in Computational Intelligence and Neuroscience. New York: Hindawi., 2008, vol. 2008.
    • (2008) Computational Intelligence and Neuroscience , vol.2008
    • FitzGerald, D.1    Cranitch, M.2    Coyle, E.3
  • 26
    • 85032751591 scopus 로고
    • Linear and quadratic time-frequency signal representations
    • Apr.
    • F. Hlawatsch and G. F. Boudreaux-Bartels, "Linear and quadratic time-frequency signal representations," IEEE Signal Process. Mag., vol. 9, no. 2, pp. 21-67, Apr. 1992.
    • (1992) IEEE Signal Process. Mag. , vol.9 , Issue.2 , pp. 21-67
    • Hlawatsch, F.1    Boudreaux-Bartels, G.F.2
  • 27
    • 3142694930 scopus 로고    scopus 로고
    • Blind separation of speech mixtures via time-frequency masking
    • Jul.
    • O. Yilmaz and S. Rickard, "Blind separation of speech mixtures via time-frequency masking," IEEE Trans. Signal Process., vol. 52, no. 7, pp. 1830-1847, Jul. 2004.
    • (2004) IEEE Trans. Signal Process. , vol.52 , Issue.7 , pp. 1830-1847
    • Yilmaz, O.1    Rickard, S.2
  • 28
    • 40949145095 scopus 로고    scopus 로고
    • Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation
    • Jul.
    • H. Sawada, S. Araki, R. Mukai, and S. Makino, "Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 15, no. 5, pp. 1592-1604, Jul. 2007.
    • (2007) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.5 , pp. 1592-1604
    • Sawada, H.1    Araki, S.2    Mukai, R.3    Makino, S.4
  • 33
    • 35048843291 scopus 로고    scopus 로고
    • Non-negative matrix factor deconvolution; extraction of multiple sound sources from monophonic inputs
    • P. Smaragdis, "Non-negative matrix factor deconvolution; extraction of multiple sound sources from monophonic inputs.," in Proc. 5th Int. Conf. Ind. Compon. Anal., Granada, Spain, Sep. 2004, pp. 494-499.
    • Proc. 5th Int. Conf. Ind. Compon. Anal., Granada, Spain, Sep. 2004 , pp. 494-499
    • Smaragdis, P.1
  • 34
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
    • Mar.
    • T. Virtanen, "Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 36
    • 85132893595 scopus 로고    scopus 로고
    • Independent vector analysis for convolutive blind speech separation
    • New York: Springer
    • I. Lee, T. Kim, and T.-W. Lee, "Independent vector analysis for convolutive blind speech separation," in Blind Speech Separation. New York: Springer, 2007, pp. 169-192.
    • (2007) Blind Speech Separation , pp. 169-192
    • Lee, I.1    Kim, T.2    Lee, T.-W.3
  • 38
    • 67650927380 scopus 로고    scopus 로고
    • Bayesian inference in non-negative matrix factorization models
    • A. T. Cemgil, "Bayesian inference in non-negative matrix factorization models," Comput. Intell. Neurosci., no. Article ID 785152, 2009.
    • (2009) Comput. Intell. Neurosci. , pp. 785152
    • Cemgil, A.T.1
  • 40
    • 85008544097 scopus 로고    scopus 로고
    • Model-based expectation-maximization source separation and localization
    • Feb.
    • M. I. Mandel, R. J. Weiss, and D. Ellis, "Model-based expectation-maximization source separation and localization," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 18, no. 2, pp. 382-394, Feb. 2010.
    • (2010) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.2 , pp. 382-394
    • Mandel, M.I.1    Weiss, R.J.2    Ellis, D.3
  • 42
  • 43
    • 79951625775 scopus 로고    scopus 로고
    • NMF with time-frequency activations to model nonstationary audio events
    • May
    • R. Hennequin, R. Badeau, and B. David, "NMF with time-frequency activations to model nonstationary audio events," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 744-753, May 2011.
    • (2011) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 744-753
    • Hennequin, R.1    Badeau, R.2    David, B.3
  • 45
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 46
    • 84900510076 scopus 로고    scopus 로고
    • Non-negative matrix factorization with sparseness constraints
    • P. O. Hoyer, "Non-negative matrix factorization with sparseness constraints," J. Mach. Learn. Res., vol. 5, pp. 1457-1469, 2004.
    • (2004) J. Mach. Learn. Res. , vol.5 , pp. 1457-1469
    • Hoyer, P.O.1
  • 49
    • 72949120419 scopus 로고    scopus 로고
    • A robust method to count and locate audio sources in a multichannel underdetermined mixture
    • Jan.
    • S. Arberet, R. Gribonval, and F. Bimbot, "A robust method to count and locate audio sources in a multichannel underdetermined mixture," IEEE Trans. Signal Process., vol. 58, no. 1, pp. 121-133, Jan. 2010.
    • (2010) IEEE Trans. Signal Process. , vol.58 , Issue.1 , pp. 121-133
    • Arberet, S.1    Gribonval, R.2    Bimbot, F.3
  • 52
    • 64849117714 scopus 로고    scopus 로고
    • Transcription and separation of drum signals from polyphonic music
    • Mar.
    • O. Gillet and G. Richard, "Transcription and separation of drum signals from polyphonic music," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 16, no. 3, pp. 529-540, Mar. 2008.
    • (2008) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.3 , pp. 529-540
    • Gillet, O.1    Richard, G.2
  • 53
    • 2442437071 scopus 로고    scopus 로고
    • RWC music database: Music genre database and musical instrument sound databases
    • [Online]. Available
    • M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWC music database: Music genre database and musical instrument sound databases," in Proc. 5th Int. Symp. Music Inf. Retrieval (ISMIR), 2004, pp. 229-230 [Online]. Available: http://staff.aist.go.jp/m.goto/RWC-MDB/
    • Proc. 5th Int. Symp. Music Inf. Retrieval (ISMIR), 2004 , pp. 229-230
    • Goto, M.1    Hashiguchi, H.2    Nishimura, T.3    Oka, R.4
  • 55
    • 69249151355 scopus 로고    scopus 로고
    • Speech separation using speaker-adapted eigenvoice speech models
    • R. Weiss and D. Ellis, "Speech separation using speaker-adapted eigenvoice speech models," Comput. Speech Lang., vol. 24, no. 1, pp. 16-29, 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 16-29
    • Weiss, R.1    Ellis, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.