SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 4, 2012, Pages 1118-1133

A general flexible framework for the handling of prior information in audio source separation

(3) Ozerov, Alexey a Vincent, Emmanuel a Bimbot, Frédéric b

a INRIA (France)

b CAMPUS DE BEAULIEU (France)

Author keywords

Audio source separation; Expectation maximization; Local Gaussian model; Nonnegative matrix factorization

Indexed keywords

AUDIO SOURCE SEPARATION; EXPECTATION MAXIMIZATION; EXPECTATION-MAXIMIZATION ALGORITHMS; FLEXIBLE FRAMEWORK; LOCAL GAUSSIAN MODELING; NEW EFFICIENT METHOD; NONNEGATIVE MATRIX FACTORIZATION; SEPARATION PROBLEMS;

ALGORITHMS;

SOURCE SEPARATION;

EID: 84897584695 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2011.2172425 Document Type: Article

Times cited : (240)

References (57)

1
- 77955698250
- Probabilistic modeling paradigms for audio source separation
- Hershey, PA: IGI Global, ch. 7
- E. Vincent, M. Jafari, S. A. Abdallah, M. D. Plumbley, and M. E. Davies, "Probabilistic modeling paradigms for audio source separation," in Machine Audition: Principles, Algorithms and Systems. Hershey, PA: IGI Global, 2010, ch. 7, pp. 162-185.
- (2010) Machine Audition: Principles, Algorithms and Systems , pp. 162-185
- Vincent, E.¹ Jafari, M.² Abdallah, S.A.³ Plumbley, M.D.⁴ Davies, M.E.⁵

2
- 4644369641
- New EM algorithms for source separation and deconvolution
- H. Attias, "New EM algorithms for source separation and deconvolution," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'03), 2003, pp. 297-300.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'03), 2003 , pp. 297-300
- Attias, H.¹

3
- 84904366663
- Blind separation of speech mixtures based on nonstationarity
- D.-T. Pham, C. Servière, and H. Boumaraf, "Blind separation of speech mixtures based on nonstationarity," in Proc. 7th Int. Symp. Signal Process. and Its Applicat., 2003, pp. II-73-II-76.
- Proc. 7th Int. Symp. Signal Process. and Its Applicat., 2003
- Pham, D.-T.¹ Servière, C.² Boumaraf, H.³

4
- 30844456955
- Polyphonic transcription by non-negative sparse coding of power spectra
- S. A. Abdallah and M. D. Plumbley, "Polyphonic transcription by non-negative sparse coding of power spectra," in Proc. 5th Int. Symp. Music Inf. Retrieval (ISMIR'04), Oct. 2004, pp. 318-325.
- Proc. 5th Int. Symp. Music Inf. Retrieval (ISMIR'04), Oct. 2004 , pp. 318-325
- Abdallah, S.A.¹ Plumbley, M.D.²

5
- 33749066379
- Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models
- C. Févotte and J.-F. Cardoso, "Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA'05), Mohonk, NY, Oct. 2005, pp. 78-81.
- Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA'05), Mohonk, NY, Oct. 2005 , pp. 78-81
- Févotte, C.¹ Cardoso, J.-F.²

6
- 33744968614
- Audio source separation with a single sensor
- DOI 10.1109/TSA.2005.854110
- L. Benaroya, F. Bimbot, and R. Gribonval, "Audio source separation with a single sensor," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 191-199, Jan. 2006. (Pubitemid 43863465)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 191-199
- Benaroya, L.¹ Bimbot, F.² Gribonval, R.³

7
- 51449094735
- Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs
- Jul.
- A. Ozerov, P. Philippe, F. Bimbot, and R. Gribonval, "Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 5, pp. 1564-1578, Jul. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.5 , pp. 1564-1578
- Ozerov, A.¹ Philippe, P.² Bimbot, F.³ Gribonval, R.⁴

8
- 51449107582
- Evaluation of several strategies for single sensor speech/music separation
- R. Blouet, G. Rapaport, I. Cohen, and C. Févotte, "Evaluation of several strategies for single sensor speech/music separation," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08), Las Vegas, NV, Apr. 2008, pp. 37-40.
- Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08), Las Vegas, NV, Apr. 2008 , pp. 37-40
- Blouet, R.¹ Rapaport, G.² Cohen, I.³ Févotte, C.⁴

9
- 63249085556
- Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis
- Mar.
- C. Févotte, N. Bertin, and J.-L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis," Neural Comput., vol. 21, no. 3, pp. 793-830, Mar. 2009.
- (2009) Neural Comput. , vol.21 , Issue.3 , pp. 793-830
- Févotte, C.¹ Bertin, N.² Durrieu, J.-L.³

10
- 67149113245
- Underdetermined instantaneous audio source separation via local Gaussian modeling
- E. Vincent, S. Arberet, and R. Gribonval, "Underdetermined instantaneous audio source separation via local Gaussian modeling," in Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'09), 2009, pp. 775-782.
- Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'09), 2009 , pp. 775-782
- Vincent, E.¹ Arberet, S.² Gribonval, R.³

11
- 67149141481
- Blind spectral- GMM estimation for underdetermined instantaneous audio source separation
- S. Arberet, A. Ozerov, R. Gribonval, and F. Bimbot, "Blind spectral- GMM estimation for underdetermined instantaneous audio source separation," in Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'09), 2009, pp. 751-758.
- Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'09), 2009 , pp. 751-758
- Arberet, S.¹ Ozerov, A.² Gribonval, R.³ Bimbot, F.⁴

12
- 77950116181
- Factorial scaled hidden Markov model for polyphonic audio representation and source separation
- A. Ozerov, C. Févotte, and M. Charbit, "Factorial scaled hidden Markov model for polyphonic audio representation and source separation," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA'09), Oct. 18-21, 2009, pp. 121-124.
- Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA'09), Oct. 18-21, 2009 , pp. 121-124
- Ozerov, A.¹ Févotte, C.² Charbit, M.³

13
- 76949094445
- Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation
- Mar.
- A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 550-563, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 550-563
- Ozerov, A.¹ Févotte, C.²

14
- 76949108729
- Adaptive harmonic spectral decomposition for multiple pitch estimation
- Mar.
- E. Vincent, N. Bertin, and R. Badeau, "Adaptive harmonic spectral decomposition for multiple pitch estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 528-537, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 528-537
- Vincent, E.¹ Bertin, N.² Badeau, R.³

15
- 76949083547
- Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription
- Mar.
- N. Bertin, R. Badeau, and E. Vincent, "Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 538-549, Mar. 2010.
- (2010) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 538-549
- Bertin, N.¹ Badeau, R.² Vincent, E.³

16
- 76949096499
- Source/filter model for unsupervised main melody extraction from polyphonic audio signals
- Mar.
- J. L. Durrieu, G. Richard, B. David, and C. Févotte, "Source/filter model for unsupervised main melody extraction from polyphonic audio signals," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 564-575, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 564-575
- Durrieu, J.L.¹ Richard, G.² David, B.³ Févotte, C.⁴

17
- 78650302778
- Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation
- S. Arberet, A. Ozerov, N. Duong, E. Vincent, R. Gribonval, F. Bimbot, and P. Vandergheynst, "Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation," in Proc. 10th Int. Conf. Inf. Sci., Signal Process. their Applicat. (ISSPA'10), 2010, pp. 1-4.
- Proc. 10th Int. Conf. Inf. Sci., Signal Process. Their Applicat. (ISSPA'10), 2010 , pp. 1-4
- Arberet, S.¹ Ozerov, A.² Duong, N.³ Vincent, E.⁴ Gribonval, R.⁵ Bimbot, F.⁶ Vandergheynst, P.⁷

18
- 78349253923
- Under-determined reverberant audio source separation using local observed covariance and auditory-motivated time-frequency representation
- N. Q. K. Duong, E. Vincent, and R. Gribonval, "Under-determined reverberant audio source separation using local observed covariance and auditory-motivated time-frequency representation," in Proc. 9th Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA'10), Saint-Malo, France, Sep. 27-30, 2010, pp. 73-80.
- Proc. 9th Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA'10), Saint-Malo, France, Sep. 27-30, 2010 , pp. 73-80
- Duong, N.Q.K.¹ Vincent, E.² Gribonval, R.³

19
- 77955675017
- Under-determined reverberant audio source separation using a full-rank spatial covariance model
- Sep.
- N. Q. K. Duong, E. Vincent, and R. Gribonval, "Under-determined reverberant audio source separation using a full-rank spatial covariance model," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1830-1840, Sep. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1830-1840
- Duong, N.Q.K.¹ Vincent, E.² Gribonval, R.³

20
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- Methodological
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., ser. B, vol. 39, Methodological, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc., Ser. B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

21
- 80051637530
- Multichannel non-negative tensor factorization with structured constraints for user-guided audio source separation
- A. Ozerov, C. Févotte, R. Blouet, and J.-L. Durrieu, "Multichannel non-negative tensor factorization with structured constraints for user-guided audio source separation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'11), Prague, Czech Republic, May 2011, pp. 257-260.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'11), Prague, Czech Republic, May 2011 , pp. 257-260
- Ozerov, A.¹ Févotte, C.² Blouet, R.³ Durrieu, J.-L.⁴

22
- 57949113893
- Component separation with flexible models - Application to multichannel astrophysical observations
- Oct.
- J.-F. Cardoso, M. Le Jeune, J. Delabrouille, M. Betoule, and G. Patanchon, "Component separation with flexible models - Application to multichannel astrophysical observations," IEEE J. Sel. Topics Signal Process., vol. 2, no. 5, pp. 735-746, Oct. 2008.
- (2008) IEEE J. Sel. Topics Signal Process. , vol.2 , Issue.5 , pp. 735-746
- Cardoso, J.-F.¹ Le Jeune, M.² Delabrouille, J.³ Betoule, M.⁴ Patanchon, G.⁵

23
- 47649088496
- Extended nonnegative tensor factorisation models for musical sound source separation
- New York: Hindawi
- D. FitzGerald, M. Cranitch, and E. Coyle, "Extended nonnegative tensor factorisation models for musical sound source separation," in Computational Intelligence and Neuroscience. New York: Hindawi., 2008, vol. 2008.
- (2008) Computational Intelligence and Neuroscience , vol.2008
- FitzGerald, D.¹ Cranitch, M.² Coyle, E.³

24
- 78349269206
- A general modular framework for audio source separation
- A. Ozerov, E. Vincent, and F. Bimbot, "A general modular framework for audio source separation," in Proc. 9th Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA'10), Saint-Malo, France, Sep. 27-30, 2010, pp. 33-40.
- Proc. 9th Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA'10), Saint-Malo, France, Sep. 27-30, 2010 , pp. 33-40
- Ozerov, A.¹ Vincent, E.² Bimbot, F.³

25
- 84938332231
- Available
- A. Ozerov, E. Vincent, and F. Bimbot, Flexible Audio Source Separation Toolbox (FASST) [Online]. Available: http://bass-db.gforge.inria.fr/fasst/
- Flexible Audio Source Separation Toolbox (FASST) [Online]
- Ozerov, A.¹ Vincent, E.² Bimbot, F.³

26
- 85032751591
- Linear and quadratic time-frequency signal representations
- Apr.
- F. Hlawatsch and G. F. Boudreaux-Bartels, "Linear and quadratic time-frequency signal representations," IEEE Signal Process. Mag., vol. 9, no. 2, pp. 21-67, Apr. 1992.
- (1992) IEEE Signal Process. Mag. , vol.9 , Issue.2 , pp. 21-67
- Hlawatsch, F.¹ Boudreaux-Bartels, G.F.²

27
- 3142694930
- Blind separation of speech mixtures via time-frequency masking
- Jul.
- O. Yilmaz and S. Rickard, "Blind separation of speech mixtures via time-frequency masking," IEEE Trans. Signal Process., vol. 52, no. 7, pp. 1830-1847, Jul. 2004.
- (2004) IEEE Trans. Signal Process. , vol.52 , Issue.7 , pp. 1830-1847
- Yilmaz, O.¹ Rickard, S.²

28
- 40949145095
- Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation
- Jul.
- H. Sawada, S. Araki, R. Mukai, and S. Makino, "Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 15, no. 5, pp. 1592-1604, Jul. 2007.
- (2007) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.5 , pp. 1592-1604
- Sawada, H.¹ Araki, S.² Mukai, R.³ Makino, S.⁴

29
- 78349244502
- The 2010 signal separation evaluation campaign (SiSEC2010): Audio source separation
- S. Araki, A. Ozerov, V. Gowreesunker, H. Sawada, F. Theis, G. Nolte, D. Lutter, and N. Duong, "The 2010 signal separation evaluation campaign (SiSEC2010): Audio source separation," in Proc. 9th Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA'10), Saint-Malo, France, Sep. 2010, pp. 114-122.
- Proc. 9th Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA'10), Saint-Malo, France, Sep. 2010 , pp. 114-122
- Araki, S.¹ Ozerov, A.² Gowreesunker, V.³ Sawada, H.⁴ Theis, F.⁵ Nolte, G.⁶ Lutter, D.⁷ Duong, N.⁸

30
- 67149088353
- The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation
- E. Vincent, S. Araki, and P. Bofilld, "The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation," in Proc. Int. Conf. Ind. Compon. Anal. Signal Separat. (ICA'09), 2009, pp. 734-741.
- Proc. Int. Conf. Ind. Compon. Anal. Signal Separat. (ICA'09), 2009 , pp. 734-741
- Vincent, E.¹ Araki, S.² Bofilld, P.³

31
- 0030676410
- Maximum likelihood for blind separation and deconvolution of noisy signals using mixture models
- E. Moulines, J.-F. Cardoso, and E. Gassiat, "Maximum likelihood for blind separation and deconvolution of noisy signals using mixture models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97), Apr. 1997, pp. 3617-3620.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'97), Apr. 1997 , pp. 3617-3620
- Moulines, E.¹ Cardoso, J.-F.² Gassiat, E.³

32
- 77957745677
- Blind separation and dereverberation of speech mixtures by joint optimization
- Jan.
- T. Yoshioka, T. Nakatani, M. Miyoshi, and H. Okuno, "Blind separation and dereverberation of speech mixtures by joint optimization," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 19, no. 1, pp. 69-84, Jan. 2010.
- (2010) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.1 , pp. 69-84
- Yoshioka, T.¹ Nakatani, T.² Miyoshi, M.³ Okuno, H.⁴

33
- 35048843291
- Non-negative matrix factor deconvolution; extraction of multiple sound sources from monophonic inputs
- P. Smaragdis, "Non-negative matrix factor deconvolution; extraction of multiple sound sources from monophonic inputs.," in Proc. 5th Int. Conf. Ind. Compon. Anal., Granada, Spain, Sep. 2004, pp. 494-499.
- Proc. 5th Int. Conf. Ind. Compon. Anal., Granada, Spain, Sep. 2004 , pp. 494-499
- Smaragdis, P.¹

34
- 50249152311
- Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
- Mar.
- T. Virtanen, "Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 1066-1074
- Virtanen, T.¹

35
- 34547508917
- Analysis of musical instrument sounds by source-filter decay model
- A. Klapuri, "Analysis of musical instrument sounds by source-filter decay model," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'07), 2007, vol. 1, pp. 53-56.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'07), 2007 , vol.1 , pp. 53-56
- Klapuri, A.¹

36
- 85132893595
- Independent vector analysis for convolutive blind speech separation
- New York: Springer
- I. Lee, T. Kim, and T.-W. Lee, "Independent vector analysis for convolutive blind speech separation," in Blind Speech Separation. New York: Springer, 2007, pp. 169-192.
- (2007) Blind Speech Separation , pp. 169-192
- Lee, I.¹ Kim, T.² Lee, T.-W.³

37
- 51449100115
- Efficient model-based speech separation and denoising using non-negative subspace analysis
- S. J. Rennie, J. R. Hershey, and P. A. Olsen, "Efficient model-based speech separation and denoising using non-negative subspace analysis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08), 2008, pp. 1833-1836.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08), 2008 , pp. 1833-1836
- Rennie, S.J.¹ Hershey, J.R.² Olsen, P.A.³

38
- 67650927380
- Bayesian inference in non-negative matrix factorization models
- A. T. Cemgil, "Bayesian inference in non-negative matrix factorization models," Comput. Intell. Neurosci., no. Article ID 785152, 2009.
- (2009) Comput. Intell. Neurosci. , pp. 785152
- Cemgil, A.T.¹

39
- 0038705102
- One microphone source separation
- Cambridge, MA: MIT Press
- S. T. Roweis, "One microphone source separation," in Advances in Neural Information Processing Systems 13. Cambridge, MA: MIT Press, 2000, pp. 793-799.
- (2000) Advances in Neural Information Processing Systems 13 , pp. 793-799
- Roweis, S.T.¹

40
- 85008544097
- Model-based expectation-maximization source separation and localization
- Feb.
- M. I. Mandel, R. J. Weiss, and D. Ellis, "Model-based expectation-maximization source separation and localization," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 18, no. 2, pp. 382-394, Feb. 2010.
- (2010) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.2 , pp. 382-394
- Mandel, M.I.¹ Weiss, R.J.² Ellis, D.³

41
- 4344616431
- The three easy routes to independent component analysis; contrasts and geometry
- J.-F. Cardoso, "The three easy routes to independent component analysis; contrasts and geometry," in Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'01), San Diego, CA, Dec. 2001, pp. 1-6.
- Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'01), San Diego, CA, Dec. 2001 , pp. 1-6
- Cardoso, J.-F.¹

42
- 50249173884
- A multipitch analyzer based on harmonic temporal structured clustering
- Mar.
- H. Kameoka, T. Nishimoto, and S. Sagayama, "A multipitch analyzer based on harmonic temporal structured clustering," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982-994, Mar. 2007.
- (2007) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 982-994
- Kameoka, H.¹ Nishimoto, T.² Sagayama, S.³

43
- 79951625775
- NMF with time-frequency activations to model nonstationary audio events
- May
- R. Hennequin, R. Badeau, and B. David, "NMF with time-frequency activations to model nonstationary audio events," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 744-753, May 2011.
- (2011) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 744-753
- Hennequin, R.¹ Badeau, R.² David, B.³

44
- 33847601702
- Separation of singing and piano sounds
- Y. Meron and K. Hirose, "Separation of singing and piano sounds," in Proc. Int. Conf. Spoken Lang. Process., 1998.
- Proc. Int. Conf. Spoken Lang. Process., 1998
- Meron, Y.¹ Hirose, K.²

45
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb.
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

46
- 84900510076
- Non-negative matrix factorization with sparseness constraints
- P. O. Hoyer, "Non-negative matrix factorization with sparseness constraints," J. Mach. Learn. Res., vol. 5, pp. 1457-1469, 2004.
- (2004) J. Mach. Learn. Res. , vol.5 , pp. 1457-1469
- Hoyer, P.O.¹

47
- 10944227316
- Sparse coding and NMF
- J. Eggert and E. Körner, "Sparse coding and NMF," in Proc. Int. Joint Conf. Neural Netw. (IJCNN'04), 2004, pp. 2529-2533.
- Proc. Int. Joint Conf. Neural Netw. (IJCNN'04), 2004 , pp. 2529-2533
- Eggert, J.¹ Körner, E.²

48
- 33744975847
- Performance measurement in blind audio source separation
- DOI 10.1109/TSA.2005.858005
- E. Vincent, R. Gribonval, and C. Fevotte, "Performance measurement in blind audio source separation," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 14, no. 4, pp. 1462-1469, Jul. 2006. (Pubitemid 46547636)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1462-1469
- Vincent, E.¹ Gribonval, R.² Fevotte, C.³

49
- 72949120419
- A robust method to count and locate audio sources in a multichannel underdetermined mixture
- Jan.
- S. Arberet, R. Gribonval, and F. Bimbot, "A robust method to count and locate audio sources in a multichannel underdetermined mixture," IEEE Trans. Signal Process., vol. 58, no. 1, pp. 121-133, Jan. 2010.
- (2010) IEEE Trans. Signal Process. , vol.58 , Issue.1 , pp. 121-133
- Arberet, S.¹ Gribonval, R.² Bimbot, F.³

50
- 80051609944
- Multi-source TDOA estimation using SNR-based angular spectra
- C. Blandin, E. Vincent, and A. Ozerov, "Multi-source TDOA estimation using SNR-based angular spectra," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'11), Prague, Czech Republic, May 2011, pp. 2616-2619.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'11), Prague, Czech Republic, May 2011 , pp. 2616-2619
- Blandin, C.¹ Vincent, E.² Ozerov, A.³

51
- 38149130641
- Complex nonconvex lp norm minimization for underdetermined source separation
- E. Vincent, "Complex nonconvex lp norm minimization for underdetermined source separation," in Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'07), 2007, pp. 430-437.
- Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separat. (ICA'07), 2007 , pp. 430-437
- Vincent, E.¹

52
- 64849117714
- Transcription and separation of drum signals from polyphonic music
- Mar.
- O. Gillet and G. Richard, "Transcription and separation of drum signals from polyphonic music," IEEE Trans. Trans. Audio, Speech, Lang. Process., vol. 16, no. 3, pp. 529-540, Mar. 2008.
- (2008) IEEE Trans. Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.3 , pp. 529-540
- Gillet, O.¹ Richard, G.²

53
- 2442437071
- RWC music database: Music genre database and musical instrument sound databases
- [Online]. Available
- M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, "RWC music database: Music genre database and musical instrument sound databases," in Proc. 5th Int. Symp. Music Inf. Retrieval (ISMIR), 2004, pp. 229-230 [Online]. Available: http://staff.aist.go.jp/m.goto/RWC-MDB/
- Proc. 5th Int. Symp. Music Inf. Retrieval (ISMIR), 2004 , pp. 229-230
- Goto, M.¹ Hashiguchi, H.² Nishimura, T.³ Oka, R.⁴

54
- 84866037355
- Using the FASST source separation toolbox for noise robust speech recognition
- A. Ozerov and E. Vincent, "Using the FASST source separation toolbox for noise robust speech recognition," in Proc. Int. Workshop Mach. Listening in Multisource Environments (CHiME 2011), Florence, Italy, Sep. 2011, pp. 86-87.
- Proc. Int. Workshop Mach. Listening in Multisource Environments (CHiME 2011), Florence, Italy, Sep. 2011 , pp. 86-87
- Ozerov, A.¹ Vincent, E.²

55
- 69249151355
- Speech separation using speaker-adapted eigenvoice speech models
- R. Weiss and D. Ellis, "Speech separation using speaker-adapted eigenvoice speech models," Comput. Speech Lang., vol. 24, no. 1, pp. 16-29, 2010.
- (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 16-29
- Weiss, R.¹ Ellis, D.²

56
- 77950138969
- Multi-voice polyphonic music transcription using eigeninstruments
- G. Grindlay and D. Ellis, "Multi-voice polyphonic music transcription using eigeninstruments," in Proc. IEEEWorkshop Applicat. Signal Process. Audio Acoust. (WASPAA'09), 2009, pp. 53-56.
- Proc. IEEEWorkshop Applicat. Signal Process. Audio Acoust. (WASPAA'09), 2009 , pp. 53-56
- Grindlay, G.¹ Ellis, D.²

57
- 84897663010
- Single sensor source separation using multiple-window STFT representation
- L. Benaroya, R. Blouet, C. Févotte, and I. Cohen, "Single sensor source separation using multiple-window STFT representation," in Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC'06), Paris, France, Sep. 12-14, 2006.
- Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC'06), Paris, France, Sep. 12-14, 2006
- Benaroya, L.¹ Blouet, R.² Févotte, C.³ Cohen, I.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.