SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 6, 2007, Pages 1766-1776

Soft mask methods for single-channel speaker separation

(2) Reddy, A M a Raj, B b

a MCGILL UNIVERSITY (Canada)

b MITSUBISHI ELECTRIC RESEARCH LABORATORIES (United States)

Author keywords

Signal separation; Soft masks; Speaker separation

Indexed keywords

ACOUSTIC SIGNALS; BINARY MASKS; CURRENT TECHNIQUES; FREQUENCY COMPONENTS; HARD MASKS; MASK METHODS; MIXED SIGNALS; NATIVE FORMS; RELIABLE COMPONENTS; SIGNAL SEPARATION; SINGLE CHANNELS; SOFT MASKS; SPEAKER SEPARATION; SPECTRAL COMPONENTS; SPECTROGRAM; SPEECH SIGNALS; SUBBANDS;

APPROXIMATION ALGORITHMS; SEPARATION;

SIGNAL PROCESSING;

EID: 56249144712 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.901310 Document Type: Article

Times cited : (121)

References (28)

1
- 0004079608
- Cambridge, MA, MIT Press
- S. Handel, Listening. Cambridge, MA.: MIT Press, 1989.
- (1989) Listening
- Handel, S.¹

2
- 80052339383
- Some experiments on the recognition of speech, with one and two ears
- E. C. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer., vol. 25, pp. 975-979, 1953.
- (1953) J. Acoust. Soc. Amer , vol.25 , pp. 975-979
- Cherry, E.C.¹

3
- 0003479143
- Modeling auditory processing and organization,
- Ph.D. dissertation, Dept Comput. Sci, Univ. Sheffield, Sheffield, U.K
- M. P. Cooke, "Modeling auditory processing and organization," Ph.D. dissertation, Dept Comput. Sci., Univ. Sheffield, Sheffield, U.K., 1991.
- (1991)
- Cooke, M.P.¹

4
- 0003684441
- Cambridge, MA: MIT Press
- A. S. Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis: The Perceptual Organization of Sound
- Bregman, A.S.¹

5
- 0003982501
- A Theory and computational model of auditory monaural sound separation,
- Ph.D. dissertation, Elect. Eng. Dept, Stanford Univ, Stanford, CA
- M. Weintraub, "A Theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Elect. Eng. Dept., Stanford Univ., Stanford, CA, 1985.
- (1985)
- Weintraub, M.¹

6
- 0017004953
- Separation of speech from interfering speech by means of harmonic selection
- T. W. Parsons, "Separation of speech from interfering speech by means of harmonic selection," J. Acoust. Soc. Amer., vol. 60, no. 4, pp. 911-918, 1976.
- (1976) J. Acoust. Soc. Amer , vol.60 , Issue.4 , pp. 911-918
- Parsons, T.W.¹

7
- 0028531926
- Computational auditory scene analysis
- Oct
- G. J. Brown and M. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, Oct. 1994.
- (1994) Comput. Speech Lang , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.²

8
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- May
- D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, pp. 684-697, May 1999.
- (1999) IEEE Trans. Neural Netw , vol.10 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

9
- 0004251557
- Event formation and separation of musical sound,
- Ph.D. dissertation, Ctr. Comput. Res. Music Acoust, Dept. Music, Stanford Univ, Stanford, CA
- D. K. Mellinger, "Event formation and separation of musical sound," Ph.D. dissertation, Ctr. Comput. Res. Music Acoust., Dept. Music, Stanford Univ., Stanford, CA, 1991.
- (1991)
- Mellinger, D.K.¹

10
- 77950189256
- A perceptual representation of sound for auditory signal separation
- presented at the, Soc. Amer, Salt Lake City, UT, unpublished
- D. P. W. Ellis and B. L. Vercoe, "A perceptual representation of sound for auditory signal separation," presented at the 123rd Meeting Acoust. Soc. Amer., Salt Lake City, UT, 1992, unpublished.
- (1992) 123rd Meeting Acoust
- Ellis, D.P.W.¹ Vercoe, B.L.²

11
- 0003383455
- A computer implementation of psychoacoustic rules
- Jerusalem, Israel
- D. P. W. Ellis, "A computer implementation of psychoacoustic rules," in Proc. 12th Int. Conf. Pattern Recognition, Jerusalem, Israel, 1994.
- (1994) Proc. 12th Int. Conf. Pattern Recognition
- Ellis, D.P.W.¹

12
- 33749051687
- Blind one-microphone speech separation: A spectral learning approach
- New York
- F. R. Bach and M. Jordon, "Blind one-microphone speech separation: A spectral learning approach," in Proc. Adv. Neural Inf. Process. Syst. (NIPS), New York, 2004, vol. 17.
- (2004) Proc. Adv. Neural Inf. Process. Syst. (NIPS) , vol.17
- Bach, F.R.¹ Jordon, M.²

13
- 84898946024
- One microphone source separation
- S. T. Roweis, "One microphone source separation," Adv. Neural Inf. Process. Syst., vol. 13, pp. 793-799, 2001.
- (2001) Adv. Neural Inf. Process. Syst , vol.13 , pp. 793-799
- Roweis, S.T.¹

14
- 0025681008
- Hidden Markov model decomposition of speech and noise
- A. Varga and R. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. IEEE ICASSP, 1990, pp. 845-848.
- (1990) Proc. IEEE ICASSP , pp. 845-848
- Varga, A.¹ Moore, R.²

15
- 0027622731
- Cepstral parameter compensation for HMM recognition in noise
- Jul
- M. J. F. Gales and S. J. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Commun., vol. 12, pp. 231-239, Jul. 1993.
- (1993) Speech Commun , vol.12 , pp. 231-239
- Gales, M.J.F.¹ Young, S.J.²

16
- 4544247508
- Multiband audio modeling for single-channel acoustic source separation
- Montreal, QC, Canada
- M. J. Reyes-Gomez, D. P. W. Ellis, and N. Jojic, "Multiband audio modeling for single-channel acoustic source separation," in Proc. IEEE ICASSP, Montreal, QC, Canada, 2004, pp. 641-644.
- (2004) Proc. IEEE ICASSP , pp. 641-644
- Reyes-Gomez, M.J.¹ Ellis, D.P.W.² Jojic, N.³

17
- 64549161708
- Fast monaural separation of speech
- Copenhagen, Denmark
- N. H. Pontoppidan and M. Dyrholm, "Fast monaural separation of speech," in Proc. AES 23rd Int. Conf., Copenhagen, Denmark, 2003.
- (2003) Proc. AES 23rd Int. Conf
- Pontoppidan, N.H.¹ Dyrholm, M.²

18
- 18744390181
- From missing data to maybe useful data: Soft data modelling for noise robust ASR
- Stratford-Upon-Avon, Apr
- A. C. Morris, J. Barker, and H. Bourlard, "From missing data to maybe useful data: Soft data modelling for noise robust ASR," in Proc. Workshop Upon Innovation in Speech Processing, Stratford-Upon-Avon, Apr. 2001, pp. 153-164.
- (2001) Proc. Workshop Upon Innovation in Speech Processing , pp. 153-164
- Morris, A.C.¹ Barker, J.² Bourlard, H.³

19
- 85009074940
- A minimum mean squared error estimator for single channel speaker separation
- A. M. Reddy and B. Raj, "A minimum mean squared error estimator for single channel speaker separation," in Interspeech, 2004, pp. 2445-2448.
- (2004) Interspeech , pp. 2445-2448
- Reddy, A.M.¹ Raj, B.²

20
- 84947980404
- Soft mask estimation for single channel speaker separation
- A. M. Reddy and B. Raj, "Soft mask estimation for single channel speaker separation," in Proc. ISCA Tutorial Res. Workshop Statist. Percept. Audio Process., 2004.
- (2004) Proc. ISCA Tutorial Res. Workshop Statist. Percept. Audio Process
- Reddy, A.M.¹ Raj, B.²

21
- 85009230793
- Factorial models and re-filtering for speech separation and denoising
- S. T. Roweis, "Factorial models and re-filtering for speech separation and denoising," in Eurospeech, 2003, vol. 7, no. 6, pp. 1009-1012.
- (2003) Eurospeech , vol.7 , Issue.6 , pp. 1009-1012
- Roweis, S.T.¹

22
- 8344232372
- A maximum likelihood approach to single-channel source separation
- G.-J. Jang and T.-W. Lee, "A maximum likelihood approach to single-channel source separation," J. Mach. Learn. Res., vol. 4, pp. 1365-1392, 2003.
- (2003) J. Mach. Learn. Res , vol.4 , pp. 1365-1392
- Jang, G.-J.¹ Lee, T.-W.²

23
- 84899014722
- A probabilistic approach to single channel blind signal separation
- Cambridge, MA:MIT Press
- G.-J. Jang and T.-W. Lee, "A probabilistic approach to single channel blind signal separation," in NIPS. Cambridge, MA:MIT Press, 2003, vol. 15.
- (2003) NIPS , vol.15
- Jang, G.-J.¹ Lee, T.-W.²

24
- 35048843291
- Non-negative matrix factor deconvolution; Extraction of multiple sound sources from monophonic inputs
- Sep
- P. Smaragdis, "Non-negative matrix factor deconvolution; Extraction of multiple sound sources from monophonic inputs," in Proc. Int. Congr. Ind. Compon. Anal. Blind Signal Separation, Sep. 2004, vol. 3195/2004, pp. 494-499.
- (2004) Proc. Int. Congr. Ind. Compon. Anal. Blind Signal Separation , vol.3195 , Issue.2004 , pp. 494-499
- Smaragdis, P.¹

25
- 50249152311
- Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
- Mar
- T.Virtanen, " Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 1066-1074
- Virtanen, T.¹

26
- 34948876301
- Audio-visual sound separation via hidden Markov models
- J. Hershey and M. Casey, "Audio-visual sound separation via hidden Markov models," Proc. Neural Inf. Process. Syst., pp. 1173-1180, 2001.
- (2001) Proc. Neural Inf. Process. Syst , pp. 1173-1180
- Hershey, J.¹ Casey, M.²

27
- 0000935895
- An introduction to variational methods for graphical methods
- M. I. Jordan, Ed. Norwell, MA: Kluwer, To appear
- M. Jordan, Z. Ghahramani, S. T. Jaakkola, and L. K. Saul, "An introduction to variational methods for graphical methods," in Learning in Graphical Models, M. I. Jordan, Ed. Norwell, MA: Kluwer, To appear.
- Learning in Graphical Models
- Jordan, M.¹ Ghahramani, Z.² Jaakkola, S.T.³ Saul, L.K.⁴

28
- 0031268341
- Factorial hidden Markov models
- Z. Ghahramani and M. Jordan, "Factorial hidden Markov models," Mach. Learn., vol. 29, 1997.
- (1997) Mach. Learn , vol.29
- Ghahramani, Z.¹ Jordan, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.