메뉴 건너뛰기




Volumn 15, Issue 6, 2007, Pages 1766-1776

Soft mask methods for single-channel speaker separation

Author keywords

Signal separation; Soft masks; Speaker separation

Indexed keywords

ACOUSTIC SIGNALS; BINARY MASKS; CURRENT TECHNIQUES; FREQUENCY COMPONENTS; HARD MASKS; MASK METHODS; MIXED SIGNALS; NATIVE FORMS; RELIABLE COMPONENTS; SIGNAL SEPARATION; SINGLE CHANNELS; SOFT MASKS; SPEAKER SEPARATION; SPECTRAL COMPONENTS; SPECTROGRAM; SPEECH SIGNALS; SUBBANDS;

EID: 56249144712     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.901310     Document Type: Article
Times cited : (121)

References (28)
  • 1
    • 0004079608 scopus 로고
    • Cambridge, MA, MIT Press
    • S. Handel, Listening. Cambridge, MA.: MIT Press, 1989.
    • (1989) Listening
    • Handel, S.1
  • 2
    • 80052339383 scopus 로고
    • Some experiments on the recognition of speech, with one and two ears
    • E. C. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer., vol. 25, pp. 975-979, 1953.
    • (1953) J. Acoust. Soc. Amer , vol.25 , pp. 975-979
    • Cherry, E.C.1
  • 3
    • 0003479143 scopus 로고
    • Modeling auditory processing and organization,
    • Ph.D. dissertation, Dept Comput. Sci, Univ. Sheffield, Sheffield, U.K
    • M. P. Cooke, "Modeling auditory processing and organization," Ph.D. dissertation, Dept Comput. Sci., Univ. Sheffield, Sheffield, U.K., 1991.
    • (1991)
    • Cooke, M.P.1
  • 5
    • 0003982501 scopus 로고
    • A Theory and computational model of auditory monaural sound separation,
    • Ph.D. dissertation, Elect. Eng. Dept, Stanford Univ, Stanford, CA
    • M. Weintraub, "A Theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Elect. Eng. Dept., Stanford Univ., Stanford, CA, 1985.
    • (1985)
    • Weintraub, M.1
  • 6
    • 0017004953 scopus 로고
    • Separation of speech from interfering speech by means of harmonic selection
    • T. W. Parsons, "Separation of speech from interfering speech by means of harmonic selection," J. Acoust. Soc. Amer., vol. 60, no. 4, pp. 911-918, 1976.
    • (1976) J. Acoust. Soc. Amer , vol.60 , Issue.4 , pp. 911-918
    • Parsons, T.W.1
  • 7
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • Oct
    • G. J. Brown and M. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, Oct. 1994.
    • (1994) Comput. Speech Lang , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.2
  • 8
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • May
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, pp. 684-697, May 1999.
    • (1999) IEEE Trans. Neural Netw , vol.10 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 9
    • 0004251557 scopus 로고
    • Event formation and separation of musical sound,
    • Ph.D. dissertation, Ctr. Comput. Res. Music Acoust, Dept. Music, Stanford Univ, Stanford, CA
    • D. K. Mellinger, "Event formation and separation of musical sound," Ph.D. dissertation, Ctr. Comput. Res. Music Acoust., Dept. Music, Stanford Univ., Stanford, CA, 1991.
    • (1991)
    • Mellinger, D.K.1
  • 10
    • 77950189256 scopus 로고
    • A perceptual representation of sound for auditory signal separation
    • presented at the, Soc. Amer, Salt Lake City, UT, unpublished
    • D. P. W. Ellis and B. L. Vercoe, "A perceptual representation of sound for auditory signal separation," presented at the 123rd Meeting Acoust. Soc. Amer., Salt Lake City, UT, 1992, unpublished.
    • (1992) 123rd Meeting Acoust
    • Ellis, D.P.W.1    Vercoe, B.L.2
  • 11
    • 0003383455 scopus 로고
    • A computer implementation of psychoacoustic rules
    • Jerusalem, Israel
    • D. P. W. Ellis, "A computer implementation of psychoacoustic rules," in Proc. 12th Int. Conf. Pattern Recognition, Jerusalem, Israel, 1994.
    • (1994) Proc. 12th Int. Conf. Pattern Recognition
    • Ellis, D.P.W.1
  • 12
    • 33749051687 scopus 로고    scopus 로고
    • Blind one-microphone speech separation: A spectral learning approach
    • New York
    • F. R. Bach and M. Jordon, "Blind one-microphone speech separation: A spectral learning approach," in Proc. Adv. Neural Inf. Process. Syst. (NIPS), New York, 2004, vol. 17.
    • (2004) Proc. Adv. Neural Inf. Process. Syst. (NIPS) , vol.17
    • Bach, F.R.1    Jordon, M.2
  • 13
    • 84898946024 scopus 로고    scopus 로고
    • One microphone source separation
    • S. T. Roweis, "One microphone source separation," Adv. Neural Inf. Process. Syst., vol. 13, pp. 793-799, 2001.
    • (2001) Adv. Neural Inf. Process. Syst , vol.13 , pp. 793-799
    • Roweis, S.T.1
  • 14
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • A. Varga and R. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. IEEE ICASSP, 1990, pp. 845-848.
    • (1990) Proc. IEEE ICASSP , pp. 845-848
    • Varga, A.1    Moore, R.2
  • 15
    • 0027622731 scopus 로고
    • Cepstral parameter compensation for HMM recognition in noise
    • Jul
    • M. J. F. Gales and S. J. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Commun., vol. 12, pp. 231-239, Jul. 1993.
    • (1993) Speech Commun , vol.12 , pp. 231-239
    • Gales, M.J.F.1    Young, S.J.2
  • 16
    • 4544247508 scopus 로고    scopus 로고
    • Multiband audio modeling for single-channel acoustic source separation
    • Montreal, QC, Canada
    • M. J. Reyes-Gomez, D. P. W. Ellis, and N. Jojic, "Multiband audio modeling for single-channel acoustic source separation," in Proc. IEEE ICASSP, Montreal, QC, Canada, 2004, pp. 641-644.
    • (2004) Proc. IEEE ICASSP , pp. 641-644
    • Reyes-Gomez, M.J.1    Ellis, D.P.W.2    Jojic, N.3
  • 18
    • 18744390181 scopus 로고    scopus 로고
    • From missing data to maybe useful data: Soft data modelling for noise robust ASR
    • Stratford-Upon-Avon, Apr
    • A. C. Morris, J. Barker, and H. Bourlard, "From missing data to maybe useful data: Soft data modelling for noise robust ASR," in Proc. Workshop Upon Innovation in Speech Processing, Stratford-Upon-Avon, Apr. 2001, pp. 153-164.
    • (2001) Proc. Workshop Upon Innovation in Speech Processing , pp. 153-164
    • Morris, A.C.1    Barker, J.2    Bourlard, H.3
  • 19
    • 85009074940 scopus 로고    scopus 로고
    • A minimum mean squared error estimator for single channel speaker separation
    • A. M. Reddy and B. Raj, "A minimum mean squared error estimator for single channel speaker separation," in Interspeech, 2004, pp. 2445-2448.
    • (2004) Interspeech , pp. 2445-2448
    • Reddy, A.M.1    Raj, B.2
  • 21
    • 85009230793 scopus 로고    scopus 로고
    • Factorial models and re-filtering for speech separation and denoising
    • S. T. Roweis, "Factorial models and re-filtering for speech separation and denoising," in Eurospeech, 2003, vol. 7, no. 6, pp. 1009-1012.
    • (2003) Eurospeech , vol.7 , Issue.6 , pp. 1009-1012
    • Roweis, S.T.1
  • 22
    • 8344232372 scopus 로고    scopus 로고
    • A maximum likelihood approach to single-channel source separation
    • G.-J. Jang and T.-W. Lee, "A maximum likelihood approach to single-channel source separation," J. Mach. Learn. Res., vol. 4, pp. 1365-1392, 2003.
    • (2003) J. Mach. Learn. Res , vol.4 , pp. 1365-1392
    • Jang, G.-J.1    Lee, T.-W.2
  • 23
    • 84899014722 scopus 로고    scopus 로고
    • A probabilistic approach to single channel blind signal separation
    • Cambridge, MA:MIT Press
    • G.-J. Jang and T.-W. Lee, "A probabilistic approach to single channel blind signal separation," in NIPS. Cambridge, MA:MIT Press, 2003, vol. 15.
    • (2003) NIPS , vol.15
    • Jang, G.-J.1    Lee, T.-W.2
  • 24
    • 35048843291 scopus 로고    scopus 로고
    • Non-negative matrix factor deconvolution; Extraction of multiple sound sources from monophonic inputs
    • Sep
    • P. Smaragdis, "Non-negative matrix factor deconvolution; Extraction of multiple sound sources from monophonic inputs," in Proc. Int. Congr. Ind. Compon. Anal. Blind Signal Separation, Sep. 2004, vol. 3195/2004, pp. 494-499.
    • (2004) Proc. Int. Congr. Ind. Compon. Anal. Blind Signal Separation , vol.3195 , Issue.2004 , pp. 494-499
    • Smaragdis, P.1
  • 25
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
    • Mar
    • T.Virtanen, " Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 26
    • 34948876301 scopus 로고    scopus 로고
    • Audio-visual sound separation via hidden Markov models
    • J. Hershey and M. Casey, "Audio-visual sound separation via hidden Markov models," Proc. Neural Inf. Process. Syst., pp. 1173-1180, 2001.
    • (2001) Proc. Neural Inf. Process. Syst , pp. 1173-1180
    • Hershey, J.1    Casey, M.2
  • 27
    • 0000935895 scopus 로고    scopus 로고
    • An introduction to variational methods for graphical methods
    • M. I. Jordan, Ed. Norwell, MA: Kluwer, To appear
    • M. Jordan, Z. Ghahramani, S. T. Jaakkola, and L. K. Saul, "An introduction to variational methods for graphical methods," in Learning in Graphical Models, M. I. Jordan, Ed. Norwell, MA: Kluwer, To appear.
    • Learning in Graphical Models
    • Jordan, M.1    Ghahramani, Z.2    Jaakkola, S.T.3    Saul, L.K.4
  • 28
    • 0031268341 scopus 로고    scopus 로고
    • Factorial hidden Markov models
    • Z. Ghahramani and M. Jordan, "Factorial hidden Markov models," Mach. Learn., vol. 29, 1997.
    • (1997) Mach. Learn , vol.29
    • Ghahramani, Z.1    Jordan, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.