메뉴 건너뛰기




Volumn 12, Issue 4, 2008, Pages 332-353

Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design

Author keywords

computational auditory scene analysis; hearing aids; ideal binary mask; time frequency masking

Indexed keywords

ALGORITHM; AUDITORY DISCRIMINATION; AUDITORY MASKING; HEARING AID; HUMAN; NOISE REDUCTION; REVIEW; SPEECH ANALYSIS; SPEECH DISCRIMINATION;

EID: 56249144201     PISSN: 10847138     EISSN: 19405588     Source Type: Journal    
DOI: 10.1177/1084713808326455     Document Type: Article
Times cited : (156)

References (88)
  • 2
    • 33748523481 scopus 로고    scopus 로고
    • Determination of the potential benefit of time-frequency gain manipulation
    • Anzalone M. C. Calandruccio L. Doherty K. A. Carney L. H. (2006). Determination of the potential benefit of time-frequency gain manipulation. Ear and Hearing, 27, 480–492.
    • (2006) Ear and Hearing , vol.27 , pp. 480-492
    • Anzalone, M.C.1    Calandruccio, L.2    Doherty, K.A.3    Carney, L.H.4
  • 7
    • 34247223586 scopus 로고    scopus 로고
    • Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors
    • Araki S. Sawada H. Mukai R. Makino S. (2007). Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors. Signal Processing, 87, 1833–1847.
    • (2007) Signal Processing , vol.87 , pp. 1833-1847
    • Araki, S.1    Sawada, H.2    Mukai, R.3    Makino, S.4
  • 11
    • 0002706411 scopus 로고
    • Modeling human sound-source localization and the cocktail-party-effect
    • Bodden M. (1993). Modeling human sound-source localization and the cocktail-party-effect. Acta Acustica, 1, 43–55.
    • (1993) Acta Acustica , vol.1 , pp. 43-55
    • Bodden, M.1
  • 17
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • Brungart D. Chang P. S. Simpson B. D. Wang D. L. (2006). Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation. Journal of the Acoustical Society of America, 120, 4007–4018.
    • (2006) Journal of the Acoustical Society of America , vol.120 , pp. 4007-4018
    • Brungart, D.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.L.4
  • 20
    • 0028413241 scopus 로고
    • Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor
    • Cappe O. (1994). Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Transactions on Speech and Audio Processing, 2, 345–349.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 345-349
    • Cappe, O.1
  • 23
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke M. Green P. Josifovski L. Vizinho A. (2001). Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, 34, 267–285.
    • (2001) Speech Communication , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 27
    • 33746589116 scopus 로고    scopus 로고
    • Speech source separation in convolutive environments using space-time-frequency analysis
    • Article 38412
    • Dubnov S. Tabrikian J. Arnon-Targan M. (2006). Speech source separation in convolutive environments using space-time-frequency analysis. EURASIP Journal on Applied Signal Processing, 2006, Article 38412, 11 pages.
    • (2006) EURASIP Journal on Applied Signal Processing , vol.2006 , pp. 11
    • Dubnov, S.1    Tabrikian, J.2    Arnon-Targan, M.3
  • 30
    • 0023922474 scopus 로고
    • Excess masking among listeners with a sensorineural hearing loss
    • Gagne J.-P. (1988). Excess masking among listeners with a sensorineural hearing loss. Journal of the Acoustical Society of America, 83, 2311–2321.
    • (1988) Journal of the Acoustical Society of America , vol.83 , pp. 2311-2321
    • Gagne, J.-P.1
  • 33
    • 84998077855 scopus 로고
    • (Ellis A. J., Trans., 2nd English ed.). New York: Dover
    • Helmholtz H. (1863). On the sensation of tone (Ellis A. J., Trans., 2nd English ed.). New York: Dover.
    • (1863) On the sensation of tone
    • Helmholtz, H.1
  • 35
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Hu G. Wang D. L. (2004). Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Transactions on Neural Networks, 15, 1135–1150.
    • (2004) IEEE Transactions on Neural Networks , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 36
    • 46049084696 scopus 로고    scopus 로고
    • An auditory scene analysis approach to monaural speech segregation
    • In Hansler E. Schmidt G. (Eds.) Heidelberg, Germany: Springer
    • Hu G. Wang D. L. (2006). An auditory scene analysis approach to monaural speech segregation. In Hansler E. Schmidt G. (Eds.), Topics in acoustic echo and noise control (pp. 485–515). Heidelberg, Germany: Springer.
    • (2006) Topics in acoustic echo and noise control , pp. 485-515
    • Hu, G.1    Wang, D.L.2
  • 37
    • 49249107353 scopus 로고    scopus 로고
    • Segregation of unvoiced speech from nonspeech interference
    • Hu G. Wang D. L. (2008). Segregation of unvoiced speech from nonspeech interference. Journal of the Acoustical Society of America, 124, 1306–1319.
    • (2008) Journal of the Acoustical Society of America , vol.124 , pp. 1306-1319
    • Hu, G.1    Wang, D.L.2
  • 39
    • 0014568991 scopus 로고
    • IEEE recommended practice for speech quality measurements
    • IEEE. (1969). IEEE recommended practice for speech quality measurements. IEEE Transactions on Audio and Electroacoustics, 17, 225–246.
    • (1969) IEEE Transactions on Audio and Electroacoustics , vol.17 , pp. 225-246
  • 41
    • 33751279943 scopus 로고    scopus 로고
    • Multichannel dynamic-range compression using digital frequency warping
    • Kates J. M. Arehart K. H. (2005). Multichannel dynamic-range compression using digital frequency warping. EURASIP Journal on Applied Signal Processing, 18, 3003–3014.
    • (2005) EURASIP Journal on Applied Signal Processing , vol.18 , pp. 3003-3014
    • Kates, J.M.1    Arehart, K.H.2
  • 46
    • 41849093721 scopus 로고    scopus 로고
    • Effect of spectral resolution on the intelligibility of ideal binary masked speech
    • Li N. Loizou P. C. (2008a). Effect of spectral resolution on the intelligibility of ideal binary masked speech. Journal of the Acoustical Society of America, 123, EL59–EL64.
    • (2008) Journal of the Acoustical Society of America , vol.123 , pp. EL59-EL64
    • Li, N.1    Loizou, P.C.2
  • 47
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
    • Li N. Loizou P. C. (2008b). Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction. Journal of the Acoustical Society of America, 123, 1673–1682.
    • (2008) Journal of the Acoustical Society of America , vol.123 , pp. 1673-1682
    • Li, N.1    Loizou, P.C.2
  • 48
    • 40949108726 scopus 로고    scopus 로고
    • Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
    • Li P. Guan Y. Xu B. Liu W. (2006). Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech. IEEE Transactions on Audio, Speech, and Language Processing, 14, 2014–2023.
    • (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , pp. 2014-2023
    • Li, P.1    Guan, Y.2    Xu, B.3    Liu, W.4
  • 57
  • 59
    • 0028012490 scopus 로고
    • Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise
    • Nilsson M. Soli S. Sullivan J. A. (1994). Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise. Journal of the Acoustical Society of America, 95, 1085–1099.
    • (1994) Journal of the Acoustical Society of America , vol.95 , pp. 1085-1099
    • Nilsson, M.1    Soli, S.2    Sullivan, J.A.3
  • 60
    • 2942539074 scopus 로고    scopus 로고
    • Techniques for handling convolutional distortion with “missing data” automatic speech recognition
    • Palomäki K. J. Brown G. J. Barker J. (2004). Techniques for handling convolutional distortion with “missing data” automatic speech recognition. Speech Communication, 43, 123–142.
    • (2004) Speech Communication , vol.43 , pp. 123-142
    • Palomäki, K.J.1    Brown, G.J.2    Barker, J.3
  • 63
    • 35648992055 scopus 로고    scopus 로고
    • Monophonic sound source separation with an unsupervised network of spiking neurones
    • Pichevar R. Rouat J. (2007). Monophonic sound source separation with an unsupervised network of spiking neurones. Neurocomputing, 71, 109–120.
    • (2007) Neurocomputing , vol.71 , pp. 109-120
    • Pichevar, R.1    Rouat, J.2
  • 64
    • 33845940172 scopus 로고    scopus 로고
    • A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation
    • Article 84186
    • Radfar M. H. Dansereau R. M. Sayadiyan A. (2007). A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation. EURASIP Journal on Audio, Speech, and Music Processing, 2007, Article 84186, 15 pages.
    • (2007) EURASIP Journal on Audio, Speech, and Music Processing , vol.2007 , pp. 15
    • Radfar, M.H.1    Dansereau, R.M.2    Sayadiyan, A.3
  • 75
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • Srinivasan S. Roman N. Wang D. L. (2006). Binary and ratio time-frequency masks for robust speech recognition. Speech Communication, 48, 1486–1501.
    • (2006) Speech Communication , vol.48 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.L.3
  • 80
    • 0023985457 scopus 로고
    • Beamforming: A versatile approach to spatial filtering
    • (April)
    • van Veen B. D. Buckley K. M. (1988, April). Beamforming: A versatile approach to spatial filtering. IEEE ASSP Magazine, pp. 4–24.
    • (1988) IEEE ASSP Magazine , pp. 4-24
    • van Veen, B.D.1    Buckley, K.M.2
  • 82
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • In Divenyi P. (Ed.) Norwell, MA: Kluwer Academic
    • Wang D. L. (2005). On ideal binary mask as the computational goal of auditory scene analysis. In Divenyi P. (Ed.), Speech separation by humans and machines (pp. 181–197). Norwell, MA: Kluwer Academic.
    • (2005) Speech separation by humans and machines , pp. 181-197
    • Wang, D.L.1
  • 83
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • Wang D. L. Brown G. J. (1999). Separation of speech from interfering sounds based on oscillatory correlation. IEEE Transactions on Neural Networks, 10, 684–697.
    • (1999) IEEE Transactions on Neural Networks , vol.10 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 88
    • 3142694930 scopus 로고    scopus 로고
    • Blind separation of speech mixtures via time-frequency masking
    • Yilmaz O. Rickard S. (2004). Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 52, 1830–1847.
    • (2004) IEEE Transactions on Signal Processing , vol.52 , pp. 1830-1847
    • Yilmaz, O.1    Rickard, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.