메뉴 건너뛰기




Volumn 51, Issue 3, 2009, Pages 230-239

On the optimality of ideal binary time-frequency masks

Author keywords

Ideal binary mask; Ideal ratio mask; Optimality; Sound separation; Wiener filter

Indexed keywords

DATABASE SYSTEMS; SEPARATION; SIGNAL TO NOISE RATIO;

EID: 58149196390     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2008.09.001     Document Type: Article
Times cited : (140)

References (31)
  • 2
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • Brown G.J., and Cooke M.P. Computational auditory scene analysis. Comput. Speech Lang. 8 (1994) 297-336
    • (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 3
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask
    • Brungart D., Chang P.S., Simpson B.D., and Wang D.L. Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask. J. Acoust. Soc. Amer. 120 (2006) 4007-4018
    • (2006) J. Acoust. Soc. Amer. , vol.120 , pp. 4007-4018
    • Brungart, D.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.L.4
  • 5
    • 34249884500 scopus 로고    scopus 로고
    • Speech enhancement using the modified phase-opponency model
    • Deshmukh O.M., Espy-Wilson C.Y., and Carney L.H. Speech enhancement using the modified phase-opponency model. J. Acoust. Soc. Amer. 121 6 (2007) 3886-3898
    • (2007) J. Acoust. Soc. Amer. , vol.121 , Issue.6 , pp. 3886-3898
    • Deshmukh, O.M.1    Espy-Wilson, C.Y.2    Carney, L.H.3
  • 7
    • 58149177199 scopus 로고    scopus 로고
    • Goto, M., Hashiguchi, H., Nishimura, T., Oka, R., 2003. RWC music database: music genre database and musical instrument sound database. In: Internat. Conf. on Music Information Retrieval.
    • Goto, M., Hashiguchi, H., Nishimura, T., Oka, R., 2003. RWC music database: music genre database and musical instrument sound database. In: Internat. Conf. on Music Information Retrieval.
  • 8
    • 33744971131 scopus 로고    scopus 로고
    • Mask estimation for missing data speech recognition based on statistics of binaural interaction
    • Harding S., Barker J., and Brown G.J. Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE Trans. Audio, Speech, Lang. Process. 14 1 (2006) 58-67
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 58-67
    • Harding, S.1    Barker, J.2    Brown, G.J.3
  • 9
    • 0035681924 scopus 로고    scopus 로고
    • Hu, G., Wang, D.L., 2001. Speech segregation based on pitch tracking and amplitude modulation. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.
    • Hu, G., Wang, D.L., 2001. Speech segregation based on pitch tracking and amplitude modulation. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.
  • 10
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Hu G., and Wang D.L. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks 15 5 (2004) 1135-1150
    • (2004) IEEE Trans. Neural Networks , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 11
    • 0035748878 scopus 로고    scopus 로고
    • Recognizing the component tones of a major chord
    • Hubbard T.L., and Datteri D.L. Recognizing the component tones of a major chord. Amer. J. Psychol. 114 4 (2001) 569-589
    • (2001) Amer. J. Psychol. , vol.114 , Issue.4 , pp. 569-589
    • Hubbard, T.L.1    Datteri, D.L.2
  • 12
    • 51449114976 scopus 로고    scopus 로고
    • Zero-crossing based time-frequency masking for sound segregation
    • Kim Y.-I., An S.J., and Kil R.M. Zero-crossing based time-frequency masking for sound segregation. Neural Inform. Process. - Lett. Rev. 10 (2006) 125-134
    • (2006) Neural Inform. Process. - Lett. Rev. , vol.10 , pp. 125-134
    • Kim, Y.-I.1    An, S.J.2    Kil, R.M.3
  • 13
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
    • Li N., and Loizou P.C. Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction. J. Acoust. Soc. Amer. 123 (2008) 1673-1682
    • (2008) J. Acoust. Soc. Amer. , vol.123 , pp. 1673-1682
    • Li, N.1    Loizou, P.C.2
  • 14
    • 34547539791 scopus 로고    scopus 로고
    • Li, Y., Wang, D.L., 2007. Pitch detection in polyphonic music using instrument tone models. In: IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, pp. II.481-484.
    • Li, Y., Wang, D.L., 2007. Pitch detection in polyphonic music using instrument tone models. In: IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, pp. II.481-484.
  • 15
    • 40949108726 scopus 로고    scopus 로고
    • Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
    • Li P., Guan Y., Xu B., and Liu W. Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech. IEEE Trans. Audio, Speech, Lang. Process. 14 6 (2006) 2014-2023
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 2014-2023
    • Li, P.1    Guan, Y.2    Xu, B.3    Liu, W.4
  • 16
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech
    • Lim J.S., and Oppenheim A.V. Enhancement and bandwidth compression of noisy speech. Proc. IEEE 67 12 (1979) 1586-1604
    • (1979) Proc. IEEE , vol.67 , Issue.12 , pp. 1586-1604
    • Lim, J.S.1    Oppenheim, A.V.2
  • 19
    • 33845940172 scopus 로고    scopus 로고
    • Radfar, M.H., Dansereau, R.M., Sayadiyan, A., 2007. A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation. EURASIP Journal on Audio, Speech, and Music Processing 2007, Article ID 84186, p. 15.
    • Radfar, M.H., Dansereau, R.M., Sayadiyan, A., 2007. A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation. EURASIP Journal on Audio, Speech, and Music Processing 2007, Article ID 84186, p. 15.
  • 20
    • 56249144712 scopus 로고    scopus 로고
    • Soft mask methods for single-channel speaker separation
    • Reddy A.M., and Raj B. Soft mask methods for single-channel speaker separation. IEEE Trans. Audio, Speech, Lang. Process. 25 6 (2007) 1766-1776
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.25 , Issue.6 , pp. 1766-1776
    • Reddy, A.M.1    Raj, B.2
  • 21
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • Roman N., Wang D.L., and Brown G.J. Speech segregation based on sound localization. J. Acoust. Soc. Amer. 114 4 (2003) 2236-2252
    • (2003) J. Acoust. Soc. Amer. , vol.114 , Issue.4 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 22
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • Srinivasan S., Roman N., and Wang D.L. Binary and ratio time-frequency masks for robust speech recognition. Speech Comm. 48 (2006) 1486-1501
    • (2006) Speech Comm. , vol.48 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.L.3
  • 26
    • 34247173529 scopus 로고    scopus 로고
    • Oracle estimators for the benchmarking of source separation algorithms
    • Vincent E., Gribonval R., and Plumbley M.D. Oracle estimators for the benchmarking of source separation algorithms. Signal Process. 87 (2007) 1933-1950
    • (2007) Signal Process. , vol.87 , pp. 1933-1950
    • Vincent, E.1    Gribonval, R.2    Plumbley, M.D.3
  • 27
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary masks as the computational goal of auditory scene analysis
    • Divenyi P. (Ed), Kluwer Academic, Boston, MA
    • Wang D.L. On ideal binary masks as the computational goal of auditory scene analysis. In: Divenyi P. (Ed). Speech Separation by Humans and Machines (2005), Kluwer Academic, Boston, MA 181-197
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 28
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • Wang D.L., and Brown G.J. Separation of speech from interfering sounds based on oscillatory correlation. IEEE Trans. Neural Networks 10 3 (1999) 684-697
    • (1999) IEEE Trans. Neural Networks , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 30
    • 58149204094 scopus 로고    scopus 로고
    • Weintraub, M., 1985. A theory and computational model of auditory monaural sound separation. Ph.D. Thesis, Stanford University, Department of Electrical Engineering.
    • Weintraub, M., 1985. A theory and computational model of auditory monaural sound separation. Ph.D. Thesis, Stanford University, Department of Electrical Engineering.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.