SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 51, Issue 3, 2009, Pages 230-239

On the optimality of ideal binary time-frequency masks

(2) Li, Yipeng a Wang, DeLiang a

a The Ohio State University (United States)

Author keywords

Ideal binary mask; Ideal ratio mask; Optimality; Sound separation; Wiener filter

Indexed keywords

DATABASE SYSTEMS; SEPARATION; SIGNAL TO NOISE RATIO;

IDEAL BINARY MASK; IDEAL RATIO MASK; OPTIMALITY; SOUND SEPARATION; WIENER FILTER;

ADAPTIVE FILTERING;

EID: 58149196390 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2008.09.001 Document Type: Article

Times cited : (140)

References (31)

1
- 0003684441
- MIT Press, Cambridge, MA
- Bregman A.S. Auditory Scene Analysis (1990), MIT Press, Cambridge, MA
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

2
- 0028531926
- Computational auditory scene analysis
- Brown G.J., and Cooke M.P. Computational auditory scene analysis. Comput. Speech Lang. 8 (1994) 297-336
- (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.P.²

3
- 33845354768
- Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask
- Brungart D., Chang P.S., Simpson B.D., and Wang D.L. Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask. J. Acoust. Soc. Amer. 120 (2006) 4007-4018
- (2006) J. Acoust. Soc. Amer. , vol.120 , pp. 4007-4018
- Brungart, D.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.L.⁴

4
- 0003479143
- Cambridge University Press, Cambridge, UK
- Cooke M.P. Modeling Auditory Processing and Organization (1993), Cambridge University Press, Cambridge, UK
- (1993) Modeling Auditory Processing and Organization
- Cooke, M.P.¹

5
- 34249884500
- Speech enhancement using the modified phase-opponency model
- Deshmukh O.M., Espy-Wilson C.Y., and Carney L.H. Speech enhancement using the modified phase-opponency model. J. Acoust. Soc. Amer. 121 6 (2007) 3886-3898
- (2007) J. Acoust. Soc. Amer. , vol.121 , Issue.6 , pp. 3886-3898
- Deshmukh, O.M.¹ Espy-Wilson, C.Y.² Carney, L.H.³

6
- 84873856136
- Model-based scene analysis
- Wang D.L., and Brown G.J. (Eds), Wiley/IEEE Press, Hoboken, NJ
- Ellis D.P.W. Model-based scene analysis. In: Wang D.L., and Brown G.J. (Eds). Computational Auditory Scene Analysis: Principles, Algorithms, and Application (2006), Wiley/IEEE Press, Hoboken, NJ 115-146
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Application , pp. 115-146
- Ellis, D.P.W.¹

7
- 58149177199
- Goto, M., Hashiguchi, H., Nishimura, T., Oka, R., 2003. RWC music database: music genre database and musical instrument sound database. In: Internat. Conf. on Music Information Retrieval.
- Goto, M., Hashiguchi, H., Nishimura, T., Oka, R., 2003. RWC music database: music genre database and musical instrument sound database. In: Internat. Conf. on Music Information Retrieval.

8
- 33744971131
- Mask estimation for missing data speech recognition based on statistics of binaural interaction
- Harding S., Barker J., and Brown G.J. Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE Trans. Audio, Speech, Lang. Process. 14 1 (2006) 58-67
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 58-67
- Harding, S.¹ Barker, J.² Brown, G.J.³

9
- 0035681924
- Hu, G., Wang, D.L., 2001. Speech segregation based on pitch tracking and amplitude modulation. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.
- Hu, G., Wang, D.L., 2001. Speech segregation based on pitch tracking and amplitude modulation. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

10
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Hu G., and Wang D.L. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks 15 5 (2004) 1135-1150
- (2004) IEEE Trans. Neural Networks , vol.15 , Issue.5 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

11
- 0035748878
- Recognizing the component tones of a major chord
- Hubbard T.L., and Datteri D.L. Recognizing the component tones of a major chord. Amer. J. Psychol. 114 4 (2001) 569-589
- (2001) Amer. J. Psychol. , vol.114 , Issue.4 , pp. 569-589
- Hubbard, T.L.¹ Datteri, D.L.²

12
- 51449114976
- Zero-crossing based time-frequency masking for sound segregation
- Kim Y.-I., An S.J., and Kil R.M. Zero-crossing based time-frequency masking for sound segregation. Neural Inform. Process. - Lett. Rev. 10 (2006) 125-134
- (2006) Neural Inform. Process. - Lett. Rev. , vol.10 , pp. 125-134
- Kim, Y.-I.¹ An, S.J.² Kil, R.M.³

13
- 40749125179
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
- Li N., and Loizou P.C. Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction. J. Acoust. Soc. Amer. 123 (2008) 1673-1682
- (2008) J. Acoust. Soc. Amer. , vol.123 , pp. 1673-1682
- Li, N.¹ Loizou, P.C.²

14
- 34547539791
- Li, Y., Wang, D.L., 2007. Pitch detection in polyphonic music using instrument tone models. In: IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, pp. II.481-484.
- Li, Y., Wang, D.L., 2007. Pitch detection in polyphonic music using instrument tone models. In: IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, pp. II.481-484.

15
- 40949108726
- Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
- Li P., Guan Y., Xu B., and Liu W. Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech. IEEE Trans. Audio, Speech, Lang. Process. 14 6 (2006) 2014-2023
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 2014-2023
- Li, P.¹ Guan, Y.² Xu, B.³ Liu, W.⁴

16
- 0018642851
- Enhancement and bandwidth compression of noisy speech
- Lim J.S., and Oppenheim A.V. Enhancement and bandwidth compression of noisy speech. Proc. IEEE 67 12 (1979) 1586-1604
- (1979) Proc. IEEE , vol.67 , Issue.12 , pp. 1586-1604
- Lim, J.S.¹ Oppenheim, A.V.²

17
- 0003513556
- Prentice-Hall
- Oppenheim A.V., Schafer R.W., and Buck J.R. Discrete-Time Signal Processing. second ed. (1999), Prentice-Hall
- (1999) Discrete-Time Signal Processing. second ed.
- Oppenheim, A.V.¹ Schafer, R.W.² Buck, J.R.³

18
- 0022794595
- Analysis/synthesis filter bank design based on time domain aliasing cancellation
- Princen J.P., and Bradley A.B. Analysis/synthesis filter bank design based on time domain aliasing cancellation. IEEE Transactions on Acoustics, Speech, and Signal Processing 34 5 (1986) 1153-1161
- (1986) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.34 , Issue.5 , pp. 1153-1161
- Princen, J.P.¹ Bradley, A.B.²

19
- 33845940172
- Radfar, M.H., Dansereau, R.M., Sayadiyan, A., 2007. A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation. EURASIP Journal on Audio, Speech, and Music Processing 2007, Article ID 84186, p. 15.
- Radfar, M.H., Dansereau, R.M., Sayadiyan, A., 2007. A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation. EURASIP Journal on Audio, Speech, and Music Processing 2007, Article ID 84186, p. 15.

20
- 56249144712
- Soft mask methods for single-channel speaker separation
- Reddy A.M., and Raj B. Soft mask methods for single-channel speaker separation. IEEE Trans. Audio, Speech, Lang. Process. 25 6 (2007) 1766-1776
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.25 , Issue.6 , pp. 1766-1776
- Reddy, A.M.¹ Raj, B.²

21
- 0142026377
- Speech segregation based on sound localization
- Roman N., Wang D.L., and Brown G.J. Speech segregation based on sound localization. J. Acoust. Soc. Amer. 114 4 (2003) 2236-2252
- (2003) J. Acoust. Soc. Amer. , vol.114 , Issue.4 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

22
- 33750311718
- Binary and ratio time-frequency masks for robust speech recognition
- Srinivasan S., Roman N., and Wang D.L. Binary and ratio time-frequency masks for robust speech recognition. Speech Comm. 48 (2006) 1486-1501
- (2006) Speech Comm. , vol.48 , pp. 1486-1501
- Srinivasan, S.¹ Roman, N.² Wang, D.L.³

23
- 0004206760
- SIAM, Philadelphia, PA
- Strang G., and Nguyen T. Wavelets and Filter Banks (1996), SIAM, Philadelphia, PA
- (1996) Wavelets and Filter Banks
- Strang, G.¹ Nguyen, T.²

24
- 0003462953
- Wiley, New York
- van Trees H.L. Detection, Estimation, and Modulation Theory, Part I (1968), Wiley, New York
- (1968) Detection, Estimation, and Modulation Theory, Part I
- van Trees, H.L.¹

25
- 33744975847
- Performance measurement in blind audio source separation
- Vincent E., Gribonval R., and Fevotte C. Performance measurement in blind audio source separation. IEEE Trans. Audio, Speech, Lang. Process. 14 4 (2006) 1462-1469
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1462-1469
- Vincent, E.¹ Gribonval, R.² Fevotte, C.³

26
- 34247173529
- Oracle estimators for the benchmarking of source separation algorithms
- Vincent E., Gribonval R., and Plumbley M.D. Oracle estimators for the benchmarking of source separation algorithms. Signal Process. 87 (2007) 1933-1950
- (2007) Signal Process. , vol.87 , pp. 1933-1950
- Vincent, E.¹ Gribonval, R.² Plumbley, M.D.³

27
- 84892233308
- On ideal binary masks as the computational goal of auditory scene analysis
- Divenyi P. (Ed), Kluwer Academic, Boston, MA
- Wang D.L. On ideal binary masks as the computational goal of auditory scene analysis. In: Divenyi P. (Ed). Speech Separation by Humans and Machines (2005), Kluwer Academic, Boston, MA 181-197
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

28
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- Wang D.L., and Brown G.J. Separation of speech from interfering sounds based on oscillatory correlation. IEEE Trans. Neural Networks 10 3 (1999) 684-697
- (1999) IEEE Trans. Neural Networks , vol.10 , Issue.3 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

29
- 82255178542
- Wang D.L., and Brown G.J. (Eds), Wiley/IEEE Press, Hoboken, NJ
- In: Wang D.L., and Brown G.J. (Eds). Computational Auditory Scene Analysis: Principles, Algorithms, and Applications (2006), Wiley/IEEE Press, Hoboken, NJ
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications

30
- 58149204094
- Weintraub, M., 1985. A theory and computational model of auditory monaural sound separation. Ph.D. Thesis, Stanford University, Department of Electrical Engineering.
- Weintraub, M., 1985. A theory and computational model of auditory monaural sound separation. Ph.D. Thesis, Stanford University, Department of Electrical Engineering.

31
- 0003574794
- MIT Press, Cambridge, MA
- Wiener N. Extrapolation, Interpolation, and Smoothing of Stationary Time Series (1949), MIT Press, Cambridge, MA
- (1949) Extrapolation, Interpolation, and Smoothing of Stationary Time Series
- Wiener, N.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.