SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 136, Issue 2, 2014, Pages 892-902

Reconstruction techniques for improving the perceptual quality of binary masked speech

(3) Williamson, Donald S a Wang, Yuxuan a Wang, Deliang a,b

a The Ohio State University (United States)

b OHIO STATE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN NETWORKS; SEPARATION; SPEECH;

DEEP NEURAL NETWORKS; NONNEGATIVE MATRIX FACTORIZATION; OBJECTIVE SPEECH QUALITY MEASURES; PERCEPTUAL EVALUATION OF SPEECH QUALITIES; RECONSTRUCTION TECHNIQUES; SHORT TIME FOURIER TRANSFORMS; SPARSE RECONSTRUCTION; TIME FREQUENCY DOMAIN;

QUALITY CONTROL;

ADVERSE EFFECTS; ALGORITHM; ARTIFICIAL NEURAL NETWORK; AUDITORY STIMULATION; BAYES THEOREM; COMPARATIVE STUDY; FOURIER ANALYSIS; HUMAN; NOISE; PERCEPTION; SIGNAL PROCESSING; SPEECH; SPEECH ANALYSIS; SPEECH AUDIOMETRY; SPEECH INTELLIGIBILITY; SPEECH PERCEPTION; STATISTICAL MODEL; TIME; VOICE;

ACOUSTIC STIMULATION; ALGORITHMS; AUDIOMETRY, SPEECH; BAYES THEOREM; FOURIER ANALYSIS; HUMANS; LINEAR MODELS; NEURAL NETWORKS (COMPUTER); NOISE; PERCEPTUAL MASKING; SIGNAL PROCESSING, COMPUTER-ASSISTED; SPEECH ACOUSTICS; SPEECH INTELLIGIBILITY; SPEECH PERCEPTION; SPEECH PRODUCTION MEASUREMENT; TIME FACTORS; VOICE QUALITY;

EID: 84905693981 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.4884759 Document Type: Article

Times cited : (33)

References (55)

1
- 33748523481
- Determination of the potential benefit of time-frequency gain manipulation
- 10.1097/01.aud.0000233891.86809.df
- Anzalone, M. C., Calandruccio, L., Doherty, K. A., and Carney, L. H. (2006). " Determination of the potential benefit of time-frequency gain manipulation," Ear Hear. 27, 480-492. 10.1097/01.aud.0000233891.86809.df
- (2006) Ear Hear. , vol.27 , pp. 480-492
- Anzalone, M.C.¹ Calandruccio, L.² Doherty, K.A.³ Carney, L.H.⁴

2
- 33646759922
- Reducing musical noise by a fine-shift overlap-add method applied to source separation using a time-frequency mask
- Araki, S., Makino, S., Sawada, H., and Mukai, R. (2005). " Reducing musical noise by a fine-shift overlap-add method applied to source separation using a time-frequency mask," in Proceedings of ICASSP, Vol. 3, pp. 81-84.
- (2005) Proceedings of ICASSP , vol.3 , pp. 81-84
- Araki, S.¹ Makino, S.² Sawada, H.³ Mukai, R.⁴

3
- 38149032997
- Compressed sensing and source separation
- edited by M. E. Davies, C. J. James, S. Abdallah, and M. D. Plumbley (Springer Verlag, New York)
- Blumensath, T., and Davis, M. E. (2007). " Compressed sensing and source separation," in Independent Component Analysis and Blind Source Separation, edited by M. E. Davies, C. J. James, S. Abdallah, and M. D. Plumbley (Springer Verlag, New York), pp. 341-348.
- (2007) Independent Component Analysis and Blind Source Separation , pp. 341-348
- Blumensath, T.¹ Davis, M.E.²

4
- 0038120523
- Last viewed 5/30/13
- Boersma, P., and Weeknink, D. (2012). " Praat: Doing phonetics by computer (Version 5.3.32)," Available: http://www.praat.org/ (Last viewed 5/30/13).
- (2012) Praat: Doing Phonetics by Computer (Version 5.3.32)
- Boersma, P.¹ Weeknink, D.²

5
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- 10.1121/1.2363929
- Brungart, D., Chang, P., Simpson, B., and Wang, D. (2006). " Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Am. 120, 4007-4018. 10.1121/1.2363929
- (2006) J. Acoust. Soc. Am. , vol.120 , pp. 4007-4018
- Brungart, D.¹ Chang, P.² Simpson, B.³ Wang, D.⁴

6
- 33745604236
- Stable signal recovery from incomplete and inaccurate measurements
- 10.1002/cpa.20124
- Candes, E. J., Romberg, J., and Tao, T. (2006). " Stable signal recovery from incomplete and inaccurate measurements," Commun. Pure Appl. Math. 59, 1207-1223. 10.1002/cpa.20124
- (2006) Commun. Pure Appl. Math. , vol.59 , pp. 1207-1223
- Candes, E.J.¹ Romberg, J.² Tao, T.³

7
- 79954508213
- Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise
- 10.1121/1.3559707
- Cao, S., Li, L., and Wu, X. (2011). " Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise," J. Acoust. Soc. Am. 129, 2227-2236. 10.1121/1.3559707
- (2011) J. Acoust. Soc. Am. , vol.129 , pp. 2227-2236
- Cao, S.¹ Li, L.² Wu, X.³

8
- 78049381260
- Tech. Re, Human Language Technologies, IBM
- Carmi, A., Gurfil, P., Kanevsky, D., and Ramabhadran, B. (2009). " ABCS: Approximate Bayesian compressed sensing," Tech. Rep., Human Language Technologies, IBM, pp. 1-18.
- (2009) ABCS: Approximate Bayesian Compressed Sensing , pp. 1-18
- Carmi, A.¹ Gurfil, P.² Kanevsky, D.³ Ramabhadran, B.⁴

9
- 56349098310
- Algorithms for orthogonal nonnegative matrix factorization
- Choi, S. (2008). " Algorithms for orthogonal nonnegative matrix factorization," in Proceedings IJCNN, pp. 1828-1832.
- (2008) Proceedings IJCNN , pp. 1828-1832
- Choi, S.¹

10
- 33746239350
- Extended SMART algorithms for non-negative matrix factorization
- Cichocki, A., Amari, S. I., Zdunek, R., Kompass, R., Hori, G., and He, Z. (2006). " Extended SMART algorithms for non-negative matrix factorization," in Proceedings of ICAISC, pp. 548-562.
- (2006) Proceedings of ICAISC , pp. 548-562
- Cichocki, A.¹ Amari, S.I.² Zdunek, R.³ Kompass, R.⁴ Hori, G.⁵ He, Z.⁶

11
- 33645712892
- Compressed sensing
- 10.1109/TIT.2006.871582
- Donoho, D. L. (2006). " Compressed sensing," IEEE Trans. Inf. Theory 52, 1289-1306. 10.1109/TIT.2006.871582
- (2006) IEEE Trans. Inf. Theory , vol.52 , pp. 1289-1306
- Donoho, D.L.¹

12
- 10944227316
- Sparse coding and NMF
- Eggert, J., and Korner, E. (2004). " Sparse coding and NMF," in IEEE Int. Conf. Neural Networks 4, 2529-2533.
- (2004) IEEE Int. Conf. Neural Networks , vol.4 , pp. 2529-2533
- Eggert, J.¹ Korner, E.²

13
- 33845584374
- Image denoising via learned dictionaries and sparse representation
- Elad, M., and Aharon, M. (2006a). " Image denoising via learned dictionaries and sparse representation," in IEEE Comput. Soc. Conf. Comput. Vision Pattern Recognit. 1, 895-900.
- (2006) IEEE Comput. Soc. Conf. Comput. Vision Pattern Recognit. , vol.1 , pp. 895-900
- Elad, M.¹ Aharon, M.²

14
- 33751379736
- Image denoising via sparse and redundant representations over learned dictionaries
- 10.1109/TIP.2006.881969
- Elad, M., and Aharon, M. (2006b). " Image denoising via sparse and redundant representations over learned dictionaries," IEEE Trans. Image Proc. 15, 3736-3745. 10.1109/TIP.2006.881969
- (2006) IEEE Trans. Image Proc. , vol.15 , pp. 3736-3745
- Elad, M.¹ Aharon, M.²

15
- 70349196731
- Using sparse representations for missing data imputation in noise robust speech recognition
- Gemmeke, J., and Cranen, B. (2008). " Using sparse representations for missing data imputation in noise robust speech recognition," in Proceedings of EUSIPCO, pp. 1-5.
- (2008) Proceedings of EUSIPCO , pp. 1-5
- Gemmeke, J.¹ Cranen, B.²

16
- 77949695902
- Compressive sensing for missing data imputation in noise robust speech recognition
- 10.1109/JSTSP.2009.2039171
- Gemmeke, J., Van Hamme, H., Cranen, B., and Boves, L. (2010). " Compressive sensing for missing data imputation in noise robust speech recognition," IEEE J. Sel. Top. Signal Process. 4, 272-287. 10.1109/JSTSP.2009.2039171
- (2010) IEEE J. Sel. Top. Signal Process. , vol.4 , pp. 272-287
- Gemmeke, J.¹ Van Hamme, H.² Cranen, B.³ Boves, L.⁴

17
- 84878596951
- Ph.D. thesis, Radboud University Nijmegen, The Netherlands
- Gemmeke, J. F. (2011). " Noise robust ASR: missing data techniques and beyond," Ph.D. thesis, Radboud University Nijmegen, The Netherlands, pp. 1-169.
- (2011) Noise Robust ASR: Missing Data Techniques and beyond , pp. 1-169
- Gemmeke, J.F.¹

18
- 84863733079
- Using sparse representations for exemplar based continuous digit recognition
- Gemmeke, J. F., ten Bosch, L., Boves, L., and Cranen, B. (2009). " Using sparse representations for exemplar based continuous digit recognition," in Proceedings of EUSIPCO, pp. 1755-1759.
- (2009) Proceedings of EUSIPCO , pp. 1755-1759
- Gemmeke, J.F.¹ Ten Bosch, L.² Boves, L.³ Cranen, B.⁴

19
- 79960657803
- Exemplar-based sparse representations for noise robust automatic speech recognition
- 10.1109/TASL.2011.2112350
- Gemmeke, J. F., Virtanen, T., and Hurmalainen, A. (2011). " Exemplar-based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process. 19, 2067-2080. 10.1109/TASL.2011.2112350
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , pp. 2067-2080
- Gemmeke, J.F.¹ Virtanen, T.² Hurmalainen, A.³

20
- 84905701689
- Last viewed 5/30/13
- Grindlay, G. (2010). " NMFLib," Available: http://code.google. com/p/nmflib/ (Last viewed 5/30/13).
- (2010) NMFLib
- Grindlay, G.¹

21
- 84885412715
- An algorithm to improve speech recognition in noise for hearing-impaired listeners
- 10.1121/1.4820893
- Healy, E. W., Yoho, S. E., Wang, Y., and Wang, D. L. (2013). " An algorithm to improve speech recognition in noise for hearing-impaired listeners," J. Acoust. Soc. Am. 134, 3029-3038. 10.1121/1.4820893
- (2013) J. Acoust. Soc. Am. , vol.134 , pp. 3029-3038
- Healy, E.W.¹ Yoho, S.E.² Wang, Y.³ Wang, D.L.⁴

22
- 0003639435
- ITU-T
- ITU-T. (2001). " Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs," p. 862.
- (2001) Perceptual Evaluation of Speech Quality (PESQ), An Objective Method for End-to-end Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs , pp. 862

23
- 70349093614
- An algorithm that improves speech intelligibility in noise for normal-hearing listeners
- 10.1121/1.3184603
- Kim, G., Lu, Y., Hu, Y., and Loizou, P. (2009). " An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Am. 126, 1486-1494. 10.1121/1.3184603
- (2009) J. Acoust. Soc. Am. , vol.126 , pp. 1486-1494
- Kim, G.¹ Lu, Y.² Hu, Y.³ Loizou, P.⁴

24
- 0033592606
- Learning the parts of objects by non-negative matrix factorization
- 10.1038/44565
- Lee, D., and Seung, H. S. (1999). " Learning the parts of objects by non-negative matrix factorization," Nature 401, 788-791. 10.1038/44565
- (1999) Nature , vol.401 , pp. 788-791
- Lee, D.¹ Seung, H.S.²

25
- 40749125179
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
- 10.1121/1.2832617
- Li, N., and Loizou, P. (2008). " Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction," J. Acoust. Soc. Am. 123, 1673-1682. 10.1121/1.2832617
- (2008) J. Acoust. Soc. Am. , vol.123 , pp. 1673-1682
- Li, N.¹ Loizou, P.²

26
- 0018918171
- An algorithm for vector quantizer design
- 10.1109/TCOM.1980.1094577
- Linde, Y., Buzo, A., and Gray, R. M. (1980). " An algorithm for vector quantizer design," IEEE Trans. Commun. 28, 84-95. 10.1109/TCOM.1980.1094577
- (1980) IEEE Trans. Commun. , vol.28 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.M.³

27
- 51449112795
- Temporal smoothing of spectral masks in the cepstral domain for speech separation
- Madhu, N., Breithaupt, C., and Martin, R. (2008). " Temporal smoothing of spectral masks in the cepstral domain for speech separation," in Proceedings of ICASSP, pp. 45-48.
- (2008) Proceedings of ICASSP , pp. 45-48
- Madhu, N.¹ Breithaupt, C.² Martin, R.³

28
- 71149119964
- Online dictionary learning for sparse coding
- Mairal, J., Bach, F., Ponce, J., and Sapiro, G. (2009). " Online dictionary learning for sparse coding," International Conference on Machine Learning, pp. 689-696.
- (2009) International Conference on Machine Learning , pp. 689-696
- Mairal, J.¹ Bach, F.² Ponce, J.³ Sapiro, G.⁴

29
- 76749107542
- Online learning for matrix factorization and sparse coding
- "
- Mairal, J., Bach, F., Ponce, J., and Sapiro, G. (2010). " Online learning for matrix factorization and sparse coding," J. Mach. Learn. Res. 11, 19-60.
- (2010) J. Mach. Learn. Res. , vol.11 , pp. 19-60
- Mairal, J.¹ Bach, F.² Ponce, J.³ Sapiro, G.⁴

30
- 39149089704
- Sparse representation for color image restoration
- 10.1109/TIP.2007.911828
- Mairal, J., Elad, M., and Sapiro, G. (2008). " Sparse representation for color image restoration," IEEE Trans. Image Process. 17, 53-69. 10.1109/TIP.2007.911828
- (2008) IEEE Trans. Image Process. , vol.17 , pp. 53-69
- Mairal, J.¹ Elad, M.² Sapiro, G.³

31
- 0003789815
- 5th ed. (Academic, San Diego, CA), Cha 3
- Moore, B. C. J. (2003). An Introduction to the Psychology of Hearing, 5th ed. (Academic, San Diego, CA), Chap. 3, pp. 89-147.
- (2003) An Introduction to the Psychology of Hearing , pp. 89-147
- Moore, B.C.J.¹

32
- 84865693405
- A joint approach for single-channel speaker identification and speech separation
- 10.1109/TASL.2012.2208627
- Mowlaee, P., Saeidi, R., Christensen, M. G., Tan, Z., Kinnunen, T., Franti, P., and Jensen, S. H. (2012). " A joint approach for single-channel speaker identification and speech separation," IEEE Trans. Audio, Speech, Lang. Process. 20, 2586-2601. 10.1109/TASL.2012.2208627
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , pp. 2586-2601
- Mowlaee, P.¹ Saeidi, R.² Christensen, M.G.³ Tan, Z.⁴ Kinnunen, T.⁵ Franti, P.⁶ Jensen, S.H.⁷

33
- 0027814133
- Orthogonal matching pursuit: Recursive function approximation with applications to wavelet decomposition
- Pati, Y. C., Rezaiifar, R., and Krishnaprasad, P. S. (1993). " Orthogonal matching pursuit: Recursive function approximation with applications to wavelet decomposition," in Proceedings of the 27th Annual Asilomar Conference on Signals, Systems and Computers, Vol. 1, 40-44.
- (1993) Proceedings of the 27th Annual Asilomar Conference on Signals, Systems and Computers , vol.1 , pp. 40-44
- Pati, Y.C.¹ Rezaiifar, R.² Krishnaprasad, P.S.³

34
- 34250023466
- Monaural speech segregation based on fusion of source-driven with model-driven techniques
- 10.1016/j.specom.2007.04.007
- Radfar, M. H., Dansereau, R. M., and Sayadiyan, A. (2007). " Monaural speech segregation based on fusion of source-driven with model-driven techniques," Speech Commun. 49, 464-476. 10.1016/j.specom.2007.04.007
- (2007) Speech Commun. , vol.49 , pp. 464-476
- Radfar, M.H.¹ Dansereau, R.M.² Sayadiyan, A.³

35
- 4644336054
- Reconstruction of missing features for robust speech recognition
- 10.1016/j.specom.2004.03.007
- Raj, B., Seltzer, M. L., and Stern, R. M. (2004). " Reconstruction of missing features for robust speech recognition," Speech Commun. 43, 275-296. 10.1016/j.specom.2004.03.007
- (2004) Speech Commun. , vol.43 , pp. 275-296
- Raj, B.¹ Seltzer, M.L.² Stern, R.M.³

36
- 79959818117
- Non-negative matrix factorization based compensation of music for automatic speech recognition
- Raj, B., Virtanen, T., Chaudhuri, S., and Singh, R. (2010). " Non-negative matrix factorization based compensation of music for automatic speech recognition," in Proceedings of Interspeech, pp. 717-720.
- (2010) Proceedings of Interspeech , pp. 717-720
- Raj, B.¹ Virtanen, T.² Chaudhuri, S.³ Singh, R.⁴

37
- 0014568991
- IEEE recommended practice for speech quality measurements
- 10.1109/TAU.1969.1162058
- Rothauser, E. H., Chapman, W. D., Guttman, N., Hecker, M. H. L., Nordby, K. S., Silbiger, H. R., Urbanek, G. E., and Weinstock, M. (1969). " IEEE recommended practice for speech quality measurements," IEEE Trans. Audio Electroacoust. 17, 225-246. 10.1109/TAU.1969.1162058
- (1969) IEEE Trans. Audio Electroacoust. , vol.17 , pp. 225-246
- Rothauser, E.H.¹ Chapman, W.D.² Guttman, N.³ Hecker, M.H.L.⁴ Nordby, K.S.⁵ Silbiger, H.R.⁶ Urbanek, G.E.⁷ Weinstock, M.⁸

38
- 80053610626
- Exemplar-based sparse representation features: From TIMIT to LVCSR
- 10.1109/TASL.2011.2155060
- Sainath, T. N., Ramabhadran, B., Picheny, M., Nahamoo, D., and Kanevsky, D. (2011). " Exemplar-based sparse representation features: from TIMIT to LVCSR," IEEE Trans Audio, Speech, Lang. Process. 19, 2598-2613. 10.1109/TASL.2011.2155060
- (2011) IEEE Trans Audio, Speech, Lang. Process. , vol.19 , pp. 2598-2613
- Sainath, T.N.¹ Ramabhadran, B.² Picheny, M.³ Nahamoo, D.⁴ Kanevsky, D.⁵

39
- 78651087442
- Tech. Report
- Schmidt, M. (2007). " Speech separation using non-negative feature and sparse non-negative matrix factorization," Tech. Report, pp. 1-15.
- (2007) Speech Separation Using Non-negative Feature and Sparse Non-negative Matrix Factorization , pp. 1-15
- Schmidt, M.¹

40
- 50249173994
- Linear regression on sparse features for single-channel speech separation
- Schmidt, M. N., and Olsson, R. K. (2007). " Linear regression on sparse features for single-channel speech separation," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 26-29.
- (2007) IEEE Workshoon Applications of Signal Processing to Audio and Acoustics , pp. 26-29
- Schmidt, M.N.¹ Olsson, R.K.²

41
- 84898964201
- Algorithms for non-negative matrix factorization
- Seung, H. S., and Lee, D. (2001). " Algorithms for non-negative matrix factorization," Adv. Neural Inf. Process. Syst. 13, 556-562.
- (2001) Adv. Neural Inf. Process. Syst. , vol.13 , pp. 556-562
- Seung, H.S.¹ Lee, D.²

42
- 34547511508
- Sparse overcomplete decomposition for single channel speaker separation
- Shashanka, M. V. S., Raj, B., and Smaragdis, P. (2007). " Sparse overcomplete decomposition for single channel speaker separation," in Proceedings of ICASSP, pp. 641-644.
- (2007) Proceedings of ICASSP , pp. 641-644
- Shashanka, M.V.S.¹ Raj, B.² Smaragdis, P.³

43
- 35048843291
- Non negative matrix factor deconvolution: Extraction of multiple sound sources from monophonic inputs
- Smaragdis, P. (2004). " Non negative matrix factor deconvolution: extraction of multiple sound sources from monophonic inputs," Independent Component Analysis and Blind Signal Separation, pp. 494-499.
- (2004) Independent Component Analysis and Blind Signal Separation , pp. 494-499
- Smaragdis, P.¹

44
- 38049021850
- Convolutive speech bases and their application to supervised speech separation
- 10.1109/TASL.2006.876726
- Smaragdis, P. (2007). " Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process. 15, 1-12. 10.1109/TASL.2006.876726
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , pp. 1-12
- Smaragdis, P.¹

45
- 33750311718
- Binary and ratio time-frequency masks for robust speech recognition
- 10.1016/j.specom.2006.09.003
- Srinivasan, S., Roman, N., and Wang, D. L. (2006). " Binary and ratio time-frequency masks for robust speech recognition," Speech Commun. 48, 1486-1501. 10.1016/j.specom.2006.09.003
- (2006) Speech Commun. , vol.48 , pp. 1486-1501
- Srinivasan, S.¹ Roman, N.² Wang, D.L.³

46
- 79960916745
- An algorithm for intelligibility prediction of time frequency weighted noisy speech
- 10.1109/TASL.2011.2114881
- Taal, C. H., Hendriks, R. C., Heusdens, R., and Jensen, J. (2011). " An algorithm for intelligibility prediction of time frequency weighted noisy speech," IEEE Trans. Audio, Speech, Lang. Process. 19, 2125-2136. 10.1109/TASL.2011.2114881
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , pp. 2125-2136
- Taal, C.H.¹ Hendriks, R.C.² Heusdens, R.³ Jensen, J.⁴

47
- 50249152311
- Monaural sound source separation by nonnegative matrix factorization with temporal continuity and spareness criteria
- 10.1109/TASL.2006.885253
- Virtanen, T. (2007). " Monaural sound source separation by nonnegative matrix factorization with temporal continuity and spareness criteria," IEEE Trans. Audio, Speech, Lang. Process. 15, 1066-1074. 10.1109/TASL.2006.885253
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , pp. 1066-1074
- Virtanen, T.¹

48
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- edited by P. Divenyi (Kluwer Academic, Norwell, MA)
- Wang, D. L. (2005). " On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, edited by P. Divenyi (Kluwer Academic, Norwell, MA), pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

49
- 56249144201
- Time-frequency masking for speech separation and its potential for hearing aid design
- 10.1177/1084713808326455
- Wang, D. L. (2008). " Time-frequency masking for speech separation and its potential for hearing aid design," Trends Amplif. 12, 332-353. 10.1177/1084713808326455
- (2008) Trends Amplif. , vol.12 , pp. 332-353
- Wang, D.L.¹

50
- 82255178542
- Fundamentals of computational auditory scene analysis
- Eds. (Wiley-IEEE Press, Hoboken, NJ), Cha 1
- Wang, D. L., and Brown, G., Eds. (2006). " Fundamentals of computational auditory scene analysis," in Computational Auditory Scene Analysis: Principles, Algorithms, and Applications (Wiley-IEEE Press, Hoboken, NJ), Chap. 1, pp. 1-37.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , pp. 1-37
- Wang, D.L.¹ Brown, G.²

51
- 64649103540
- Speech intelligibility in background noise with ideal binary time-frequency masking
- 10.1121/1.3083233
- Wang, D. L., Kjems, U., Pedersen, M. S., Boldt, J. B., and Lunner, T. (2009). " Speech intelligibility in background noise with ideal binary time-frequency masking," J. Acoust. Soc. Am. 125, 2336-2347. 10.1121/1.3083233
- (2009) J. Acoust. Soc. Am. , vol.125 , pp. 2336-2347
- Wang, D.L.¹ Kjems, U.² Pedersen, M.S.³ Boldt, J.B.⁴ Lunner, T.⁵

52
- 84870477511
- Exploring monaural features for classification-based speech segregation
- 10.1109/TASL.2012.2221459
- Wang, Y., Han, K., and Wang, D. L. (2013). " Exploring monaural features for classification-based speech segregation," IEEE Trans. Audio, Speech, Lang. Process. 21, 270-279. 10.1109/TASL.2012.2221459
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , pp. 270-279
- Wang, Y.¹ Han, K.² Wang, D.L.³

53
- 84875678689
- Towards scaling up classification-based speech separation
- 10.1109/TASL.2013.2250961
- Wang, Y., and Wang, D. L. (2013). " Towards scaling up classification-based speech separation," IEEE Trans. Audio, Speech, Lang. Process. 21, 1381-1390. 10.1109/TASL.2013.2250961
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , pp. 1381-1390
- Wang, Y.¹ Wang, D.L.²

54
- 51449092704
- Speech denoising using nonnegative matrix factorization with priors
- Wilson, K., Raj, B., Smaragdis, P., and Divakaran, A. (2008). " Speech denoising using nonnegative matrix factorization with priors," in Proceedings of ICASSP, pp. 4029-4032.
- (2008) Proceedings of ICASSP , pp. 4029-4032
- Wilson, K.¹ Raj, B.² Smaragdis, P.³ Divakaran, A.⁴

55
- 84859024513
- CASA-based robust speaker identification
- 10.1109/TASL.2012.2186803
- Zhao, X., Shao, Y., and Wang, D. L. (2012). " CASA-based robust speaker identification," IEEE Trans. Audio, Speech, Lang. Process. 20, 1608-1616. 10.1109/TASL.2012.2186803
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , pp. 1608-1616
- Zhao, X.¹ Shao, Y.² Wang, D.L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.