SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 132, Issue 5, 2012, Pages 3475-3483

A classification based approach to speech segregation

(2) Han, Kun a Wang, Deliang a

a Ohio State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BACKGROUND NOISE; CLASSIFICATION ACCURACY; CLASSIFICATION APPROACH; CLASSIFICATION RESULTS; COMPUTATIONAL AUDITORY SCENE ANALYSIS; FALSE ALARM RATE; HIGH QUALITY; IDEAL BINARY MASK; INTRINSIC PROPERTY; SOUND SEGREGATION; SPEECH SEGREGATION; SYSTEMATIC EVALUATION; TARGET SPEECH; TIME FREQUENCY;

SPEECH COMMUNICATION; SPEECH INTELLIGIBILITY;

SPEECH PROCESSING;

ARTICLE; AUTOMATIC SPEECH RECOGNITION; CLASSIFICATION; COMPARATIVE STUDY; EVALUATION; FEMALE; HUMAN; MALE; NOISE; SIGNAL PROCESSING; SOUND DETECTION; SPEECH; SPEECH INTELLIGIBILITY; SUPPORT VECTOR MACHINE; TIME; VOICE;

FEMALE; HUMANS; MALE; NOISE; SIGNAL PROCESSING, COMPUTER-ASSISTED; SOUND SPECTROGRAPHY; SPEECH ACOUSTICS; SPEECH INTELLIGIBILITY; SPEECH RECOGNITION SOFTWARE; SUPPORT VECTOR MACHINES; TIME FACTORS; VOICE QUALITY;

EID: 84869105129 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.4754541 Document Type: Article

Times cited : (99)

References (29)

1
- 22944452794
- Applying support vector machines to imbalanced datasets
- in
- Akbani, R., Kwek, S., and Japkowicz, N. (2004). Applying support vector machines to imbalanced datasets., in Proceedings of the 15th European Conference on Machine Learning, pp. 39-50.
- (2004) Proceedings of the 15th European Conference on Machine Learning , pp. 39-50
- Akbani, R.¹ Kwek, S.² Japkowicz, N.³

2
- 33748523481
- Determination of the potential benefit of time-frequency gain manipulation
- 10.1097/01.aud.0000233891.86809.df
- Anzalone, M. C., Calandruccio, L., Doherty, K. A., and Carney, L. H. (2006). Determination of the potential benefit of time-frequency gain manipulation., Ear Hear. 27, 480-492. 10.1097/01.aud.0000233891.86809.df
- (2006) Ear Hear. , vol.27 , pp. 480-492
- Anzalone, M.C.¹ Calandruccio, L.² Doherty, K.A.³ Carney, L.H.⁴

3
- 84869143719
- praat: Doing phonetics by computer (version 4.5) [computer program], http://www.fon.hum.uva.nl/praat (Last viewed November 2010)
- Boersma, P., and Weenink, D. (2007). praat: Doing phonetics by computer (version 4.5) [computer program], http://www.fon.hum.uva.nl/praat (Last viewed November 2010).
- (2007)
- Boersma, P.¹ Weenink, D.²

4
- 18744396499
- Technical Report MSR-TR-2003-34, Microsoft Cor
- Brank, J., Grobelnik, M., Milic-Frayling, N., and Mladenic, D. (2003). Training text classifiers with SVM on very few positive examples., Technical Report MSR-TR-2003-34, Microsoft Corp.
- (2003) Training Text Classifiers with SVM on Very Few Positive Examples
- Brank, J.¹ Grobelnik, M.² Milic-Frayling, N.³ Mladenic, D.⁴

5
- 0003684441
- (The MIT Press, Cambridge, MA), Cha
- Bregman, A. S. (1990). Auditory Scene Analysis (The MIT Press, Cambridge, MA), Chap..
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

6
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- 10.1121/1.2363929
- Brungart, D. S., Chang, P. S., Simpson, B. D., and Wang, D. L. (2006). Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation., J. Acoust. Soc. Am. 120, 4007-4018. 10.1121/1.2363929
- (2006) J. Acoust. Soc. Am. , vol.120 , pp. 4007-4018
- Brungart, D.S.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.L.⁴

7
- 0003710380
- http://www.csie.ntu.edu.tw/∼cjlin/libsvm (Last viewed November 2010)
- Chang, C. C., and Lin, C. J. (2001). LIBSVM: A library for support vector machines., http://www.csie.ntu.edu.tw/∼cjlin/libsvm (Last viewed November 2010).
- (2001) LIBSVM: A Library for Support Vector Machines
- Chang, C.C.¹ Lin, C.J.²

8
- 0003548585
- National Institute of Standards and Technology, NISTIR 4930
- Garofolo, J. S., Lamel, L. F., Fisher, W. M., Fiscus, J. G., Pallett, D. S., Dahlgren, N. L., and Zue, V. (1993). DARPA TIMIT acoustic-phonetic continuous speech corpus., National Institute of Standards and Technology, NISTIR 4930.
- (1993) DARPA TIMIT Acoustic-phonetic Continuous Speech Corpus
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶ Zue, V.⁷

9
- 67650246505
- (Prentice Hall, New York), Cha
- Haykin, S. S. (2009). Neural Networks and Learning Machines (Prentice Hall, New York), Chap..
- (2009) Neural Networks and Learning Machines
- Haykin, S.S.¹

10
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- 10.1109/TNN.2004.832812
- Hu, G., and Wang, D. L. (2004). Monaural speech segregation based on pitch tracking and amplitude modulation., IEEE Trans. Neural Netw. 15, 1135-1150. 10.1109/TNN.2004.832812
- (2004) IEEE Trans. Neural Netw. , vol.15 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

11
- 38849102154
- Auditory segmentation based on onset and offset analysis
- 10.1109/TASL.2006.881700
- Hu, G., and Wang, D. L. (2007). Auditory segmentation based on onset and offset analysis., IEEE Trans. Audio, Speech, Lang. Process. 15, 396-405. 10.1109/TASL.2006.881700
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , pp. 396-405
- Hu, G.¹ Wang, D.L.²

12
- 49249107353
- Segregation of unvoiced speech from nonspeech interference
- 10.1121/1.2939132
- Hu, G., and Wang, D. L. (2008). Segregation of unvoiced speech from nonspeech interference., J. Acoust. Soc. Am. 124, 1306-1319. 10.1121/1.2939132
- (2008) J. Acoust. Soc. Am. , vol.124 , pp. 1306-1319
- Hu, G.¹ Wang, D.L.²

13
- 77955695149
- A tandem algorithm for pitch estimation and voiced speech segregation
- 10.1109/TASL.2010.2041110
- Hu, G., and Wang, D. L. (2010). A tandem algorithm for pitch estimation and voiced speech segregation., IEEE Trans. Audio, Speech, Lang. Process. 18, 2067-2079. 10.1109/TASL.2010.2041110
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , pp. 2067-2079
- Hu, G.¹ Wang, D.L.²

14
- 65249103478
- A supervised learning approach to monaural segregation of reverberant speech
- 10.1109/TASL.2008.2010633
- Jin, Z., and Wang, D. L. (2009). A supervised learning approach to monaural segregation of reverberant speech., IEEE Trans. Audio, Speech, Lang. Process. 17, 625-638. 10.1109/TASL.2008.2010633
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , pp. 625-638
- Jin, Z.¹ Wang, D.L.²

15
- 78049365070
- A multipitch tracking algorithms for noisy and reverberant speech
- in
- Jin, Z., and Wang, D. L. (2010). A multipitch tracking algorithms for noisy and reverberant speech., in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4218-4221.
- (2010) Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 4218-4221
- Jin, Z.¹ Wang, D.L.²

16
- 70349093614
- An algorithm that improves speech intelligibility in noise for normal-hearing listeners
- 10.1121/1.3184603
- Kim, G., Lu, Y., Hu, Y., and Loizou, P. C. (2009). An algorithm that improves speech intelligibility in noise for normal-hearing listeners., J. Acoust. Soc. Am. 126, 1486-1494. 10.1121/1.3184603
- (2009) J. Acoust. Soc. Am. , vol.126 , pp. 1486-1494
- Kim, G.¹ Lu, Y.² Hu, Y.³ Loizou, P.C.⁴

17
- 40749125179
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
- 10.1121/1.2832617
- Li, N., and Loizou, P. C. (2008). Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction., J. Acoust. Soc. Am. 123, 1673-1682. 10.1121/1.2832617
- (2008) J. Acoust. Soc. Am. , vol.123 , pp. 1673-1682
- Li, N.¹ Loizou, P.C.²

18
- 0142056390
- Technical Report No. 2341, MRC Applied Psychology Unit
- Patterson, R. D., Nimmo-Smith, I., Holdsworth, J., and Rice, P. (1988). An efficient auditory filterbank based on the gammatone function., Technical Report No. 2341, MRC Applied Psychology Unit.
- (1988) An Efficient Auditory Filterbank Based on the Gammatone Function
- Patterson, R.D.¹ Nimmo-Smith, I.² Holdsworth, J.³ Rice, P.⁴

19
- 0003243224
- Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
- in (The MIT Press, Cambridge, MA)
- Platt, J. C. (1999). Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods., in Advances in Large Margin Classifiers (The MIT Press, Cambridge, MA), pp. 61-74.
- (1999) Advances in Large Margin Classifiers , pp. 61-74
- Platt, J.C.¹

20
- 0142026377
- Speech segregation based on sound localization
- 10.1121/1.1610463
- Roman, N., Wang, D. L., and Brown, G. J. (2003). Speech segregation based on sound localization., J. Acoust. Soc. Am. 114, 2236-2252. 10.1121/1.1610463
- (2003) J. Acoust. Soc. Am. , vol.114 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

21
- 0014568991
- IEEE recommended practice for speech quality measurements
- Rothauser, E. H., Chapman, W. D., Guttman, N., Nordby, K. S., Silbiger, H. R., Urbanek, G. E., and Weinstock, M. (1969). IEEE recommended practice for speech quality measurements., IEEE Trans. Audio Electroacoust. 17, 227-246.
- (1969) IEEE Trans. Audio Electroacoust. , vol.17 , pp. 227-246
- Rothauser, E.H.¹ Chapman, W.D.² Guttman, N.³ Nordby, K.S.⁴ Silbiger, H.R.⁵ Urbanek, G.E.⁶ Weinstock, M.⁷

22
- 70350565063
- On strategies for imbalanced text classification using SVM: A comparative study
- 10.1016/j.dss.2009.07.011
- Sun, A., Lim, E. P., and Liu, Y. (2009). On strategies for imbalanced text classification using SVM: A comparative study., Decision Support Syst. 48, 191-201. 10.1016/j.dss.2009.07.011
- (2009) Decision Support Syst. , vol.48 , pp. 191-201
- Sun, A.¹ Lim, E.P.² Liu, Y.³

23
- 0038712550
- SNR estimation based on amplitude modulation analysis with applications to noise suppression
- 10.1109/TSA.2003.811542
- Tchorz, J., and Kollmeier, B. (2003). SNR estimation based on amplitude modulation analysis with applications to noise suppression., IEEE Trans. Speech Audio Process. 11, 184-192. 10.1109/TSA.2003.811542
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 184-192
- Tchorz, J.¹ Kollmeier, B.²

24
- 0003450542
- (Springer-Verlag, New York), Cha
- Vapnik, V. N. (2000). The Nature of Statistical Learning Theory (Springer-Verlag, New York), Chap..
- (2000) The Nature of Statistical Learning Theory
- Vapnik, V.N.¹

25
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- edited by P. Divenyi (Kluwer Academic, Dordrecht)
- Wang, D. L. (2005). On ideal binary mask as the computational goal of auditory scene analysis., in Speech Separation by Humans and Machines, edited by, P. Divenyi, (Kluwer Academic, Dordrecht), pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

26
- 82255178542
- Fundamentals of computational auditory scene analysis
- edited by D. L. Wang and G. J. Brown (Wiley and Sons, Hoboken, NJ), Cha
- Wang, D. L., and Brown, G. J. (2006). Fundamentals of computational auditory scene analysis., in Computational Auditory Scene Analysis: Principles, Algorithms and Applications, edited by, D. L. Wang, and, G. J. Brown, (Wiley and Sons, Hoboken, NJ), Chap., pp. 1-37.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms and Applications , pp. 1-37
- Wang, D.L.¹ Brown, G.J.²

27
- 64649103540
- Speech intelligibility in background noise with ideal binary time-frequency masking
- 10.1121/1.3083233
- Wang, D. L., Kjems, U., Pedersen, M. S., Boldt, J. B., and Lunner, T. (2009). Speech intelligibility in background noise with ideal binary time-frequency masking., J. Acoust. Soc. Am. 125, 2336-2347. 10.1121/1.3083233
- (2009) J. Acoust. Soc. Am. , vol.125 , pp. 2336-2347
- Wang, D.L.¹ Kjems, U.² Pedersen, M.S.³ Boldt, J.B.⁴ Lunner, T.⁵

28
- 20844441675
- KBA: Kernel boundary alignment considering imbalanced data distribution
- 10.1109/TKDE.2005.95
- Wu, G., and Chang, E. (2005). KBA: Kernel boundary alignment considering imbalanced data distribution., IEEE Trans. Knowl. Data Eng. 17, 786-795. 10.1109/TKDE.2005.95
- (2005) IEEE Trans. Knowl. Data Eng. , vol.17 , pp. 786-795
- Wu, G.¹ Chang, E.²

29
- 0026041732
- Gender recognition from speech. Part I: Coarse analysis
- 10.1121/1.401663
- Wu, K., and Childers, D. G. (1991). Gender recognition from speech. Part I: Coarse analysis., J. Acoust. Soc. Am. 90, 1828-1840. 10.1121/1.401663
- (1991) J. Acoust. Soc. Am. , vol.90 , pp. 1828-1840
- Wu, K.¹ Childers, D.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.