메뉴 건너뛰기




Volumn 132, Issue 5, 2012, Pages 3475-3483

A classification based approach to speech segregation

Author keywords

[No Author keywords available]

Indexed keywords

BACKGROUND NOISE; CLASSIFICATION ACCURACY; CLASSIFICATION APPROACH; CLASSIFICATION RESULTS; COMPUTATIONAL AUDITORY SCENE ANALYSIS; FALSE ALARM RATE; HIGH QUALITY; IDEAL BINARY MASK; INTRINSIC PROPERTY; SOUND SEGREGATION; SPEECH SEGREGATION; SYSTEMATIC EVALUATION; TARGET SPEECH; TIME FREQUENCY;

EID: 84869105129     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.4754541     Document Type: Article
Times cited : (99)

References (29)
  • 2
    • 33748523481 scopus 로고    scopus 로고
    • Determination of the potential benefit of time-frequency gain manipulation
    • 10.1097/01.aud.0000233891.86809.df
    • Anzalone, M. C., Calandruccio, L., Doherty, K. A., and Carney, L. H. (2006). Determination of the potential benefit of time-frequency gain manipulation., Ear Hear. 27, 480-492. 10.1097/01.aud.0000233891.86809.df
    • (2006) Ear Hear. , vol.27 , pp. 480-492
    • Anzalone, M.C.1    Calandruccio, L.2    Doherty, K.A.3    Carney, L.H.4
  • 3
    • 84869143719 scopus 로고    scopus 로고
    • praat: Doing phonetics by computer (version 4.5) [computer program], http://www.fon.hum.uva.nl/praat (Last viewed November 2010)
    • Boersma, P., and Weenink, D. (2007). praat: Doing phonetics by computer (version 4.5) [computer program], http://www.fon.hum.uva.nl/praat (Last viewed November 2010).
    • (2007)
    • Boersma, P.1    Weenink, D.2
  • 6
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • 10.1121/1.2363929
    • Brungart, D. S., Chang, P. S., Simpson, B. D., and Wang, D. L. (2006). Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation., J. Acoust. Soc. Am. 120, 4007-4018. 10.1121/1.2363929
    • (2006) J. Acoust. Soc. Am. , vol.120 , pp. 4007-4018
    • Brungart, D.S.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.L.4
  • 10
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • 10.1109/TNN.2004.832812
    • Hu, G., and Wang, D. L. (2004). Monaural speech segregation based on pitch tracking and amplitude modulation., IEEE Trans. Neural Netw. 15, 1135-1150. 10.1109/TNN.2004.832812
    • (2004) IEEE Trans. Neural Netw. , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 11
    • 38849102154 scopus 로고    scopus 로고
    • Auditory segmentation based on onset and offset analysis
    • 10.1109/TASL.2006.881700
    • Hu, G., and Wang, D. L. (2007). Auditory segmentation based on onset and offset analysis., IEEE Trans. Audio, Speech, Lang. Process. 15, 396-405. 10.1109/TASL.2006.881700
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , pp. 396-405
    • Hu, G.1    Wang, D.L.2
  • 12
    • 49249107353 scopus 로고    scopus 로고
    • Segregation of unvoiced speech from nonspeech interference
    • 10.1121/1.2939132
    • Hu, G., and Wang, D. L. (2008). Segregation of unvoiced speech from nonspeech interference., J. Acoust. Soc. Am. 124, 1306-1319. 10.1121/1.2939132
    • (2008) J. Acoust. Soc. Am. , vol.124 , pp. 1306-1319
    • Hu, G.1    Wang, D.L.2
  • 13
    • 77955695149 scopus 로고    scopus 로고
    • A tandem algorithm for pitch estimation and voiced speech segregation
    • 10.1109/TASL.2010.2041110
    • Hu, G., and Wang, D. L. (2010). A tandem algorithm for pitch estimation and voiced speech segregation., IEEE Trans. Audio, Speech, Lang. Process. 18, 2067-2079. 10.1109/TASL.2010.2041110
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , pp. 2067-2079
    • Hu, G.1    Wang, D.L.2
  • 14
    • 65249103478 scopus 로고    scopus 로고
    • A supervised learning approach to monaural segregation of reverberant speech
    • 10.1109/TASL.2008.2010633
    • Jin, Z., and Wang, D. L. (2009). A supervised learning approach to monaural segregation of reverberant speech., IEEE Trans. Audio, Speech, Lang. Process. 17, 625-638. 10.1109/TASL.2008.2010633
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , pp. 625-638
    • Jin, Z.1    Wang, D.L.2
  • 16
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normal-hearing listeners
    • 10.1121/1.3184603
    • Kim, G., Lu, Y., Hu, Y., and Loizou, P. C. (2009). An algorithm that improves speech intelligibility in noise for normal-hearing listeners., J. Acoust. Soc. Am. 126, 1486-1494. 10.1121/1.3184603
    • (2009) J. Acoust. Soc. Am. , vol.126 , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.C.4
  • 17
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
    • 10.1121/1.2832617
    • Li, N., and Loizou, P. C. (2008). Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction., J. Acoust. Soc. Am. 123, 1673-1682. 10.1121/1.2832617
    • (2008) J. Acoust. Soc. Am. , vol.123 , pp. 1673-1682
    • Li, N.1    Loizou, P.C.2
  • 19
    • 0003243224 scopus 로고    scopus 로고
    • Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
    • in (The MIT Press, Cambridge, MA)
    • Platt, J. C. (1999). Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods., in Advances in Large Margin Classifiers (The MIT Press, Cambridge, MA), pp. 61-74.
    • (1999) Advances in Large Margin Classifiers , pp. 61-74
    • Platt, J.C.1
  • 20
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • 10.1121/1.1610463
    • Roman, N., Wang, D. L., and Brown, G. J. (2003). Speech segregation based on sound localization., J. Acoust. Soc. Am. 114, 2236-2252. 10.1121/1.1610463
    • (2003) J. Acoust. Soc. Am. , vol.114 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 22
    • 70350565063 scopus 로고    scopus 로고
    • On strategies for imbalanced text classification using SVM: A comparative study
    • 10.1016/j.dss.2009.07.011
    • Sun, A., Lim, E. P., and Liu, Y. (2009). On strategies for imbalanced text classification using SVM: A comparative study., Decision Support Syst. 48, 191-201. 10.1016/j.dss.2009.07.011
    • (2009) Decision Support Syst. , vol.48 , pp. 191-201
    • Sun, A.1    Lim, E.P.2    Liu, Y.3
  • 23
    • 0038712550 scopus 로고    scopus 로고
    • SNR estimation based on amplitude modulation analysis with applications to noise suppression
    • 10.1109/TSA.2003.811542
    • Tchorz, J., and Kollmeier, B. (2003). SNR estimation based on amplitude modulation analysis with applications to noise suppression., IEEE Trans. Speech Audio Process. 11, 184-192. 10.1109/TSA.2003.811542
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 184-192
    • Tchorz, J.1    Kollmeier, B.2
  • 25
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • edited by P. Divenyi (Kluwer Academic, Dordrecht)
    • Wang, D. L. (2005). On ideal binary mask as the computational goal of auditory scene analysis., in Speech Separation by Humans and Machines, edited by, P. Divenyi, (Kluwer Academic, Dordrecht), pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 26
    • 82255178542 scopus 로고    scopus 로고
    • Fundamentals of computational auditory scene analysis
    • edited by D. L. Wang and G. J. Brown (Wiley and Sons, Hoboken, NJ), Cha
    • Wang, D. L., and Brown, G. J. (2006). Fundamentals of computational auditory scene analysis., in Computational Auditory Scene Analysis: Principles, Algorithms and Applications, edited by, D. L. Wang, and, G. J. Brown, (Wiley and Sons, Hoboken, NJ), Chap., pp. 1-37.
    • (2006) Computational Auditory Scene Analysis: Principles, Algorithms and Applications , pp. 1-37
    • Wang, D.L.1    Brown, G.J.2
  • 27
    • 64649103540 scopus 로고    scopus 로고
    • Speech intelligibility in background noise with ideal binary time-frequency masking
    • 10.1121/1.3083233
    • Wang, D. L., Kjems, U., Pedersen, M. S., Boldt, J. B., and Lunner, T. (2009). Speech intelligibility in background noise with ideal binary time-frequency masking., J. Acoust. Soc. Am. 125, 2336-2347. 10.1121/1.3083233
    • (2009) J. Acoust. Soc. Am. , vol.125 , pp. 2336-2347
    • Wang, D.L.1    Kjems, U.2    Pedersen, M.S.3    Boldt, J.B.4    Lunner, T.5
  • 28
    • 20844441675 scopus 로고    scopus 로고
    • KBA: Kernel boundary alignment considering imbalanced data distribution
    • 10.1109/TKDE.2005.95
    • Wu, G., and Chang, E. (2005). KBA: Kernel boundary alignment considering imbalanced data distribution., IEEE Trans. Knowl. Data Eng. 17, 786-795. 10.1109/TKDE.2005.95
    • (2005) IEEE Trans. Knowl. Data Eng. , vol.17 , pp. 786-795
    • Wu, G.1    Chang, E.2
  • 29
    • 0026041732 scopus 로고
    • Gender recognition from speech. Part I: Coarse analysis
    • 10.1121/1.401663
    • Wu, K., and Childers, D. G. (1991). Gender recognition from speech. Part I: Coarse analysis., J. Acoust. Soc. Am. 90, 1828-1840. 10.1121/1.401663
    • (1991) J. Acoust. Soc. Am. , vol.90 , pp. 1828-1840
    • Wu, K.1    Childers, D.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.