메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7472-7476

Feature denoising for speech separation in unknown noisy environments

Author keywords

deep neural networks; feature denoising; generalization; Speech separation

Indexed keywords

ACOUSTIC FEATURES; DE-NOISING; DEEP NEURAL NETWORKS; GENERALIZATION; NOISY ENVIRONMENT; SEPARATION SYSTEMS; SPEECH SEPARATION; TRAINING AND TESTING;

EID: 84890523904     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639115     Document Type: Conference Paper
Times cited : (8)

References (17)
  • 1
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normalhearing listeners
    • G. Kim, Y. Lu, Y. Hu, and P.C. Loizou, "An algorithm that improves speech intelligibility in noise for normalhearing listeners," Journal of the Acoustical Society of America, vol. 126, pp. 1486-1494, 2009.
    • (2009) Journal of the Acoustical Society of America , vol.126 , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.C.4
  • 2
    • 65249103478 scopus 로고    scopus 로고
    • A supervised learning approach to monaural segregation of reverberant speech
    • Z. Jin and D. Wang, "A supervised learning approach to monaural segregation of reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, pp. 625-638, 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , pp. 625-638
    • Jin, Z.1    Wang, D.2
  • 3
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • Divenyi P., Ed.Kluwer Academic, Norwell MA
    • D. Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans andMachines, Divenyi P., Ed., pp. 181-197. Kluwer Academic, Norwell MA., 2005.
    • (2005) Speech Separation by Humans AndMachines , pp. 181-197
    • Wang, D.1
  • 4
    • 56449089103 scopus 로고    scopus 로고
    • Extracting and composing robust features with denoising autoencoders
    • P. Vincent, H. Larochelle, Y. Bengio, and P.A. Manzagol, "Extracting and composing robust features with denoising autoencoders," in Proc. ICML, 2008, pp. 1096-1103.
    • (2008) Proc. ICML , pp. 1096-1103
    • Vincent, P.1    Larochelle, H.2    Bengio, Y.3    Manzagol, P.A.4
  • 7
    • 0034855352 scopus 로고    scopus 로고
    • High-performance robust speech recognition using stereo training data
    • L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "High-performance robust speech recognition using stereo training data," in Proc. ICASSP, 2001, pp. 301-304.
    • (2001) Proc. ICASSP , pp. 301-304
    • Deng, L.1    Acero, A.2    Jiang, L.3    Droppo, J.4    Huang, X.5
  • 8
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • D.S. Brungart, P.S. Chang, B.D. Simpson, and D.Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," Journal of the Acoustical Society of America, vol. 120, pp. 4007-4018, 2006.
    • (2006) Journal of the Acoustical Society of America , vol.120 , pp. 4007-4018
    • Brungart, D.S.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.4
  • 10
    • 84870477511 scopus 로고    scopus 로고
    • Exploring monaural features for classification-based speech segregation
    • Y. Wang, K. Han, and D. Wang, "Exploring monaural features for classification-based speech segregation," IEEE Trans. Audio, Speech, Lang. Process., pp. 270-279, 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , pp. 270-279
    • Wang, Y.1    Han, K.2    Wang, D.3
  • 11
    • 84875678689 scopus 로고    scopus 로고
    • Towards scaling up classification-based speech separation
    • Lang. Process., in press
    • Y. Wang and D. Wang, "Towards scaling up classification-based speech separation," IEEE Trans. Audio, Speech, Lang. Process., in press, 2013.
    • (2013) IEEE Trans. Audio, Speech
    • Wang, Y.1    Wang, D.2
  • 12
    • 0014568991 scopus 로고
    • IEEE recommended practice for speech quality measurements
    • IEEE, "IEEE recommended practice for speech quality measurements," IEEE Trans. Audio Electroacoust., vol. 17, pp. 225-246, 1969.
    • (1969) IEEE Trans. Audio Electroacoust. , vol.17 , pp. 225-246
  • 13
    • 85161980001 scopus 로고    scopus 로고
    • Sparse deep belief net model for visual area V2
    • H. Lee, C. Ekanadham, and A. Ng, "Sparse deep belief net model for visual area V2," in NIPS, 2008.
    • (2008) NIPS
    • Lee, H.1    Ekanadham, C.2    Ng, A.3
  • 14
    • 0034133184 scopus 로고    scopus 로고
    • Learning overcomplete representations
    • M.S. Lewicki and T.J. Sejnowski, "Learning overcomplete representations," Neural computation, vol. 12, pp. 337-365, 2000.
    • (2000) Neural Computation , vol.12 , pp. 337-365
    • Lewicki, M.S.1    Sejnowski, T.J.2
  • 17
    • 78049364397 scopus 로고    scopus 로고
    • MMSE based noise PSD trackingwith low complexity
    • R.C. Hendriks, R. Heusdens, and J. Jensen, "MMSE based noise PSD trackingwith low complexity," in Proc. ICASSP, 2010, pp. 4266-4269.
    • (2010) Proc. ICASSP , pp. 4266-4269
    • Hendriks, R.C.1    Heusdens, R.2    Jensen, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.