메뉴 건너뛰기




Volumn 21, Issue 1, 2014, Pages 65-68

An experimental study on speech enhancement based on deep neural networks

Author keywords

Deep neural networks; noise reduction; regression model; speech enhancement

Indexed keywords

CONVENTIONAL TECHNIQUES; DEEP NEURAL NETWORKS; GENERALIZATION CAPABILITY; MINIMUM MEAN SQUARE ERRORS; MULTI-CONDITION TRAININGS; OBJECTIVE QUALITY MEASURES; REGRESSION MODEL; SPEECH ENHANCEMENT ALGORITHM;

EID: 84889257121     PISSN: 10709908     EISSN: None     Source Type: Journal    
DOI: 10.1109/LSP.2013.2291240     Document Type: Article
Times cited : (965)

References (22)
  • 2
    • 85075926376 scopus 로고    scopus 로고
    • Spectral enhancement methods
    • J. Benesty M. M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer
    • I. Cohen and S. Gannot, "Spectral enhancement methods," in Springer Handbook of Speech Processing, J. Benesty, M. M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer, 2008, pp. 873-901.
    • (2008) Springer Handbook of Speech Processing , pp. 873-901
    • Cohen, I.1    Gannot, S.2
  • 3
    • 0021892216 scopus 로고
    • Speech enhancement using minimu mean square log spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using minimu mean square log spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. 33, no. 2, pp. 443-445, 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 5
    • 0024876950 scopus 로고
    • An analysis of a noise reduction neural network
    • S. I. Tamura, "An analysis of a noise reduction neural network," in Proc. ICASSP, 1989, pp. 2001-2004.
    • (1989) Proc. ICASSP , pp. 2001-2004
    • Tamura, S.I.1
  • 6
    • 85079214161 scopus 로고
    • A family of MLP based nonlinear spectral estimators for noise reduction
    • F. Xie and D. V. Compernolle, "A family of MLP based nonlinear spectral estimators for noise reduction," in Proc. ICASSP, 1994, pp. 53-56.
    • (1994) Proc. ICASSP , pp. 53-56
    • Xie, F.1    Compernolle, D.V.2
  • 8
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for AI
    • Y. Bengio, "Learning deep architectures for AI," Found. Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, 2009.
    • (2009) Found. Trends Mach. Learn. , vol.2 , Issue.1 , pp. 1-127
    • Bengio, Y.1
  • 9
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 10
    • 85032751458 scopus 로고    scopus 로고
    • Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
    • G. E. Hinton, L. Deng, D. Yu, and G. E. Dahl, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, 2012.
    • (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
    • Hinton, G.E.1    Deng, L.2    Yu, D.3    Dahl, G.E.4
  • 11
    • 84889263385 scopus 로고    scopus 로고
    • Denoising deep neural networks based voice activity detection
    • X. L. Zhang and J. Wu, "Denoising deep neural networks based voice activity detection," in Proc. ICASSP, 2013, pp. 853-857.
    • (2013) Proc. ICASSP , pp. 853-857
    • Zhang, X.L.1    Wu, J.2
  • 12
    • 84867202951 scopus 로고    scopus 로고
    • A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions
    • J. Du and Q. Huo, "A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions," in Proc. Interspeech, 2008, pp. 569-572.
    • (2008) Proc. Interspeech , pp. 569-572
    • Du, J.1    Huo, Q.2
  • 13
    • 84875678689 scopus 로고    scopus 로고
    • Towards scaling up classification-based speech separation
    • Y. X.Wang and D. L. Wang, "Towards scaling up classification-based speech separation," IEEE Trans. Audio, Speech Lang. Process., vol. 21, no. 7, pp. 1381-1390, 2013.
    • (2013) IEEE Trans. Audio, Speech Lang. Process. , vol.21 , Issue.7 , pp. 1381-1390
    • Wang, Y.X.1    Wang, D.L.2
  • 14
    • 84890493989 scopus 로고    scopus 로고
    • Ideal ratio mask estimation using deep neural networks for robust speech recognition
    • A. Narayanan and D. L.Wang, "Ideal ratio mask estimation using deep neural networks for robust speech recognition," in Proc. ICASSP, 2013, pp. 1520-6149.
    • (2013) Proc. ICASSP , pp. 1520-6149
    • Narayanan, A.1    Wang, D.L.2
  • 15
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non-stationary noise environments
    • I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environments," Signal Process., vol. 81, no. 11, pp. 2403-2418, 2001.
    • (2001) Signal Process. , vol.81 , Issue.11 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 16
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 466-475, 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 17
    • 79959842828 scopus 로고    scopus 로고
    • Binary coding of speech spectrograms using a deep auto-encoder
    • L. Deng, M. L. Seltzer, and D. Yu et al., "Binary coding of speech spectrograms using a deep auto-encoder," in Proc. Interspeech, 2010, pp. 1692-1695.
    • Proc. Interspeech , vol.2010 , pp. 1692-1695
    • Deng, L.1    Seltzer, M.L.2    Yu, D.3
  • 18
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental framework for the preformance evaluations of speech recognition systems under noisy conditions
    • H. G. Hirsch and D. Pearce, "The AURORA experimental framework for the preformance evaluations of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR, 2000, pp. 181-188.
    • (2000) Proc. ISCA ITRW ASR , pp. 181-188
    • Hirsch, H.G.1    Pearce, D.2
  • 20
    • 0003639435 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs
    • Int. Telecommun. Union-Telecommun. Stand. Sector
    • ITU-T, Rec. P.862, "Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs," Int. Telecommun. Union-Telecommun. Stand. Sector 2001.
    • (2001) ITU-T Rec. P.862
  • 21
    • 70349227623 scopus 로고    scopus 로고
    • Efficient musical noise suppression for speech enhancement system
    • T. Esch and P. Vary, "Efficient musical noise suppression for speech enhancement system," in Proc. ICASSP, 2009, pp. 4409-4412.
    • (2009) Proc. ICASSP , pp. 4409-4412
    • Esch, T.1    Vary, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.