메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2670-2674

Dynamic noise aware training for speech enhancement based on deep neural networks

Author keywords

Deep neural networks; Ideal binary mask; Noise aware training; Non stationary noise; Speech enhancement

Indexed keywords

ALGORITHMS; MEAN SQUARE ERROR; QUALITY CONTROL; SIGNAL TO NOISE RATIO; SPEECH; SPEECH COMMUNICATION;

EID: 84910038203     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (100)

References (33)
  • 2
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. on Acoustic, Speech and Signal Processing, Vol. 27, No. 2, pp. 113-120, 1979.
    • (1979) IEEE Trans. on Acoustic, Speech and Signal Processing , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 3
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech
    • J. S. Lim and A. V. Oppenheim, "Enhancement and bandwidth compression of noisy speech, " in Proc. IEEE, Vol. 67, No. 12, pp. 1586-1604, 1979.
    • (1979) Proc. IEEE , vol.67 , Issue.12 , pp. 1586-1604
    • Lim, J.S.1    Oppenheim, A.V.2
  • 4
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, " IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 32, No.6, pp. 1109-1121, 1984.
    • (1984) IEEE Trans. on Acoustics, Speech and Signal Processing , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 5
    • 0021892216 scopus 로고
    • Speech enhancement using minimu mean square log spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using minimu mean square log spectral amplitude estimator, " IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 33, No. 2, pp. 443- 445, 1985.
    • (1985) IEEE Trans. on Acoustics, Speech and Signal Processing , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 6
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non- stationary noise environments
    • I. Cohen and B. Berdugo, "Speech enhancement for non- stationary noise environments, " Signal Processing, Vol. 81, No. 11, pp. 2403-2418, 2001.
    • (2001) Signal Processing , vol.81 , Issue.11 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 7
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging, " IEEE Trans. on Speech and Audio Processing, Vol. 11, No. 5, pp. 466-475, 2003.
    • (2003) IEEE Trans. on Speech and Audio Processing , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 10
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks, " Science, Vol. 313, No. 5786, pp. 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 13
    • 84889257121 scopus 로고    scopus 로고
    • An experimental study on speech enhancement based on deep neural networks
    • Y. Xu, J. Du, L.-R. Dai and C.-H. Lee, "An experimental study on speech enhancement based on deep neural networks, " IEEE Signal Processing Letters, Vol. 21, No. 1, pp. 65-68, 2014.
    • (2014) IEEE Signal Processing Letters , vol.21 , Issue.1 , pp. 65-68
    • Xu, Y.1    Du, J.2    Dai, L.-R.3    Lee, C.-H.4
  • 14
    • 84896537574 scopus 로고    scopus 로고
    • Wiener filtering based speech enhancement with weighted denoising auto-encoder and noise classification
    • B.-Y. Xia and C.-C. Bao, "Wiener filtering based speech enhancement with weighted denoising auto-encoder and noise classification, " Speech Communication, Vol. 60, pp. 13-29, 2014.
    • (2014) Speech Communication , vol.60 , pp. 13-29
    • Xia, B.-Y.1    Bao, C.-C.2
  • 15
    • 84906279378 scopus 로고    scopus 로고
    • Speech enhancement with weighted de- noising auto-encoder
    • B.-Y. Xia and C.-C. Bao, "Speech enhancement with weighted de- noising Auto-Encoder, " Proc. Interspeech, pp. 3444-3448, 2013.
    • (2013) Proc. Interspeech , pp. 3444-3448
    • Xia, B.-Y.1    Bao, C.-C.2
  • 16
    • 84906262433 scopus 로고    scopus 로고
    • Speech enhancement based on deep denoising auto-encoder
    • X.-G. Lu and Y. Tsao and S. Matsuda and C. Hori, "Speech enhancement based on deep denoising Auto-Encoder, " Proc. Inter- speech, pp. 436-440, 2013.
    • (2013) Proc. Inter- Speech , pp. 436-440
    • Lu, X.-G.1    Tsao, Y.2    Matsuda, S.3    Hori, C.4
  • 17
    • 84875678689 scopus 로고    scopus 로고
    • Towards scaling up classification- based speech separation
    • Y. X. Wang and D. L. Wang, "Towards scaling up classification- based speech separation, " IEEE Trans. on Audio, Speech and Lan- guage Processing, Vol. 21, No. 7, pp. 1381-1390, 2013.
    • (2013) IEEE Trans. on Audio, Speech and Language Processing , vol.21 , Issue.7 , pp. 1381-1390
    • Wang, Y.X.1    Wang, D.L.2
  • 19
    • 29444448046 scopus 로고    scopus 로고
    • A noise-estimation algorithm for highly non-stationary environments
    • S. Rangachari and P. C. Loizou, "A noise-estimation algorithm for highly non-stationary environments, " Speech Communication, Vol. 48, No. 2, pp. 220-231, 2006.
    • (2006) Speech Communication , vol.48 , Issue.2 , pp. 220-231
    • Rangachari, S.1    Loizou, P.C.2
  • 20
    • 84890492030 scopus 로고    scopus 로고
    • An investigation of deep neural networks for noise robust speech recognition
    • M. Seltzer, D. Yu and Y. Wang, "An investigation of deep neural networks for noise robust speech recognition, " Proc. ICASSP, pp. 7398-7402, 2013.
    • (2013) Proc. ICASSP , pp. 7398-7402
    • Seltzer, M.1    Yu, D.2    Wang, Y.3
  • 21
    • 84890452886 scopus 로고    scopus 로고
    • Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
    • O. Abdel-Hamid and H. Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code, " Proc. ICASSP, pp. 7942-7946, 2013.
    • (2013) Proc. ICASSP , pp. 7942-7946
    • Abdel-Hamid, O.1    Jiang, H.2
  • 22
    • 84857498666 scopus 로고    scopus 로고
    • Unbiased MMSE-based noise power estimation with low complexity and low tracking delay
    • T. Gerkmann, R. C. Hendriks, "Unbiased MMSE-based noise power estimation with low complexity and low tracking delay, " IEEE Trans. on Audio, Speech, and Language Processing, Vol. 20, No. 4, pp. 1383-1393, 2012.
    • (2012) IEEE Trans. on Audio, Speech, and Language Processing , vol.20 , Issue.4 , pp. 1383-1393
    • Gerkmann, T.1    Hendriks, R.C.2
  • 23
    • 78049364397 scopus 로고    scopus 로고
    • MMSE based noise PSD tracking with low complexity
    • R. C. Hendriks, R. Heusdens, and J. Jensen, "MMSE based noise PSD tracking with low complexity, " Proc. ICASSP, pp. 4266- 4269, 2010.
    • (2010) Proc. ICASSP , pp. 4266-4269
    • Hendriks, R.C.1    Heusdens, R.2    Jensen, J.3
  • 24
    • 47949104834 scopus 로고    scopus 로고
    • Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
    • J. H. Hansen, V. Radhakrishnan and K. H. Arehart, "Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system, " IEEE trans. on Audio, Speech and Language Processing, Vol. 14, No. 6, pp. 2049-2063, 2006.
    • (2006) IEEE Trans. on Audio, Speech and Language Processing , vol.14 , Issue.6 , pp. 2049-2063
    • Hansen, J.H.1    Radhakrishnan, V.2    Arehart, K.H.3
  • 25
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga and H. J. M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experi- ment to study the effect of additive noise on speech recognition systems, " Speech Communication, Vol. 12, No. 3, pp. 247-251, 1993.
    • (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 26
    • 84867202951 scopus 로고    scopus 로고
    • A speech enhancement approach using piece- wise linear approximation of an explicit model of environmental distortions
    • J. Du and Q. Huo, "A speech enhancement approach using piece- wise linear approximation of an explicit model of environmental distortions, " Proc. Interspeech, pp. 569-572, 2008.
    • (2008) Proc. Interspeech , pp. 569-572
    • Du, J.1    Huo, Q.2
  • 29
    • 84870477511 scopus 로고    scopus 로고
    • Exploring monaural features for classification-based speech segregation
    • Y. X. Wang, K. Han and D. L. Wang, "Exploring monaural features for classification-based speech segregation, " IEEE Trans. on Audio, Speech, and Language Processing, Vo. 21, No. 2, pp. 270- 279, 2013.
    • (2013) IEEE Trans. on Audio, Speech, and Language Processing , vol.21 , Issue.2 , pp. 270-279
    • Wang, Y.X.1    Han, K.2    Wang, D.L.3
  • 30
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental frame- work for the preformance evaluations of speech recognition systems under noisy conditions
    • H. G. Hirsch and D. Pearce, "The AURORA experimental frame- work for the preformance evaluations of speech recognition systems under noisy conditions, " Proc. ISCA ITRW ASR, pp. 181- 188, 2000.
    • (2000) Proc. ISCA ITRW ASR , pp. 181-188
    • Hirsch, H.G.1    Pearce, D.2
  • 31
    • 0003419545 scopus 로고
    • Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database
    • J. S. Garofolo, Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database, NIST Tech Report, 1988.
    • (1988) NIST Tech Report
    • Garofolo, J.S.1
  • 32
    • 85014384841 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone network- s and speech codecs
    • Recommendation
    • ITU-T, Recommendation P.862, "Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone network- s and speech codecs, " International Telecommunication Union- Telecommunication Standardisation Sector, 2001.
    • (2001) International Telecommunication Union- Telecommunication Standardisation Sector
    • ITU-T1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.