메뉴 건너뛰기




Volumn , Issue , 2013, Pages 3002-3006

An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition

Author keywords

Deep neural networks; Spectral restoration; Speech enhancement

Indexed keywords

RESTORATION; SPEECH ENHANCEMENT; SPEECH RECOGNITION;

EID: 84906272122     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (42)

References (25)
  • 2
    • 79959840616 scopus 로고    scopus 로고
    • Investigation of full-sequence training of deep belief networks for speech recognition
    • A. Mohamed, D. Yu, and L. Deng, "Investigation of full-sequence training of deep belief networks for speech recognition, " in Proc. Interspeech. ISCA, 2010, pp. 2846-2849.
    • (2010) Proc. Interspeech. ISCA , pp. 2846-2849
    • Mohamed, A.1    Yu, D.2    Deng, L.3
  • 3
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 4
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. Interspeech. ISCA, 2011, pp. 437-440.
    • (2011) Proc. Interspeech. ISCA , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 5
    • 84867626068 scopus 로고    scopus 로고
    • Revisiting recurrent neural networks for robust ASR
    • O. Vinyals, S. V. Ravuri, and D. Povey, "Revisiting recurrent neural networks for robust ASR, " in Proc. ICASSP. IEEE, 2012, pp. 4085-4088.
    • (2012) Proc. ICASSP. IEEE , pp. 4085-4088
    • Vinyals, O.1    Ravuri, S.V.2    Povey, D.3
  • 8
    • 0034855352 scopus 로고    scopus 로고
    • Highperformance robust speech recognition using stereo training data
    • IEEE
    • L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "Highperformance robust speech recognition using stereo training data, " in Proc. ICASSP, vol. 1. IEEE, 2001, pp. 301-304.
    • (2001) Proc. ICASSP , vol.1 , pp. 301-304
    • Deng, L.1    Acero, A.2    Jiang, L.3    Droppo, J.4    Huang, X.5
  • 9
    • 77955815755 scopus 로고    scopus 로고
    • Advanced front-end feature extraction algorithm
    • ETSI
    • ETSI, "Advanced front-end feature extraction algorithm, " in Technical Report. ETSI ES 202 050, 2007.
    • (2007) Technical Report, ETSI es 202 050
  • 10
    • 84890532503 scopus 로고    scopus 로고
    • Noise adaptive front-end normalization based on vector taylor series for deep neural networks in robust speech recognition
    • IEEE
    • B. Li and K. C. Sim, "Noise adaptive front-end normalization based on vector taylor series for deep neural networks in robust speech recognition, " in Proc. ICASSP. IEEE, 2013.
    • (2013) Proc. ICASSP
    • Li, B.1    Sim, K.C.2
  • 13
    • 0029726517 scopus 로고    scopus 로고
    • Speech enhancement based on a priori signal to noise estimation
    • IEEE
    • P. Scalart and J. Vieira Filho, "Speech enhancement based on a priori signal to noise estimation, " in Proc. ICASSP, vol. 2. IEEE, 1996, pp. 629-632.
    • (1996) Proc. ICASSP , vol.2 , pp. 629-632
    • Scalart, P.1    Vieira Filho, J.2
  • 14
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 32, no. 6, pp. 1109-1121, 1984.
    • (1984) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 15
    • 27644556974 scopus 로고    scopus 로고
    • Speech enhancement based on minimum mean-square error estimation and supergaussian priors
    • R. Martin, "Speech enhancement based on minimum mean-square error estimation and supergaussian priors, " Speech and Audio Processing, IEEE Transactions on, vol. 13, no. 5, pp. 845-856, 2005.
    • (2005) Speech and Audio Processing, IEEE Transactions on , vol.13 , Issue.5 , pp. 845-856
    • Martin, R.1
  • 16
    • 47949104834 scopus 로고    scopus 로고
    • Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
    • J. H. Hansen, V. Radhakrishnan, and K. H. Arehart, "Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 14, no. 6, pp. 2049-2063, 2006.
    • (2006) Audio, Speech, and Language Processing, IEEE Transactions on , vol.14 , Issue.6 , pp. 2049-2063
    • Hansen, J.H.1    Radhakrishnan, V.2    Arehart, K.H.3
  • 17
    • 22944438092 scopus 로고    scopus 로고
    • Speech enhancement by MAP spectral amplitude estimation using a super-gaussian speech model
    • T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a super-gaussian speech model, " EURASIP Journal on Applied Signal Processing, vol. 2005, pp. 1110-1126, 2005.
    • (2005) EURASIP Journal on Applied Signal Processing , vol.2005 , pp. 1110-1126
    • Lotter, T.1    Vary, P.2
  • 20
    • 84890540088 scopus 로고    scopus 로고
    • Maximum likelihood based noise covariance matrix estimation for multi-microphone speech enhancement
    • U. Kjems and J. Jensen, "Maximum likelihood based noise covariance matrix estimation for multi-microphone speech enhancement, " Proc. EUSIPCO, 2012.
    • (2012) Proc. EUSIPCO
    • Kjems, U.1    Jensen, J.2
  • 21
    • 84890461970 scopus 로고    scopus 로고
    • Speech enhancement using generalized maximum a posteriori spectral amplitude estimator
    • IEEE
    • Y. C. Su, Y. Tsao, J. E. Wu, and F. R. Jean, "Speech enhancement using generalized maximum a posteriori spectral amplitude estimator, " in Proc. ICASSP. IEEE, 2013.
    • (2013) Proc. ICASSP
    • Su, Y.C.1    Tsao, Y.2    Wu, J.E.3    Jean, F.R.4
  • 22
    • 0036226165 scopus 로고    scopus 로고
    • Noise estimation by minima controlled recursive averaging for robust speech enhancement
    • IEEE
    • I. Cohen and B. Berdugo, "Noise estimation by minima controlled recursive averaging for robust speech enhancement, " Signal Processing Letters, IEEE, vol. 9, no. 1, pp. 12-15, 2002.
    • (2002) Signal Processing Letters , vol.9 , Issue.1 , pp. 12-15
    • Cohen, I.1    Berdugo, B.2
  • 23
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging, " Speech and Audio Processing, IEEE Transactions on, vol. 11, no. 5, pp. 466- 475, 2003.
    • (2003) Speech and Audio Processing, IEEE Transactions on , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 24
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 33, no. 2, pp. 443-445, 1985.
    • (1985) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 25
    • 57849152033 scopus 로고    scopus 로고
    • Aurora 2.0 speech recognition in noise: Update 2
    • D. Pierce and A. Gunawardana, "Aurora 2.0 speech recognition in noise: Update 2, " in Proc. ICSLP, 2002.
    • (2002) Proc. ICSLP
    • Pierce, D.1    Gunawardana, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.