SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 5, 2011, Pages 1206-1220

Efficient MMSE Estimation and Uncertainty Processing for Multienvironment Robust Speech Recognition

(4) González, José A a Peinado, Antonio M a Gómez, Angel M a Carmona, José L a

a UNIVERSITY OF GRANADA (Spain)

Author keywords

Feature vector compensation; minimum mean square error (MMSE) estimation; robust speech recognition; stereo data

Indexed keywords

EID: 85008009592 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2010.2087753 Document Type: Article

Times cited : (10)

References (40)

1
- 44849089531
- New York: Wiley
- A. M. Peinado and J. C. Segura, Speech Recognition over Digital Channels: Robustness and Standars. New York: Wiley, 2006.
- (2006) Speech Recognition over Digital Channels: Robustness and Standars
- Peinado, A.M.¹ Segura, J.C.²

2
- 0029288202
- Speech recognition in noisy environments: A survey
- Apr.
- Y. Gong “Speech recognition in noisy environments: A survey,” Speech Commun., vol. 16, no. 3, pp. 261–291, Apr. 1995.
- (1995) Speech Commun. , vol.16 , Issue.3 , pp. 261-291
- Gong, Y.¹

3
- 34547941599
- Automatic speech recognition and speech variability: A review
- 11 Nov.
- M. Benzeghiba, R. D. Mori, O. Deroo, S. Dupont, T. Erbes, D. Jouvet, L. Fissore, P. Laface, A. Mertins, C. Ris, R. Rose, V. Tyagi, and C. Wellekens “Automatic speech recognition and speech variability: A review,” Speech Commun., vol. 49, no. 10–11, pp. 763–786, Nov. 2007.
- (2007) Speech Commun. , vol.49 , Issue.10 , pp. 763-786
- Benzeghiba, M.¹ Mori, R.D.² Deroo, O.³ Dupont, S.⁴ Erbes, T.⁵ Jouvet, D.⁶ Fissore, L.⁷ Laface, P.⁸ Mertins, A.⁹ Ris, C.¹⁰ Rose, R.¹¹ Tyagi, V.¹² Wellekens, C.¹³

4
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J.-L. Gauvain and C.-H. Lee “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

5
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- Apr.
- C. J. Leggetter and P. C. Woodland “Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models,” Comput. Speech Lang., vol. 9, no. 2, pp. 171–185, Apr. 1995.
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

6
- 0030245128
- Robust continuous speech recognition using parallel model combination
- Sep.
- M. J. F. Gales and S. J. Young “Robust continuous speech recognition using parallel model combination,” IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352–359, Sep. 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

7
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr.
- S. Boll “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113–120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.¹

8
- 34447100796
- Boca Raton, FL: CRC
- P. Loizou, Speech enhancement: Theory and Practice. Boca Raton, FL: CRC, 2007.
- (2007) Speech enhancement: Theory and Practice
- Loizou, P.¹

9
- 0021645331
- Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
- Dec.
- Y. Ephraim and D. Malah “Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109–1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

10
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- Jun.
- B. Atal “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification,” J. Acoust. Soc. Amer., vol. 55, pp. 1304–1312, Jun. 1974.
- (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1312
- Atal, B.¹

11
- 18744371585
- Histogram equalization of speech representation for robust speech recognition
- May
- A. de la Torre, A. M. Peinado, J. C. Segura, J. L. Perez-Cordoba, M. C. Benitez, and A. J. Rubio “Histogram equalization of speech representation for robust speech recognition,” IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 355–366, May 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.3 , pp. 355-366
- de la Torre, A.¹ Peinado, A.M.² Segura, J.C.³ Perez-Cordoba, J.L.⁴ Benitez, M.C.⁵ Rubio, A.J.⁶

12
- 2442509974
- Cepstral domain segmental nonlinear feature transformations for robust speech recognition
- May
- J. C. Segura, M. C. Benitez, A. de la Torre, A. J. Rubio, and J. Ramirez, “Cepstral domain segmental nonlinear feature transformations for robust speech recognition,” IEEE Signal Process. Lett., vol. 11, no. 5, pp. 517–520, May 2004.
- (2004) IEEE Signal Process. Lett. , vol.11 , Issue.5 , pp. 517-520
- Segura, J.C.¹ Benitez, M.C.² de la Torre, A.³ Rubio, A.J.⁴ Ramirez, J.⁵

13
- 65549153550
- Ph.D. dissertation, Dept. of Elect. Comput. Eng., Carnegie Mellon Univ.
- P. Moreno, “Speech Recognition in Noisy Environments,” Ph.D. dissertation, Dept. of Elect. Comput. Eng., Carnegie Mellon Univ., 1996.
- (1996) Speech Recognition in Noisy Environments
- Moreno, P.¹

14
- 0032048385
- Speech recognition in noisy environments using first-order vector Taylor series
- Apr.
- D. Y. Kim, C. K. Un, and N. S. Kim “Speech recognition in noisy environments using first-order vector Taylor series,” Speech Commun., vol. 24, no. 1, pp. 39–49, Apr. 1998.
- (1998) Speech Commun. , vol.24 , Issue.1 , pp. 39-49
- Kim, D.Y.¹ Un, C.K.² Kim, N.S.³

15
- 66149101303
- Robust speech recognition using a cepstral minimum-mean-square-error-motivated noise suppressor
- Jul.
- D. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero “Robust speech recognition using a cepstral minimum-mean-square-error-motivated noise suppressor,” IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 5, pp. 1061–1070, Jul. 2008.
- (2008) IEEE Trans. Audio Speech Lang. Process. , vol.16 , Issue.5 , pp. 1061-1070
- Yu, D.¹ Deng, L.² Droppo, J.³ Wu, J.⁴ Gong, Y.⁵ Acero, A.⁶

16
- 0347968277
- Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
- Nov.
- L. Deng, J. Droppo, and A. Acero, “Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition,” IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 568–580, Nov. 2003.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 568-580
- Deng, L.¹ Droppo, J.² Acero, A.³

17
- 68549125183
- Stereo-based stochastic mapping for robust speech recognition
- Sep.
- M. Afify, X. Cui, and Y. Gao, “Stereo-based stochastic mapping for robust speech recognition,” IEEE Trans. Audio Speech Lang. Process., vol. 17, no. 7, pp. 1325–1334, Sep. 2009.
- (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , Issue.7 , pp. 1325-1334
- Afify, M.¹ Cui, X.² Gao, Y.³

18
- 84987702417
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- H. Hirsch and D. Pearce, “The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions,” in Proc. ICSLP, 2000, pp. 29–32.
- (2000) Proc. ICSLP , pp. 29-32
- Hirsch, H.¹ Pearce, D.²

19
- 0004319970
- Norwell, MA: Kluwer
- A. Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition. Norwell, MA: Kluwer, 1993.
- (1993) Acoustical and Environmental Robustness in Automatic Speech Recognition
- Acero, A.¹

20
- 85006734596
- Evaluation of the SPLICE algorithm on the Aurora2 database
- Aalborg, Denmark
- J. Droppo, L. Deng, and A. Acero, “Evaluation of the SPLICE algorithm on the Aurora2 database,” in Proc. Eurospeech '01, Aalborg, Denmark, 2001, pp. 217–220.
- (2001) Proc. Eurospeech '01 , pp. 217-220
- Droppo, J.¹ Deng, L.² Acero, A.³

21
- 44849120851
- Cepstral vector normalization based on stereo data for robust speech recognition
- Mar.
- L. Buera, E. Lleida, A. Miguel, A. Ortega, and O. Saz “Cepstral vector normalization based on stereo data for robust speech recognition,” IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 3, pp. 1098–1113, Mar. 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.3 , pp. 1098-1113
- Buera, L.¹ Lleida, E.² Miguel, A.³ Ortega, A.⁴ Saz, O.⁵

22
- 40249103761
- Issues with uncertainty decoding for noise robust automatic speech recognition
- H. Liao and M. J. F. Gales “Issues with uncertainty decoding for noise robust automatic speech recognition,” Speech Commun., vol. 50, no. 4, pp. 265–277, 2008.
- (2008) Speech Commun. , vol.50 , Issue.4 , pp. 265-277
- Liao, H.¹ Gales, M.J.F.²

23
- 51449114531
- MMSE-based stereo feature stochastic mapping for noise robust speech recognition
- Apr.
- X. Cui, M. Afify, and Y. Gao, “MMSE-based stereo feature stochastic mapping for noise robust speech recognition,” in Proc. ICASSP'08, Apr. 2008, pp. 4077–4080.
- (2008) Proc. ICASSP'08 , pp. 4077-4080
- Cui, X.¹ Afify, M.² Gao, Y.³

24
- 78049396097
- Efficient VQ-based MMSE estimation for robust speech recognition
- Mar.
- J. A. Gonzalez, A. M. Peinado, A. M. Gomez, J. L. Carmona, and J. A. Morales-Cordovilla, “Efficient VQ-based MMSE estimation for robust speech recognition,” in Proc. ICASSP'10, Mar. 2010, pp. 4558–4561.
- (2010) Proc. ICASSP'10 , pp. 4558-4561
- Gonzalez, J.A.¹ Peinado, A.M.² Gomez, A.M.³ Carmona, J.L.⁴ Morales-Cordovilla, J.A.⁵

25
- 0242721421
- HMM-based channel error mitigation and its application to distributed speech recognition
- Nov.
- A. M. Peinado, V. Sanchez, J. L. Perez-Cordoba, and A. de la Torre “HMM-based channel error mitigation and its application to distributed speech recognition,” Speech Commun., vol. 41, no. 4, pp. 549–561, Nov. 2003.
- (2003) Speech Commun. , vol.41 , Issue.4 , pp. 549-561
- Peinado, A.M.¹ Sanchez, V.² Perez-Cordoba, J.L.³ de la Torre, A.⁴

26
- 64349084660
- Noise condition-dependent training based on noise classification and SNR estimation
- Nov.
- H. Xu, P. Dalsgaard, Z.-H. Tan, and B. Lindberg, “Noise condition-dependent training based on noise classification and SNR estimation,” IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 8, pp. 2431–2443, Nov. 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.8 , pp. 2431-2443
- Xu, H.¹ Dalsgaard, P.² Tan, Z.-H.³ Lindberg, B.⁴

27
- 56949089751
- Feature compensation in the cepstral domain employing model combination
- Feb.
- W. Kim and J. H. L. Hansen “Feature compensation in the cepstral domain employing model combination,” Speech Commun., vol. 51, no. 2, pp. 83–96, Feb. 2009.
- (2009) Speech Commun. , vol.51 , Issue.2 , pp. 83-96
- Kim, W.¹ Hansen, J.H.L.²

28
- 19944385270
- Efficient MMSE-based channel error mitigation techniques. Application to distributed speech recognition over wireless channels
- Jan.
- A. M. Peinado, V. Sanchez, J. L. Perez-Cordoba, and A. J. Rubio “Efficient MMSE-based channel error mitigation techniques. Application to distributed speech recognition over wireless channels,” IEEE Trans. Wireless Commun., vol. 4, no. 1, pp. 14–19, Jan. 2005.
- (2005) IEEE Trans. Wireless Commun. , vol.4 , Issue.1 , pp. 14-19
- Peinado, A.M.¹ Sanchez, V.² Perez-Cordoba, J.L.³ Rubio, A.J.⁴

29
- 34547553730
- Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K.
- H. Liao, “Uncertainty decoding for noise robust speech recognition,” Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K., 2007.
- (2007) Uncertainty decoding for noise robust speech recognition
- Liao, H.¹

30
- 0013251345
- ETSI ES 201 108
- Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, Std, ETSI ES 201 108, 2000.
- (2000) Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, Std

31
- 0003822743
- 3.4 ed. Cambridge, U.K.: Cambridge Univ. Eng. Dept.
- S. Young, G. Everman, M. J. F. Gales, T. Hain, D. Kershaw, D. Liu, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book, 3.4 ed. Cambridge, U.K.: Cambridge Univ. Eng. Dept., 2006.
- (2006) The HTK Book
- Young, S.¹ Everman, G.² Gales, M.J.F.³ Hain, T.⁴ Kershaw, D.⁵ Liu, D.⁶ Moore, G.⁷ Odell, J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.¹²

32
- 84867196386
- HMM-based estimation of unreliable spectral components for noise robust speech recognition
- Brisbane, Australia, Sep.
- B. J. Borgstrom and A. Alwan, “HMM-based estimation of unreliable spectral components for noise robust speech recognition,” in Proc. Interspeech, Brisbane, Australia, Sep. 2008, pp. 1769–1772.
- (2008) Proc. Interspeech , pp. 1769-1772
- Borgstrom, B.J.¹ Alwan, A.²

33
- 33750376174
- Model-based feature enhancement with uncertainty decoding for noise robust ASR
- Nov.
- V. Stouten, H. V. Hamme, and P. Wambacq “Model-based feature enhancement with uncertainty decoding for noise robust ASR,” Speech Commun., vol. 48, no. 11, pp. 1502–1514, Nov. 2006.
- (2006) Speech Commun. , vol.48 , Issue.11 , pp. 1502-1514
- Stouten, V.¹ Hamme, H.V.² Wambacq, P.³

34
- 51449120334
- An efficient approximation of the forward-backward algorithm to deal with packet loss, with applications to remote speech recognition
- Apr.
- B. J. Borgstrom and A. Alwan, “An efficient approximation of the forward-backward algorithm to deal with packet loss, with applications to remote speech recognition,” in Proc. ICASSP, Apr. 2008, pp. 4425–4428.
- (2008) Proc. ICASSP , pp. 4425-4428
- Borgstrom, B.J.¹ Alwan, A.²

35
- 33744969796
- Hidden Markov model-based packet loss concealment for voice over IP
- Sep.
- C. A. Rodbro, M. N. Murthi, S. V. Andersen, and S. H. Jensen “Hidden Markov model-based packet loss concealment for voice over IP,” IEEE Trans. Audio Speech Lang. Process., vol. 14, no. 5, pp. 1609–1623, Sep. 2006.
- (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , Issue.5 , pp. 1609-1623
- Rodbro, C.A.¹ Murthi, M.N.² Andersen, S.V.³ Jensen, S.H.⁴

36
- 0035273996
- Softbit speech decoding: A new approach to error concealment
- Mar.
- T. Fingscheidt and P. Vary “Softbit speech decoding: A new approach to error concealment,” IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 240–251, Mar. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 240-251
- Fingscheidt, T.¹ Vary, P.²

37
- 77955829273
- MMSE-based packet loss concealment for CELP-coded speech recognition
- Aug.
- J. L. Carmona, A. M. Peinado, J. L. Perez-Cordoba, and A. M. Gomez “MMSE-based packet loss concealment for CELP-coded speech recognition,” IEEE Trans. Audio Speech Lang. Process., vol. 18, no. 6, pp. 1341–1353, Aug. 2010.
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.6 , pp. 1341-1353
- Carmona, J.L.¹ Peinado, A.M.² Perez-Cordoba, J.L.³ Gomez, A.M.⁴

38
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb.
- L. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proc. IEEE, vol. 77, no. 2, pp. 257–286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.¹

39
- 84892174007
- Weighted Viterbi algorithm and state duration modelling for speech recognition in noise
- May
- N. B. Yoma, F. R. McInnes, and M. A. Jack, “Weighted Viterbi algorithm and state duration modelling for speech recognition in noise,” in Proc. ICASSP, May 1998, vol. 2, pp. 709–712.
- (1998) Proc. ICASSP , vol.2 , pp. 709-712
- Yoma, N.B.¹ McInnes, F.R.² Jack, M.A.³

40
- 33845666211
- Combining media-specific FEC and error concealment for robust distributed speech recognition over loss-prone packet channels
- Dec.
- A. M. Gomez, A. M. Peinado, V. Sanchez, and A. J. Rubio “Combining media-specific FEC and error concealment for robust distributed speech recognition over loss-prone packet channels,” IEEE Trans. Multimedia, vol. 8, pp. 1228–1238, Dec. 2006.
- (2006) IEEE Trans. Multimedia , vol.8 , pp. 1228-1238
- Gomez, A.M.¹ Peinado, A.M.² Sanchez, V.³ Rubio, A.J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.