SCOPUS 정보 검색 플랫폼

IEEE Journal on Selected Topics in Signal Processing

Volumn 4, Issue 5, 2010, Pages 824-833

An uncertainty propagation approach to robust ASR using the ETSI advanced front-end

(4) Astudillo, Ramón Fernández a Kolossa, Dorothea a Mandelartz, Philipp a Orglmeister, Reinhold a

a TECHNISCHE UNIVERSITÄT BERLIN (Germany)

Author keywords

Advanced front end (AFE); AURORA5; European Telecommunications Standards Institute (ETSI) distributed recognition (DSR); uncertainty decoding; uncertainty propagation

Indexed keywords

AURORA5; EUROPEAN TELECOMMUNICATIONS STANDARDS INSTITUTES; FRONT END; UNCERTAINTY DECODING; UNCERTAINTY PROPAGATION;

COMPUTATIONAL COMPLEXITY; DECODING; FEATURE EXTRACTION; PARAMETER ESTIMATION; SPEECH ENHANCEMENT; STANDARDS; TELECOMMUNICATION; TIME DOMAIN ANALYSIS;

SPEECH RECOGNITION;

EID: 77956717352 PISSN: 19324553 EISSN: None Source Type: Journal
DOI: 10.1109/JSTSP.2010.2057194 Document Type: Article

Times cited : (18)

References (30)

1
- 84893214804
- Speech Processing Transmission and Quality Aspects (STQ); ETSI ES 202 050 V1.1.5 (2007-2101) Jan
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 202 050 V1.1.5 (2007-2101), Jan. 2007.
- (2007) Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms

2
- 0038669544
- The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- H. G. Hirsch and D. Pearce, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. Automat. Speech Recognition: Challenges for the New Millenium, 2000.
- (2000) Proc. Automat. Speech Recognition: Challenges for the New Millenium
- Hirsch, H.G.¹ Pearce, D.²

3
- 70450167188
- Niederrhein Univ. of Applied Sciences
- H. G. Hirsch, "AURORA-5 Experimental framework for the performance evaluation of speech recognition in case of a hands-free speech input in noisy environments," Niederrhein Univ. of Applied Sciences, 2007.
- (2007) AURORA-5 Experimental Framework for the Performance Evaluation of Speech Recognition in Case of A Hands-free Speech Input in Noisy Environments
- Hirsch, H.G.¹

4
- 0036291376
- Uncertainty decoding with splice for noise robust speech recognition
- J. Droppo, A. Acero, and L. Deng, "Uncertainty decoding with splice for noise robust speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2002, vol.I, pp. 57-60.
- (2002) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 57-60
- Droppo, J.¹ Acero, A.² Deng, L.³

5
- 33750376174
- Model based feature enhancement with uncertainty decoding for noise robust ASR
- V. Stouten, H. Van hamme, and W. Wambacq, "Model based feature enhancement with uncertainty decoding for noise robust ASR," Speech Commun., vol.48, no.11, pp. 1502-1514, 2006.
- (2006) Speech Commun , vol.48 , Issue.11 , pp. 1502-1514
- Stouten, V.¹ Van Hamme, H.² Wambacq, W.³

6
- 0032205798
- Improving performance of spectral subtraction in speech recognition using a model for additive noise
- Nov
- N. Yoma, F. McInnes, and M. Jack, "Improving performance of spectral subtraction in speech recognition using a model for additive noise," IEEE Trans. Speech Audio Process., vol.6, no.6, pp. 579-582, Nov. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.6 , pp. 579-582
- Yoma, N.¹ McInnes, F.² Jack, M.³

7
- 0036508276
- Speaker verification in noise using a stochastic version of the weighted Viterbi algorithm
- Mar
- N.Yoma and M.Villar, "Speaker verification in noise using a stochastic version of the weighted Viterbi algorithm," IEEE Trans. Speech Audio Process., vol.10, no.3, pp. 158-166, Mar. 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 158-166
- Yoma, N.¹ Villar, M.²

8
- 85009275141
- Exploiting variances in robust feature extraction based on a parametric model of speech distortion
- L. Deng, J. Droppo, and A. Acero, "Exploiting variances in robust feature extraction based on a parametric model of speech distortion," in Proc. Int. Conf. Spoken Lang. Process., 2002.
- (2002) Proc. Int. Conf. Spoken Lang. Process
- Deng, L.¹ Droppo, J.² Acero, A.³

9
- 85009154399
- Including uncertainty of speech observations in robust speech recognition
- M. C. Benítez, J. C. Segura, A. Torre, J. Ramírez, and A. Rubio, "Including uncertainty of speech observations in robust speech recognition," in Proc. Int. Conf. Acoust. Speech, Signal Process., 2004, pp. 137-140.
- (2004) Proc. Int. Conf. Acoust. Speech, Signal Process , pp. 137-140
- Benítez, M.C.¹ Segura, J.C.² Torre, A.³ Ramírez, J.⁴ Rubio, A.⁵

10
- 44949190747
- Improved source modeling and predictive classification for channel robust speech recognition
- V. Ion and R. Haeb-Umbach, "Improved source modeling and predictive classification for channel robust speech recognition," in Proc. Interspeech, 2006.
- (2006) Proc. Interspeech
- Ion, V.¹ Haeb-Umbach, R.²

11
- 33749058582
- Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques
- Oct
- D. Kolossa, A. Klimas, and R. Orglmeister, "Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques," in Proc. Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), Oct. 2005, pp. 82-85.
- (2005) Proc. Workshop Applicat. Signal Process. Audio Acoust. (WASPAA) , pp. 82-85
- Kolossa, D.¹ Klimas, A.² Orglmeister, R.³

12
- 47049124615
- Recognition of convolutive speech mixtures by missing feature techniques for ICA
- D. Kolossa, H. Sawada, R. Astudillo, R. Orglmeister, and S. Makino, "Recognition of convolutive speech mixtures by missing feature techniques for ICA," in Proc. Asilomar Conf. Signals, Syst., Comput., 2006, pp. 1397-1401.
- (2006) Proc. Asilomar Conf. Signals, Syst., Comput , pp. 1397-1401
- Kolossa, D.¹ Sawada, H.² Astudillo, R.³ Orglmeister, R.⁴ Makino, S.⁵

13
- 77956751887
- Propagation of statistical information through non-linear feature extractions for robust speech recognition
- R. F. Astudillo, D. Kolossa, and R. Orglmeister, "Propagation of statistical information through non-linear feature extractions for robust speech recognition," in Proc. Int. Workshop Bayesian Inference Maximum Entropy Methods Sci. Eng., 2007.
- (2007) Proc. Int. Workshop Bayesian Inference Maximum Entropy Methods Sci. Eng
- Astudillo, R.F.¹ Kolossa, D.² Orglmeister, R.³

14
- 77954608105
- Ph.D. dissertation, Technische Univ. Berlin, Berlin, Germany
- D. Kolossa, "Independent component analysis for environmentally robust speech recognition," Ph.D. dissertation, Technische Univ. Berlin, Berlin, Germany, 2008.
- (2008) Independent Component Analysis for Environmentally Robust Speech Recognition
- Kolossa, D.¹

15
- 84872036128
- Uncertainty propagation for speech recognition using RASTA features in highly nonstationary noisy environments
- R. F. Astudillo, D. Kolossa, and R. Orglmeister, "Uncertainty propagation for speech recognition using RASTA features in highly nonstationary noisy environments," in Proc. ITGWorkshop Speech Commun., 2008.
- (2008) Proc. ITGWorkshop Speech Commun
- Astudillo, R.F.¹ Kolossa, D.² Orglmeister, R.³

16
- 70450180510
- Accounting for the uncertainty of speech estimates in the complex domain for minimum mean square error speech enhancement
- R. F. Astudillo, D. Kolossa, and R. Orglmeister, "Accounting for the uncertainty of speech estimates in the complex domain for minimum mean square error speech enhancement," in Proc. Interspeech, 2009.
- (2009) Proc. Interspeech
- Astudillo, R.F.¹ Kolossa, D.² Orglmeister, R.³

17
- 4544245839
- Two-stage mel-warped wiener filter for robust speech recognition
- A. Agarwal and Y. M. Cheng, "Two-stage mel-warped wiener filter for robust speech recognition," in Proc. IEEE Workshop Autom. Speech Recognition Understanding, 1999, pp. 12-15.
- (1999) Proc. IEEE Workshop Autom. Speech Recognition Understanding , pp. 12-15
- Agarwal, A.¹ Cheng, Y.M.²

18
- 0034848706
- SNR-dependent waveform processing for improving the robustness of ASR front-end
- D. Macho and Y. M. Cheng, "SNR-dependent waveform processing for improving the robustness of ASR front-end," in Proc. Int. Conf. Acoust. Speech Signal Process., 2001, vol.1, pp. 305-308.
- (2001) Proc. Int. Conf. Acoust. Speech Signal Process , vol.1 , pp. 305-308
- MacHo, D.¹ Cheng, Y.M.²

19
- 85009242725
- Evaluation of a noise-robust DSR front-end on aurora databases
- Sep
- D. Macho, L. Mauuary, B. No, Y. M. Cheng, D. Ealey, D. Jouvet, H. Kelleher, D. Pearce, and F. Saadoun, "Evaluation of a noise-robust DSR front-end on aurora databases," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), Sep. 2002, pp. 17-20.
- (2002) Proc. Int. Conf. Spoken Lang. Process. (ICSLP) , pp. 17-20
- MacHo, D.¹ Mauuary, L.² No, B.³ Cheng, Y.M.⁴ Ealey, D.⁵ Jouvet, D.⁶ Kelleher, H.⁷ Pearce, D.⁸ Saadoun, F.⁹

20
- 77956783992
- Blind equalization via minimization of VQ distortion for ETSI standard DSR front-end
- Oct
- S. Kuroiwa, S. Tsuge, and F. Ren, "Blind equalization via minimization of VQ distortion for ETSI standard DSR front-end," in Proc. Int. Conf. Natural Lang. Process. Knowledge Eng., Oct. 2003, pp. 585-590.
- (2003) Proc. Int. Conf. Natural Lang. Process. Knowledge Eng. , pp. 585-590
- Kuroiwa, S.¹ Tsuge, S.² Ren, F.³

21
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Apr
- S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-28, no.2, pp. 357-366, Apr. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.2 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

22
- 84910023856
- Speech Processing Transmission and Quality Aspects (STQ); Compression Algorithms ETSI ES 201 108 V1.1.3 (2003-2009), Sep.
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 201 108 V1.1.3 (2003-2009), Sep. 2003.
- (2003) Distributed Speech Recognition; Front-End Feature Extraction Algorithm

23
- 51449123884
- Boca Raton, FL: CRC
- Y. Ephraim and I. Cohen, Recent Advancements in Speech Enhancement. Boca Raton, FL: CRC, 2004, pp. 1-22.
- (2004) Recent Advancements in Speech Enhancement , pp. 1-22
- Ephraim, Y.¹ Cohen, I.²

24
- 0019009880
- Speech enhancement using a soft-decision noise suppression filter
- Apr
- R. McAulay and M. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-28, no.2, pp. 137-145, Apr. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.2 , pp. 137-145
- McAulay, R.¹ Malpass, M.²

25
- 0003498504
- A. Jeffrey and D. Zwillinger, Eds. Amsterdam, The Netherlands: Elsevier
- I. S. Gradshteyn and I. Ryzhik, Table of Integrals, Series and Products, A. Jeffrey and D. Zwillinger, Eds. Amsterdam, The Netherlands: Elsevier, 2007.
- (2007) Table of Integrals, Series and Products
- Gradshteyn, I.S.¹ Ryzhik, I.²

26
- 33947674784
- Application of minimum statistics and minima controlled recursive averaging methods to estimate a cepstral noise model for robust ASR
- May
- V. Stouten, H. Van Hamme, and P. Wambacq, "Application of minimum statistics and minima controlled recursive averaging methods to estimate a cepstral noise model for robust ASR," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 2006, vol.1, pp. 765-768.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 765-768
- Stouten, V.¹ Van Hamme, H.² Wambacq, P.³

27
- 0003671941
- Ph.D. dissertation, Gonville and Caius College, Cambridge, U.K
- M. J. F. Gales, "Model-based technique for noise robust speech recognition," Ph.D. dissertation, Gonville and Caius College, Cambridge, U.K., 1995.
- (1995) Model-based Technique for Noise Robust Speech Recognition
- Gales, M.J.F.¹

28
- 0003822743
- (for HTK Version 3.4). Cambridge, U.K.: Cambridge Univ. Eng. Dept.
- S. Young, The HTK Book (for HTK Version 3.4). Cambridge, U.K.: Cambridge Univ. Eng. Dept..
- The HTK Book
- Young, S.¹

29
- 40249103761
- Issues with uncertainty decoding for noise robust automatic speech recognition
- H. Liao and M. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Commun., vol.50, no.4, pp. 265-277, 2008.
- (2008) Speech Commun , vol.50 , Issue.4 , pp. 265-277
- Liao, H.¹ Gales, M.²

30
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol.34, no.3, pp. 267-285, 2001.
- (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.