메뉴 건너뛰기




Volumn 4, Issue 5, 2010, Pages 824-833

An uncertainty propagation approach to robust ASR using the ETSI advanced front-end

Author keywords

Advanced front end (AFE); AURORA5; European Telecommunications Standards Institute (ETSI) distributed recognition (DSR); uncertainty decoding; uncertainty propagation

Indexed keywords

AURORA5; EUROPEAN TELECOMMUNICATIONS STANDARDS INSTITUTES; FRONT END; UNCERTAINTY DECODING; UNCERTAINTY PROPAGATION;

EID: 77956717352     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2057194     Document Type: Article
Times cited : (18)

References (30)
  • 2
    • 0038669544 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • H. G. Hirsch and D. Pearce, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. Automat. Speech Recognition: Challenges for the New Millenium, 2000.
    • (2000) Proc. Automat. Speech Recognition: Challenges for the New Millenium
    • Hirsch, H.G.1    Pearce, D.2
  • 5
    • 33750376174 scopus 로고    scopus 로고
    • Model based feature enhancement with uncertainty decoding for noise robust ASR
    • V. Stouten, H. Van hamme, and W. Wambacq, "Model based feature enhancement with uncertainty decoding for noise robust ASR," Speech Commun., vol.48, no.11, pp. 1502-1514, 2006.
    • (2006) Speech Commun , vol.48 , Issue.11 , pp. 1502-1514
    • Stouten, V.1    Van Hamme, H.2    Wambacq, W.3
  • 6
    • 0032205798 scopus 로고    scopus 로고
    • Improving performance of spectral subtraction in speech recognition using a model for additive noise
    • Nov
    • N. Yoma, F. McInnes, and M. Jack, "Improving performance of spectral subtraction in speech recognition using a model for additive noise," IEEE Trans. Speech Audio Process., vol.6, no.6, pp. 579-582, Nov. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.6 , pp. 579-582
    • Yoma, N.1    McInnes, F.2    Jack, M.3
  • 7
    • 0036508276 scopus 로고    scopus 로고
    • Speaker verification in noise using a stochastic version of the weighted Viterbi algorithm
    • Mar
    • N.Yoma and M.Villar, "Speaker verification in noise using a stochastic version of the weighted Viterbi algorithm," IEEE Trans. Speech Audio Process., vol.10, no.3, pp. 158-166, Mar. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 158-166
    • Yoma, N.1    Villar, M.2
  • 8
    • 85009275141 scopus 로고    scopus 로고
    • Exploiting variances in robust feature extraction based on a parametric model of speech distortion
    • L. Deng, J. Droppo, and A. Acero, "Exploiting variances in robust feature extraction based on a parametric model of speech distortion," in Proc. Int. Conf. Spoken Lang. Process., 2002.
    • (2002) Proc. Int. Conf. Spoken Lang. Process
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 10
    • 44949190747 scopus 로고    scopus 로고
    • Improved source modeling and predictive classification for channel robust speech recognition
    • V. Ion and R. Haeb-Umbach, "Improved source modeling and predictive classification for channel robust speech recognition," in Proc. Interspeech, 2006.
    • (2006) Proc. Interspeech
    • Ion, V.1    Haeb-Umbach, R.2
  • 11
    • 33749058582 scopus 로고    scopus 로고
    • Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques
    • Oct
    • D. Kolossa, A. Klimas, and R. Orglmeister, "Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques," in Proc. Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), Oct. 2005, pp. 82-85.
    • (2005) Proc. Workshop Applicat. Signal Process. Audio Acoust. (WASPAA) , pp. 82-85
    • Kolossa, D.1    Klimas, A.2    Orglmeister, R.3
  • 15
    • 84872036128 scopus 로고    scopus 로고
    • Uncertainty propagation for speech recognition using RASTA features in highly nonstationary noisy environments
    • R. F. Astudillo, D. Kolossa, and R. Orglmeister, "Uncertainty propagation for speech recognition using RASTA features in highly nonstationary noisy environments," in Proc. ITGWorkshop Speech Commun., 2008.
    • (2008) Proc. ITGWorkshop Speech Commun
    • Astudillo, R.F.1    Kolossa, D.2    Orglmeister, R.3
  • 16
    • 70450180510 scopus 로고    scopus 로고
    • Accounting for the uncertainty of speech estimates in the complex domain for minimum mean square error speech enhancement
    • R. F. Astudillo, D. Kolossa, and R. Orglmeister, "Accounting for the uncertainty of speech estimates in the complex domain for minimum mean square error speech enhancement," in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Astudillo, R.F.1    Kolossa, D.2    Orglmeister, R.3
  • 18
    • 0034848706 scopus 로고    scopus 로고
    • SNR-dependent waveform processing for improving the robustness of ASR front-end
    • D. Macho and Y. M. Cheng, "SNR-dependent waveform processing for improving the robustness of ASR front-end," in Proc. Int. Conf. Acoust. Speech Signal Process., 2001, vol.1, pp. 305-308.
    • (2001) Proc. Int. Conf. Acoust. Speech Signal Process , vol.1 , pp. 305-308
    • MacHo, D.1    Cheng, Y.M.2
  • 20
    • 77956783992 scopus 로고    scopus 로고
    • Blind equalization via minimization of VQ distortion for ETSI standard DSR front-end
    • Oct
    • S. Kuroiwa, S. Tsuge, and F. Ren, "Blind equalization via minimization of VQ distortion for ETSI standard DSR front-end," in Proc. Int. Conf. Natural Lang. Process. Knowledge Eng., Oct. 2003, pp. 585-590.
    • (2003) Proc. Int. Conf. Natural Lang. Process. Knowledge Eng. , pp. 585-590
    • Kuroiwa, S.1    Tsuge, S.2    Ren, F.3
  • 21
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Apr
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-28, no.2, pp. 357-366, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.2 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 22
    • 84910023856 scopus 로고    scopus 로고
    • Speech Processing Transmission and Quality Aspects (STQ); Compression Algorithms ETSI ES 201 108 V1.1.3 (2003-2009), Sep.
    • Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 201 108 V1.1.3 (2003-2009), Sep. 2003.
    • (2003) Distributed Speech Recognition; Front-End Feature Extraction Algorithm
  • 24
    • 0019009880 scopus 로고
    • Speech enhancement using a soft-decision noise suppression filter
    • Apr
    • R. McAulay and M. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-28, no.2, pp. 137-145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.2 , pp. 137-145
    • McAulay, R.1    Malpass, M.2
  • 26
    • 33947674784 scopus 로고    scopus 로고
    • Application of minimum statistics and minima controlled recursive averaging methods to estimate a cepstral noise model for robust ASR
    • May
    • V. Stouten, H. Van Hamme, and P. Wambacq, "Application of minimum statistics and minima controlled recursive averaging methods to estimate a cepstral noise model for robust ASR," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 2006, vol.1, pp. 765-768.
    • (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 765-768
    • Stouten, V.1    Van Hamme, H.2    Wambacq, P.3
  • 28
    • 0003822743 scopus 로고    scopus 로고
    • (for HTK Version 3.4). Cambridge, U.K.: Cambridge Univ. Eng. Dept.
    • S. Young, The HTK Book (for HTK Version 3.4). Cambridge, U.K.: Cambridge Univ. Eng. Dept..
    • The HTK Book
    • Young, S.1
  • 29
    • 40249103761 scopus 로고    scopus 로고
    • Issues with uncertainty decoding for noise robust automatic speech recognition
    • H. Liao and M. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Commun., vol.50, no.4, pp. 265-277, 2008.
    • (2008) Speech Commun , vol.50 , Issue.4 , pp. 265-277
    • Liao, H.1    Gales, M.2
  • 30
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol.34, no.3, pp. 267-285, 2001.
    • (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.