SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 5507-5511

Extension of uncertainty propagation to dynamic MFCCS for noise robust ASR

(3) Tran, Dung T a,b,c Vincent, Emmanuel a,b,c Jouvet, Denis a,b,c

a INRIA (France)

b LORIA (France)

c UNIVERSITÉ DE LORRAINE (France)

Author keywords

Automatic speech recognition; noise robustness; uncertainty handling

Indexed keywords

ACOUSTIC NOISE; COVARIANCE MATRIX; SIGNAL PROCESSING; SPEECH RECOGNITION;

AUTOMATIC SPEECH RECOGNITION; DYNAMIC FEATURES; NOISE ROBUSTNESS; NONSTATIONARY NOISE; RELATIVE ERROR RATES; SCALING COEFFICIENTS; UNCERTAINTY HANDLING; UNCERTAINTY PROPAGATION;

UNCERTAINTY ANALYSIS;

EID: 84905216197 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854656 Document Type: Conference Paper

Times cited : (10)

References (25)

1
- 85032751593
- Research developments and directions in speech recognition and under-standing, part 1
- May
- J.M. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shaughnessy, "Research developments and directions in speech recognition and under-standing, part 1," IEEE Signal Processing Magazine, vol. 26, no. 3, pp. 75-80, May 2009.
- (2009) IEEE Signal Processing Magazine , vol.26 , Issue.3 , pp. 75-80
- Baker, J.M.¹ Deng, L.² Glass, J.³ Khudanpur, S.⁴ Lee, C.-H.⁵ Morgan, N.⁶ Oshaughnessy, D.⁷

2
- 50449083999
- Wiley
- M. Wolfel and J. McDonough, Distant Speech Recognition, Wiley, 2009.
- (2009) Distant Speech Recognition
- Wolfel, M.¹ McDonough, J.²

3
- 84891583985
- Wiley
- T. Virtanen, R. Singh, and B. Raj, Eds., Techniques for Noise Robustness in Automatic Speech Recognition, Wiley, 2012.
- (2012) Techniques for Noise Robustness in Automatic Speech Recognition
- Virtanen, T.¹ Singh, R.² Raj, B.³

4
- 84893704157
- The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes
- E. Vincent, J. Barker, S.Watanabe, J. Le Roux, F. Nesta, and M. Matassoni, "The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes," in Proc. ASRU, 2013.
- (2013) Proc. ASRU
- Vincent, E.¹ Barker, J.² Watanabe, S.³ Le Roux, J.⁴ Nesta, F.⁵ Matassoni, M.⁶

5
- 0003671941
- Ph.D. thesis, Cambridge University
- M. Gales, Model Based Techniques for Noise Robust Speech Regcognition, Ph.D. thesis, Cambridge University, 1995.
- (1995) Model Based Techniques for Noise Robust Speech Regcognition
- Gales, M.¹

6
- 84867608537
- Power-normalized cepstral coefficients (PNCC) for robust speech recognition
- C. Kim and R. Stern, "Power-normalized cepstral coefficients (PNCC) for robust speech recognition," in Proc. ICASSP, 2012, pp. 4101-4104.
- (2012) Proc. ICASSP , pp. 4101-4104
- Kim, C.¹ Stern, R.²

7
- 85009070292
- Large vocabulary speech recognition under adverse acoustic environments
- L. Deng, A. Acero, M. Plumpe, and X. D. Huang, "Large vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, 2000, pp. 806-809.
- (2000) Proc. ICSLP , pp. 806-809
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.D.⁴

8
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- June
- M. Cooke, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Communication, vol. 34, no. 3, pp. 267-285, June 2001.
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹

9
- 34547528168
- Adaptive training with joint uncertainty decoding for robust recognition of noisy data
- H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP, 2007, vol. 4, pp. 389-392.
- (2007) Proc. ICASSP , vol.4 , pp. 389-392
- Liao, H.¹ Gales, M.J.F.²

10
- 70350450398
- Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
- Jan
- M. Delcroix, T. Nakatani, and S. Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing," IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 2, pp. 324-334, Jan 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.2 , pp. 324-334
- Delcroix, M.¹ Nakatani, T.² Watanabe, S.³

11
- 84867617677
- Front-end, back-end, and hybrid techniques for noise-robust speech recognition
- Springer
- L. Deng, "Front-end, back-end, and hybrid techniques for noise-robust speech recognition," in Robust Speech Recognition of Uncertain or Missing Data-Theory and Applications, pp. 67-99. Springer, 2011.
- (2011) Robust Speech Recognition of Uncertain or Missing Data-Theory and Applications , pp. 67-99
- Deng, L.¹

12
- 77954583785
- Independent component analysis and timefrequency masking for multi speaker recognition
- Article ID 651420
- D. Kolossa, R. Astudillo, E. Hoffmann, and R. Orglmeister, "Independent component analysis and timefrequency masking for multi speaker recognition," in EURASIP Journal on Audio, Speech, and Music Processing, 2010, vol. 2010, Article ID 651420.
- (2010) EURASIP Journal on Audio, Speech, and Music Processing , vol.2010
- Kolossa, D.¹ Astudillo, R.² Hoffmann, E.³ Orglmeister, R.⁴

13
- 84893709985
- Uncertainty propagation
- Springer, D. Kolossa and R. Haeb-Umbach, Eds
- R. Astudillo and D. Kolossa, "Uncertainty propagation," in Robust Speech Recognition of Uncertain or Missing Data-Theory and Applications, D. Kolossa and R. Haeb-Umbach, Eds., pp. 35-62. Springer, 2011.
- (2011) Robust Speech Recognition of Uncertain or Missing Data-Theory and Applications , pp. 35-62
- Astudillo, R.¹ Kolossa, D.²

14
- 84890541336
- Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments
- H. Kallasjoki, S. Keronen, G. J. Brown, J. F. Gemmeke, U. Remes, and K. J. Palomaki, "Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments," in Proc. CHiME, 2011, pp. 58-63.
- (2011) Proc. CHiME , pp. 58-63
- Kallasjoki, H.¹ Keronen, S.² Brown, G.J.³ Gemmeke, J.F.⁴ Remes, U.⁵ Palomaki, K.J.⁶

15
- 84893685019
- A flexible spatial blind source extraction framework for robust speech recognition in noisy environments
- F. Nesta, M. Matassoni, and R. Astudillo, "A flexible spatial blind source extraction framework for robust speech recognition in noisy environments," in Proc. CHiME, 2013, pp. 33-40.
- (2013) Proc. CHiME , pp. 33-40
- Nesta, F.¹ Matassoni, M.² Astudillo, R.³

16
- 18744401086
- Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
- May
- L. Deng, J. Wu, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE Transactions on Audio, Speech, and Language Processing, vol. 13, no. 3, pp. 412-421, May 2005.
- (2005) IEEE Transactions on Audio, Speech, and Language Processing , vol.13 , Issue.3 , pp. 412-421
- Deng, L.¹ Wu, J.² Droppo, J.³ Acero, A.⁴

17
- 84905275072
- Uncertaintybased learning of acoustic models from noisy data
- Feb
- A. Ozerov, M. Lagrange, and E. Vincent, "Uncertaintybased learning of acoustic models from noisy data," Computer Speech and Language, vol. 27, no. 3, pp. 874-894, Feb. 2013.
- (2013) Computer Speech and Language , vol.27 , Issue.3 , pp. 874-894
- Ozerov, A.¹ Lagrange, M.² Vincent, E.³

18
- 84897584695
- A general flexible framework for the handling of prior information in audio source separation
- May
- A. Ozerov, E. Vincent, and F. Bimbot, "A general flexible framework for the handling of prior information in audio source separation," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 4, pp. 1118-1133, May 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.4 , pp. 1118-1133
- Ozerov, A.¹ Vincent, E.² Bimbot, F.³

19
- 79959819066
- Ph.D. thesis, TU Berlin
- R. Astudillo, Integration of Short-Time Fourier Domain Speech Enhancement and Observation Uncertainty Techniques for Robust Automatic Speech Recognition, Ph.D. thesis, TU Berlin, 2010.
- (2010) Integration of Short-Time Fourier Domain Speech Enhancement and Observation Uncertainty Techniques for Robust Automatic Speech Recognition
- Astudillo, R.¹

20
- 84905260733
- Series and Product
- I. Gradshteyn and I. Ryzhik, Table of Intergral, Series and Product, 1995.
- (1995) Table of Intergral
- Gradshteyn, I.¹ Ryzhik, I.²

21
- 84939730902
- Mathematical analysis of random noise
- S. Rice, "Mathematical analysis of random noise," Bell System Technical Journal, vol. 23, 1944.
- (1944) Bell System Technical Journal , vol.23
- Rice, S.¹

22
- 0029725301
- A vector Taylor series approach for environment-independent speech recognition
- P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. ICASSP, 1996, vol. 2, pp. 733-736.
- (1996) Proc. ICASSP , vol.2 , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

23
- 84905278580
- S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book, 2002
- S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book, 2002.

24
- 33847655586
- A generalized divergence measure fon nonnegative matrix factorization
- Mar
- R. Kompass, "A generalized divergence measure fon nonnegative matrix factorization," Neural Computation, vol. 19, no. 3, pp. 780-791, Mar. 2007.
- (2007) Neural Computation , vol.19 , Issue.3 , pp. 780-791
- Kompass, R.¹

25
- 84893336307
- Tech. Rep. RT-0428, Inria, Aug
- K. Adiloǧlu and E. Vincent, "Variational Bayesian inference for source separation and robust feature extraction," Tech. Rep. RT-0428, Inria, Aug. 2012.
- (2012) Variational Bayesian Inference for Source Separation and Robust Feature Extraction
- Adiloǧlu, K.¹ Vincent, E.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.