SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 23, Issue 7, 2015, Pages 1198-1208

Bounded conditional mean imputation with observation uncertainties and acoustic model adaptation

(4) Remes, Ulpu a López, Ana Ramírez a Palomäki, Kalle a Kurimo, Mikko a

a AALTO UNIVERSITY (Finland)

Author keywords

Acoustic model adaptation; missing data; noise robust speech recognition; observation uncertainties

Indexed keywords

ACOUSTIC NOISE; ACOUSTIC NOISE MEASUREMENT; SPEECH; UNCERTAINTY ANALYSIS;

ACOUSTIC MODEL ADAPTATION; AUTOMATIC SPEECH RECOGNITION SYSTEM; ENVIRONMENTAL VARIATIONS; MISSING DATA; MISSING DATA METHODS; NOISE ROBUST SPEECH RECOGNITION; OBSERVATION UNCERTAINTIES; POSTERIOR DISTRIBUTIONS;

SPEECH RECOGNITION;

EID: 84929376602 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2015.2424322 Document Type: Article

Times cited : (6)

References (44)

1
- 84929381006
- Environmental robustness in automatic speech recognition
- R. Rose, "Environmental robustness in automatic speech recognition," in Proc. ISCA Workshop Robustness Conversat. Interact., 2004.
- (2004) Proc. ISCA Workshop Robustness Conversat. Interact.
- Rose, R.¹

2
- 0022883703
- Noise compensation for speech recognition using probabilistic models
- J. N. Holmes and N. C. Sedgwick, "Noise compensation for speech recognition using probabilistic models," in Proc. ICASSP, 1986, pp. 741-744.
- (1986) Proc. ICASSP , pp. 741-744
- Holmes, J.N.¹ Sedgwick, N.C.²

3
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001.
- (2001) Speech Commun. , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

4
- 4644336054
- Reconstruction of missing features for robust speech recognition
- B. Raj, M. L. Seltzer, and R. M. Stern, "Reconstruction of missing features for robust speech recognition," Speech Commun., vol. 43, pp. 275-296, 2004.
- (2004) Speech Commun. , vol.43 , pp. 275-296
- Raj, B.¹ Seltzer, M.L.² Stern, R.M.³

5
- 33846190246
- Reconstructing spectral vectors with uncertain spectrographic masks for robust speech recognition
- B. Raj and R. Singh, "Reconstructing spectral vectors with uncertain spectrographic masks for robust speech recognition," in Proc. ASRU, 2005.
- (2005) Proc. ASRU
- Raj, B.¹ Singh, R.²

6
- 70349226857
- Bounded conditional mean imputation with Gaussian mixture models: A reconstruction approach to partly occluded features
- F. Faubel, J. McDonough, and D. Klakow, "Bounded conditional mean imputation with Gaussian mixture models: A reconstruction approach to partly occluded features," in Proc. ICASSP, 2009, pp. 3869-3872.
- (2009) Proc. ICASSP , pp. 3869-3872
- Faubel, F.¹ McDonough, J.² Klakow, D.³

7
- 77949695902
- Compres-sive sensing for missing data imputation in noise robust speech recognition
- Apr.
- J. F. Gemmeke, H. Van hamme, B. Cranen, and L. Boves, "Compres-sive sensing for missing data imputation in noise robust speech recognition," IEEE J. Sel. Topics Signal Process., vol. 4, no. 2, pp. 272-287, Apr. 2010.
- (2010) IEEE J. Sel. Topics Signal Process , vol.4 , Issue.2 , pp. 272-287
- Gemmeke, J.F.¹ Vanhamme, H.² Cranen, B.³ Boves, L.⁴

8
- 84929381007
- A comparative study of missing feature imputation techniques
- M. Braun, F. Faubel, and D. Klakow, "A comparative study of missing feature imputation techniques," in Proc. 10. ITG Symp. Speech Commun., 2012.
- (2012) Proc. 10. ITG Symp. Speech Commun.
- Braun, M.¹ Faubel, F.² Klakow, D.³

9
- 84867612282
- Combining missing-data reconstruction and uncertainty decoding for robust speech recognition
- J. A. González, A. M. Peinado, A. M. Gómez, N. Ma, and J. Barker, "Combining missing-data reconstruction and uncertainty decoding for robust speech recognition," in Proc. ICASSP, 2012, pp. 4693-4696.
- (2012) Proc. ICASSP , pp. 4693-4696
- González, J.A.¹ Peinado, A.M.² Gómez, A.M.³ Ma, N.⁴ Barker, J.⁵

10
- 84872188748
- MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition
- Mar.
- J. A. González, A. M. Peinado, N. Ma, A. M. Gómez, and J. Barker, "MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 3, pp. 624-635, Mar. 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.3 , pp. 624-635
- González, J.A.¹ Peinado, A.M.² Ma, N.³ Gómez, A.M.⁴ Barker, J.⁵

11
- 84906258664
- Bounded conditional mean imputation with an approximate posterior
- U. Remes, "Bounded conditional mean imputation with an approximate posterior," in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Remes, U.¹

12
- 84865710179
- GMM-based missing-feature reconstruction on multi-frame windows
- U. Remes, Y. Nankaku, and K. Tokuda, "GMM-based missing-feature reconstruction on multi-frame windows," in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Remes, U.¹ Nankaku, Y.² Tokuda, K.³

13
- 84929381009
- Robust automatic speech recognition using acoustic model adaptation prior to missing feature reconstruction
- U. Remes, K. J. Palomäki, and M. Kurimo, "Robust automatic speech recognition using acoustic model adaptation prior to missing feature reconstruction," in Proc. EUSIPCO, 2009.
- (2009) Proc. EUSIPCO
- Remes, U.¹ Palomäki, K.J.² Kurimo, M.³

14
- 56249136428
- Transforming binary uncertainties for robust speech recognition
- Sep.
- S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2130-2140, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.7 , pp. 2130-2140
- Srinivasan, S.¹ Wang, D.L.²

15
- 79959814198
- Observation uncertainty measures for sparse imputation
- J. F. Gemmeke, U. Remes, and K. J. Palomäki, "Observation uncertainty measures for sparse imputation," in Proc. Interspeech, 2010.
- (2010) Proc. Interspeech
- Gemmeke, J.F.¹ Remes, U.² Palomäki, K.J.³

16
- 77956717352
- An uncertainty propagation approach to robust ASR using the ETSI advanced front-end
- Oct.
- R. F. Astudillo, D. Kolossa, P. Mandelartz, and R. Orglmeister, "An uncertainty propagation approach to robust ASR using the ETSI advanced front-end," IEEE J. Sel. Topics Signal Process., vol. 4, no. 5, pp. 824-833, Oct. 2010.
- (2010) IEEE J. Sel. Topics Signal Process , vol.4 , Issue.5 , pp. 824-833
- Astudillo, R.F.¹ Kolossa, D.² Mandelartz, P.³ Orglmeister, R.⁴

17
- 84893709985
- Uncertainty propagation
- D. Kolossa and R. Haeb-Umbach, Eds. New York, NY, USA: Springer Verlag
- R. F. Astudillo and D. Kolossa, "Uncertainty propagation," in Robust Speech Recognition of Uncertain and Missing Data, D. Kolossa and R. Haeb-Umbach, Eds. New York, NY, USA: Springer Verlag, 2011, pp. 35-64.
- (2011) Robust Speech Recognition of Uncertain and Missing Data , pp. 35-64
- Astudillo, R.F.¹ Kolossa, D.²

18
- 85009067687
- Using observation uncertainty in HMM decoding
- J. A. Arrowood and M. A. Clements, "Using observation uncertainty in HMM decoding," in Proc. ICSLP, 2002, pp. 1561-1564.
- (2002) Proc. ICSLP , pp. 1561-1564
- Arrowood, J.A.¹ Clements, M.A.²

19
- 18744401086
- Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
- May
- L. Deng, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 412-421, May 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 412-421
- Deng, L.¹ Droppo, J.² Acero, A.³

20
- 40249103761
- Issues with uncertainty decoding for noise robust automatic speech recognition
- H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Commun., vol. 50, pp. 265-277, 2008.
- (2008) Speech Commun. , vol.50 , pp. 265-277
- Liao, H.¹ Gales, M.J.F.²

21
- 33749058582
- Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques
- D. Kolossa, A. Klimas, and R. Orglmeister, "Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques," in Proc. ASPAA, 2005.
- (2005) Proc. ASPAA
- Kolossa, D.¹ Klimas, A.² Orglmeister, R.³

22
- 84890521674
- GMM-based significance decoding
- A. H. Abdelaziz, S. Zeiler, D. Kolossa, V. Leutnant, and R. Haeb-Um-bach, "GMM-based significance decoding," in Proc. ICASSP, 2013, pp. 6827-6831.
- (2013) Proc. ICASSP , pp. 6827-6831
- Abdelaziz, A.H.¹ Zeiler, S.² Kolossa, D.³ Leutnant, V.⁴ Haeb-Um-Bach, R.⁵

23
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
- (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
- Gales, M.J.F.¹

24
- 69849103259
- Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition
- Mar.
- G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos, "Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 3, pp. 423-435, Mar. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.3 , pp. 423-435
- Papandreou, G.¹ Katsamanis, A.² Pitsikalis, V.³ Maragos, P.⁴

25
- 84905275072
- Uncertainty-based learning of acoustic models from noisy data
- A. Ozerov, M. Lagrange, and E. Vincent, "Uncertainty-based learning of acoustic models from noisy data," Comput. Speech Lang., vol. 27, pp. 874-894, 2013.
- (2013) Comput. Speech Lang. , vol.27 , pp. 874-894
- Ozerov, A.¹ Lagrange, M.² Vincent, E.³

26
- 80051604053
- MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations
- A. Krueger and R. Haeb-Umbach, "MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations," in Proc. ICASSP, 2011, pp. 3596-3599.
- (2011) Proc. ICASSP , pp. 3596-3599
- Krueger, A.¹ Haeb-Umbach, R.²

27
- 84890514458
- MAP-based estimation of the parameters of a Gaussian mixture model in the presence of noisy observations
- A. Chinaev and R. Haeb-Umbach, "MAP-based estimation of the parameters of a Gaussian mixture model in the presence of noisy observations," in Proc. ICASSP, 2013, pp. 3352-3356.
- (2013) Proc. ICASSP , pp. 3352-3356
- Chinaev, A.¹ Haeb-Umbach, R.²

28
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- Sep.
- V. V. Digalakis, D. Rtischev, and L. G. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
- (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.V.¹ Rtischev, D.² Neumeyer, L.G.³

29
- 84865783757
- Separating speaker and environmental variability using factored transforms
- M. L. Seltzer and A. Acero, "Separating speaker and environmental variability using factored transforms," in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Seltzer, M.L.¹ Acero, A.²

30
- 84862293102
- Speaker and noise factorization for robust speech recognition
- Sep.
- Y. Wang and M. J. F. Gales, "Speaker and noise factorization for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 7, pp. 2149-2158, Sep. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.7 , pp. 2149-2158
- Wang, Y.¹ Gales, M.J.F.²

31
- 78049302682
- Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition
- Feb.
- D. K. Kim and M. J. F. Gales, "Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 2, pp. 315-325, Feb. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.2 , pp. 315-325
- Kim, D.K.¹ Gales, M.J.F.²

32
- 84910032186
- SPEECON-speech databases for consumer devices: Database specification and validation
- D. Iskra, B. Grosskopf, K. Marasek, H. van den Heuvel, F. Diehl, and A. Kiessling, "SPEECON-speech databases for consumer devices: Database specification and validation," in Proc. LREC, 2002.
- (2002) Proc. LREC
- Iskra, D.¹ Grosskopf, B.² Marasek, K.³ Heuvel Den H.Van⁴ Diehl, F.⁵ Kiessling, A.⁶

33
- 65349113250
- Importance of high-order n-gram models in morph-based speech recognition
- May
- T. Hirsimäki, J. Pylkkönen, and M. Kurimo, "Importance of high-order n-gram models in morph-based speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 724-732, May 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.4 , pp. 724-732
- Hirsimäki, T.¹ Pylkkönen, J.² Kurimo, M.³

34
- 78049401459
- Duration modeling techniques for continuous speech recognition
- J. Pylkkönen and M. Kurimo, "Duration modeling techniques for continuous speech recognition," in Proc. Interspeech, 2004.
- (2004) Proc. Interspeech
- Pylkkönen, J.¹ Kurimo, M.²

35
- 84855263089
- A. Willis, QPC-Quadratic Programming in C V2.0. 2010.
- (2010) QPC-Quadratic Programming in C V2.0.
- Willis, A.¹

36
- 84906270822
- J. Kämäräinen and P. Paalanen, GMMBAYES-Bayesian classifier and Gaussian mixture model toolbox V1.0. 2005.
- (2005) GMMBAYES-Bayesian Classifier and Gaussian Mixture Model Toolbox V1.0.
- Kämäräinen, J.¹ Paalanen, P.²

37
- 84890449972
- A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data
- T. Kinnunen and P. Rajan, "A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data," in Proc. ICASSP, 2013, pp. 7229-7231.
- (2013) Proc. ICASSP , pp. 7229-7231
- Kinnunen, T.¹ Rajan, P.²

38
- 33644661135
- A glimpsing model of speech perception in noise
- M. Cooke, "A glimpsing model of speech perception in noise," J. Acoust. Soc. Amer., vol. 119, pp. 1562-1573, 2006.
- (2006) J. Acoust. Soc. Amer. , vol.119 , pp. 1562-1573
- Cooke, M.¹

39
- 84897933562
- Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition
- Feb.
- H. Kallasjoki, J. F. Gemmeke, and K. J. Palomäki, "Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 22, no. 2, pp. 368-380, Feb. 2014.
- (2014) IEEE/ACM Trans. Audio, Speech, Lang. Process , vol.22 , Issue.2 , pp. 368-380
- Kallasjoki, H.¹ Gemmeke, J.F.² Palomäki, K.J.³

40
- 84929381011
- Noise robust missing data mask estimation based on automatically learned features
- S. Keronen, U. Remes, H. Kallasjoki, and K. Palomäki, "Noise robust missing data mask estimation based on automatically learned features," in Proc. CHIME, 2013.
- (2013) Proc. CHIME
- Keronen, S.¹ Remes, U.² Kallasjoki, H.³ Palomäki, K.⁴

41
- 84946061185
- Recognition of reverberant speech by missing data imputation and NMF feature enhancement
- H. Kallasjoki, J. F. Gemmeke, K. J. Palomäki, A. V. Beeston, and G. J. Brown, "Recognition of reverberant speech by missing data imputation and NMF feature enhancement," in Proc. REVERB, 2014.
- (2014) Proc. REVERB
- Kallasjoki, H.¹ Gemmeke, J.F.² Palomäki, K.J.³ Beeston, A.V.⁴ Brown, G.J.⁵

42
- 33646762213
- Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement
- V. Stouten, H. Van hamme, and P. Wambacq, "Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement," in Proc. Interspeech, 2004.
- (2004) Proc. Interspeech
- Stouten, V.¹ Van Hamme, H.² Wambacq, P.³

43
- 70350450398
- Static and dynamic variance compensation for recognition of reverberant speech with derever-beration preprocessing
- Feb.
- M. Delcroix, T. Nakatani, and S. Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with derever-beration preprocessing," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 324-334, Feb. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.2 , pp. 324-334
- Delcroix, M.¹ Nakatani, T.² Watanabe, S.³

44
- 84905216197
- Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR
- D. T. Tran, E. Vincent, and D. Jouvet, "Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR," in Proc. ICASSP, 2014, pp. 5507-5511.
- (2014) Proc. ICASSP , pp. 5507-5511
- Tran, D.T.¹ Vincent, E.² Jouvet, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.