메뉴 건너뛰기




Volumn 23, Issue 7, 2015, Pages 1198-1208

Bounded conditional mean imputation with observation uncertainties and acoustic model adaptation

Author keywords

Acoustic model adaptation; missing data; noise robust speech recognition; observation uncertainties

Indexed keywords

ACOUSTIC NOISE; ACOUSTIC NOISE MEASUREMENT; SPEECH; UNCERTAINTY ANALYSIS;

EID: 84929376602     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2015.2424322     Document Type: Article
Times cited : (6)

References (44)
  • 2
    • 0022883703 scopus 로고
    • Noise compensation for speech recognition using probabilistic models
    • J. N. Holmes and N. C. Sedgwick, "Noise compensation for speech recognition using probabilistic models," in Proc. ICASSP, 1986, pp. 741-744.
    • (1986) Proc. ICASSP , pp. 741-744
    • Holmes, J.N.1    Sedgwick, N.C.2
  • 3
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001.
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 4
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • B. Raj, M. L. Seltzer, and R. M. Stern, "Reconstruction of missing features for robust speech recognition," Speech Commun., vol. 43, pp. 275-296, 2004.
    • (2004) Speech Commun. , vol.43 , pp. 275-296
    • Raj, B.1    Seltzer, M.L.2    Stern, R.M.3
  • 5
    • 33846190246 scopus 로고    scopus 로고
    • Reconstructing spectral vectors with uncertain spectrographic masks for robust speech recognition
    • B. Raj and R. Singh, "Reconstructing spectral vectors with uncertain spectrographic masks for robust speech recognition," in Proc. ASRU, 2005.
    • (2005) Proc. ASRU
    • Raj, B.1    Singh, R.2
  • 6
    • 70349226857 scopus 로고    scopus 로고
    • Bounded conditional mean imputation with Gaussian mixture models: A reconstruction approach to partly occluded features
    • F. Faubel, J. McDonough, and D. Klakow, "Bounded conditional mean imputation with Gaussian mixture models: A reconstruction approach to partly occluded features," in Proc. ICASSP, 2009, pp. 3869-3872.
    • (2009) Proc. ICASSP , pp. 3869-3872
    • Faubel, F.1    McDonough, J.2    Klakow, D.3
  • 7
    • 77949695902 scopus 로고    scopus 로고
    • Compres-sive sensing for missing data imputation in noise robust speech recognition
    • Apr.
    • J. F. Gemmeke, H. Van hamme, B. Cranen, and L. Boves, "Compres-sive sensing for missing data imputation in noise robust speech recognition," IEEE J. Sel. Topics Signal Process., vol. 4, no. 2, pp. 272-287, Apr. 2010.
    • (2010) IEEE J. Sel. Topics Signal Process , vol.4 , Issue.2 , pp. 272-287
    • Gemmeke, J.F.1    Vanhamme, H.2    Cranen, B.3    Boves, L.4
  • 9
    • 84867612282 scopus 로고    scopus 로고
    • Combining missing-data reconstruction and uncertainty decoding for robust speech recognition
    • J. A. González, A. M. Peinado, A. M. Gómez, N. Ma, and J. Barker, "Combining missing-data reconstruction and uncertainty decoding for robust speech recognition," in Proc. ICASSP, 2012, pp. 4693-4696.
    • (2012) Proc. ICASSP , pp. 4693-4696
    • González, J.A.1    Peinado, A.M.2    Gómez, A.M.3    Ma, N.4    Barker, J.5
  • 10
    • 84872188748 scopus 로고    scopus 로고
    • MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition
    • Mar.
    • J. A. González, A. M. Peinado, N. Ma, A. M. Gómez, and J. Barker, "MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 3, pp. 624-635, Mar. 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.3 , pp. 624-635
    • González, J.A.1    Peinado, A.M.2    Ma, N.3    Gómez, A.M.4    Barker, J.5
  • 11
    • 84906258664 scopus 로고    scopus 로고
    • Bounded conditional mean imputation with an approximate posterior
    • U. Remes, "Bounded conditional mean imputation with an approximate posterior," in Proc. Interspeech, 2013.
    • (2013) Proc. Interspeech
    • Remes, U.1
  • 12
    • 84865710179 scopus 로고    scopus 로고
    • GMM-based missing-feature reconstruction on multi-frame windows
    • U. Remes, Y. Nankaku, and K. Tokuda, "GMM-based missing-feature reconstruction on multi-frame windows," in Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Remes, U.1    Nankaku, Y.2    Tokuda, K.3
  • 13
    • 84929381009 scopus 로고    scopus 로고
    • Robust automatic speech recognition using acoustic model adaptation prior to missing feature reconstruction
    • U. Remes, K. J. Palomäki, and M. Kurimo, "Robust automatic speech recognition using acoustic model adaptation prior to missing feature reconstruction," in Proc. EUSIPCO, 2009.
    • (2009) Proc. EUSIPCO
    • Remes, U.1    Palomäki, K.J.2    Kurimo, M.3
  • 14
    • 56249136428 scopus 로고    scopus 로고
    • Transforming binary uncertainties for robust speech recognition
    • Sep.
    • S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2130-2140, Sep. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.7 , pp. 2130-2140
    • Srinivasan, S.1    Wang, D.L.2
  • 16
    • 77956717352 scopus 로고    scopus 로고
    • An uncertainty propagation approach to robust ASR using the ETSI advanced front-end
    • Oct.
    • R. F. Astudillo, D. Kolossa, P. Mandelartz, and R. Orglmeister, "An uncertainty propagation approach to robust ASR using the ETSI advanced front-end," IEEE J. Sel. Topics Signal Process., vol. 4, no. 5, pp. 824-833, Oct. 2010.
    • (2010) IEEE J. Sel. Topics Signal Process , vol.4 , Issue.5 , pp. 824-833
    • Astudillo, R.F.1    Kolossa, D.2    Mandelartz, P.3    Orglmeister, R.4
  • 17
    • 84893709985 scopus 로고    scopus 로고
    • Uncertainty propagation
    • D. Kolossa and R. Haeb-Umbach, Eds. New York, NY, USA: Springer Verlag
    • R. F. Astudillo and D. Kolossa, "Uncertainty propagation," in Robust Speech Recognition of Uncertain and Missing Data, D. Kolossa and R. Haeb-Umbach, Eds. New York, NY, USA: Springer Verlag, 2011, pp. 35-64.
    • (2011) Robust Speech Recognition of Uncertain and Missing Data , pp. 35-64
    • Astudillo, R.F.1    Kolossa, D.2
  • 18
    • 85009067687 scopus 로고    scopus 로고
    • Using observation uncertainty in HMM decoding
    • J. A. Arrowood and M. A. Clements, "Using observation uncertainty in HMM decoding," in Proc. ICSLP, 2002, pp. 1561-1564.
    • (2002) Proc. ICSLP , pp. 1561-1564
    • Arrowood, J.A.1    Clements, M.A.2
  • 19
    • 18744401086 scopus 로고    scopus 로고
    • Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
    • May
    • L. Deng, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 412-421, May 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 412-421
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 20
    • 40249103761 scopus 로고    scopus 로고
    • Issues with uncertainty decoding for noise robust automatic speech recognition
    • H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Commun., vol. 50, pp. 265-277, 2008.
    • (2008) Speech Commun. , vol.50 , pp. 265-277
    • Liao, H.1    Gales, M.J.F.2
  • 21
    • 33749058582 scopus 로고    scopus 로고
    • Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques
    • D. Kolossa, A. Klimas, and R. Orglmeister, "Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques," in Proc. ASPAA, 2005.
    • (2005) Proc. ASPAA
    • Kolossa, D.1    Klimas, A.2    Orglmeister, R.3
  • 23
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 24
    • 69849103259 scopus 로고    scopus 로고
    • Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition
    • Mar.
    • G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos, "Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 3, pp. 423-435, Mar. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.3 , pp. 423-435
    • Papandreou, G.1    Katsamanis, A.2    Pitsikalis, V.3    Maragos, P.4
  • 25
    • 84905275072 scopus 로고    scopus 로고
    • Uncertainty-based learning of acoustic models from noisy data
    • A. Ozerov, M. Lagrange, and E. Vincent, "Uncertainty-based learning of acoustic models from noisy data," Comput. Speech Lang., vol. 27, pp. 874-894, 2013.
    • (2013) Comput. Speech Lang. , vol.27 , pp. 874-894
    • Ozerov, A.1    Lagrange, M.2    Vincent, E.3
  • 26
    • 80051604053 scopus 로고    scopus 로고
    • MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations
    • A. Krueger and R. Haeb-Umbach, "MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations," in Proc. ICASSP, 2011, pp. 3596-3599.
    • (2011) Proc. ICASSP , pp. 3596-3599
    • Krueger, A.1    Haeb-Umbach, R.2
  • 27
    • 84890514458 scopus 로고    scopus 로고
    • MAP-based estimation of the parameters of a Gaussian mixture model in the presence of noisy observations
    • A. Chinaev and R. Haeb-Umbach, "MAP-based estimation of the parameters of a Gaussian mixture model in the presence of noisy observations," in Proc. ICASSP, 2013, pp. 3352-3356.
    • (2013) Proc. ICASSP , pp. 3352-3356
    • Chinaev, A.1    Haeb-Umbach, R.2
  • 28
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • Sep.
    • V. V. Digalakis, D. Rtischev, and L. G. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.V.1    Rtischev, D.2    Neumeyer, L.G.3
  • 29
    • 84865783757 scopus 로고    scopus 로고
    • Separating speaker and environmental variability using factored transforms
    • M. L. Seltzer and A. Acero, "Separating speaker and environmental variability using factored transforms," in Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Seltzer, M.L.1    Acero, A.2
  • 30
    • 84862293102 scopus 로고    scopus 로고
    • Speaker and noise factorization for robust speech recognition
    • Sep.
    • Y. Wang and M. J. F. Gales, "Speaker and noise factorization for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 7, pp. 2149-2158, Sep. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.7 , pp. 2149-2158
    • Wang, Y.1    Gales, M.J.F.2
  • 31
    • 78049302682 scopus 로고    scopus 로고
    • Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition
    • Feb.
    • D. K. Kim and M. J. F. Gales, "Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 2, pp. 315-325, Feb. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.2 , pp. 315-325
    • Kim, D.K.1    Gales, M.J.F.2
  • 33
    • 65349113250 scopus 로고    scopus 로고
    • Importance of high-order n-gram models in morph-based speech recognition
    • May
    • T. Hirsimäki, J. Pylkkönen, and M. Kurimo, "Importance of high-order n-gram models in morph-based speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 724-732, May 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.4 , pp. 724-732
    • Hirsimäki, T.1    Pylkkönen, J.2    Kurimo, M.3
  • 34
    • 78049401459 scopus 로고    scopus 로고
    • Duration modeling techniques for continuous speech recognition
    • J. Pylkkönen and M. Kurimo, "Duration modeling techniques for continuous speech recognition," in Proc. Interspeech, 2004.
    • (2004) Proc. Interspeech
    • Pylkkönen, J.1    Kurimo, M.2
  • 37
    • 84890449972 scopus 로고    scopus 로고
    • A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data
    • T. Kinnunen and P. Rajan, "A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data," in Proc. ICASSP, 2013, pp. 7229-7231.
    • (2013) Proc. ICASSP , pp. 7229-7231
    • Kinnunen, T.1    Rajan, P.2
  • 38
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • M. Cooke, "A glimpsing model of speech perception in noise," J. Acoust. Soc. Amer., vol. 119, pp. 1562-1573, 2006.
    • (2006) J. Acoust. Soc. Amer. , vol.119 , pp. 1562-1573
    • Cooke, M.1
  • 39
    • 84897933562 scopus 로고    scopus 로고
    • Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition
    • Feb.
    • H. Kallasjoki, J. F. Gemmeke, and K. J. Palomäki, "Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 22, no. 2, pp. 368-380, Feb. 2014.
    • (2014) IEEE/ACM Trans. Audio, Speech, Lang. Process , vol.22 , Issue.2 , pp. 368-380
    • Kallasjoki, H.1    Gemmeke, J.F.2    Palomäki, K.J.3
  • 40
    • 84929381011 scopus 로고    scopus 로고
    • Noise robust missing data mask estimation based on automatically learned features
    • S. Keronen, U. Remes, H. Kallasjoki, and K. Palomäki, "Noise robust missing data mask estimation based on automatically learned features," in Proc. CHIME, 2013.
    • (2013) Proc. CHIME
    • Keronen, S.1    Remes, U.2    Kallasjoki, H.3    Palomäki, K.4
  • 42
    • 33646762213 scopus 로고    scopus 로고
    • Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement
    • V. Stouten, H. Van hamme, and P. Wambacq, "Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement," in Proc. Interspeech, 2004.
    • (2004) Proc. Interspeech
    • Stouten, V.1    Van Hamme, H.2    Wambacq, P.3
  • 43
    • 70350450398 scopus 로고    scopus 로고
    • Static and dynamic variance compensation for recognition of reverberant speech with derever-beration preprocessing
    • Feb.
    • M. Delcroix, T. Nakatani, and S. Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with derever-beration preprocessing," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 324-334, Feb. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.2 , pp. 324-334
    • Delcroix, M.1    Nakatani, T.2    Watanabe, S.3
  • 44
    • 84905216197 scopus 로고    scopus 로고
    • Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR
    • D. T. Tran, E. Vincent, and D. Jouvet, "Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR," in Proc. ICASSP, 2014, pp. 5507-5511.
    • (2014) Proc. ICASSP , pp. 5507-5511
    • Tran, D.T.1    Vincent, E.2    Jouvet, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.