메뉴 건너뛰기




Volumn 17, Issue 7, 2009, Pages 1325-1334

Stereo-based stochastic mapping for robust speech recognition

Author keywords

Noise robustness; Nonlinear mapping; Speech recognition; Stereo data

Indexed keywords

CLEAN SPEECH; DIGIT RECOGNITION; FIELD DATA; GAUSSIAN MIXTURE MODEL; JOINT DISTRIBUTIONS; LARGE VOCABULARY; LINEAR TRANSFORM; LINEAR TRANSFORMATION; MAXIMUM A POSTERIORI CRITERIONS; MINIMUM MEAN SQUARE ERROR CRITERION; NOISE ROBUSTNESS; NONLINEAR MAPPING; REAL ENVIRONMENTS; ROBUST SPEECH RECOGNITION; STEREO-BASED; STEREO-DATA; STOCHASTIC MAPPING; WORD ERROR RATE;

EID: 68549125183     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2018017     Document Type: Article
Times cited : (34)

References (37)
  • 1
    • 0004319970 scopus 로고
    • Acoustical and environmental robustness for automatic speech recognition,
    • Ph.D. dissertation, Elect. Comput. Eng. Dept, Carnegie Mellon Univ, Pittsburgh, PA, Sep
    • A. Acero, "Acoustical and environmental robustness for automatic speech recognition," Ph.D. dissertation, Elect. Comput. Eng. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Sep. 1990.
    • (1990)
    • Acero, A.1
  • 2
    • 34547550766 scopus 로고    scopus 로고
    • Stereo-based stochastic mapping for robust speech recognition
    • Honolulu, hi, Apr
    • M. Afify, X. Cui, and Y. Gao, "Stereo-based stochastic mapping for robust speech recognition," in Proc. ICASSP'07, Honolulu, hi, Apr. 2007, pp. 377-380.
    • (2007) Proc. ICASSP'07 , pp. 377-380
    • Afify, M.1    Cui, X.2    Gao, Y.3
  • 3
    • 18744396687 scopus 로고    scopus 로고
    • Accurate compensation in the log-spectral domain for noisy speech recognition
    • May
    • M. Afify, "Accurate compensation in the log-spectral domain for noisy speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 388-398, May 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 388-398
    • Afify, M.1
  • 4
    • 0030643240 scopus 로고    scopus 로고
    • Subband-based speech recognition
    • Munich, Germany, Apr
    • H. Bourlard and S. Dupont, "Subband-based speech recognition," in Proc. ICASSP'97, Munich, Germany, Apr. 1997, pp. 1251-1254.
    • (1997) Proc. ICASSP'97 , pp. 1251-1254
    • Bourlard, H.1    Dupont, S.2
  • 5
    • 84867220211 scopus 로고    scopus 로고
    • N-best based stochastic mapping on stereo HMM for noise robust speech recognition
    • Brisbane, Australia, Sep
    • X. Cui, M. Afify, and Y. Gao, "N-best based stochastic mapping on stereo HMM for noise robust speech recognition," in Proc. Interspeech'08, Brisbane, Australia, Sep. 2008.
    • (2008) Proc. Interspeech'08
    • Cui, X.1    Afify, M.2    Gao, Y.3
  • 6
    • 0029375590 scopus 로고
    • Speaker adaptation by constrained estimation of Gaussian mixtures
    • Sep
    • V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation by constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Rtischev, D.2    Neumeyer, L.3
  • 7
    • 85006734596 scopus 로고    scopus 로고
    • Evaluation of the SPLICE algorithm on the AURORA 2 database
    • Aalborg, Denmark, Sep
    • J. Droppo, L. Deng, and A. Acero, "Evaluation of the SPLICE algorithm on the AURORA 2 database," in Proc. Eurospeech'01, Aalborg, Denmark, Sep. 2001.
    • (2001) Proc. Eurospeech'01
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 8
    • 0036291376 scopus 로고    scopus 로고
    • Uncertainty decoding with splice for noise robust speech recognition
    • Orlando, FL, May
    • J. Droppo, L. Deng, and A. Acero, "Uncertainty decoding with splice for noise robust speech recognition," in Proc. ICASSP'02, Orlando, FL, May 2002, pp. 57-80.
    • (2002) Proc. ICASSP'02 , pp. 57-80
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 9
    • 85009074657 scopus 로고    scopus 로고
    • ALGONQUIN: Iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition
    • Aalborg, Denmark, Sep
    • B. Frey, L. Deng, A. Acero, and T. Kristjanson, "ALGONQUIN: Iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition," in Proc. Eurospeech'01, Aalborg, Denmark, Sep. 2001.
    • (2001) Proc. Eurospeech'01
    • Frey, B.1    Deng, L.2    Acero, A.3    Kristjanson, T.4
  • 10
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Sep
    • M. Gales and S. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, Sep. 1996.
    • (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.1    Young, S.2
  • 11
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., pp. 193-228, 1998.
    • (1998) Comput. Speech Lang , pp. 193-228
    • Gales, M.1
  • 12
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • May
    • M. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.1
  • 14
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Apr
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, Apr. 1995.
    • (1995) Speech Commun , vol.16 , pp. 261-291
    • Gong, Y.1
  • 17
    • 34547526633 scopus 로고    scopus 로고
    • A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations
    • Pittsburgh, PA, Sep
    • Q. Huo and D. Zhu, "A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations," in Proc. Interspeech'06, Pittsburgh, PA, Sep. 2006.
    • (2006) Proc. Interspeech'06
    • Huo, Q.1    Zhu, D.2
  • 18
    • 0023167174 scopus 로고
    • Signal restoration by spectral mapping
    • Apr
    • B. H. Juang and L. R. Rabiner, "Signal restoration by spectral mapping," in Proc. ICASSP'87, Apr. 1987, pp. 2368-2372.
    • (1987) Proc. ICASSP'87 , pp. 2368-2372
    • Juang, B.H.1    Rabiner, L.R.2
  • 19
    • 33947669945 scopus 로고    scopus 로고
    • Feature adaptation based on Gaussian posteriors
    • Tolouse, France, Apr
    • S. Kozat, K. Visweswariah, andR. Gopinath, "Feature adaptation based on Gaussian posteriors," in Proc. ICASSP'06, Tolouse, France, Apr. 2006, pp. 221-224.
    • (2006) Proc. ICASSP'06 , pp. 221-224
    • Kozat, S.1    Visweswariah, K.2    andR3    Gopinath4
  • 20
    • 0036293930 scopus 로고    scopus 로고
    • Accounting for uncertainity in observations: A new paradigm for robust speech recognition
    • Orlando, FL, May
    • T. Kristjansson and B. Frey, "Accounting for uncertainity in observations: A new paradigm for robust speech recognition," in Proc. ICASSP'02, Orlando, FL, May 2002.
    • (2002) Proc. ICASSP'02
    • Kristjansson, T.1    Frey, B.2
  • 21
    • 0032140546 scopus 로고    scopus 로고
    • On stochastic feature and model compensation approaches to robust speech recognition
    • C. H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Commun., vol. 25, pp. 29-47, 1998.
    • (1998) Speech Commun , vol.25 , pp. 29-47
    • Lee, C.H.1
  • 22
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • Aug
    • C. H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
    • Lee, C.H.1    Huo, Q.2
  • 23
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 24
    • 68549095140 scopus 로고    scopus 로고
    • High performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
    • Kyoto, Japan
    • J. Li, L. Deng, Y.Gong, and A. Acero, "High performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU'07, Kyoto, Japan, 2007.
    • (2007) Proc. ASRU'07
    • Li, J.1    Deng, L.2    Gong, Y.3    Acero, A.4
  • 25
    • 34547528168 scopus 로고    scopus 로고
    • Adaptive training with joint uncertainty decoding for robust recognition of noisy data
    • Honolulu, HI, Apr
    • H. Liao and M. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP'07, Honolulu, HI, Apr. 2007, pp. 389-392.
    • (2007) Proc. ICASSP'07 , pp. 389-392
    • Liao, H.1    Gales, M.2
  • 26
    • 33947630738 scopus 로고    scopus 로고
    • Joint uncertainity decoding for noise robust speech recognition
    • Lisbone, Portugal, Sep
    • H. Liao and M. Gales, "Joint uncertainity decoding for noise robust speech recognition," in Proc. Eurospeech'05, Lisbone, Portugal, Sep. 2005.
    • (2005) Proc. Eurospeech'05
    • Liao, H.1    Gales, M.2
  • 28
    • 0026370318 scopus 로고
    • Word recognition in the car: Speech enhancement/spectral transformations
    • Toronto
    • C. Mokbel and G. Chollet, "Word recognition in the car: Speech enhancement/spectral transformations," in Proc. ICASSP'91, Toronto, 1991, pp. 925-928.
    • (1991) Proc. ICASSP'91 , pp. 925-928
    • Mokbel, C.1    Chollet, G.2
  • 29
    • 0029725301 scopus 로고    scopus 로고
    • A vector taylor series approach for environment-independent speech recognition
    • Atlanta, GA, May
    • P. J. Moreno, B. Raj, and R. M. Stern, "A vector taylor series approach for environment-independent speech recognition," in Proc. ICASSP, Atlanta, GA, May 1996, pp. 733-736.
    • (1996) Proc. ICASSP , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 30
    • 0002127129 scopus 로고
    • Probabilistic optimal filtering for robust speech recognition
    • Adelaide, Australia, Apr
    • L. Neumeyer and M. Weintraub, "Probabilistic optimal filtering for robust speech recognition," in Proc. ICASSP'94, Adelaide, Australia, Apr. 1994, pp. 417-420.
    • (1994) Proc. ICASSP'94 , pp. 417-420
    • Neumeyer, L.1    Weintraub, M.2
  • 31
    • 68549086009 scopus 로고    scopus 로고
    • personal communication, May
    • M. K. Omar, personal communication, May 2004.
    • (2004)
    • Omar, M.K.1
  • 32
    • 0035278964 scopus 로고    scopus 로고
    • Time-frequency distributions for automatic speech recognition
    • Mar
    • A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition," IEEE Trans. Speech Audio Process., vol. 9, pp. 196-200, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , pp. 196-200
    • Potamianos, A.1    Maragos, P.2
  • 33
    • 0034841234 scopus 로고    scopus 로고
    • Linear feature space projections for speaker adaptation
    • Salt lake City, UT, Apr
    • G. Saon, G. Zweig, and M. Padmanabhan, "Linear feature space projections for speaker adaptation," in Proc. ICASSP'01, Salt lake City, UT, Apr. 2001, pp. 325-328.
    • (2001) Proc. ICASSP'01 , pp. 325-328
    • Saon, G.1    Zweig, G.2    Padmanabhan, M.3
  • 34
    • 85009154856 scopus 로고    scopus 로고
    • Accounting for the uncertainity of speech estimates in the context of model-based feature enhancement
    • Jeju, Korea, Sep
    • V. Stouten, H. Van Hamme, and P. Wambacq, "Accounting for the uncertainity of speech estimates in the context of model-based feature enhancement," in Proc. ICSLP'04, Jeju, Korea, Sep. 2004.
    • (2004) Proc. ICSLP'04
    • Stouten, V.1    Van Hamme, H.2    Wambacq, P.3
  • 35
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Jan
    • Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, pp. 131-142, Jan. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , pp. 131-142
    • Stylianou, Y.1    Cappe, O.2    Moulines, E.3
  • 36
    • 33947637126 scopus 로고    scopus 로고
    • Feature adaptation using projection of Gaussian posteriors
    • Lisbone, Portugal, Sep
    • K. Visweswariah and P. Olsen, "Feature adaptation using projection of Gaussian posteriors," in Proc. Interspeech'05, Lisbone, Portugal, Sep. 2005.
    • (2005) Proc. Interspeech'05
    • Visweswariah, K.1    Olsen, P.2
  • 37
    • 68549086008 scopus 로고    scopus 로고
    • S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book for HTK Version 3.1, Dec. 2001
    • S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book (for HTK Version 3.1). Dec. 2001.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.