메뉴 건너뛰기




Volumn 42, Issue 1, 2004, Pages 25-41

α-Jacobian environmental adaptation

Author keywords

Automatic speech recognition; Fast environmental adaptation; Jacobian adaptation; Model compensation; Noise robustness; PMC

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; APPROXIMATION THEORY; AUTOMATION; COST EFFECTIVENESS; FUNCTIONS; MICROPHONES; REVERBERATION; ROBUSTNESS (CONTROL SYSTEMS);

EID: 0347899510     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2003.08.003     Document Type: Conference Paper
Times cited : (12)

References (37)
  • 1
    • 85009214271 scopus 로고
    • Discriminative analysis for feature reduction in automatic speech recognition
    • Bocchieri, E., Wilpon, J., 1992. Discriminative analysis for feature reduction in automatic speech recognition. In: ICASSP'92, Vol. 1. pp. 501-504.
    • (1992) ICASSP'92 , vol.1 , pp. 501-504
    • Bocchieri, E.1    Wilpon, J.2
  • 2
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and combination of partial frequency bands
    • Bourlard, H., Dupont, S., 1996. A new ASR approach based on independent processing and combination of partial frequency bands. In: ICSLP'96. pp. 422-425.
    • (1996) ICSLP'96 , pp. 422-425
    • Bourlard, H.1    Dupont, S.2
  • 3
    • 0346699979 scopus 로고    scopus 로고
    • Towards a global optimization scheme for multi-band speech recognition
    • Prague, September 1999
    • Cerisara, C., Haton, J.-P., Fohr, D., 1999. Towards a global optimization scheme for multi-band speech recognition. In: EUROSPEECH'99. Prague, September 1999.
    • (1999) EUROSPEECH'99
    • Cerisara, C.1    Haton, J.-P.2    Fohr, D.3
  • 4
    • 0003771595 scopus 로고    scopus 로고
    • Transformation of Jacobian matrices for noisy speech recognition
    • Beijing, China, October 2000
    • Cerisara, C., Rigazio, L., Boman, R., Junqua, J.-C., 2000. Transformation of Jacobian matrices for noisy speech recognition. In: ICSLP'2000, Vol. 1. Beijing, China, October 2000, pp. 369-372.
    • (2000) ICSLP'2000 , vol.1 , pp. 369-372
    • Cerisara, C.1    Rigazio, L.2    Boman, R.3    Junqua, J.-C.4
  • 5
    • 0034846894 scopus 로고    scopus 로고
    • Environmental adaptation based on first order approximation
    • Salt Lake City, USA
    • Cerisara, C., Rigazio, L., Boman, R., Junqua, J.-C., 2001. Environmental adaptation based on first order approximation. In: ICASSP'2001. Salt Lake City, USA.
    • (2001) ICASSP'2001
    • Cerisara, C.1    Rigazio, L.2    Boman, R.3    Junqua, J.-C.4
  • 6
    • 0036296961 scopus 로고    scopus 로고
    • Dynamic estimation of a noise overestimation factor for Jacobian-based adaptation
    • Orlando, USA, May 2002
    • Cerisara, C., Junqua, J.-C., Rigazio, L., 2002. Dynamic estimation of a noise overestimation factor for Jacobian-based adaptation. In: ICASSP'2002. Orlando, USA, May 2002.
    • (2002) ICASSP'2002
    • Cerisara, C.1    Junqua, J.-C.2    Rigazio, L.3
  • 7
    • 84875738293 scopus 로고    scopus 로고
    • A new approach for Multi-Band speech recognition based on probabilistic graphical models
    • Beijing, China, October 2000
    • Daoudi, K., Fohr, D., Antoine, C., 2000. A new approach for Multi-Band speech recognition based on probabilistic graphical models. In: ICSLP'2000. Beijing, China, October 2000.
    • (2000) ICSLP'2000
    • Daoudi, K.1    Fohr, D.2    Antoine, C.3
  • 8
    • 0022352370 scopus 로고
    • Computer-steered microphone arrays for sound transduction in large rooms
    • Flanagan J.L., Johnston J.D., Zahn R., Elko G.W. Computer-steered microphone arrays for sound transduction in large rooms. J. Acoust. Soc. Amer. 78(5):1985;1508-1518.
    • (1985) J. Acoust. Soc. Amer. , vol.78 , Issue.5 , pp. 1508-1518
    • Flanagan, J.L.1    Johnston, J.D.2    Zahn, R.3    Elko, G.W.4
  • 9
    • 0346699975 scopus 로고    scopus 로고
    • Robust speech recognition
    • Computational Models of Speech Pattern Processing. Springer-Verlag
    • Furui S. Robust speech recognition. Computational Models of Speech Pattern Processing. NATO ASI Series F. Vol. 169:1999;102-111 Springer-Verlag.
    • (1999) NATO ASI Series F , vol.169 , pp. 102-111
    • Furui, S.1
  • 11
    • 0032139556 scopus 로고    scopus 로고
    • Predictive model-based compensation schemes for robust speech recognition
    • Gales M. Predictive model-based compensation schemes for robust speech recognition. Speech Commun. 25:1998;49-74.
    • (1998) Speech Commun. , vol.25 , pp. 49-74
    • Gales, M.1
  • 12
    • 0028996863 scopus 로고
    • A fast and flexible implementation of parallel model combination
    • Gales, M., Young, S., 1995a. A fast and flexible implementation of parallel model combination. In: ICASSP'95. pp. 133-136.
    • (1995) ICASSP'95 , pp. 133-136
    • Gales, M.1    Young, S.2
  • 13
    • 0029390135 scopus 로고
    • Robust speech recognition in additive and convolutional noise using parallel model combination
    • Gales M., Young S. Robust speech recognition in additive and convolutional noise using parallel model combination. Comput. Speech Lang. 9:1995;289-307.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 289-307
    • Gales, M.1    Young, S.2
  • 14
    • 0346699974 scopus 로고    scopus 로고
    • Reconnaissance de la parole dans une voiture: Spécification, réalisation et validation d'un corpus oral
    • Mart̀gny, Suisse
    • Gassert, C., Mari, J.-F., 1998. Reconnaissance de la parole dans une voiture: spécification, réalisation et validation d'un corpus oral. In: XXIIèmes Journées d'Etude sur la Parole, pp. 171-174, Mart̀gny, Suisse.
    • (1998) XXIIèmes Journées d'Etude sur la Parole , pp. 171-174
    • Gassert, C.1    Mari, J.-F.2
  • 15
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Gong Y. Speech recognition in noisy environments: a survey. Speech Commun. 16:1995;261-291.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.1
  • 16
    • 0346069170 scopus 로고
    • Robust feature-estimation and objective quality assessment for noisy speech recognition using the credit card corpus
    • Hansen J., Arslan L. Robust feature-estimation and objective quality assessment for noisy speech recognition using the credit card corpus. IEEE Trans., ASSP. 33:1995;1404-1413.
    • (1995) IEEE Trans., ASSP , vol.33 , pp. 1404-1413
    • Hansen, J.1    Arslan, L.2
  • 17
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky H. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Amer. 87(4):1990;1738-1752.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 18
    • 0027166410 scopus 로고
    • Recognition of speech in additive and convolutional noise based on RASTA spectral processing
    • Minneapolis, MN, April 1993
    • Hermansky, H., Morgan, N., Hirsch, H., 1993. Recognition of speech in additive and convolutional noise based on RASTA spectral processing. In: ICASSP'93, Vol. 2. Minneapolis, MN, April 1993, pp. 83-86.
    • (1993) ICASSP'93 , vol.2 , pp. 83-86
    • Hermansky, H.1    Morgan, N.2    Hirsch, H.3
  • 19
    • 0038133932 scopus 로고
    • A statistical approach to metrics for word and syllable recognition
    • Hunt M.J. A statistical approach to metrics for word and syllable recognition. J. Acoust. Soc. Amer. 66:1979;S535-536.
    • (1979) J. Acoust. Soc. Amer. , vol.66
    • Hunt, M.J.1
  • 20
    • 0026189808 scopus 로고
    • Speech recognition in adverse environments
    • Juang B.-H. Speech recognition in adverse environments. Comput. Speech Lang. 5:1991;275-294.
    • (1991) Comput. Speech Lang. , vol.5 , pp. 275-294
    • Juang, B.-H.1
  • 23
    • 0029288633 scopus 로고
    • Maximum Likelihood Linear Regression for speaker adaptation of continuous density hidden Markov models
    • Leggetter C., Woodland P. Maximum Likelihood Linear Regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9:1995;171-185.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 24
    • 0026882842 scopus 로고
    • Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars
    • Lockwood P., Boudy J. Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars. Speech Commun. 11:1992;215-228.
    • (1992) Speech Commun. , vol.11 , pp. 215-228
    • Lockwood, P.1    Boudy, J.2
  • 25
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • Moreno, P.J., Raj, B., Stern, R.M., 1996. A vector Taylor series approach for environment-independent speech recognition. In: ICASSP'96. pp: 733-736.
    • (1996) ICASSP'96 , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 26
    • 0347605871 scopus 로고    scopus 로고
    • Channel adaptation
    • Computational Models of Speech Pattern Processing. Springer-Verlag
    • Ponting K. Channel adaptation. Computational Models of Speech Pattern Processing. NATO ASI Series F. Vol. 169:1999;112-121 Springer-Verlag.
    • (1999) NATO ASI Series F , vol.169 , pp. 112-121
    • Ponting, K.1
  • 27
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by Maximum likelihood estimation for robust telephone speech recognition
    • Rahim M.G., Juang B.-J. Signal bias removal by Maximum likelihood estimation for robust telephone speech recognition. IEEE Trans. Speech Audio Process. 4(January):1996;19-30.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.JANUARY , pp. 19-30
    • Rahim, M.G.1    Juang, B.-J.2
  • 28
    • 0030365580 scopus 로고    scopus 로고
    • Cepstral compensation by polynomial approximation for environment-independent speech recognition
    • Philadelphia, PA
    • Raj, B., Gouvea, E., Moreno, P.J., Stern, R.M., 1996. Cepstral compensation by polynomial approximation for environment-independent speech recognition. In: ICSLP'96. Philadelphia, PA, pp. 2340-2343.
    • (1996) ICSLP'96 , pp. 2340-2343
    • Raj, B.1    Gouvea, E.2    Moreno, P.J.3    Stern, R.M.4
  • 30
    • 0347960645 scopus 로고    scopus 로고
    • Separating speaker and environment variabilities for improved recognition in non-stationary conditions
    • Scandinavia
    • Rigazio, L., Nguyen, P., Kryze, D., Junqua, J.-C., 2001. Separating speaker and environment variabilities for improved recognition in non-stationary conditions. In: EUROSPEECH'2001. Scandinavia.
    • (2001) EUROSPEECH'2001
    • Rigazio, L.1    Nguyen, P.2    Kryze, D.3    Junqua, J.-C.4
  • 31
    • 0030649027 scopus 로고    scopus 로고
    • Jacobian approach to fast acoustic model adaptation
    • Munich, Germany
    • Sagayama, S., Yamaguchi, Y., Takahashi, S., Takahashi, J., 1997. Jacobian approach to fast acoustic model adaptation. In: ICASSP'97. Munich, Germany, pp. 835-838.
    • (1997) ICASSP'97 , pp. 835-838
    • Sagayama, S.1    Yamaguchi, Y.2    Takahashi, S.3    Takahashi, J.4
  • 32
    • 0346069169 scopus 로고    scopus 로고
    • Jacobian approach to joint adaptation to noise, channel and vocal tract length
    • Sophia Antipolis, France
    • Sagayama, S., Kato, Y., Nakai, M., Shimodaira, H., 2001. Jacobian approach to joint adaptation to noise, channel and vocal tract length. In: ISCA Workshop on Adaptation Methods. Sophia Antipolis, France, pp. 117-120.
    • (2001) ISCA Workshop on Adaptation Methods , pp. 117-120
    • Sagayama, S.1    Kato, Y.2    Nakai, M.3    Shimodaira, H.4
  • 33
    • 85009061070 scopus 로고    scopus 로고
    • Improved Jacobian adaptation for fast acoustic model adaptation in noisy speech recognition
    • Beijing, China, October 2000
    • Sarikaya, R., Hansen, J.H.L., 2000. Improved Jacobian adaptation for fast acoustic model adaptation in noisy speech recognition. In: ICSLP'2000, Vol. 3. Beijing, China, October 2000, pp. 702-705.
    • (2000) ICSLP'2000 , vol.3 , pp. 702-705
    • Sarikaya, R.1    Hansen, J.H.L.2
  • 35
    • 0003078259 scopus 로고
    • The HTK continuous speech recogniser
    • Berlin, September 1993
    • Woodland, P., Young, S., 1993. The HTK continuous speech recogniser. In: Eurospeech'93. Berlin, September 1993, pp. 2207-2219.
    • (1993) Eurospeech'93 , pp. 2207-2219
    • Woodland, P.1    Young, S.2
  • 36
    • 0029726509 scopus 로고    scopus 로고
    • Improving environmental robustness in large vocabulary speech recognition
    • Atlanta, GA, May 1996
    • Woodland, P.C., Gales, M.J.F., Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition. In: ICASSP'96. Atlanta, GA, May 1996. pp. 65-68.
    • (1996) ICASSP'96 , pp. 65-68
    • Woodland, P.C.1    Gales, M.J.F.2    Pye, D.3
  • 37
    • 0001459635 scopus 로고    scopus 로고
    • Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutional noises
    • Zhao Y. Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutional noises. IEEE Trans. Speech Audio Process. 8(3):2000;255-266.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 255-266
    • Zhao, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.