메뉴 건너뛰기




Volumn 14, Issue 1, 2006, Pages 246-254

Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition

Author keywords

Coding distortion; EM estimation algorithm; HMM compensation; Low bit rate coders; Speech recognition

Indexed keywords

CODING DISTORTION; EM ESTIMATION ALGORITHM; HMM COMPENSATION; LOW-BIT RATE CODERS;

EID: 33744969526     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.852994     Document Type: Conference Paper
Times cited : (11)

References (28)
  • 1
    • 0029288202 scopus 로고
    • Speech recognition in noise environments: A survey
    • Y. Gong, "Speech recognition in noise environments: a survey," Speech Commun., vol. 16, pp. 261-291, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.1
  • 2
    • 0018320733 scopus 로고
    • Enhancement of speech corrupted by acoustic noise
    • M. Berouti et al., "Enhancement of speech corrupted by acoustic noise," in Proc. ICASSP, 1979.
    • (1979) Proc. ICASSP
    • Berouti, M.1
  • 3
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verication
    • S. Furui, "Cepstral analysis technique for automatic speaker verication," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-29, no. 2, pp. 254-272, 1981.
    • (1981) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-29 , Issue.2 , pp. 254-272
    • Furui, S.1
  • 4
    • 85135377175 scopus 로고
    • Compensation for the effect of the communication chennel in auditory-like analysis of speech (RASTA-PLP)
    • H. Hermansky et al., "Compensation for the effect of the communication chennel in auditory-like analysis of speech (RASTA-PLP)," in Proc. Eurospeech 91, 1991, pp. 1367-1370.
    • (1991) Proc. Eurospeech 91 , pp. 1367-1370
    • Hermansky, H.1
  • 5
    • 0013458095 scopus 로고    scopus 로고
    • Ph.D dissertation, Dept. Elect. Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, Apr.
    • J. M. Huerta, "Speech recognition in mobile environments," Ph.D dissertation, Dept. Elect. Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, Apr. 2000.
    • (2000) Speech Recognition in Mobile Environments
    • Huerta, J.M.1
  • 6
    • 0031625058 scopus 로고    scopus 로고
    • Robust speech recognition for multiple topological scenarios of the GSM mobile phone system
    • T. Salonidis and V. Digalakis, "Robust speech recognition for multiple topological scenarios of the GSM mobile phone system," in Proc. ICASSP 98, 1998.
    • (1998) Proc. ICASSP 98
    • Salonidis, T.1    Digalakis, V.2
  • 7
    • 0030355780 scopus 로고    scopus 로고
    • Effect of speech coders on speech recognition performance
    • B. T. Lilly and K. K. Paliwal, "Effect of speech coders on speech recognition performance," in Proc. ICSLP 96, 1996.
    • (1996) Proc. ICSLP 96
    • Lilly, B.T.1    Paliwal, K.K.2
  • 8
    • 85009255371 scopus 로고
    • The influence of speech coding algorithms on automatic speech recognition
    • S. Euler and J. Zinke, "The influence of speech coding algorithms on automatic speech recognition," in Proc. ICASSP 94, vol. I, 1994, pp. 621-624.
    • (1994) Proc. ICASSP 94 , vol.1 , pp. 621-624
    • Euler, S.1    Zinke, J.2
  • 10
    • 0035365050 scopus 로고    scopus 로고
    • Recognizing voice over IP: A robust front-end for speech recognition on the world wide web
    • C. Peláez, A. Gallardo, and F. Díaz-d6e-María, "Recognizing voice over IP: a robust front-end for speech recognition on the world wide web," IEEE Trans. Multimedia, vol. 3, no. 2, pp. 209-218, 2001.
    • (2001) IEEE Trans. Multimedia , vol.3 , Issue.2 , pp. 209-218
    • Peláez, C.1    Gallardo, A.2    Díaz-De-María, F.3
  • 12
    • 0003944029 scopus 로고
    • RPE-LTP (Regular Pulse Excitation, Long Term Predictor), ETSI, France, Oct.
    • ETSI, GSM-06.10 Full Rate Speech Transcoding. RPE-LTP (Regular Pulse Excitation, Long Term Predictor), ETSI, France, Oct. 1992.
    • (1992) GSM-06.10 Full Rate Speech Transcoding
  • 14
    • 44949275238 scopus 로고
    • The federal standard 1016 4800 bps CELP voice coder
    • J. P. Campbell, T. E. Tremain, and V. C. Welch, "The federal standard 1016 4800 bps CELP voice coder," Digital Signal Process., vol. 1, no. 3, pp. 145-155, 1991.
    • (1991) Digital Signal Process. , vol.1 , Issue.3 , pp. 145-155
    • Campbell, J.P.1    Tremain, T.E.2    Welch, V.C.3
  • 15
    • 85033001107 scopus 로고
    • 40-,32-,24-, and 16-Kb/s adaptive differential pulse code modulation
    • Dec.
    • ITU-T, Recommendation G.726, "40-,32-,24-, and 16-Kb/s adaptive differential pulse code modulation", Dec. 1990.
    • (1990) ITU-T, Recommendation G.726
  • 16
    • 0036508276 scopus 로고    scopus 로고
    • Speaker Verification in noise using a stochastic version of the weighted Viterbi algorithm
    • Mar.
    • N. B. Yoma and M. Villar, "Speaker Verification in noise using a stochastic version of the weighted Viterbi algorithm," IEEE Trans. Speech Audio Processing, vol. 10, no. 3, pp. 158-166, Mar. 2002.
    • (2002) IEEE Trans. Speech Audio Processing , vol.10 , Issue.3 , pp. 158-166
    • Yoma, N.B.1    Villar, M.2
  • 17
  • 18
    • 0032203405 scopus 로고    scopus 로고
    • A general joint additive and convolutive bias compensation approach applied to noise Lombard speech recognition
    • Nov.
    • M. Afify, Y. Gong, and J. Haton, "A general joint additive and convolutive bias compensation approach applied to noise Lombard speech recognition," IEEE Trans. Speech Audio Processing, vol. 6. no. 6, pp. 524-538, Nov. 1998.
    • (1998) IEEE Trans. Speech Audio Processing , vol.6 , Issue.6 , pp. 524-538
    • Afify, M.1    Gong, Y.2    Haton, J.3
  • 19
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observation chains
    • Apr.
    • J. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observation chains," IEEE Trans. Speech Audio Processing, vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.1    Lee, C.-H.2
  • 20
    • 0030287048 scopus 로고    scopus 로고
    • The expectation-maximization algorithm
    • T. K. Moon, "The expectation-maximization algorithm," IEEE Signal Processing Mag., vol. 13, no. 6, pp. 47-60, 1996.
    • (1996) IEEE Signal Processing Mag. , vol.13 , Issue.6 , pp. 47-60
    • Moon, T.K.1
  • 21
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 22
    • 0025628728 scopus 로고
    • Environmental robustness in automatic speech recognition
    • A. Acero and R. Stern, "Environmental robustness in automatic speech recognition," in Proc. ICASSP, 1990.
    • (1990) Proc. ICASSP
    • Acero, A.1    Stern, R.2
  • 23
    • 0028996861 scopus 로고
    • Multivariate Gaussian based cepstral normalization for robust speech recognition
    • P. J. Moreno, B. Raj, E. Govea, and R. M. Stern, "Multivariate Gaussian based cepstral normalization for robust speech recognition," in Proc. ICASSP '95, 1995.
    • (1995) Proc. ICASSP '95
    • Moreno, P.J.1    Raj, B.2    Govea, E.3    Stern, R.M.4
  • 24
    • 0030365580 scopus 로고    scopus 로고
    • Cepstral compensation by polynomial approximation for environment-independent speech recognition
    • R. M. Raj, E. B. Gouvea, P. J. Moreno, and R. M. Stern, "Cepstral compensation by polynomial approximation for environment-independent speech recognition," in Proc. ICSLP, 1996.
    • (1996) Proc. ICSLP
    • Raj, R.M.1    Gouvea, E.B.2    Moreno, P.J.3    Stern, R.M.4
  • 26
    • 84858892311 scopus 로고
    • Univ. Pennsylvania, Philadelphia, PA. [Online]
    • Linguistic Data Consortium (LDC). (1995) Latino database. Univ. Pennsylvania, Philadelphia, PA. [Online] Available: http://www.ldc.upenn.edu/ Catalog/LDC95S28.html
    • (1995) Latino Database
  • 27
    • 0028460810 scopus 로고
    • An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition
    • Jul.
    • Y. Zhao, "An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 380-394, Jul. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.3 , pp. 380-394
    • Zhao, Y.1
  • 28
    • 85032751521 scopus 로고    scopus 로고
    • Dynamic programming search for continuous speech recognition
    • Sep.
    • H. Ney and S. Ortmanns, "Dynamic programming search for continuous speech recognition." IEEE Signal Process. Mag., pp. 64-81, Sep. 1999.
    • (1999) IEEE Signal Process. Mag. , pp. 64-81
    • Ney, H.1    Ortmanns, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.