SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 1, 2006, Pages 246-254

Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition

(4) Yoma, Néstor Becerra a,b,c Molina, Carlos b,c Silva, Jorge b,d Busso, Carlos b,e

a IEEE (Chile)

b UNIVERSITY OF CHILE (Chile)

c International Speech Communication Association (Chile)

d University of Southern California (United States)

e UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

Coding distortion; EM estimation algorithm; HMM compensation; Low bit rate coders; Speech recognition

Indexed keywords

CODING DISTORTION; EM ESTIMATION ALGORITHM; HMM COMPENSATION; LOW-BIT RATE CODERS;

ALGORITHMS; BIT ERROR RATE; MATHEMATICAL MODELS; PARAMETER ESTIMATION; SIGNAL PROCESSING; SPEECH CODING;

SPEECH RECOGNITION;

EID: 33744969526 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.852994 Document Type: Conference Paper

Times cited : (11)

References (28)

1
- 0029288202
- Speech recognition in noise environments: A survey
- Y. Gong, "Speech recognition in noise environments: a survey," Speech Commun., vol. 16, pp. 261-291, 1995.
- (1995) Speech Commun. , vol.16 , pp. 261-291
- Gong, Y.¹

2
- 0018320733
- Enhancement of speech corrupted by acoustic noise
- M. Berouti et al., "Enhancement of speech corrupted by acoustic noise," in Proc. ICASSP, 1979.
- (1979) Proc. ICASSP
- Berouti, M.¹

3
- 0019555090
- Cepstral analysis technique for automatic speaker verication
- S. Furui, "Cepstral analysis technique for automatic speaker verication," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-29, no. 2, pp. 254-272, 1981.
- (1981) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-29 , Issue.2 , pp. 254-272
- Furui, S.¹

4
- 85135377175
- Compensation for the effect of the communication chennel in auditory-like analysis of speech (RASTA-PLP)
- H. Hermansky et al., "Compensation for the effect of the communication chennel in auditory-like analysis of speech (RASTA-PLP)," in Proc. Eurospeech 91, 1991, pp. 1367-1370.
- (1991) Proc. Eurospeech 91 , pp. 1367-1370
- Hermansky, H.¹

5
- 0013458095
- Ph.D dissertation, Dept. Elect. Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, Apr.
- J. M. Huerta, "Speech recognition in mobile environments," Ph.D dissertation, Dept. Elect. Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, Apr. 2000.
- (2000) Speech Recognition in Mobile Environments
- Huerta, J.M.¹

6
- 0031625058
- Robust speech recognition for multiple topological scenarios of the GSM mobile phone system
- T. Salonidis and V. Digalakis, "Robust speech recognition for multiple topological scenarios of the GSM mobile phone system," in Proc. ICASSP 98, 1998.
- (1998) Proc. ICASSP 98
- Salonidis, T.¹ Digalakis, V.²

7
- 0030355780
- Effect of speech coders on speech recognition performance
- B. T. Lilly and K. K. Paliwal, "Effect of speech coders on speech recognition performance," in Proc. ICSLP 96, 1996.
- (1996) Proc. ICSLP 96
- Lilly, B.T.¹ Paliwal, K.K.²

8
- 85009255371
- The influence of speech coding algorithms on automatic speech recognition
- S. Euler and J. Zinke, "The influence of speech coding algorithms on automatic speech recognition," in Proc. ICASSP 94, vol. I, 1994, pp. 621-624.
- (1994) Proc. ICASSP 94 , vol.1 , pp. 621-624
- Euler, S.¹ Zinke, J.²

9
- 64549127516
- Rapid CODEC adaptation for cellular phone speech recognition
- Alborg, Denmark
- M. Naito, S. Kuroiwa, T. Kato, T. Shimizu, and N. Higuchi, "Rapid CODEC adaptation for cellular phone speech recognition," in Proc. Eurospeech 2001, Alborg, Denmark, 2001.
- (2001) Proc. Eurospeech 2001
- Naito, M.¹ Kuroiwa, S.² Kato, T.³ Shimizu, T.⁴ Higuchi, N.⁵

10
- 0035365050
- Recognizing voice over IP: A robust front-end for speech recognition on the world wide web
- C. Peláez, A. Gallardo, and F. Díaz-d6e-María, "Recognizing voice over IP: a robust front-end for speech recognition on the world wide web," IEEE Trans. Multimedia, vol. 3, no. 2, pp. 209-218, 2001.
- (2001) IEEE Trans. Multimedia , vol.3 , Issue.2 , pp. 209-218
- Peláez, C.¹ Gallardo, A.² Díaz-De-María, F.³

11
- 0003933583
- Mar.
- ITU-T, Recommendation G.729-Coding of Speech at 8 kbit/s Using Conjugate-Structure Algebraic-Code-Excited Linear-Prediction (CS-CELP), Mar. 1996.
- (1996) ITU-T, Recommendation G.729-Coding of Speech at 8 Kbit/s Using Conjugate-Structure Algebraic-code-excited Linear-prediction (CS-CELP)

12
- 0003944029
- RPE-LTP (Regular Pulse Excitation, Long Term Predictor), ETSI, France, Oct.
- ETSI, GSM-06.10 Full Rate Speech Transcoding. RPE-LTP (Regular Pulse Excitation, Long Term Predictor), ETSI, France, Oct. 1992.
- (1992) GSM-06.10 Full Rate Speech Transcoding

13
- 0003727048
- Marzo
- ITU-T, Recommendation G.723.1 Dual Rate Speech Coder for Multimedia Communications Transmitting at 5.3 and 6.3 kbps, Marzo 1996.
- (1996) ITU-T, Recommendation G.723.1 Dual Rate Speech Coder for Multimedia Communications Transmitting at 5.3 and 6.3 Kbps

14
- 44949275238
- The federal standard 1016 4800 bps CELP voice coder
- J. P. Campbell, T. E. Tremain, and V. C. Welch, "The federal standard 1016 4800 bps CELP voice coder," Digital Signal Process., vol. 1, no. 3, pp. 145-155, 1991.
- (1991) Digital Signal Process. , vol.1 , Issue.3 , pp. 145-155
- Campbell, J.P.¹ Tremain, T.E.² Welch, V.C.³

15
- 85033001107
- 40-,32-,24-, and 16-Kb/s adaptive differential pulse code modulation
- Dec.
- ITU-T, Recommendation G.726, "40-,32-,24-, and 16-Kb/s adaptive differential pulse code modulation", Dec. 1990.
- (1990) ITU-T, Recommendation G.726

16
- 0036508276
- Speaker Verification in noise using a stochastic version of the weighted Viterbi algorithm
- Mar.
- N. B. Yoma and M. Villar, "Speaker Verification in noise using a stochastic version of the weighted Viterbi algorithm," IEEE Trans. Speech Audio Processing, vol. 10, no. 3, pp. 158-166, Mar. 2002.
- (2002) IEEE Trans. Speech Audio Processing , vol.10 , Issue.3 , pp. 158-166
- Yoma, N.B.¹ Villar, M.²

17
- 0003462715
- Edinburgh, U.K.: Edinburgh Univ. Press
- X. D. Huang et al., HMM for Speech Recognition, Edinburgh, U.K.: Edinburgh Univ. Press, 1990.
- (1990) HMM for Speech Recognition
- Huang, X.D.¹

18
- 0032203405
- A general joint additive and convolutive bias compensation approach applied to noise Lombard speech recognition
- Nov.
- M. Afify, Y. Gong, and J. Haton, "A general joint additive and convolutive bias compensation approach applied to noise Lombard speech recognition," IEEE Trans. Speech Audio Processing, vol. 6. no. 6, pp. 524-538, Nov. 1998.
- (1998) IEEE Trans. Speech Audio Processing , vol.6 , Issue.6 , pp. 524-538
- Afify, M.¹ Gong, Y.² Haton, J.³

19
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observation chains
- Apr.
- J. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observation chains," IEEE Trans. Speech Audio Processing, vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.¹ Lee, C.-H.²

20
- 0030287048
- The expectation-maximization algorithm
- T. K. Moon, "The expectation-maximization algorithm," IEEE Signal Processing Mag., vol. 13, no. 6, pp. 47-60, 1996.
- (1996) IEEE Signal Processing Mag. , vol.13 , Issue.6 , pp. 47-60
- Moon, T.K.¹

21
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, 1998.
- (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

22
- 0025628728
- Environmental robustness in automatic speech recognition
- A. Acero and R. Stern, "Environmental robustness in automatic speech recognition," in Proc. ICASSP, 1990.
- (1990) Proc. ICASSP
- Acero, A.¹ Stern, R.²

23
- 0028996861
- Multivariate Gaussian based cepstral normalization for robust speech recognition
- P. J. Moreno, B. Raj, E. Govea, and R. M. Stern, "Multivariate Gaussian based cepstral normalization for robust speech recognition," in Proc. ICASSP '95, 1995.
- (1995) Proc. ICASSP '95
- Moreno, P.J.¹ Raj, B.² Govea, E.³ Stern, R.M.⁴

24
- 0030365580
- Cepstral compensation by polynomial approximation for environment-independent speech recognition
- R. M. Raj, E. B. Gouvea, P. J. Moreno, and R. M. Stern, "Cepstral compensation by polynomial approximation for environment-independent speech recognition," in Proc. ICSLP, 1996.
- (1996) Proc. ICSLP
- Raj, R.M.¹ Gouvea, E.B.² Moreno, P.J.³ Stern, R.M.⁴

25
- 33744996940
- Ph.D. dissertation, Univ. Edinburgh, Edinburgh, U.K.
- N. B. Yoma, "Speech recognition in noise using weighted matching algorithms," Ph.D. dissertation, Univ. Edinburgh, Edinburgh, U.K., 1998.
- (1998) Speech Recognition in Noise Using Weighted Matching Algorithms
- Yoma, N.B.¹

26
- 84858892311
- Univ. Pennsylvania, Philadelphia, PA. [Online]
- Linguistic Data Consortium (LDC). (1995) Latino database. Univ. Pennsylvania, Philadelphia, PA. [Online] Available: http://www.ldc.upenn.edu/ Catalog/LDC95S28.html
- (1995) Latino Database

27
- 0028460810
- An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition
- Jul.
- Y. Zhao, "An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 380-394, Jul. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.3 , pp. 380-394
- Zhao, Y.¹

28
- 85032751521
- Dynamic programming search for continuous speech recognition
- Sep.
- H. Ney and S. Ortmanns, "Dynamic programming search for continuous speech recognition." IEEE Signal Process. Mag., pp. 64-81, Sep. 1999.
- (1999) IEEE Signal Process. Mag. , pp. 64-81
- Ney, H.¹ Ortmanns, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.