SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2009, Pages 2987-2990

Auditory model based optimization of MFCCs improves automatic speech recognition performance

(3) Chatterjee, Saikat a Koniaris, Christos a Kleijn, W Bastiaan a

a ROYAL INSTITUTE OF TECHNOLOGY (Sweden)

Author keywords

ASR; Auditory model; MFCC

Indexed keywords

AUDITORY MODELS; AUTOMATIC SPEECH RECOGNITION; ENVIRONMENTAL CONDITIONS; FEATURE DOMAIN; HUMAN AUDITORY SYSTEM; LOCAL GEOMETRY; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; RECOGNITION PERFORMANCE; SPEECH RECOGNITION SYSTEMS;

MATHEMATICAL MODELS; OPTIMIZATION; REMELTING; SPEECH COMMUNICATION;

SPEECH RECOGNITION;

EID: 70450221097 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (18)

1
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- S.B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Proc., vol. 28, No. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Proc , vol.28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

2
- 0024392496
- Application of an auditory model to speech recognition
- June
- J.R. Cohen, "Application of an auditory model to speech recognition," J. Acoust. Soc. Amer., pp. 2623-2629, Vol. 85 (6), June 1989.
- (1989) J. Acoust. Soc. Amer , vol.85 , Issue.6 , pp. 2623-2629
- Cohen, J.R.¹

3
- 0028312802
- Auditory models and human performance in tasks related to speech coding and speech recognition
- Jan
- O. Ghitza, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech, Audio Proc., vol. 2, No. 1, pp. 115-132, Jan 1994.
- (1994) IEEE Trans. Speech, Audio Proc , vol.2 , Issue.1 , pp. 115-132
- Ghitza, O.¹

4
- 0031238095
- A model of dynamic auditory perception and its application to robust word recognition
- Sept
- B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech, Audio Proc., vol. 5, No. 5, pp. 451-464, Sept. 1997.
- (1997) IEEE Trans. Speech, Audio Proc , vol.5 , Issue.5 , pp. 451-464
- Strope, B.¹ Alwan, A.²

5
- 0032785783
- Auditory processing of speech signals for robust speech recognition in real-world noisy environments
- Jan
- D.S. Kim, S.Y. Lee and R.M. Kil, "Auditory processing of speech signals for robust speech recognition in real-world noisy environments," IEEE Trans. Speech, Audio Proc., vol. 7, No. 1, pp. 55-69, Jan 1999.
- (1999) IEEE Trans. Speech, Audio Proc , vol.7 , Issue.1 , pp. 55-69
- Kim, D.S.¹ Lee, S.Y.² Kil, R.M.³

6
- 0032828464
- A model of auditory perception as front end for automatic speech recognition
- Oct
- J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., pp. 2040-2050, Vol. 106 (4), Oct. 1999.
- (1999) J. Acoust. Soc. Amer , vol.106 , Issue.4 , pp. 2040-2050
- Tchorz, J.¹ Kollmeier, B.²

7
- 33744994972
- Automatic speech recognition with an adaptation model motivated by auditory processing
- Jan
- M. Holmberg, D. Gelbart and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. Speech, Audio Proc., vol. 14, No. 1, pp. 43-49, Jan 2006.
- (2006) IEEE Trans. Speech, Audio Proc , vol.14 , Issue.1 , pp. 43-49
- Holmberg, M.¹ Gelbart, D.² Hemmert, W.³

8
- 84928837806
- A joint synchrony/mean-rate model of auditory processing
- Jan
- S. Seneff, "A joint synchrony/mean-rate model of auditory processing," J. Phonet., pp. 55-76, Vol. 85 (1), Jan 1988.
- (1988) J. Phonet , vol.85 , Issue.1 , pp. 55-76
- Seneff, S.¹

9
- 0022624057
- Simulation of mechanical to neural transduction in the auditory receptor
- March
- R. Meddis, "Simulation of mechanical to neural transduction in the auditory receptor," J. Acoust. Soc. Amer., pp. 702-711, Vol. 79 (3), March 1988.
- (1988) J. Acoust. Soc. Amer , vol.79 , Issue.3 , pp. 702-711
- Meddis, R.¹

10
- 0029378047
- Two-tone suppression in a cochlear model
- Sept
- J.M. Kates, "Two-tone suppression in a cochlear model," IEEE Trans. Speech, Audio Proc., vol. 3, No. 5, pp. 396-406, Sept. 1995.
- (1995) IEEE Trans. Speech, Audio Proc , vol.3 , Issue.5 , pp. 396-406
- Kates, J.M.¹

11
- 0029952425
- A quantitative model of the effective signal processing in the auditory system. I. Model structure
- Jun
- T. Dau, D. Puschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I. Model structure," J. Acoust. Soc. Amer., pp. 3615-3622, Vol. 99 (6), Jun 1996.
- (1996) J. Acoust. Soc. Amer , vol.99 , Issue.6 , pp. 3615-3622
- Dau, T.¹ Puschel, D.² Kohlrausch, A.³

12
- 0035125936
- Forward masking: Adaptation or integration?
- Feb
- A.J. Oxenham, "Forward masking: Adaptation or integration?," J. Acoust. Soc. Amer., pp. 732-741, Vol. 109 (2), Feb 2001.
- (2001) J. Acoust. Soc. Amer , vol.109 , Issue.2 , pp. 732-741
- Oxenham, A.J.¹

13
- 27844508054
- A Perceptual model for sinusoidal audio coding based on spectral integration
- S. van de Par, A. Kohlrausch, R. Heusdens, J. Jensen and S.H. Jensen, "A Perceptual model for sinusoidal audio coding based on spectral integration" EURASIP J. Applied Signal Proc., vol. 9, pp. 1292-1304, 2005.
- (2005) EURASIP J. Applied Signal Proc , vol.9 , pp. 1292-1304
- van de Par, S.¹ Kohlrausch, A.² Heusdens, R.³ Jensen, J.⁴ Jensen, S.H.⁵

14
- 0029375948
- Theoretical analysis of the high-rate vector quantization of LPC parameters
- Sept
- W.R. Gardner and B.D. Rao, "Theoretical analysis of the high-rate vector quantization of LPC parameters," IEEE Trans. Speech and Audio Proc., vol. 3, No.5, pp. 367-381, Sept 1995.
- (1995) IEEE Trans. Speech and Audio Proc , vol.3 , Issue.5 , pp. 367-381
- Gardner, W.R.¹ Rao, B.D.²

15
- 47649083103
- The sensitivity matrix: Using advanced auditory models in speech and audio processing
- Jan
- J.H. Plasberg and W.B. Kleijn, "The sensitivity matrix: using advanced auditory models in speech and audio processing," IEEE Trans. Audio, Speech, Language Proc., vol. 15, No. 1, pp. 310-319, Jan 2007.
- (2007) IEEE Trans. Audio, Speech, Language Proc , vol.15 , Issue.1 , pp. 310-319
- Plasberg, J.H.¹ Kleijn, W.B.²

16
- 0027659197
- Signal modeling techniques in speech recognition
- Sept
- J.W. Picone, "Signal modeling techniques in speech recognition," Proc. IEEE, pp. 1215-1247, Vol. 81, No. 9, Sept. 1993.
- (1993) Proc. IEEE , vol.81 , Issue.9 , pp. 1215-1247
- Picone, J.W.¹

17
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- Nov
- K.F. Lee and H.W. Hon, "Speaker-independent phone recognition using hidden Markov models," IEEE Trans. Acoust., Speech, Signal Proc., vol. 37, No. 11, pp. 1641-1648, Nov. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Proc , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.F.¹ Hon, H.W.²

18
- 70350491776
- Environmental robustness
- Springer, pp, Oct
- J. Droppo and A. Acero, "Environmental robustness," Handbook of Speech Processing, Springer, pp. 658-659, Oct. 2007.
- (2007) Handbook of Speech Processing , pp. 658-659
- Droppo, J.¹ Acero, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.