메뉴 건너뛰기




Volumn 6, Issue 1, 1998, Pages 90-94

Speech analysis and recognition using interval statistics generated from a composite auditory model

Author keywords

[No Author keywords available]

Indexed keywords

AUDITION; COMPUTER SIMULATION; MARKOV PROCESSES; MATHEMATICAL MODELS; SIGNAL TO NOISE RATIO; SPEECH ANALYSIS; STATISTICAL METHODS;

EID: 0031647650     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.650316     Document Type: Article
Times cited : (13)

References (34)
  • 1
    • 0018617277 scopus 로고
    • "Encoding of steady-state vowels in the auditory nerve: Representation in terms of dischage rate,"
    • Aug
    • M. B. Sachs and E. D. Young, "Encoding of steady-state vowels in the auditory nerve: Representation in terms of dischage rate," J. Acoust. Soc. Amer., vol. 66, pp. 470-479, Aug! 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.66 , pp. 470-479
    • Sachs, M.B.1    Young, E.D.2
  • 2
    • 0018606571 scopus 로고
    • "Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers," J
    • Nov
    • E. D. Young and M. D. Sachs, "Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers," J. Acoust. Soc. Amer., vol. 66, pp. 1381-1403, Nov. 1979.
    • (1979) Acoust. Soc. Amer. , vol.66 , pp. 1381-1403
    • Young, E.D.1    Sachs, M.D.2
  • 3
    • 0021403669 scopus 로고
    • "Speech coding in the auditory nerve, I: Vowel-like sounds,"
    • B. Delgutte and N. Y. S. Kiang, "Speech coding in the auditory nerve, I: Vowel-like sounds," J. Acoust. Soc. Amer., vol. 75, pp. 866-878, 1984.
    • (1984) J. Acoust. Soc. Amer. , vol.75 , pp. 866-878
    • Delgutte, B.1    Kiang, N.Y.S.2
  • 4
    • 0021403721 scopus 로고
    • Speech coding in the auditory nerve: IV. Sounds with consonant-like dynamic characteristics,"
    • _"Speech coding in the auditory nerve: IV. Sounds with consonant-like dynamic characteristics," J. Acoust. Soc. Amer., vol. 75, pp. 897-907, 1984.
    • (1984) J. Acoust. Soc. Amer. , vol.75 , pp. 897-907
  • 5
    • 0021403730 scopus 로고
    • "Speech coding in the auditory nerve, V: Vowels in background noise,"
    • _, "Speech coding in the auditory nerve, V: Vowels in background noise," J. Acoust. Soc. Amer., vol. 75, pp. 908-918, 1984.
    • (1984) J. Acoust. Soc. Amer. , vol.75 , pp. 908-918
  • 6
    • 0022545937 scopus 로고
    • "A temporal analysis of auditory-nerve fiber responses to spoken stop consonant-vowel syllables," 7
    • June
    • L. H. Carney and C. D. Geisler, "A temporal analysis of auditory-nerve fiber responses to spoken stop consonant-vowel syllables," 7. Acoust. Soc. Amer., vol. 79, pp. 1896-1914, June 1986.
    • (1986) Acoust. Soc. Amer. , vol.79 , pp. 1896-1914
    • Carney, L.H.1    Geisler, C.D.2
  • 7
    • 0023487893 scopus 로고
    • "Responses of auditory nerve fibers to nasal consonant-vowel syllables,"
    • L. Dcng and C. D. Geisler, "Responses of auditory nerve fibers to nasal consonant-vowel syllables," J. Acoust. Soc. Amer., vol. 82, pp. 1977-1988, 1987.
    • (1987) J. Acoust. Soc. Amer. , vol.82 , pp. 1977-1988
    • Dcng, L.1    Geisler, C.D.2
  • 9
    • 0023841401 scopus 로고
    • "Vowel processing by a model of the auditory periphery: A comparison to eighth-nerve responses," J
    • K. L. Payton, "Vowel processing by a model of the auditory periphery: A comparison to eighth-nerve responses," J. Acoust. Soc. Amer., vol. 83, pp. 145-162, 1988.
    • (1988) Acoust. Soc. Amer. , vol.83 , pp. 145-162
    • Payton, K.L.1
  • 10
    • 84928841878 scopus 로고
    • "The acoustic features of speech sounds in a model of auditory processing: Vowels and voiceless fricatives,"
    • S. Shamma, "The acoustic features of speech sounds in a model of auditory processing: Vowels and voiceless fricatives," J. Phonet., vol. 16, pp. 77-91, 1988.
    • (1988) J. Phonet. , vol.16 , pp. 77-91
    • Shamma, S.1
  • 11
    • 0023516708 scopus 로고
    • "A composite auditory models for processing speech sounds,"
    • Dec.
    • L. Deng and C. D. Geisler, "A composite auditory models for processing speech sounds," J. Acoust. Soc. Amer., vol. 82, pp. 2001-2012, Dec. 1987.
    • (1987) J. Acoust. Soc. Amer. , vol.82 , pp. 2001-2012
    • Deng, L.1    Geisler, C.D.2
  • 12
    • 0026477793 scopus 로고
    • "Processing of acoustic signals in a cochlear model incorporating laterally coupled suppressive elements,"
    • Jan.
    • L. Deng, "Processing of acoustic signals in a cochlear model incorporating laterally coupled suppressive elements," Neural Nehvorks, vol. 5, pp. 19-34, Jan. 1992.
    • (1992) Neural Nehvorks , vol.5 , pp. 19-34
    • Deng, L.1
  • 13
    • 0028053082 scopus 로고
    • "A computational model of the auditory periphery for speech and hearing research, I: Ascending path,"
    • Jan.
    • C. Giguere and P. Woodland, "A computational model of the auditory periphery for speech and hearing research, I: Ascending path," J. Acoust. Soc. Amer., vol. 95, pp. 331-342, Jan. 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 331-342
    • Giguere, C.1    Woodland, P.2
  • 14
    • 0028091217 scopus 로고
    • "A computational model of the auditory periphery for speech and hearing research, II: Descending path,"
    • Jan.
    • _, "A computational model of the auditory periphery for speech and hearing research, II: Descending path," J. Acoust. Soc. Amer., vol. 95, pp. 343-349, Jan. 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 343-349
  • 15
    • 0023505424 scopus 로고
    • "Responses of auditory-nerve fibers to multiple-tone complexes," J
    • Dec.
    • L. Deng, C. D. Geisler, and S. Greenberg, "Responses of auditory-nerve fibers to multiple-tone complexes," J. Acoust. Soc. Amer., vol. 82, pp. 1989-2000, Dec. 1987.
    • (1987) Acoust. Soc. Amer. , vol.82 , pp. 1989-2000
    • Deng, L.1    Geisler, C.D.2    Greenberg, S.3
  • 16
    • 0024162368 scopus 로고
    • 'Temporal coding of resonances by low-frequency auditory nerve fibers: Single-fiber responses and a population model,"
    • L. H. Carney and T. Yin, 'Temporal coding of resonances by low-frequency auditory nerve fibers: Single-fiber responses and a population model," J. Neurophys., vol. 60, pp. 1653-1677, 1988.
    • (1988) J. Neurophys. , vol.60 , pp. 1653-1677
    • Carney, L.H.1    Yin, T.2
  • 17
    • 0025109524 scopus 로고
    • 'Time-domain analysis of auditory-nerve-fiber firing.rates,"
    • Sept.
    • H. E. Seeker-Walker and C. L. Searle, 'Time-domain analysis of auditory-nerve-fiber firing.rates," J. Acoust. Soc. Amer., vol. 3, pp. 1427-1436, Sept. 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.3 , pp. 1427-1436
    • Seeker-Walker, H.E.1    Searle, C.L.2
  • 18
    • 0027528776 scopus 로고
    • "A model for the responses of low-frequency auditory nerve fibers in cat,"
    • Jan.
    • L. H. Carney, "A model for the responses of low-frequency auditory nerve fibers in cat," J. Acoust. Soc. Amer., vol. 93, pp. 401-417, Jan. 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 401-417
    • Carney, L.H.1
  • 20
    • 84928839596 scopus 로고
    • "A composite model of the auditory periphery for the processing of speech,"
    • Jan.
    • L. Deng, C. Geisler, and S. Greenberg, "A composite model of the auditory periphery for the processing of speech," J. Phonet., vol. 16, pp. 93-108, Jan. 1988.
    • (1988) J. Phonet. , vol.16 , pp. 93-108
    • Deng, L.1    Geisler, C.2    Greenberg, S.3
  • 21
    • 0027588975 scopus 로고
    • "Dynamic formant tracking of noisy speech using temporal analysis on outputs from a nonlinear cochlear model," I
    • May
    • L. Deng and I. Kheirallah, "Dynamic formant tracking of noisy speech using temporal analysis on outputs from a nonlinear cochlear model," IEEE Trans. Biomed. Eng., vol. 40, pp. 456-467, May 1993.
    • (1993) EEE Trans. Biomed. Eng. , vol.40 , pp. 456-467
    • Deng, L.1    Kheirallah, I.2
  • 22
    • 0014090339 scopus 로고
    • "Middle-ear characteristics of anesthetized cats,"
    • J. J. J. Guinan and P. W. T., "Middle-ear characteristics of anesthetized cats," J. Acoust. Soc. Amer., vol. 41, pp. 1237-1261, 1967.
    • (1967) J. Acoust. Soc. Amer. , vol.41 , pp. 1237-1261
    • Guinan, J.J.J.1
  • 23
    • 0026189555 scopus 로고
    • "Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition," IEEE Trans. Acoust., Speech, Signal Processing
    • July
    • L. Deng, et al, "Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 39, pp. 1677-1681, July 1991.
    • (1991) Et Al , vol.39 , pp. 1677-1681
    • Deng, L.1
  • 25
    • 0026458724 scopus 로고
    • "Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech unit,"
    • Dec.
    • L. Deng and K. Erler, "Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech unit," J. Acoust. Soc. Amer., vol. 92, pp. 3058-3067, Dec. 1992.
    • (1992) J. Acoust. Soc. Amer. , vol.92 , pp. 3058-3067
    • Deng, L.1    Erler, K.2
  • 26
    • 0025145948 scopus 로고
    • "Modeling microsegments of stop consonants in a hidden Markov based word recognizer,"
    • L. Deng, M. Lennig, and P. Mermelstein, "Modeling microsegments of stop consonants in a hidden Markov based word recognizer," J. Acoust. Soc. Amer., vol. 87, pp. 273-2747, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , pp. 273-2747
    • Deng, L.1    Lennig, M.2    Mermelstein, P.3
  • 28
    • 0027460571 scopus 로고
    • Adequacy of auditory models to predict internal human representation of speech sounds
    • Apr.
    • O. Ghitza, "Adequacy of auditory models to predict internal human representation of speech sounds," J. Acoust. Soc. Amer., vol. 93, pp. 2160-2171, Apr. 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 2160-2171
    • Ghitza, O.1
  • 29
    • 0028312802 scopus 로고
    • "Auditory models and human performance in tasks related to speech coding and speech recognition,"
    • Jan
    • _, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, pp. 115-132, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 115-132
  • 30
    • 84991416125 scopus 로고
    • Auditory nerves representation as a front-end for speech recognition in a noisy environment,"
    • _, "Auditory nerves representation as a front-end for speech recognition in a noisy environment," Comput. Speech Lang., vol. 1, pp. 109-131, 1986.
    • (1986) Comput. Speech Lang. , vol.1 , pp. 109-131
  • 32
    • 84925726310 scopus 로고
    • "Auditory front-end in DTW word recognition under noisy, reverberant, and multi-speaker conditions,"
    • T. Obara and T. Hirahara, "Auditory front-end in DTW word recognition under noisy, reverberant, and multi-speaker conditions," J. Acoust. Soc. Amer., vol. 90, p. S274, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.90
    • Obara, T.1    Hirahara, T.2
  • 33
    • 0028195651 scopus 로고
    • "Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization,"
    • Jan.
    • H. Sheikhzadeh and L. Deng, "Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization," IEEE Trans. Speech Audio Processing, vol. 2, pp. 80-89, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 80-89
    • Sheikhzadeh, H.1    Deng, L.2
  • 34
    • 0027678649 scopus 로고
    • "A stochastic model of speech incorporating hierarchical nonstationarity,"
    • Oct.
    • L. Deng, "A stochastic model of speech incorporating hierarchical nonstationarity," IEEE Trans. Speech Audio Processing, vol. 1, pp. 471-475, Oct. 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 471-475
    • Deng, L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.