메뉴 건너뛰기




Volumn 2, Issue , 1998, Pages 721-724

Incorporating information from syllable-length time scales into automatic speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; AUTOMATIC SPEECH RECOGNITION SYSTEM; BASELINE SYSTEMS; EXPERIMENTAL SYSTEM; RECOGNITION SYSTEMS; RECOGNITION UNITS; SPEECH PERCEPTION; WORD ERROR RATE;

EID: 84892186467     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.1998.675366     Document Type: Conference Paper
Times cited : (56)

References (19)
  • 1
    • 0030369532 scopus 로고    scopus 로고
    • Intelligibility of speech with filtered time trajectories of spectral envelopes
    • Takayuki Arai, Misha Pavel, Hynek Hermansky, and Carlos Avendano. Intelligibility of speech with filtered time trajectories of spectral envelopes. In ICSLP, volume 4, pages 2490-2493,1996.
    • (1996) ICSLP, Volume 4 , pp. 2490-2493
    • Arai, T.1    Pavel, M.2    Hermansky, H.3    Avendano, C.4
  • 2
    • 33745246234 scopus 로고    scopus 로고
    • Multiresolution channel normalization for ASR in reverberant environments
    • Rhodes, Greece, September. ESCA
    • Carlos Avendano, Sangita Tibrewala, and Hynek Hermansky. Multiresolution channel normalization for ASR in reverberant environments. In Eurospeech, volume 3, pages 11071110, Rhodes, Greece, September 1997. ESCA.
    • (1997) Eurospeech , vol.3 , pp. 11071110
    • Avendano, C.1    Tibrewala, S.2    Hermansky, H.3
  • 4
    • 85135196323 scopus 로고
    • New telephone speech corpora at CSLU
    • September
    • R. A. Cole, M. Noel, T. Lander, and T. Durham. New telephone speech corpora at CSLU. In Eurospeech, volume 1, pages 821-824, September 1995.
    • (1995) Eurospeech , vol.1 , pp. 821-824
    • Cole, R.A.1    Noel, M.2    Lander, T.3    Durham, T.4
  • 7
    • 0343249601 scopus 로고    scopus 로고
    • Using multiple time scales in a multi-stream speech recognition system
    • Rhodes, Greece, October
    • Stephane Dupont, Herve Bourlard, and Christophe Ris. Using multiple time scales in a multi-stream speech recognition system. In Eurospeech, pages 3-6, Rhodes, Greece, October 1997.
    • (1997) Eurospeech , pp. 3-6
    • Dupont, S.1    Bourlard, H.2    Ris, C.3
  • 10
    • 0030711174 scopus 로고    scopus 로고
    • The modulation spectrogram: In pursuit of an invariant representation of speech
    • Munich, Germany, April. IEEE
    • Steven Greenberg and Brian E. D. Kingsbury. The modulation spectrogram: In pursuit of an invariant representation of speech. In ICASSP, volume 3, pages 1647-1650, Munich, Germany, April 1997. IEEE.
    • (1997) ICASSP , vol.3 , pp. 1647-1650
    • Greenberg, S.1    Kingsbury, B.E.D.2
  • 12
    • 0015553712 scopus 로고
    • The modulation transfer function in room acoustics as a predictor of speech intelligibility
    • T. Houtgast and H. J. M. Steeneken. The modulation transfer function in room acoustics as a predictor of speech intelligibility. Acustica, 28:66-73,1973.
    • (1973) Acustica , vol.28 , pp. 66-73
    • Houtgast, T.1    Steeneken, H.J.M.2
  • 13
  • 14
    • 0030682292 scopus 로고    scopus 로고
    • Recognizing reverberant speech with rasta-plp
    • Munich, Germany, April. IEEE
    • Brian E. D. Kingsbury and Nelson Morgan. Recognizing reverberant speech with RASTA-PLP. In ICASSP, volume 2, pages 1259-1262, Munich, Germany, April 1997. IEEE.
    • (1997) ICASSP , vol.2 , pp. 1259-1262
    • Kingsbury, D.B.E.1    Morgan, N.2
  • 15
    • 0015307394 scopus 로고
    • Preperceptual images, processing time and perceptual units in auditory perception
    • Dominic W. Massaro. Preperceptual images, processing time and perceptual units in auditory perception. Psychological Review, 79(2):124-145, 1972.
    • (1972) Psychological Review , vol.79 , Issue.2 , pp. 124-145
    • Massaro, D.W.1
  • 16
    • 0029725455 scopus 로고    scopus 로고
    • Efficient evaluation of the LVCSR search space using the noway decoder
    • Atlanta, Georgia, May. IEEE
    • Steve Renals and Mike Hochberg. Efficient evaluation of the LVCSR search space using the noway decoder. In ICASSP, volume 1, pages 149-152, Atlanta, Georgia, May 1996. IEEE.
    • (1996) ICASSP , vol.1 , pp. 149-152
    • Renals, S.1    Hochberg, M.2
  • 17
    • 0018906941 scopus 로고
    • A physical method for measuring speech-transmission quality
    • January
    • Herman J. M. Steeneken and Tammo Houtgast. A physical method for measuring speech-transmission quality. Journal of the Acoustical Society of America, 67(1 ):318-326, January 1980.
    • (1980) Journal of the Acoustical Society of America , vol.67 , Issue.1 , pp. 318-326
    • Steeneken, H.J.M.1    Houtgast, T.2
  • 18
    • 84892184434 scopus 로고    scopus 로고
    • Perceptual processing of speech and other perceptual patterns: Some similarities and differences
    • 1998 Oxford University Press,. To appear
    • Richard M. Warren. Perceptual processing of speech and other perceptual patterns: Some similarities and differences. In Steven Greenberg and William Ainsworth, editors, Listening to Speech: An Auditory Perspective. Oxford University Press, 1998. To appear.
    • (1998) Listening to Speech: An Auditory Perspective
    • Warren, R.M.1
  • 19
    • 0030643233 scopus 로고    scopus 로고
    • Integrating syllable boundary information into speech recognition
    • Munich, Germany, April. IEEE
    • Su-Lin Wu, Michael L. Shire, Steven Greenberg, and Nelson Morgan. Integrating syllable boundary information into speech recognition. In ICASSP, volume 1, Munich, Germany, April 1997. IEEE.
    • (1997) ICASSP , vol.1
    • Wu, S.-L.1    Shire, M.L.2    Greenberg, S.3    Morgan, N.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.