메뉴 건너뛰기




Volumn , Issue , 2009, Pages 2823-2826

Static and dynamic modulation spectrum for speech recognition

Author keywords

Adaptive compression; Feature extraction for speech recognition; Frequency Domain Linear Prediction (FDLP); Modulation spectrum

Indexed keywords

ADAPTIVE COMPRESSION; ADAPTIVE LOOPS; FEATURE EXTRACTION TECHNIQUES; FREQUENCY DOMAINS; LINEAR PREDICTION; MODULATION SPECTRUM; PHONEME RECOGNITION; SPECTRAL COMPONENTS; SPEECH RECOGNITION SYSTEMS; STATIC AND DYNAMIC; SUB-BANDS; TELEPHONE SPEECH; TEMPORAL ENVELOPES;

EID: 70450218182     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (15)

References (21)
  • 1
    • 0025041264 scopus 로고
    • Perceptual Linear Predictive (PLP) Analysis of Speech
    • H. Hermansky, "Perceptual Linear Predictive (PLP) Analysis of Speech", J. Acoust. Soc. Am., Vol. 87(4), pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Am , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 2
    • 0028287770 scopus 로고
    • Effect of Reducing Slow Temporal Modulations on Speech Reception
    • R. Drullman, J.M. Festen and R. Plomp,"Effect of Reducing Slow Temporal Modulations on Speech Reception", J. Acoust. Soc. Am., Vol. 95(5), pp. 2670-2680, 1994.
    • (1994) J. Acoust. Soc. Am , vol.95 , Issue.5 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 3
    • 0028823541 scopus 로고
    • Speech Recognition with Primarily Temporal Cues
    • R.V Shannon, F.G. Zeng, V. Kamath, J. Wygonski, and M. Ekelid, "Speech Recognition with Primarily Temporal Cues", Science, Vol. 270(5234), pp. 303-304, 1995.
    • (1995) Science , vol.270 , Issue.5234 , pp. 303-304
    • Shannon, R.V.1    Zeng, F.G.2    Kamath, V.3    Wygonski, J.4    Ekelid, M.5
  • 4
    • 0019060580 scopus 로고
    • Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics
    • T. Houtgast, H.J.M. Steeneken and R. Plomp, "Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics", Acoustica 46, pp. 60-72, 1980.
    • (1980) Acoustica , vol.46 , pp. 60-72
    • Houtgast, T.1    Steeneken, H.J.M.2    Plomp, R.3
  • 5
    • 0034842487 scopus 로고    scopus 로고
    • Scalable and progressive audio codec
    • M.S. Vinton and L.E. Atlas, "Scalable and progressive audio codec", Proc. ICASSP, pp. 3277-3280, 2001.
    • (2001) Proc. ICASSP , pp. 3277-3280
    • Vinton, M.S.1    Atlas, L.E.2
  • 6
    • 70450185608 scopus 로고    scopus 로고
    • Noise Suppression Based on Extending a Speech-Dominated Modulation Band
    • T.H. Falk, S. Stadler, W.B. Kleijn and W.Y. Chan, "Noise Suppression Based on Extending a Speech-Dominated Modulation Band", Interspeech, pp. 970-973, 2007.
    • (2007) Interspeech , pp. 970-973
    • Falk, T.H.1    Stadler, S.2    Kleijn, W.B.3    Chan, W.Y.4
  • 7
    • 85009254284 scopus 로고    scopus 로고
    • TRAPS - Classifiers of Temporal Patterns
    • Sydney, Australia
    • H. Hermansky and S. Sharma, "TRAPS - Classifiers of Temporal Patterns", Proc. of ICSLP, Sydney, Australia, Vol. 3, pp. 1003-1006, 1998.
    • (1998) Proc. of ICSLP , vol.3 , pp. 1003-1006
    • Hermansky, H.1    Sharma, S.2
  • 8
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • B.E.D. Kingsbury, N. Morgan and S. Greenberg, "Robust speech recognition using the modulation spectrogram", Speech Comm., Vol. 25 (1-3), pp. 117-132, 1998.
    • (1998) Speech Comm , vol.25 , Issue.1-3 , pp. 117-132
    • Kingsbury, B.E.D.1    Morgan, N.2    Greenberg, S.3
  • 9
    • 0033709098 scopus 로고    scopus 로고
    • Tandem Connectionist Feature Extraction for Conventional HMM Systems
    • H. Hermansky, D.P.W. Ellis, and S. Sharma, "Tandem Connectionist Feature Extraction for Conventional HMM Systems", Proc. of ICASSP, Vol. 3, pp. 1635-1638, 2000.
    • (2000) Proc. of ICASSP , vol.3 , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 10
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front end for automatic speech recognition
    • J. Tchorz and B. Kollmeier,"A model of auditory perception as front end for automatic speech recognition", J. Acoust. Soc. Am., Vol. 106(4), pp. 2040-2050, 1999.
    • (1999) J. Acoust. Soc. Am , vol.106 , Issue.4 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 11
    • 58649102246 scopus 로고    scopus 로고
    • Modulation spectrum based features for phoneme recognition in noisy speech
    • S. Ganapathy, S. Thomas, and H. Hermansky, "Modulation spectrum based features for phoneme recognition in noisy speech", JASA Express Letters, Vol. 125 (1), pp. EL8-EL12, 2009.
    • (2009) JASA Express Letters , vol.125 , Issue.1
    • Ganapathy, S.1    Thomas, S.2    Hermansky, H.3
  • 13
    • 0016495091 scopus 로고
    • Linear Prediction: A Tutorial Review
    • J. Makhoul, "Linear Prediction: A Tutorial Review", Proc. of the IEEE, Vol 63(4), pp. 561-580, 1975.
    • (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 14
    • 36248966385 scopus 로고    scopus 로고
    • Autoregressive modelling of temporal envelopes
    • M. Athineos and D.P.W. Ellis, "Autoregressive modelling of temporal envelopes",IEEE Trans. Speech and Audio Proc., Vol. 55, pp. 5237-5245, 2007.
    • (2007) IEEE Trans. Speech and Audio Proc , vol.55 , pp. 5237-5245
    • Athineos, M.1    Ellis, D.P.W.2
  • 16
  • 17
    • 33745213373 scopus 로고    scopus 로고
    • Multi-resolution RASTA filtering for TANDEM-based ASR
    • H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR", Proc. of INTERSPEECH, pp. 361-364, 2005.
    • (2005) Proc. of INTERSPEECH , pp. 361-364
    • Hermansky, H.1    Fousek, P.2
  • 18
    • 0030711174 scopus 로고    scopus 로고
    • The modulation spectrogram: In pursuit of an invariant representation of speech
    • S. Greenberg and B.E.D. Kingsbury, "The modulation spectrogram: in pursuit of an invariant representation of speech", Proc. ICASSP, Vol. 3, pp. 1647-1650, 1997.
    • (1997) Proc. ICASSP , vol.3 , pp. 1647-1650
    • Greenberg, S.1    Kingsbury, B.E.D.2
  • 19
    • 33745533302 scopus 로고    scopus 로고
    • The Development of AMI System for Transcription of Speech in Meetings
    • T. Hain et al., "The Development of AMI System for Transcription of Speech in Meetings", Proc. of MLMI, pp. 344356, 2005.
    • (2005) Proc. of MLMI , pp. 344356
    • Hain, T.1
  • 21
    • 0141699847 scopus 로고    scopus 로고
    • ETSI ES 202 050 v1.1.1 STQ; Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms
    • "ETSI ES 202 050 v1.1.1 STQ; Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms", 2002.
    • (2002)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.