SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2009, Pages 2823-2826

Static and dynamic modulation spectrum for speech recognition

(3) Ganapathy, Sriram a Thomas, Samuel a Hermansky, Hynek a,b

a JOHNS HOPKINS UNIVERSITY (United States)

b Johns Hopkins University (United States)

Author keywords

Adaptive compression; Feature extraction for speech recognition; Frequency Domain Linear Prediction (FDLP); Modulation spectrum

Indexed keywords

ADAPTIVE COMPRESSION; ADAPTIVE LOOPS; FEATURE EXTRACTION TECHNIQUES; FREQUENCY DOMAINS; LINEAR PREDICTION; MODULATION SPECTRUM; PHONEME RECOGNITION; SPECTRAL COMPONENTS; SPEECH RECOGNITION SYSTEMS; STATIC AND DYNAMIC; SUB-BANDS; TELEPHONE SPEECH; TEMPORAL ENVELOPES;

ELECTRIC LOAD SHEDDING; FEATURE EXTRACTION; FREQUENCY DOMAIN ANALYSIS; FREQUENCY ESTIMATION; MODULATION; REMELTING; SPEECH COMMUNICATION;

SPEECH RECOGNITION;

EID: 70450218182 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (15)

References (21)

1
- 0025041264
- Perceptual Linear Predictive (PLP) Analysis of Speech
- H. Hermansky, "Perceptual Linear Predictive (PLP) Analysis of Speech", J. Acoust. Soc. Am., Vol. 87(4), pp. 1738-1752, 1990.
- (1990) J. Acoust. Soc. Am , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

2
- 0028287770
- Effect of Reducing Slow Temporal Modulations on Speech Reception
- R. Drullman, J.M. Festen and R. Plomp,"Effect of Reducing Slow Temporal Modulations on Speech Reception", J. Acoust. Soc. Am., Vol. 95(5), pp. 2670-2680, 1994.
- (1994) J. Acoust. Soc. Am , vol.95 , Issue.5 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

3
- 0028823541
- Speech Recognition with Primarily Temporal Cues
- R.V Shannon, F.G. Zeng, V. Kamath, J. Wygonski, and M. Ekelid, "Speech Recognition with Primarily Temporal Cues", Science, Vol. 270(5234), pp. 303-304, 1995.
- (1995) Science , vol.270 , Issue.5234 , pp. 303-304
- Shannon, R.V.¹ Zeng, F.G.² Kamath, V.³ Wygonski, J.⁴ Ekelid, M.⁵

4
- 0019060580
- Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics
- T. Houtgast, H.J.M. Steeneken and R. Plomp, "Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics", Acoustica 46, pp. 60-72, 1980.
- (1980) Acoustica , vol.46 , pp. 60-72
- Houtgast, T.¹ Steeneken, H.J.M.² Plomp, R.³

5
- 0034842487
- Scalable and progressive audio codec
- M.S. Vinton and L.E. Atlas, "Scalable and progressive audio codec", Proc. ICASSP, pp. 3277-3280, 2001.
- (2001) Proc. ICASSP , pp. 3277-3280
- Vinton, M.S.¹ Atlas, L.E.²

6
- 70450185608
- Noise Suppression Based on Extending a Speech-Dominated Modulation Band
- T.H. Falk, S. Stadler, W.B. Kleijn and W.Y. Chan, "Noise Suppression Based on Extending a Speech-Dominated Modulation Band", Interspeech, pp. 970-973, 2007.
- (2007) Interspeech , pp. 970-973
- Falk, T.H.¹ Stadler, S.² Kleijn, W.B.³ Chan, W.Y.⁴

7
- 85009254284
- TRAPS - Classifiers of Temporal Patterns
- Sydney, Australia
- H. Hermansky and S. Sharma, "TRAPS - Classifiers of Temporal Patterns", Proc. of ICSLP, Sydney, Australia, Vol. 3, pp. 1003-1006, 1998.
- (1998) Proc. of ICSLP , vol.3 , pp. 1003-1006
- Hermansky, H.¹ Sharma, S.²

8
- 0032136330
- Robust speech recognition using the modulation spectrogram
- B.E.D. Kingsbury, N. Morgan and S. Greenberg, "Robust speech recognition using the modulation spectrogram", Speech Comm., Vol. 25 (1-3), pp. 117-132, 1998.
- (1998) Speech Comm , vol.25 , Issue.1-3 , pp. 117-132
- Kingsbury, B.E.D.¹ Morgan, N.² Greenberg, S.³

9
- 0033709098
- Tandem Connectionist Feature Extraction for Conventional HMM Systems
- H. Hermansky, D.P.W. Ellis, and S. Sharma, "Tandem Connectionist Feature Extraction for Conventional HMM Systems", Proc. of ICASSP, Vol. 3, pp. 1635-1638, 2000.
- (2000) Proc. of ICASSP , vol.3 , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.P.W.² Sharma, S.³

10
- 0032828464
- A model of auditory perception as front end for automatic speech recognition
- J. Tchorz and B. Kollmeier,"A model of auditory perception as front end for automatic speech recognition", J. Acoust. Soc. Am., Vol. 106(4), pp. 2040-2050, 1999.
- (1999) J. Acoust. Soc. Am , vol.106 , Issue.4 , pp. 2040-2050
- Tchorz, J.¹ Kollmeier, B.²

11
- 58649102246
- Modulation spectrum based features for phoneme recognition in noisy speech
- S. Ganapathy, S. Thomas, and H. Hermansky, "Modulation spectrum based features for phoneme recognition in noisy speech", JASA Express Letters, Vol. 125 (1), pp. EL8-EL12, 2009.
- (2009) JASA Express Letters , vol.125 , Issue.1
- Ganapathy, S.¹ Thomas, S.² Hermansky, H.³

12
- 0003573244
- Kluwer Academic Publishers
- H. Boulard and N. Morgan, Connectionist Speech Recognition - A Hybrid Approach, Kluwer Academic Publishers, 1994.
- (1994) Connectionist Speech Recognition - A Hybrid Approach
- Boulard, H.¹ Morgan, N.²

13
- 0016495091
- Linear Prediction: A Tutorial Review
- J. Makhoul, "Linear Prediction: A Tutorial Review", Proc. of the IEEE, Vol 63(4), pp. 561-580, 1975.
- (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

14
- 36248966385
- Autoregressive modelling of temporal envelopes
- M. Athineos and D.P.W. Ellis, "Autoregressive modelling of temporal envelopes",IEEE Trans. Speech and Audio Proc., Vol. 55, pp. 5237-5245, 2007.
- (2007) IEEE Trans. Speech and Audio Proc , vol.55 , pp. 5237-5245
- Athineos, M.¹ Ellis, D.P.W.²

15
- 0032634932
- Computing the Discrete-Time Analytic Signal via FFT
- L.S. Marple, "Computing the Discrete-Time Analytic Signal via FFT", IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 47, pp. 2600-2603, 1999.
- (1999) IEEE Trans. on Acoustics, Speech and Signal Processing , vol.47 , pp. 2600-2603
- Marple, L.S.¹

16
- 70450130929
- Exploiting Contextual Information for Improved Phoneme Recognition
- J. Pinto, B. Yegnanarayana, H. Hermansky and M. M. Doss, "Exploiting Contextual Information for Improved Phoneme Recognition", Proc. of Interspeech, pp. 1817-1820, 2007.
- (2007) Proc. of Interspeech , pp. 1817-1820
- Pinto, J.¹ Yegnanarayana, B.² Hermansky, H.³ Doss, M.M.⁴

17
- 33745213373
- Multi-resolution RASTA filtering for TANDEM-based ASR
- H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR", Proc. of INTERSPEECH, pp. 361-364, 2005.
- (2005) Proc. of INTERSPEECH , pp. 361-364
- Hermansky, H.¹ Fousek, P.²

18
- 0030711174
- The modulation spectrogram: In pursuit of an invariant representation of speech
- S. Greenberg and B.E.D. Kingsbury, "The modulation spectrogram: in pursuit of an invariant representation of speech", Proc. ICASSP, Vol. 3, pp. 1647-1650, 1997.
- (1997) Proc. ICASSP , vol.3 , pp. 1647-1650
- Greenberg, S.¹ Kingsbury, B.E.D.²

19
- 33745533302
- The Development of AMI System for Transcription of Speech in Meetings
- T. Hain et al., "The Development of AMI System for Transcription of Speech in Meetings", Proc. of MLMI, pp. 344356, 2005.
- (2005) Proc. of MLMI , pp. 344356
- Hain, T.¹

20
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech", IEEE Trans. Speech and Audio Proc., vol. 2, pp. 578-589, 1994.
- (1994) IEEE Trans. Speech and Audio Proc , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

21
- 0141699847
- ETSI ES 202 050 v1.1.1 STQ; Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms
- "ETSI ES 202 050 v1.1.1 STQ; Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms", 2002.
- (2002)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.