SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2008, Pages 2262-2265

Short- and long-term dynamic features for robust speech recognition

(3) Fukuda, Takashi a Ichikawa, Osamu a Nishimura, Masafumi a

Author keywords

Automatic speech recognition; Dynamic feature; Long term temporal information; Noise robustness

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; AUTOMATIC SPEECH RECOGNITION SYSTEM; CEPSTRUM; DYNAMIC FEATURE; DYNAMIC FEATURES; FEATURE PARAMETERS; HIGH-DIMENSIONAL FEATURE SPACE; LONG TERM DYNAMICS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; NOISE-ROBUSTNESS; ROBUST SPEECH RECOGNITION; SPECTRAL VARIATION; SPEECH CORPORA; TEMPORAL INFORMATION;

FEATURE EXTRACTION; SPEECH COMMUNICATION;

SPEECH RECOGNITION;

EID: 84867218137 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (4)

References (15)

1
- 0022667694
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum", IEEE Trans. Acoust., Speech and Signal Processing, Vol. ASSP-34, No. 1, pp. 52-59 (1986).
- (1986) IEEE Trans. Acoust., Speech and Signal Processing , vol.ASSP-34 , Issue.1 , pp. 52-59
- Furui, S.¹

2
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech", IEEE Trans. Speech and Audio Processing, Vol. 2, No. 4, pp. 578-589 (1994).
- (1994) IEEE Trans. Speech and Audio Processing , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

3
- 1842796666
- On the important modulation-frequency bands of speech for human speaker recognition
- T. Arai, M. Takahashi, N. Kanedera, Y. Takano, and Y. Murahara, "On the important modulation-frequency bands of speech for human speaker recognition", Proc. ICSLP, Vol. III, pp. 774-777 (2000).
- (2000) Proc. ICSLP , vol.3 , pp. 774-777
- Arai, T.¹ Takahashi, M.² Kanedera, N.³ Takano, Y.⁴ Murahara, Y.⁵

4
- 0019555090
- Cepstral analysis technique for automatic speaker verification
- S.Furui, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech and Signal Processing, Vol. 29, No. 2, pp. 254-272 (1981).
- (1981) IEEE Trans. Acoust., Speech and Signal Processing , vol.29 , Issue.2 , pp. 254-272
- Furui, S.¹

5
- 0032658253
- TRAPS - Classifiers of Temporal Patterns
- H. Hermansky and S. Sharma, "TRAPS - Classifiers of Temporal Patterns", Proc. ICASSP '99, Vol. I, pp. 289-292 (1999).
- (1999) Proc. ICASSP '99 , vol.1 , pp. 289-292
- Hermansky, H.¹ Sharma, S.²

6
- 27144509179
- Learning long-term temporal features in LVCSR using neural networks
- B. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks", Proc. ICSLP, pp. 612-615 (2004).
- (2004) Proc. ICSLP , pp. 612-615
- Chen, B.¹ Zhu, Q.² Morgan, N.³

7
- 4544224866
- TRAPping conversational speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition
- N. Morgan, B. Chen, Q. Zhu, and A. Stolcke, "TRAPping conversational speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition", Proc. ICASSP'04, Vol. I, pp. 537-540 (2004).
- (2004) Proc. ICASSP'04 , vol.1 , pp. 537-540
- Morgan, N.¹ Chen, B.² Zhu, Q.³ Stolcke, A.⁴

8
- 0038694713
- The analysis of speech in different temporal integration windows: Cerebral lateralization as asymmetric sampling in time
- D. Poeppel, "The analysis of speech in different temporal integration windows: cerebral lateralization as asymmetric sampling in time", Speech Communication, Vol. 41, pp. 245- 255 (2003).
- (2003) Speech Communication , vol.41 , pp. 245-255
- Poeppel, D.¹

9
- 60849117157
- Static and dynamic spectral features: Their noise robustness and optimal weights for ASR
- C. Yang, F. K. Soong, and T. Lee, "Static and dynamic spectral features: Their noise robustness and optimal weights for ASR," IEEE Trans. on Audio, Speech, and Language Processing, Vol. 15, No. 3, pp. 1087-1097, 2007.
- (2007) IEEE Trans. on Audio, Speech, and Language Processing , vol.15 , Issue.3 , pp. 1087-1097
- Yang, C.¹ Soong, F.K.² Lee, T.³

10
- 24144495584
- Construction and evaluation a large in-car speech corpus
- K. Takeda, H. Fujimura, K. Itoh, N. Kawaguchi, S. Matsubara, and F. Itakura, "Construction and evaluation a large in-car speech corpus," IEICE Trans. on Information and Systems, Vol. E88-D, No. 3, pp. 553-561 (2005).
- (2005) IEICE Trans. on Information and Systems , vol.E88-D , Issue.3 , pp. 553-561
- Takeda, K.¹ Fujimura, H.² Itoh, K.³ Kawaguchi, N.⁴ Matsubara, S.⁵ Itakura, F.⁶

11
- 0027957839
- Effect of temporal envelope smearing on speech perception
- R. Drullman, J. M. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech perception," J. Acoust. Soc. Amer., Vol. 95, pp. 1053-1064 (1994).
- (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 1053-1064
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

12
- 0028287770
- Effect of reducing slow temporal modulations on speech perception
- R. Drullman, J. M. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech perception," J. Acoust. Soc. Amer., Vol. 95, pp. 2670-2680 (1994).
- (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

13
- 0034817674
- Time and frequency filtering of filter-bank energies for robust HMM speech recognition
- C. Nadeu, D. Macho, and J. Hernando, "Time and frequency filtering of filter-bank energies for robust HMM speech recognition," Speech Communication, Vol. 34, pp. 93-114 (2001).
- (2001) Speech Communication , vol.34 , pp. 93-114
- Nadeu, C.¹ Macho, D.² Hernando, J.³

14
- 84856269531
- Desired characteristics of modulation spectrum for robust automatic speech recognition
- N. Kanedera, H. Hermansky, and T. Arai, "Desired characteristics of modulation spectrum for robust automatic speech recognition," Proc. ICASSP'98, pp.613-616 (1998).
- (1998) Proc. ICASSP'98 , pp. 613-616
- Kanedera, N.¹ Hermansky, H.² Arai, T.³

15
- 84867218934
- Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection
- T. Fukuda, O. Ichikawa, and M. Nishimura, "Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection," Interspeech 2008.
- Interspeech 2008
- Fukuda, T.¹ Ichikawa, O.² Nishimura, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.