SCOPUS 정보 검색 플랫폼

Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

Volumn , Issue , 2010, Pages 1181-1184

Using spectro-temporal features to improve AFE feature extraction for ASR

(2) Ravuri, Suman V a,b Morgan, Nelson a,b

a INTERNATIONAL COMPUTER SCIENCE INSTITUTE (United States)

b UNIVERSITY OF CALIFORNIA (United States)

Author keywords

Automatic speech recognition; Spectro temporal features

Indexed keywords

SPEECH COMMUNICATION;

AUTOMATIC SPEECH RECOGNITION; FRONT END; NOISY CONDITIONS; ROBUST RECOGNITION; TEMPORAL APPROACH; TEMPORAL FEATURES; WIENER FILTERING; FEATURES EXTRACTION; FILTERING METHOD; SPECTRO-TEMPORAL FEATURE; WIENER-FILTERING;

SPEECH RECOGNITION;

EID: 79959814963 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (15)

References (19)

1
- 4544245839
- Two-stage mel-warped wiener filter for robust speech recognition
- Agarwal, A., Cheng, Y.M., "Two-stage Mel-warped Wiener Filter for Robust Speech Recognition". The 1999 International Workshop on Automatic Speech Recognition and Understanding, pp. 67-70, 1999.
- (1999) The 1999 International Workshop on Automatic Speech Recognition and Understanding , pp. 67-70
- Agarwal, A.¹ Cheng, Y.M.²

2
- 0030355935
- A new ASR approach based on independent processing and recombination of partial frequency bands
- Philadelphia, PA
- Bourlard, H. and Dupont, S., "A new ASR approach based on independent processing and recombination of partial frequency bands", In Proc. of Intl. Conf. on Spoken Language Processing, Philadelphia, PA, pp. 422-425, 1996.
- (1996) Proc. of Intl. Conf. on Spoken Language Processing , pp. 422-425
- Bourlard, H.¹ Dupont, S.²

3
- 0040290402
- Spectro-temporal modulation transfer functions and speech intelligibility
- Chi, T., Gao, Y., Guyton, M.C., Ru, P., and Shamma, S.A., "Spectro-temporal modulation transfer functions and speech intelligibility", J. Acoust. Soc. Am., 106(5):2719-2732, 1999.
- (1999) J. Acoust. Soc. Am. , vol.106 , Issue.5 , pp. 2719-2732
- Chi, T.¹ Gao, Y.² Guyton, M.C.³ Ru, P.⁴ Shamma, S.A.⁵

4
- 85135278272
- Telephone speech corpus development at CSLU
- Yokohama, Japan
- Cole, R., Fanty, M., Noel, M. and Lander, T. "Telephone speech corpus development at CSLU", in Proc. Int. Conf. Spoken Lang. Proc., Yokohama, Japan, pp. 1815-1818, 1994.
- (1994) Proc. Int. Conf. Spoken Lang. Proc. , pp. 1815-1818
- Cole, R.¹ Fanty, M.² Noel, M.³ Lander, T.⁴

5
- 51449087857
- Hierarchical spectro-temporal features for robust speech recognition
- Las Vegas, USA
- Domont, X., Heckmann, M., Joublin, F., Goerick, C., "Hierarchical spectro-temporal features for robust speech recognition", In Proc. ICASSP, Las Vegas, USA, pp. 4417-4420, 2008.
- (2008) Proc. ICASSP , pp. 4417-4420
- Domont, X.¹ Heckmann, M.² Joublin, F.³ Goerick, C.⁴

6
- 4544262301
- ETSI standard doc ETSI ES 202 050 Ver.1.1.1 (2002-10)
- ETSI standard doc. "Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Feature Extraction Algorithm", ETSI ES 202 050 Ver.1.1.1 (2002-10).
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Feature Extraction Algorithm

7
- 84867191742
- Noisy numbers data and numbers testbeds
- Berkeley, CA
- Gelbart, D., "Noisy numbers data and numbers testbeds", International Computer Science Institute, Berkeley, CA. http://www.icsi. berkeley.edu/speech/papers/gelbart-ms/.
- International Computer Science Institute
- Gelbart, D.¹

8
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- Istanbul, Turkey
- Hermansky, H., Ellis, D., Sharma, S., "Tandem connectionist feature extraction for conventional HMM systems", in Proc. ICASSP, Istanbul, Turkey, pp. 1635-1638, 2000.
- (2000) Proc. ICASSP , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.² Sharma, S.³

9
- 33745213373
- Multi-resolution RASTA filtering for TANDEM-based ASR
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Hermansky, H., Fousek, P., "Multi-resolution rasta filtering for tandem-based asr", In Proceedings of Interspeech, Lisbon, Portugal, pp. 361-364, 2005. (Pubitemid 43908074)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 361-364
- Hermansky, H.¹ Fousek, P.²

10
- 0002787767
- The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Paris, France
- Hirsch, H.G., and Pearce, D., "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", in ISCA ITRW ASR: Challenges for the Next Millennium, Paris, France, pp. 18-20, 2000.
- (2000) ISCA ITRW ASR: Challenges for the Next Millennium , pp. 18-20
- Hirsch, H.G.¹ Pearce, D.²

11
- 0032676337
- On the relative importance of various components of the modulation spectrum for automatic speech recognition
- Kanedera, N., Arai, T., Hermansky, H., Pavel, M., "On the relative importance of various components of the modulation spectrum for automatic speech recognition", Speech Communication, 28:43-55, 1999.
- (1999) Speech Communication , vol.28 , pp. 43-55
- Kanedera, N.¹ Arai, T.² Hermansky, H.³ Pavel, M.⁴

12
- 85009227802
- Localized spectro-temporal features for automatic speech recognition
- Kleinschmidt, M., "Localized spectro-temporal features for automatic speech recognition", in Proceedings of Eurospeech, pp. 2573-2576, 2003.
- (2003) Proceedings of Eurospeech , pp. 2573-2576
- Kleinschmidt, M.¹

13
- 44849137298
- Improved tone modeling for mandarin broadcast news speech recognition
- Pittsburgh, PA
- Lei, X., Siu, M., Hwang, M.Y., Ostendorf, M., and Lee, T. "Improved Tone Modeling for Mandarin Broadcast News Speech Recognition", in Proc. of Intl. Conf. of Spoken Language Processing, Pittsburgh, PA, pp. 1237-1240, 2006.
- (2006) Proc. of Intl. Conf. of Spoken Language Processing , pp. 1237-1240
- Lei, X.¹ Siu, M.² Hwang, M.Y.³ Ostendorf, M.⁴ Lee, T.⁵

14
- 34047272330
- Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
- Mesgarani, N., Slaney, M., and Shamma, S., "Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations", IEEE Trans. Audio, Speech, and Language Proc., 14(3):920-929, 2006.
- (2006) IEEE Trans. Audio, Speech, and Language Proc. , vol.14 , Issue.3 , pp. 920-929
- Mesgarani, N.¹ Slaney, M.² Shamma, S.³

15
- 0141676589
- New entropy based combination rules in HMM/ANN multi-stream ASR
- Hong Kong
- Misra, H., Bourlard, H., Tyagi, V., "New entropy based combination rules in HMM/ANN multi-stream ASR, in Proc. ICASSP, pp. II-741-4 vol.2, Hong Kong, 2003.
- (2003) Proc. ICASSP , vol.2
- Misra, H.¹ Bourlard, H.² Tyagi, V.³

16
- 85032751546
- Pushing the envelope - Aside
- DOI 10.1109/MSP.2005.1511826
- Morgan, N., Zhu, Q., Stolcke, A., Sonmez, K., Sivadas, S., Shinozaki, T., Ostendorf, M., Jain, P., Hermansky, H., Ellis, D., Doddington, G., Chen, B., Cetin, O., Bourlard, H., and Athineos, M., "Pushing the envelope - aside", IEEE Signal Processing Magazine, 22(5):81-88, 2005. (Pubitemid 41508702)
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 81-88
- Morgan, N.¹ Zhu, Q.² Stolcke, A.³ Sonmez, K.⁴ Sivadas, S.⁵ Shinozaki, T.⁶ Ostendorf, M.⁷ Jain, P.⁸ Hermansky, H.⁹ Ellis, D.¹⁰ Doddington, G.¹¹ Chen, B.¹² Cetin, O.¹³ Bourlard, H.¹⁴ Athineos, M.¹⁵

17
- 84867222011
- On the combination of auditory and modulation frequency channels for ASR applications
- Brisbane, Australia
- Valente, H. and Hermansky, H., "On the combination of auditory and modulation frequency channels for ASR applications", In Proceedings of Interspeech, Brisbane, Australia, pp. 2242-2245, 2008.
- (2008) Proceedings of Interspeech , pp. 2242-2245
- Valente, H.¹ Hermansky, H.²

18
- 84867220821
- Multi-stream spectro-temporal features for robust speech recognition
- Brisbane, Australia
- Zhao, S.Y., Morgan, N. "Multi-stream spectro-temporal features for robust speech recognition", In Proceedings of Interspeech, Brisbane, Australia, pp. 898-901, 2008.
- (2008) Proceedings of Interspeech , pp. 898-901
- Zhao, S.Y.¹ Morgan, N.²

19
- 70450216114
- Multi-stream to many-stream: Using spectro-temporal features for ASR
- Brighton, UK
- Zhao, S., Ravuri, S., and Morgan, N. "Multi-Stream to Many-Stream: Using Spectro-temporal Features for ASR", In Proceedings of Interspeech, Brighton, UK, pp. 2951-2954, 2009.
- (2009) Proceedings of Interspeech , pp. 2951-2954
- Zhao, S.¹ Ravuri, S.² Morgan, N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.