메뉴 건너뛰기




Volumn , Issue , 2010, Pages 1181-1184

Using spectro-temporal features to improve AFE feature extraction for ASR

Author keywords

Automatic speech recognition; Spectro temporal features

Indexed keywords

SPEECH COMMUNICATION;

EID: 79959814963     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (15)

References (19)
  • 2
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and recombination of partial frequency bands
    • Philadelphia, PA
    • Bourlard, H. and Dupont, S., "A new ASR approach based on independent processing and recombination of partial frequency bands", In Proc. of Intl. Conf. on Spoken Language Processing, Philadelphia, PA, pp. 422-425, 1996.
    • (1996) Proc. of Intl. Conf. on Spoken Language Processing , pp. 422-425
    • Bourlard, H.1    Dupont, S.2
  • 3
    • 0040290402 scopus 로고    scopus 로고
    • Spectro-temporal modulation transfer functions and speech intelligibility
    • Chi, T., Gao, Y., Guyton, M.C., Ru, P., and Shamma, S.A., "Spectro-temporal modulation transfer functions and speech intelligibility", J. Acoust. Soc. Am., 106(5):2719-2732, 1999.
    • (1999) J. Acoust. Soc. Am. , vol.106 , Issue.5 , pp. 2719-2732
    • Chi, T.1    Gao, Y.2    Guyton, M.C.3    Ru, P.4    Shamma, S.A.5
  • 5
    • 51449087857 scopus 로고    scopus 로고
    • Hierarchical spectro-temporal features for robust speech recognition
    • Las Vegas, USA
    • Domont, X., Heckmann, M., Joublin, F., Goerick, C., "Hierarchical spectro-temporal features for robust speech recognition", In Proc. ICASSP, Las Vegas, USA, pp. 4417-4420, 2008.
    • (2008) Proc. ICASSP , pp. 4417-4420
    • Domont, X.1    Heckmann, M.2    Joublin, F.3    Goerick, C.4
  • 7
    • 84867191742 scopus 로고    scopus 로고
    • Noisy numbers data and numbers testbeds
    • Berkeley, CA
    • Gelbart, D., "Noisy numbers data and numbers testbeds", International Computer Science Institute, Berkeley, CA. http://www.icsi. berkeley.edu/speech/papers/gelbart-ms/.
    • International Computer Science Institute
    • Gelbart, D.1
  • 8
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • Istanbul, Turkey
    • Hermansky, H., Ellis, D., Sharma, S., "Tandem connectionist feature extraction for conventional HMM systems", in Proc. ICASSP, Istanbul, Turkey, pp. 1635-1638, 2000.
    • (2000) Proc. ICASSP , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.2    Sharma, S.3
  • 9
    • 33745213373 scopus 로고    scopus 로고
    • Multi-resolution RASTA filtering for TANDEM-based ASR
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • Hermansky, H., Fousek, P., "Multi-resolution rasta filtering for tandem-based asr", In Proceedings of Interspeech, Lisbon, Portugal, pp. 361-364, 2005. (Pubitemid 43908074)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 361-364
    • Hermansky, H.1    Fousek, P.2
  • 10
    • 0002787767 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France
    • Hirsch, H.G., and Pearce, D., "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", in ISCA ITRW ASR: Challenges for the Next Millennium, Paris, France, pp. 18-20, 2000.
    • (2000) ISCA ITRW ASR: Challenges for the Next Millennium , pp. 18-20
    • Hirsch, H.G.1    Pearce, D.2
  • 11
    • 0032676337 scopus 로고    scopus 로고
    • On the relative importance of various components of the modulation spectrum for automatic speech recognition
    • Kanedera, N., Arai, T., Hermansky, H., Pavel, M., "On the relative importance of various components of the modulation spectrum for automatic speech recognition", Speech Communication, 28:43-55, 1999.
    • (1999) Speech Communication , vol.28 , pp. 43-55
    • Kanedera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 12
    • 85009227802 scopus 로고    scopus 로고
    • Localized spectro-temporal features for automatic speech recognition
    • Kleinschmidt, M., "Localized spectro-temporal features for automatic speech recognition", in Proceedings of Eurospeech, pp. 2573-2576, 2003.
    • (2003) Proceedings of Eurospeech , pp. 2573-2576
    • Kleinschmidt, M.1
  • 14
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
    • Mesgarani, N., Slaney, M., and Shamma, S., "Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations", IEEE Trans. Audio, Speech, and Language Proc., 14(3):920-929, 2006.
    • (2006) IEEE Trans. Audio, Speech, and Language Proc. , vol.14 , Issue.3 , pp. 920-929
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.3
  • 15
    • 0141676589 scopus 로고    scopus 로고
    • New entropy based combination rules in HMM/ANN multi-stream ASR
    • Hong Kong
    • Misra, H., Bourlard, H., Tyagi, V., "New entropy based combination rules in HMM/ANN multi-stream ASR, in Proc. ICASSP, pp. II-741-4 vol.2, Hong Kong, 2003.
    • (2003) Proc. ICASSP , vol.2
    • Misra, H.1    Bourlard, H.2    Tyagi, V.3
  • 17
    • 84867222011 scopus 로고    scopus 로고
    • On the combination of auditory and modulation frequency channels for ASR applications
    • Brisbane, Australia
    • Valente, H. and Hermansky, H., "On the combination of auditory and modulation frequency channels for ASR applications", In Proceedings of Interspeech, Brisbane, Australia, pp. 2242-2245, 2008.
    • (2008) Proceedings of Interspeech , pp. 2242-2245
    • Valente, H.1    Hermansky, H.2
  • 18
    • 84867220821 scopus 로고    scopus 로고
    • Multi-stream spectro-temporal features for robust speech recognition
    • Brisbane, Australia
    • Zhao, S.Y., Morgan, N. "Multi-stream spectro-temporal features for robust speech recognition", In Proceedings of Interspeech, Brisbane, Australia, pp. 898-901, 2008.
    • (2008) Proceedings of Interspeech , pp. 898-901
    • Zhao, S.Y.1    Morgan, N.2
  • 19
    • 70450216114 scopus 로고    scopus 로고
    • Multi-stream to many-stream: Using spectro-temporal features for ASR
    • Brighton, UK
    • Zhao, S., Ravuri, S., and Morgan, N. "Multi-Stream to Many-Stream: Using Spectro-temporal Features for ASR", In Proceedings of Interspeech, Brighton, UK, pp. 2951-2954, 2009.
    • (2009) Proceedings of Interspeech , pp. 2951-2954
    • Zhao, S.1    Ravuri, S.2    Morgan, N.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.