메뉴 건너뛰기




Volumn , Issue , 2013, Pages 872-875

Spectro-temporal directional derivative features for automatic speech recognition

Author keywords

Automatic speech recognition; Directional wavelet transforms; Spectro temporal features

Indexed keywords

IMAGE PROCESSING; WAVELET TRANSFORMS;

EID: 84906282217     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (25)
  • 1
    • 85009233038 scopus 로고    scopus 로고
    • Improving word accuracy with gabor feature extraction
    • M. Kleinschmidt and D. Gelbart, "Improving word accuracy with gabor feature extraction, " in Proc. ICSLP, vol. 5, 2002, pp. 16-38.
    • (2002) Proc. ICSLP , vol.5 , pp. 16-38
    • Kleinschmidt, M.1    Gelbart, D.2
  • 2
    • 85009227802 scopus 로고    scopus 로고
    • Localized spectro-temporal features for automatic speech recognition
    • Citeseer
    • M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition, " in Proc. Eurospeech, vol. 87. Citeseer, 2003.
    • (2003) Proc. Eurospeech , vol.87
    • Kleinschmidt, M.1
  • 3
    • 34547509128 scopus 로고    scopus 로고
    • Representation of phonemes in primary auditory cortex: How the brain analyzes speech
    • IV-765
    • N. Mesgarani, S. David, and S. Shamma, "Representation of phonemes in primary auditory cortex: how the brain analyzes speech, " in Proc. ICASSP, vol. 4, 2007, pp. IV-765.
    • (2007) Proc. ICASSP , vol.4
    • Mesgarani, N.1    David, S.2    Shamma, S.3
  • 4
    • 0038711696 scopus 로고    scopus 로고
    • A spectro-temporal modulation index (stmi) for assessment of speech intelligibility
    • M. Elhilali, T. Chi, and S. A. Shamma, "A spectro-temporal modulation index (stmi) for assessment of speech intelligibility, " Speech communication, vol. 41, no. 2, pp. 331-348, 2003.
    • (2003) Speech Communication , vol.41 , Issue.2 , pp. 331-348
    • Elhilali, M.1    Chi, T.2    Shamma, S.A.3
  • 5
  • 6
    • 84865769808 scopus 로고    scopus 로고
    • Comparing different flavors of spectro-temporal features for asr
    • B. Meyer, S. Ravuri, M. Schädler, and N. Morgan, "Comparing different flavors of spectro-temporal features for asr, " in Proc. of Inter Speech, 2011, pp. 1269-1272.
    • (2011) Proc. of Inter Speech , pp. 1269-1272
    • Meyer, B.1    Ravuri, S.2    Schädler, M.3    Morgan, N.4
  • 7
    • 84890497049 scopus 로고    scopus 로고
    • Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition
    • B. Meyer, C. Spille, B. Kollmeier, and N. Morgan, "Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition, " in Proc. Inter Speech, vol. 15, 2012, p. 20.
    • (2012) Proc. Inter Speech , vol.15 , pp. 20
    • Meyer, B.1    Spille, C.2    Kollmeier, B.3    Morgan, N.4
  • 8
    • 84878611488 scopus 로고    scopus 로고
    • Normalization of spectrotemporal gabor filter bank features for improved robust automatic speech recognition systems
    • M. R. Schädler and B. Kollmeier, "Normalization of spectrotemporal gabor filter bank features for improved robust automatic speech recognition systems, " in Proc. Inter Speech, 2012.
    • (2012) Proc. Inter Speech
    • Schädler, M.R.1    Kollmeier, B.2
  • 9
    • 84878395103 scopus 로고    scopus 로고
    • Longer features: They do a speech detector good
    • T. Tsai and N. Morgan, "Longer features: They do a speech detector good, " in Proc. Inter Speech, 2012.
    • (2012) Proc. Inter Speech
    • Tsai, T.1    Morgan, N.2
  • 10
    • 84863799482 scopus 로고    scopus 로고
    • Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition
    • M. R. Schädler, B. T. Meyer, and B. Kollmeier, "Spectro- temporal modulation subspace-spanning filter bank features for robust automatic speech recognition, " The Journal of the Acoustical Society of America, vol. 131, p. 4134, 2012.
    • (2012) The Journal of the Acoustical Society of America , vol.131 , pp. 4134
    • Schädler, M.R.1    Meyer, B.T.2    Kollmeier, B.3
  • 11
    • 0141624530 scopus 로고
    • An efficient auditory filterbank based on the gammatone function
    • R. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function, " APU report, vol. 2341, 1988.
    • (1988) APU Report , vol.2341
    • Patterson, R.1    Nimmo-Smith, I.2    Holdsworth, J.3    Rice, P.4
  • 12
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • B. E. Kingsbury, N. Morgan, and S. Greenberg, "Robust speech recognition using the modulation spectrogram, " Speech Communication, vol. 25, no. 1, pp. 117-132, 1998.
    • (1998) Speech Communication , vol.25 , Issue.1 , pp. 117-132
    • Kingsbury, B.E.1    Morgan, N.2    Greenberg, S.3
  • 15
    • 0029487233 scopus 로고
    • The steerable pyramid: A flexible architecture for multi-scale derivative computation
    • E. Simoncelli and W. Freeman, "The steerable pyramid: A flexible architecture for multi-scale derivative computation, " in Proc. ICIP, vol. 3, 1995, pp. 444-447.
    • (1995) Proc. ICIP , vol.3 , pp. 444-447
    • Simoncelli, E.1    Freeman, W.2
  • 20
    • 84883097102 scopus 로고    scopus 로고
    • On the importance of various modulation frequencies for speech recognition
    • N. Kanedera, T. Arai, H. Hermansky, and M. Pavel, "On the importance of various modulation frequencies for speech recognition, " in Proc. Eurospeech, vol. 97, 1997, pp. 1079-1082.
    • (1997) Proc. Eurospeech , vol.97 , pp. 1079-1082
    • Kanedera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 21
    • 33646064275 scopus 로고    scopus 로고
    • Multi-resolution rasta filtering for tandem-based asr
    • H. Hermansky and P. Fousek, "Multi-resolution rasta filtering for tandem-based asr, " in Proc. Inter Speech, 2005.
    • (2005) Proc. Inter Speech
    • Hermansky, H.1    Fousek, P.2
  • 22
    • 70450182191 scopus 로고    scopus 로고
    • Tandem representations of spectral envelope and modulation frequency features for asr
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Tandem representations of spectral envelope and modulation frequency features for asr, " in Proc. Inter Speech, 2009.
    • (2009) Proc. Inter Speech
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 23
    • 0034427366 scopus 로고    scopus 로고
    • Curvelets, multi resolution representation, and scaling laws
    • E. Candes and D. Donoho, "Curvelets, multiresolution representation, and scaling laws, " in Proc. SPIE, vol. 4119, no. 1, 2000.
    • (2000) Proc. SPIE , vol.4119 , Issue.1
    • Candes, E.1    Donoho, D.2
  • 24
    • 28944432472 scopus 로고    scopus 로고
    • The contourlet transform: An efficient directional multi resolution image representation
    • M. Do and M. Vetterli, "The contourlet transform: An efficient directional multi resolution image representation, " Image Processing, IEEE Transactions on, vol. 14, no. 12, pp. 2091-2106, 2005.
    • (2005) Image Processing, IEEE Transactions on , vol.14 , Issue.12 , pp. 2091-2106
    • Do, M.1    Vetterli, M.2
  • 25
    • 0030369274 scopus 로고    scopus 로고
    • Inclusion of temporal information into features for speech recognition
    • B. Milner, "Inclusion of temporal information into features for speech recognition, " in Proc. ICSLP, vol. 1, 1996, pp. 256-259.
    • (1996) Proc. ICSLP , vol.1 , pp. 256-259
    • Milner, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.