메뉴 건너뛰기




Volumn 08-12-September-2016, Issue , 2016, Pages 1368-1372

Neural responses to speech-specific modulations derived from a spectro-temporal filter bank

Author keywords

Automatic speech recognition; ECoG; Robust feature extraction; Speech perception

Indexed keywords

BANDPASS FILTERS; ELECTROENCEPHALOGRAPHY; ELECTROPHYSIOLOGY; FEATURE EXTRACTION; FILTER BANKS; MODULATION; NEURONS; SPEECH; SPEECH COMMUNICATION; SPEECH PROCESSING;

EID: 84994365864     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: 10.21437/Interspeech.2016-1327     Document Type: Conference Paper
Times cited : (1)

References (27)
  • 1
    • 0032139768 scopus 로고    scopus 로고
    • Should recognizers have ears?
    • Hermansky, H. (1998). "Should recognizers have ears?," Speech Commun., 25, 3-24.
    • (1998) Speech Commun. , vol.25 , pp. 3-24
    • Hermansky, H.1
  • 2
    • 85032751341 scopus 로고    scopus 로고
    • Hearing is believing: Biologically inspired methods for robust automatic speech recognition
    • Stern, R. M., and Morgan, N. (2012). "Hearing is believing: Biologically inspired methods for robust automatic speech recognition," IEEE Signal Process. Mag., 29, 34-43.
    • (2012) IEEE Signal Process. Mag. , vol.29 , pp. 34-43
    • Stern, R.M.1    Morgan, N.2
  • 3
    • 34247580087 scopus 로고    scopus 로고
    • Reaching over the gap: A review of efforts to link human and automatic speech recognition research
    • Scharenborg, O. (2007). "Reaching over the gap: A review of efforts to link human and automatic speech recognition research," Speech Commun., 49, 336-347.
    • (2007) Speech Commun. , vol.49 , pp. 336-347
    • Scharenborg, O.1
  • 4
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • Cooke, M. (2006). "A glimpsing model of speech perception in noise," J. Acoust. Soc. Am., 119, 1562-1573.
    • (2006) J. Acoust. Soc. Am. , vol.119 , pp. 1562-1573
    • Cooke, M.1
  • 5
    • 0346217031 scopus 로고    scopus 로고
    • Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition
    • Scharenborg, O., ten Bosch, L., Boves, L., and Norris, D. (2003). "Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition," J. Acoust. Soc. Am., 114, 3032.
    • (2003) J. Acoust. Soc. Am. , vol.114 , pp. 3032
    • Scharenborg, O.1    Ten Bosch, L.2    Boves, L.3    Norris, D.4
  • 6
    • 0037824480 scopus 로고    scopus 로고
    • Gabor analysis of auditory mid-brain receptive fields: Spectro-temporal and binaural composition
    • Qiu, A., Schreiner, C., and Escabi, M. (2003). "Gabor analysis of auditory mid-brain receptive fields: spectro-temporal and binaural composition," Journal of Neurophysiology, 90, pp. 456-476.
    • (2003) Journal of Neurophysiology , vol.90 , pp. 456-476
    • Qiu, A.1    Schreiner, C.2    Escabi, M.3
  • 7
    • 85009233038 scopus 로고    scopus 로고
    • Improving word accuracy with Gabor feature extraction
    • Kleinschmidt, M. and Gelbart, D. (2002). "Improving word accuracy with Gabor feature extraction," in Proc. Interspeech, pp. 25-28.
    • (2002) Proc. Interspeech , pp. 25-28
    • Kleinschmidt, M.1    Gelbart, D.2
  • 8
    • 84863799482 scopus 로고    scopus 로고
    • Spectrotemporal modulation subspace-spanning filter bank features for robust automatic speech recognition
    • Schädler, M.R., Meyer, B.T., Kollmeier. B. (2012). "Spectrotemporal modulation subspace-spanning filter bank features for robust automatic speech recognition," J. Acoust. Soc. Am. Volume 131, Issue 5, pp. 4134-4151.
    • (2012) J. Acoust. Soc. Am , vol.131 , Issue.5 , pp. 4134-4151
    • Schädler, M.R.1    Meyer, B.T.2    Kollmeier, B.3
  • 9
    • 84865769808 scopus 로고    scopus 로고
    • Comparing different flavors of spectro-temporal features for ASR
    • Meyer, B. T., Ravuri, S. R., Schädler, M. R., Morgan, N. (2011). "Comparing different flavors of spectro-temporal features for ASR," in Proc. Interspeech, pp. 1269-1272.
    • (2011) Proc. Interspeech , pp. 1269-1272
    • Meyer, B.T.1    Ravuri, S.R.2    Schädler, M.R.3    Morgan, N.4
  • 10
    • 84910029373 scopus 로고    scopus 로고
    • Should deep neural nets have ears? the role of auditory features in deep learning approaches
    • Castro Martinez, A.M., Moritz, N., Meyer, B.T. (2014). "Should deep neural nets have ears? The role of auditory features in deep learning approaches," in Proc. Interspeech, pp. 2435-2439.
    • (2014) Proc. Interspeech , pp. 2435-2439
    • Castro Martinez, A.M.1    Moritz, N.2    Meyer, B.T.3
  • 11
    • 84878395103 scopus 로고    scopus 로고
    • Longer Features:They do a speech detector good
    • Portland, OR, USA
    • Tsai, T. J., and Morgan, N. (2012). "Longer Features:They do a speech detector good," Proc. Interspeech 2012, Portland, OR, USA.
    • (2012) Proc. Interspeech 2012
    • Tsai, T.J.1    Morgan, N.2
  • 12
    • 84867619222 scopus 로고    scopus 로고
    • Spectro-temporal Gabor features for speaker recognition
    • Lei, H., Meyer, B., Mirghafori, N. (2012). "Spectro-temporal Gabor features for speaker recognition," in Proc. ICASSP.
    • (2012) Proc. ICASSP
    • Lei, H.1    Meyer, B.2    Mirghafori, N.3
  • 13
  • 14
    • 84887022818 scopus 로고    scopus 로고
    • Representation of speech in human auditory cortex: Is it special?
    • Steinschneider, M., Nourski, K.V., Fischman, Y.I. (2013). "Representation of Speech in Human Auditory Cortex: Is it Special?," Hearing Research, 305, pp. 57-73.
    • (2013) Hearing Research , vol.305 , pp. 57-73
    • Steinschneider, M.1    Nourski, K.V.2    Fischman, Y.I.3
  • 17
    • 0035086278 scopus 로고    scopus 로고
    • Induced electrocorticographic gamma activity during auditory perception
    • Crone, N.E., Boatman, D., Gordon, B., Hao, L. (2001). "Induced electrocorticographic gamma activity during auditory perception," Clinical Neurophysiology, 112, pp.565-582.
    • (2001) Clinical Neurophysiology , vol.112 , pp. 565-582
    • Crone, N.E.1    Boatman, D.2    Gordon, B.3    Hao, L.4
  • 18
    • 42949095207 scopus 로고    scopus 로고
    • Spatiotemporal dynamics of word processing in the human brain
    • Canolty, R.T. (2007). "Spatiotemporal dynamics of word processing in the human brain," Frontiers in Neuroscience, 1, pp.185-196.
    • (2007) Frontiers in Neuroscience , vol.1 , pp. 185-196
    • Canolty, R.T.1
  • 19
    • 84958819082 scopus 로고    scopus 로고
    • Human Superior Temporal Gyrus organization of spectrotemporal modulation tuning derived from speech stimuli
    • Hullett, P.W., Hamilton, L.S., Mesgarani, N., Schreiner, C.E., Chang, E.F. (2016). "Human Superior Temporal Gyrus organization of spectrotemporal modulation tuning derived from speech stimuli," Journal of Neuroscience, 36, pp.2014-2026.
    • (2016) Journal of Neuroscience , vol.36 , pp. 2014-2026
    • Hullett, P.W.1    Hamilton, L.S.2    Mesgarani, N.3    Schreiner, C.E.4    Chang, E.F.5
  • 21
    • 84860761516 scopus 로고    scopus 로고
    • Selective cortical representation of attended speaker in multi-talker speech perception
    • Mesgarani, N., Chang, E.F. (2012). "Selective cortical representation of attended speaker in multi-talker speech perception," Nature, 485, pp.233-236.
    • (2012) Nature , vol.485 , pp. 233-236
    • Mesgarani, N.1    Chang, E.F.2
  • 26
    • 84883097102 scopus 로고    scopus 로고
    • On the importance of various modulation frequencies for speech recognition
    • Kanedera, N., Arai, T., Hermansky, H., and Pavel, M. (1997). "On the importance of various modulation frequencies for speech recognition," Proc. Eurospeech, pp. 1079-1082.
    • (1997) Proc. Eurospeech , pp. 1079-1082
    • Kanedera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 27
    • 34247487053 scopus 로고    scopus 로고
    • The cortical organization of speech processing
    • Hickok, G., Poeppel, D. (2007). "The cortical organization of speech processing," Nature Reviews Neuroscience, 8, pp.393-402
    • (2007) Nature Reviews Neuroscience , vol.8 , pp. 393-402
    • Hickok, G.1    Poeppel, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.