메뉴 건너뛰기




Volumn 20, Issue 4, 2012, Pages 1362-1371

Sparse Auditory Reproducing Kernel (SPARK) features for noise-robust speech recognition

Author keywords

Auditory HMAX; gammatone functions; reproducing kernel Hilbert space (RKHS); robust speech recognition; sparse features

Indexed keywords

AUDITORY HMAX; BASIS FUNCTIONS; COMPUTATIONALLY EFFICIENT; DATA SETS; FEATURE EXTRACTION ALGORITHMS; FEATURE PRUNING; KERNEL FUNCTION; NOISE ROBUST SPEECH RECOGNITION; OVER-COMPLETE; REPRODUCING KERNEL; REPRODUCING KERNEL HILBERT SPACES; ROBUST SPEECH RECOGNITION; SPARSE FEATURES; SPEECH FEATURES; SPEECH RECOGNIZER; SPEECH SIGNALS;

EID: 84857464869     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2179294     Document Type: Article
Times cited : (16)

References (64)
  • 1
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Apr.
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, Apr. 1995.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.1
  • 2
    • 0032075027 scopus 로고    scopus 로고
    • The past, present, and future of speech processing
    • May
    • B. H. Juang and T. H. Chen, "The past, present, and future of speech processing," IEEE Signal Process. Mag., vol. 15, pp. 24-48, May 1998.
    • (1998) IEEE Signal Process. Mag. , vol.15 , pp. 24-48
    • Juang, B.H.1    Chen, T.H.2
  • 3
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 4, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.27 ASSP , Issue.4 , pp. 113-120
    • Boll, S.F.1
  • 4
    • 0028996860 scopus 로고
    • Robust speech recognition based on stochastic matching
    • A. Sankar and C.-H. Lee, "Robust speech recognition based on stochastic matching," in Proc. ICASSP, 1995, pp. 121-124.
    • (1995) Proc. ICASSP , pp. 121-124
    • Sankar, A.1    Lee, C.-H.2
  • 5
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • PII S1063667696013326
    • M. G. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Process, vol. 4, no. 1, pp. 19-30, Jan. 1996. (Pubitemid 126752986)
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.1 , pp. 19-30
    • Rahim, M.G.1    Juang, B.-H.2
  • 6
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
    • Apr.
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs," Comput. Speech Lang., vol. 9, pp. 171-185, Apr. 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 7
    • 85017310148 scopus 로고
    • An improved approach to the hidden Markov model decomposition of speech and noise
    • M. J. F. Gales and S. Young, "An improved approach to the hidden Markov model decomposition of speech and noise," in Proc. ICASSP, 1992, pp. 233-236.
    • (1992) Proc. ICASSP , pp. 233-236
    • Gales, M.J.F.1    Young, S.2
  • 8
    • 85009113852 scopus 로고    scopus 로고
    • HMM adaptation using vector Taylor series for noisy speech recognition
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, 2000, pp. 869-872.
    • (2000) Proc. ICSLP , pp. 869-872
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 9
    • 27644486095 scopus 로고    scopus 로고
    • A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition
    • DOI 10.1109/TSA.2005.851963
    • Y. Gong, "A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 975-983, Sep. 2005. (Pubitemid 41558911)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 975-983
    • Gong, Y.1
  • 10
    • 62249130045 scopus 로고    scopus 로고
    • A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
    • Jul.
    • J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions," Comput. Speech Lang., vol. 23, pp. 389-405, Jul. 2009.
    • (2009) Comput. Speech Lang. , vol.23 , pp. 389-405
    • Li, J.1    Deng, L.2    Yu, D.3    Gong, Y.4    Acero, A.5
  • 11
    • 34547528168 scopus 로고    scopus 로고
    • Adaptive training with joint uncertainty decoding for robust recognition of noisy data
    • H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP, 2007, pp. 389-392.
    • (2007) Proc. ICASSP , pp. 389-392
    • Liao, H.1    Gales, M.J.F.2
  • 12
    • 44849125798 scopus 로고    scopus 로고
    • High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
    • J. Li,L.Deng,Y.Gong, andA.Acero, "High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU, 2007, pp. 65-70.
    • (2007) Proc. ASRU , pp. 65-70
    • Li, L.1    Deng, Y.2    Gong, A.3    Acero, J.4
  • 13
    • 70349194599 scopus 로고    scopus 로고
    • Noise adaptive training using a vector Talyor series approach for noise robust automatic speech recognition
    • O.Kalinli,M. L. Seltzer, andA.Acero, "Noise adaptive training using a vector Talyor series approach for noise robust automatic speech recognition," in Proc. ICASSP, 2009, pp. 3825-3828.
    • (2009) Proc. ICASSP , pp. 3825-3828
    • Kalinli, O.1    Seltzer, M.L.2    Acero, A.3
  • 15
    • 0028517648 scopus 로고
    • New LP-derived features for speaker identification
    • Oct.
    • K. T. Assaleh and R. J. Mammone, "New LP-derived features for speaker identification," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 630-638, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 630-638
    • Assaleh, K.T.1    Mammone, R.J.2
  • 16
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • S. Furui, "Cepstral analysis techniques for automatic speaker verification," IEEE Trans. Acoust., Speech, Signal Process., vol. 29, no. 2, pp. 254-272, Apr. 1981. (Pubitemid 11495877)
    • (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.2 , pp. 254-272
    • Furui, S.1
  • 17
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, Apr. 1990. (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 18
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1322, 1974.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1322
    • Atal, B.1
  • 19
    • 0141479107 scopus 로고    scopus 로고
    • Feature space normalization in adverse acoustic conditions
    • S. Molau, F. Hilger, and H. Ney, "Feature space normalization in adverse acoustic conditions," in Proc. ICASSP, 2003, pp. 656-659.
    • (2003) Proc. ICASSP , pp. 656-659
    • Molau, S.1    Hilger, F.2    Ney, H.3
  • 21
    • 85009142188 scopus 로고    scopus 로고
    • Maximum likelihood nonlinear transformation for environment adaptation in speech recognition systems
    • M. Padmanabhan and S. Dharanipragada, "Maximum likelihood nonlinear transformation for environment adaptation in speech recognition systems," in Proc. Eurospeech, 2001, pp. 2359-2362.
    • (2001) Proc. Eurospeech , pp. 2359-2362
    • Padmanabhan, M.1    Dharanipragada, S.2
  • 24
    • 85009070292 scopus 로고    scopus 로고
    • Large vocabulary speech recognition under adverse acoustic environments
    • L. Deng, A. Acero, M. Plumpe, and X. Huang, "Large vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, 2000, pp. 806-809.
    • (2000) Proc. ICSLP , pp. 806-809
    • Deng, L.1    Acero, A.2    Plumpe, M.3    Huang, X.4
  • 25
    • 70450205161 scopus 로고    scopus 로고
    • Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction
    • C. Kim and R. M. Stern, "Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction," in Proc. Interspeech, 2009, pp. 28-31.
    • (2009) Proc. Interspeech , pp. 28-31
    • Kim, C.1    Stern, R.M.2
  • 26
    • 84991416125 scopus 로고
    • Auditory nerve representation as a front-end for speech recognition in a noisy environment
    • O. Ghitza, "Auditory nerve representation as a front-end for speech recognition in a noisy environment," Comput. Speech Lang., vol. 1, pp. 109-131, 1986.
    • (1986) Comput. Speech Lang. , vol.1 , pp. 109-131
    • Ghitza, O.1
  • 27
    • 69249159165 scopus 로고    scopus 로고
    • A computational auditory scene analysis system for speech segregation and robust speech recognition
    • Y. Shao, S. Srinivasan, Z. Jin, and D. L. Wang, "A computational auditory scene analysis system for speech segregation and robust speech recognition," Comput. Speech Lang., vol. 24, 2010.
    • (2010) Comput. Speech Lang. , vol.24
    • Shao, Y.1    Srinivasan, S.2    Jin, Z.3    Wang, D.L.4
  • 28
    • 78049408631 scopus 로고    scopus 로고
    • Robust speaker identification using an auditory-based feature
    • Q. Li and Y. Huang, "Robust speaker identification using an auditory-based feature," in Proc. ICASSP, 2010, pp. 4514-4517.
    • (2010) Proc. ICASSP , pp. 4514-4517
    • Li, Q.1    Huang, Y.2
  • 29
    • 85008045118 scopus 로고    scopus 로고
    • Auditory model based design and optimization of feature vectors for automatic speech recognition
    • Aug.
    • S. Chatterjee and W. B. Kleijn, "Auditory model based design and optimization of feature vectors for automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1813-1825, Aug. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.6 , pp. 1813-1825
    • Chatterjee, S.1    Kleijn, W.B.2
  • 30
    • 33744994972 scopus 로고    scopus 로고
    • Automatic speech recognition with an adaptation model motivated by auditory processing
    • DOI 10.1109/TSA.2005.860349
    • M. Holmberg, D. Gelbart, and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 43-49, Jan. 2006. (Pubitemid 43863451)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 43-49
    • Holmberg, M.1    Gelbart, D.2    Hemmert, W.3
  • 31
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front end for automatic speech recognition
    • Oct.
    • J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., vol. 106, pp. 2040-2050, Oct. 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.106 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 32
    • 0031238095 scopus 로고    scopus 로고
    • A model of dynamic auditory perception and its application to Robust Word recognition
    • PII S1063667697063906
    • B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 451-464, Sep. 1997. (Pubitemid 127746017)
    • (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.5 , pp. 451-464
    • Strope, B.1    Alwan, A.2
  • 33
    • 0032785783 scopus 로고    scopus 로고
    • Auditory processing of speech signals for robust speech recognition in real-world noisy environments
    • Jan.
    • D. S. Kim, S. Y. Lee, and R. M. Kil, "Auditory processing of speech signals for robust speech recognition in real-world noisy environments," IEEE Trans. Speech Audio Process., vol. 7, no. 1, pp. 55-69, Jan. 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.1 , pp. 55-69
    • Kim, D.S.1    Lee, S.Y.2    Kil, R.M.3
  • 34
    • 64549133282 scopus 로고    scopus 로고
    • Robust speech feature extraction by growth transformation in reproducing kernel Hilbert space
    • Aug.
    • S. Chakrabartty, Y. Deng, and G. Cauwenberghs, "Robust speech feature extraction by growth transformation in reproducing kernel Hilbert space," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1842-1849, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1842-1849
    • Chakrabartty, S.1    Deng, Y.2    Cauwenberghs, G.3
  • 35
    • 70350155133 scopus 로고    scopus 로고
    • Non-linear filtering in reproducing kernel Hilbert spaces for noise-robust speaker verification
    • A. Fazel and S. Chakrabartty, "Non-linear filtering in reproducing kernel Hilbert spaces for noise-robust speaker verification," in Proc. ISCAS, 2009, pp. 113-116.
    • (2009) Proc. ISCAS , pp. 113-116
    • Fazel, A.1    Chakrabartty, S.2
  • 36
    • 0029938380 scopus 로고    scopus 로고
    • Emergence of simple-cell receptive field properties by learning a sparse code for natural images
    • DOI 10.1038/381607a0
    • B. A. Olshausen and D. J. Field, "Emergence of simple-cell receptive field properties by learning a sparse code for natural images," Nature, vol. 381, pp. 607-609, Jun. 1996. (Pubitemid 26177476)
    • (1996) Nature , vol.381 , Issue.6583 , pp. 607-609
    • Olshausen, B.A.1    Field, D.J.2
  • 37
    • 14544277086 scopus 로고    scopus 로고
    • Efficient coding of time-relative structure using spikes
    • DOI 10.1162/0899766052530839
    • E. C. Smith and M. S. Lewicki, "Efficient coding of time-relative structure using spikes," Neural Comput., vol. 17, pp. 19-45, Jan. 2005. (Pubitemid 40305881)
    • (2005) Neural Computation , vol.17 , Issue.1 , pp. 19-45
    • Smith, E.1    Lewicki, M.S.2
  • 38
    • 33644513420 scopus 로고    scopus 로고
    • Efficient auditory coding
    • DOI 10.1038/nature04485, PII N04485
    • E. C. Smith and M. S. Lewicki, "Efficient auditory coding," Nature, vol. 439, pp. 978-982, Feb. 2006. (Pubitemid 43292416)
    • (2006) Nature , vol.439 , Issue.7079 , pp. 978-982
    • Smith, E.C.1    Lewicki, M.S.2
  • 39
    • 0001050571 scopus 로고
    • Auditory filters and excitation patterns as representations of frequency resolution
    • R. Patterson and B. Moore, "Auditory filters and excitation patterns as representations of frequency resolution," Freq. Select. in Hear., pp. 123-177, 1986.
    • (1986) Freq. Select. in Hear. , pp. 123-177
    • Patterson, R.1    Moore, B.2
  • 40
    • 23744508888 scopus 로고    scopus 로고
    • Multiresolution spectrotemporal analysis of complex sounds
    • DOI 10.1121/1.1945807
    • T. Chi, P. Ru, and S. Shamma, "Multiresolution spectrotemporal analysis of complex sounds," J. Acoust. Soc. Amer., vol. 118, pp. 887-906, 2005. (Pubitemid 41129224)
    • (2005) Journal of the Acoustical Society of America , vol.118 , Issue.2 , pp. 887-906
    • Chi, T.1    Ru, P.2    Shamma, S.A.3
  • 41
    • 0035145191 scopus 로고    scopus 로고
    • Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging
    • DOI 10.1162/089892901564108
    • C. M. Wessinger, J. VanMeter, B. Tian, J. V. Lare, J. Pekar, and J. P. Rauschecker, "Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging," J. Cognitive Neu-rosci., vol. 13, pp. 1-7, 2001. (Pubitemid 32121491)
    • (2001) Journal of Cognitive Neuroscience , vol.13 , Issue.1 , pp. 1-7
    • Wessinger, C.M.1    Vanmeter, J.2    Tian, B.3    Van Lare, J.4    Pekar, J.5    Rauschecker, J.P.6
  • 42
    • 77956601209 scopus 로고    scopus 로고
    • Hierarchical organization of human auditory cortex: Evidence from acoustic invariance in the response to intelligible speech
    • K. Okada, F. Rong, J. Venezia, W. Matchin, I.-H. Hsieh, K. Saberi, J. T. Serences, and G. Hickok, "Hierarchical organization of human auditory cortex: Evidence from acoustic invariance in the response to intelligible speech," Cereb. Cortex, vol. 20, pp. 2486-2495, 2010.
    • (2010) Cereb. Cortex , vol.20 , pp. 2486-2495
    • Okada, K.1    Rong, F.2    Venezia, J.3    Matchin, W.4    Hsieh, I.-H.5    Saberi, K.6    Serences, J.T.7    Hickok, G.8
  • 43
    • 14544293518 scopus 로고    scopus 로고
    • Hierarchical and asymmetric temporal sensitivity in human auditory cortices
    • DOI 10.1038/nn1409
    • A. Boemio, S. Fromm, A. Braun, and D. Poeppel, "Hierarchical and asymmetric temporal sensitivity in human auditory cortices," Nature Neurosci., vol. 8, pp. 389-395, 2005. (Pubitemid 40300197)
    • (2005) Nature Neuroscience , vol.8 , Issue.3 , pp. 389-395
    • Boemio, A.1    Fromm, S.2    Braun, A.3    Poeppel, D.4
  • 44
    • 0034653816 scopus 로고    scopus 로고
    • Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds
    • F. Theunissen, K. Sen, and A. J. Doupe, "Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds," J. Neurosci., vol. 20, no. 6, pp. 2315-2331, 2000. (Pubitemid 30230085)
    • (2000) Journal of Neuroscience , vol.20 , Issue.6 , pp. 2315-2331
    • Theunissen, F.E.1    Sen, K.2    Doupe, A.J.3
  • 45
    • 0142090233 scopus 로고    scopus 로고
    • Spectrotemporal structure of receptive fields in areas AI and AAF of mouse auditory cortex
    • DOI 10.1152/jn.00751.2002
    • J. F. Linden, R. C. Liu, M. Sahani, C. E. Schreiner, and M. M. Merzenich, "Spectrotemporal structure of receptive fields in areas ai and aaf of mouse auditory cortex," J. Neurophysiol., vol. 90, no. 4, pp. 2660-2675, 2003. (Pubitemid 37266531)
    • (2003) Journal of Neurophysiology , vol.90 , Issue.4 , pp. 2660-2675
    • Linden, J.F.1    Liu, R.C.2    Sahani, M.3    Schreiner, C.E.4    Merzenich, M.M.5
  • 46
    • 85009233038 scopus 로고    scopus 로고
    • Improving word accuracy with gabor feature extraction
    • M.Kleinschmidt and D. Gelbart, "Improving word accuracy with gabor feature extraction," in Proc. ICSLP, 2002.
    • (2002) Proc. ICSLP
    • Kleinschmidt, M.1    Gelbart, D.2
  • 47
    • 85009227802 scopus 로고    scopus 로고
    • Localized spectro-temporal features for automatic speech recognition
    • M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition," in Proc. Eurospeech, 2003.
    • (2003) Proc. Eurospeech
    • Kleinschmidt, M.1
  • 48
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
    • DOI 10.1109/TSA.2005.858055
    • N. Mesgarani, M. Slaney, and S. A. Shamma, "Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 920-930, May 2006. (Pubitemid 46547653)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 920-930
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.A.3
  • 49
    • 85063071800 scopus 로고    scopus 로고
    • Discriminative word-spotting using ordered spectro-temporal patch features
    • Sep.
    • T. Ezzat and T. Poggio, "Discriminative word-spotting using ordered spectro-temporal patch features," in Proc. SAPA Workshop, Sep. 2008, pp. 35-40.
    • (2008) Proc. SAPA Workshop , pp. 35-40
    • Ezzat, T.1    Poggio, T.2
  • 50
    • 51449089975 scopus 로고    scopus 로고
    • Localized spectro-temporal cep-stral analysis of speech
    • May
    • J. Bouvrie, T. Ezzat, and T. Poggio, "Localized spectro-temporal cep-stral analysis of speech," in Proc. ICASSP, May 2008, pp. 4733-4736.
    • (2008) Proc. ICASSP , pp. 4733-4736
    • Bouvrie, J.1    Ezzat, T.2    Poggio, T.3
  • 51
    • 0033316361 scopus 로고    scopus 로고
    • Hierarchical models of object recognition in cortex
    • DOI 10.1038/14819
    • M. Riesenhuber and T. Poggio, "Hierarchical models of object recognition in cortex," Nature Neurosci., vol. 2, pp. 1019-1025, 1999. (Pubitemid 30599567)
    • (1999) Nature Neuroscience , vol.2 , Issue.11 , pp. 1019-1025
    • Riesenhuber, M.1    Poggio, T.2
  • 53
    • 0031220487 scopus 로고    scopus 로고
    • Effects of phase on the perception of intervocalic stop consonants
    • PII S016763939700054X
    • L. Liu, J. He, and G. Palm, "Effects of phase on the perception of intervocalic stop consonants,"Speech Commun., vol. 22, no. 4,pp. 403-417, 1997. (Pubitemid 127433607)
    • (1997) Speech Communication , vol.22 , Issue.4 , pp. 403-417
    • Liu, L.1    He, J.2    Palm, G.3
  • 55
    • 0003913694 scopus 로고
    • An efficient implementation of the Patterson-Holdsworth auditory filter bank
    • M. Slaney, An efficient implementation of the Patterson-Holdsworth auditory filter bank, Apple Computer Tech. Rep., 1993, no. 35.
    • (1993) Apple Computer Tech. Rep. , Issue.35
    • Slaney, M.1
  • 56
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol. 47, pp. 103-108, 1990.
    • (1990) Hear. Res. , vol.47 , pp. 103-108
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 57
    • 0004412846 scopus 로고
    • SVOS final report: The auditory filterbank
    • R. D. Patterson, Holdsworth, I. Nimmo-Smith, and P. Rice, "SVOS final report: The auditory filterbank," APU Rep., 1988, no. 2341.
    • (1988) APU Rep. , Issue.2341
    • Patterson, R.D.1    Nimmo-Smith, H.I.2    Rice, P.3
  • 58
    • 0003241883 scopus 로고
    • Splines models for observational data
    • Philadelphia, PA: SIAM
    • G. Wahba, "Splines models for observational data," inSeries in Applied Mathematics. Philadelphia, PA: SIAM, 1990, vol. 59.
    • (1990) InSeries in Applied Mathematics , vol.59
    • Wahba, G.1
  • 59
    • 0001219859 scopus 로고
    • Regularization theory and neural networks architectures
    • F. Girosi, M. Jones, and T. Poggio, "Regularization theory and neural networks architectures," Neural Comput., vol. 7, pp. 219-269, 1995.
    • (1995) Neural Comput. , vol.7 , pp. 219-269
    • Girosi, F.1    Jones, M.2    Poggio, T.3
  • 60
    • 34547539413 scopus 로고    scopus 로고
    • Gammatone features and feature combination for large vocabulary speech recognition
    • R. Schluter, L. Bezrukov, H. Wagner, and H. Ney, "Gammatone features and feature combination for large vocabulary speech recognition," in Proc. ICASSP, 2007, pp. 649-652.
    • (2007) Proc. ICASSP , pp. 649-652
    • Schluter, R.1    Bezrukov, L.2    Wagner, H.3    Ney, H.4
  • 61
    • 0038669544 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • H. G Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ASR, 2000, pp. 181-188.
    • (2000) Proc. ASR , pp. 181-188
    • Hirsch, H.G.1    Pearce, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.