SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 4, 2012, Pages 1362-1371

Sparse Auditory Reproducing Kernel (SPARK) features for noise-robust speech recognition

(2) Fazel, Amin a Chakrabartty, Shantanu a

a Michigan State University (United States)

Author keywords

Auditory HMAX; gammatone functions; reproducing kernel Hilbert space (RKHS); robust speech recognition; sparse features

Indexed keywords

AUDITORY HMAX; BASIS FUNCTIONS; COMPUTATIONALLY EFFICIENT; DATA SETS; FEATURE EXTRACTION ALGORITHMS; FEATURE PRUNING; KERNEL FUNCTION; NOISE ROBUST SPEECH RECOGNITION; OVER-COMPLETE; REPRODUCING KERNEL; REPRODUCING KERNEL HILBERT SPACES; ROBUST SPEECH RECOGNITION; SPARSE FEATURES; SPEECH FEATURES; SPEECH RECOGNIZER; SPEECH SIGNALS;

SPEECH RECOGNITION;

ELECTRIC SPARKS;

EID: 84857464869 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2011.2179294 Document Type: Article

Times cited : (16)

References (64)

1
- 0029288202
- Speech recognition in noisy environments: A survey
- Apr.
- Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, Apr. 1995.
- (1995) Speech Commun. , vol.16 , pp. 261-291
- Gong, Y.¹

2
- 0032075027
- The past, present, and future of speech processing
- May
- B. H. Juang and T. H. Chen, "The past, present, and future of speech processing," IEEE Signal Process. Mag., vol. 15, pp. 24-48, May 1998.
- (1998) IEEE Signal Process. Mag. , vol.15 , pp. 24-48
- Juang, B.H.¹ Chen, T.H.²

3
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr.
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 4, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.27 ASSP , Issue.4 , pp. 113-120
- Boll, S.F.¹

4
- 0028996860
- Robust speech recognition based on stochastic matching
- A. Sankar and C.-H. Lee, "Robust speech recognition based on stochastic matching," in Proc. ICASSP, 1995, pp. 121-124.
- (1995) Proc. ICASSP , pp. 121-124
- Sankar, A.¹ Lee, C.-H.²

5
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- PII S1063667696013326
- M. G. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Process, vol. 4, no. 1, pp. 19-30, Jan. 1996. (Pubitemid 126752986)
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.1 , pp. 19-30
- Rahim, M.G.¹ Juang, B.-H.²

6
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
- Apr.
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs," Comput. Speech Lang., vol. 9, pp. 171-185, Apr. 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

7
- 85017310148
- An improved approach to the hidden Markov model decomposition of speech and noise
- M. J. F. Gales and S. Young, "An improved approach to the hidden Markov model decomposition of speech and noise," in Proc. ICASSP, 1992, pp. 233-236.
- (1992) Proc. ICASSP , pp. 233-236
- Gales, M.J.F.¹ Young, S.²

8
- 85009113852
- HMM adaptation using vector Taylor series for noisy speech recognition
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, 2000, pp. 869-872.
- (2000) Proc. ICSLP , pp. 869-872
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

9
- 27644486095
- A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition
- DOI 10.1109/TSA.2005.851963
- Y. Gong, "A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 975-983, Sep. 2005. (Pubitemid 41558911)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 975-983
- Gong, Y.¹

10
- 62249130045
- A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
- Jul.
- J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions," Comput. Speech Lang., vol. 23, pp. 389-405, Jul. 2009.
- (2009) Comput. Speech Lang. , vol.23 , pp. 389-405
- Li, J.¹ Deng, L.² Yu, D.³ Gong, Y.⁴ Acero, A.⁵

11
- 34547528168
- Adaptive training with joint uncertainty decoding for robust recognition of noisy data
- H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP, 2007, pp. 389-392.
- (2007) Proc. ICASSP , pp. 389-392
- Liao, H.¹ Gales, M.J.F.²

12
- 44849125798
- High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
- J. Li,L.Deng,Y.Gong, andA.Acero, "High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU, 2007, pp. 65-70.
- (2007) Proc. ASRU , pp. 65-70
- Li, L.¹ Deng, Y.² Gong, A.³ Acero, J.⁴

13
- 70349194599
- Noise adaptive training using a vector Talyor series approach for noise robust automatic speech recognition
- O.Kalinli,M. L. Seltzer, andA.Acero, "Noise adaptive training using a vector Talyor series approach for noise robust automatic speech recognition," in Proc. ICASSP, 2009, pp. 3825-3828.
- (2009) Proc. ICASSP , pp. 3825-3828
- Kalinli, O.¹ Seltzer, M.L.² Acero, A.³

14
- 0004244302
- Englewood Cliffs NJ: Prentice-Hall
- L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

15
- 0028517648
- New LP-derived features for speaker identification
- Oct.
- K. T. Assaleh and R. J. Mammone, "New LP-derived features for speaker identification," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 630-638, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 630-638
- Assaleh, K.T.¹ Mammone, R.J.²

16
- 0019555090
- Cepstral analysis technique for automatic speaker verification
- S. Furui, "Cepstral analysis techniques for automatic speaker verification," IEEE Trans. Acoust., Speech, Signal Process., vol. 29, no. 2, pp. 254-272, Apr. 1981. (Pubitemid 11495877)
- (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.2 , pp. 254-272
- Furui, S.¹

17
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- DOI 10.1121/1.399423
- H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, Apr. 1990. (Pubitemid 20256470)
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

18
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1322, 1974.
- (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1322
- Atal, B.¹

19
- 0141479107
- Feature space normalization in adverse acoustic conditions
- S. Molau, F. Hilger, and H. Ney, "Feature space normalization in adverse acoustic conditions," in Proc. ICASSP, 2003, pp. 656-659.
- (2003) Proc. ICASSP , pp. 656-659
- Molau, S.¹ Hilger, F.² Ney, H.³

20
- 0028517164
- Rasta processing of speech
- Oct.
- H. Hermansky and N. Morgan, "Rasta processing of speech," IEEE Trans. Speech Audio Process, vol. 2, no. 4, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

21
- 85009142188
- Maximum likelihood nonlinear transformation for environment adaptation in speech recognition systems
- M. Padmanabhan and S. Dharanipragada, "Maximum likelihood nonlinear transformation for environment adaptation in speech recognition systems," in Proc. Eurospeech, 2001, pp. 2359-2362.
- (2001) Proc. Eurospeech , pp. 2359-2362
- Padmanabhan, M.¹ Dharanipragada, S.²

22
- 85009242725
- Evaluation of a noise robust DSR front-end on Aurora databases
- D. Macho, L. Mauuary, B. Noe, Y. M. Cheng, D. Ealey, D. Jouvet, H. Kelleher, D. Pearce, and F. Saadoun, "Evaluation of a noise robust DSR front-end on Aurora databases," in Proc. ICSLP, 2002, pp. 17-20.
- (2002) Proc. ICSLP , pp. 17-20
- MacHo, D.¹ Mauuary, L.² Noe, B.³ Cheng, Y.M.⁴ Ealey, D.⁵ Jouvet, D.⁶ Kelleher, H.⁷ Pearce, D.⁸ Saadoun, F.⁹

23
- 0442317754
- ETSI ES 202 050 Vers. 1.1.5
- "Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms," 2007, ETSI ES 202 050 Vers. 1.1.5.
- (2007) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms,"

24
- 85009070292
- Large vocabulary speech recognition under adverse acoustic environments
- L. Deng, A. Acero, M. Plumpe, and X. Huang, "Large vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, 2000, pp. 806-809.
- (2000) Proc. ICSLP , pp. 806-809
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.⁴

25
- 70450205161
- Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction
- C. Kim and R. M. Stern, "Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction," in Proc. Interspeech, 2009, pp. 28-31.
- (2009) Proc. Interspeech , pp. 28-31
- Kim, C.¹ Stern, R.M.²

26
- 84991416125
- Auditory nerve representation as a front-end for speech recognition in a noisy environment
- O. Ghitza, "Auditory nerve representation as a front-end for speech recognition in a noisy environment," Comput. Speech Lang., vol. 1, pp. 109-131, 1986.
- (1986) Comput. Speech Lang. , vol.1 , pp. 109-131
- Ghitza, O.¹

27
- 69249159165
- A computational auditory scene analysis system for speech segregation and robust speech recognition
- Y. Shao, S. Srinivasan, Z. Jin, and D. L. Wang, "A computational auditory scene analysis system for speech segregation and robust speech recognition," Comput. Speech Lang., vol. 24, 2010.
- (2010) Comput. Speech Lang. , vol.24
- Shao, Y.¹ Srinivasan, S.² Jin, Z.³ Wang, D.L.⁴

28
- 78049408631
- Robust speaker identification using an auditory-based feature
- Q. Li and Y. Huang, "Robust speaker identification using an auditory-based feature," in Proc. ICASSP, 2010, pp. 4514-4517.
- (2010) Proc. ICASSP , pp. 4514-4517
- Li, Q.¹ Huang, Y.²

29
- 85008045118
- Auditory model based design and optimization of feature vectors for automatic speech recognition
- Aug.
- S. Chatterjee and W. B. Kleijn, "Auditory model based design and optimization of feature vectors for automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1813-1825, Aug. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.6 , pp. 1813-1825
- Chatterjee, S.¹ Kleijn, W.B.²

30
- 33744994972
- Automatic speech recognition with an adaptation model motivated by auditory processing
- DOI 10.1109/TSA.2005.860349
- M. Holmberg, D. Gelbart, and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 43-49, Jan. 2006. (Pubitemid 43863451)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 43-49
- Holmberg, M.¹ Gelbart, D.² Hemmert, W.³

31
- 0032828464
- A model of auditory perception as front end for automatic speech recognition
- Oct.
- J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., vol. 106, pp. 2040-2050, Oct. 1999.
- (1999) J. Acoust. Soc. Amer. , vol.106 , pp. 2040-2050
- Tchorz, J.¹ Kollmeier, B.²

32
- 0031238095
- A model of dynamic auditory perception and its application to Robust Word recognition
- PII S1063667697063906
- B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 451-464, Sep. 1997. (Pubitemid 127746017)
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.5 , pp. 451-464
- Strope, B.¹ Alwan, A.²

33
- 0032785783
- Auditory processing of speech signals for robust speech recognition in real-world noisy environments
- Jan.
- D. S. Kim, S. Y. Lee, and R. M. Kil, "Auditory processing of speech signals for robust speech recognition in real-world noisy environments," IEEE Trans. Speech Audio Process., vol. 7, no. 1, pp. 55-69, Jan. 1999.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.1 , pp. 55-69
- Kim, D.S.¹ Lee, S.Y.² Kil, R.M.³

34
- 64549133282
- Robust speech feature extraction by growth transformation in reproducing kernel Hilbert space
- Aug.
- S. Chakrabartty, Y. Deng, and G. Cauwenberghs, "Robust speech feature extraction by growth transformation in reproducing kernel Hilbert space," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1842-1849, Aug. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1842-1849
- Chakrabartty, S.¹ Deng, Y.² Cauwenberghs, G.³

35
- 70350155133
- Non-linear filtering in reproducing kernel Hilbert spaces for noise-robust speaker verification
- A. Fazel and S. Chakrabartty, "Non-linear filtering in reproducing kernel Hilbert spaces for noise-robust speaker verification," in Proc. ISCAS, 2009, pp. 113-116.
- (2009) Proc. ISCAS , pp. 113-116
- Fazel, A.¹ Chakrabartty, S.²

36
- 0029938380
- Emergence of simple-cell receptive field properties by learning a sparse code for natural images
- DOI 10.1038/381607a0
- B. A. Olshausen and D. J. Field, "Emergence of simple-cell receptive field properties by learning a sparse code for natural images," Nature, vol. 381, pp. 607-609, Jun. 1996. (Pubitemid 26177476)
- (1996) Nature , vol.381 , Issue.6583 , pp. 607-609
- Olshausen, B.A.¹ Field, D.J.²

37
- 14544277086
- Efficient coding of time-relative structure using spikes
- DOI 10.1162/0899766052530839
- E. C. Smith and M. S. Lewicki, "Efficient coding of time-relative structure using spikes," Neural Comput., vol. 17, pp. 19-45, Jan. 2005. (Pubitemid 40305881)
- (2005) Neural Computation , vol.17 , Issue.1 , pp. 19-45
- Smith, E.¹ Lewicki, M.S.²

38
- 33644513420
- Efficient auditory coding
- DOI 10.1038/nature04485, PII N04485
- E. C. Smith and M. S. Lewicki, "Efficient auditory coding," Nature, vol. 439, pp. 978-982, Feb. 2006. (Pubitemid 43292416)
- (2006) Nature , vol.439 , Issue.7079 , pp. 978-982
- Smith, E.C.¹ Lewicki, M.S.²

39
- 0001050571
- Auditory filters and excitation patterns as representations of frequency resolution
- R. Patterson and B. Moore, "Auditory filters and excitation patterns as representations of frequency resolution," Freq. Select. in Hear., pp. 123-177, 1986.
- (1986) Freq. Select. in Hear. , pp. 123-177
- Patterson, R.¹ Moore, B.²

40
- 23744508888
- Multiresolution spectrotemporal analysis of complex sounds
- DOI 10.1121/1.1945807
- T. Chi, P. Ru, and S. Shamma, "Multiresolution spectrotemporal analysis of complex sounds," J. Acoust. Soc. Amer., vol. 118, pp. 887-906, 2005. (Pubitemid 41129224)
- (2005) Journal of the Acoustical Society of America , vol.118 , Issue.2 , pp. 887-906
- Chi, T.¹ Ru, P.² Shamma, S.A.³

41
- 0035145191
- Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging
- DOI 10.1162/089892901564108
- C. M. Wessinger, J. VanMeter, B. Tian, J. V. Lare, J. Pekar, and J. P. Rauschecker, "Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging," J. Cognitive Neu-rosci., vol. 13, pp. 1-7, 2001. (Pubitemid 32121491)
- (2001) Journal of Cognitive Neuroscience , vol.13 , Issue.1 , pp. 1-7
- Wessinger, C.M.¹ Vanmeter, J.² Tian, B.³ Van Lare, J.⁴ Pekar, J.⁵ Rauschecker, J.P.⁶

42
- 77956601209
- Hierarchical organization of human auditory cortex: Evidence from acoustic invariance in the response to intelligible speech
- K. Okada, F. Rong, J. Venezia, W. Matchin, I.-H. Hsieh, K. Saberi, J. T. Serences, and G. Hickok, "Hierarchical organization of human auditory cortex: Evidence from acoustic invariance in the response to intelligible speech," Cereb. Cortex, vol. 20, pp. 2486-2495, 2010.
- (2010) Cereb. Cortex , vol.20 , pp. 2486-2495
- Okada, K.¹ Rong, F.² Venezia, J.³ Matchin, W.⁴ Hsieh, I.-H.⁵ Saberi, K.⁶ Serences, J.T.⁷ Hickok, G.⁸

43
- 14544293518
- Hierarchical and asymmetric temporal sensitivity in human auditory cortices
- DOI 10.1038/nn1409
- A. Boemio, S. Fromm, A. Braun, and D. Poeppel, "Hierarchical and asymmetric temporal sensitivity in human auditory cortices," Nature Neurosci., vol. 8, pp. 389-395, 2005. (Pubitemid 40300197)
- (2005) Nature Neuroscience , vol.8 , Issue.3 , pp. 389-395
- Boemio, A.¹ Fromm, S.² Braun, A.³ Poeppel, D.⁴

44
- 0034653816
- Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds
- F. Theunissen, K. Sen, and A. J. Doupe, "Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds," J. Neurosci., vol. 20, no. 6, pp. 2315-2331, 2000. (Pubitemid 30230085)
- (2000) Journal of Neuroscience , vol.20 , Issue.6 , pp. 2315-2331
- Theunissen, F.E.¹ Sen, K.² Doupe, A.J.³

45
- 0142090233
- Spectrotemporal structure of receptive fields in areas AI and AAF of mouse auditory cortex
- DOI 10.1152/jn.00751.2002
- J. F. Linden, R. C. Liu, M. Sahani, C. E. Schreiner, and M. M. Merzenich, "Spectrotemporal structure of receptive fields in areas ai and aaf of mouse auditory cortex," J. Neurophysiol., vol. 90, no. 4, pp. 2660-2675, 2003. (Pubitemid 37266531)
- (2003) Journal of Neurophysiology , vol.90 , Issue.4 , pp. 2660-2675
- Linden, J.F.¹ Liu, R.C.² Sahani, M.³ Schreiner, C.E.⁴ Merzenich, M.M.⁵

46
- 85009233038
- Improving word accuracy with gabor feature extraction
- M.Kleinschmidt and D. Gelbart, "Improving word accuracy with gabor feature extraction," in Proc. ICSLP, 2002.
- (2002) Proc. ICSLP
- Kleinschmidt, M.¹ Gelbart, D.²

47
- 85009227802
- Localized spectro-temporal features for automatic speech recognition
- M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition," in Proc. Eurospeech, 2003.
- (2003) Proc. Eurospeech
- Kleinschmidt, M.¹

48
- 34047272330
- Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
- DOI 10.1109/TSA.2005.858055
- N. Mesgarani, M. Slaney, and S. A. Shamma, "Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 920-930, May 2006. (Pubitemid 46547653)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 920-930
- Mesgarani, N.¹ Slaney, M.² Shamma, S.A.³

49
- 85063071800
- Discriminative word-spotting using ordered spectro-temporal patch features
- Sep.
- T. Ezzat and T. Poggio, "Discriminative word-spotting using ordered spectro-temporal patch features," in Proc. SAPA Workshop, Sep. 2008, pp. 35-40.
- (2008) Proc. SAPA Workshop , pp. 35-40
- Ezzat, T.¹ Poggio, T.²

50
- 51449089975
- Localized spectro-temporal cep-stral analysis of speech
- May
- J. Bouvrie, T. Ezzat, and T. Poggio, "Localized spectro-temporal cep-stral analysis of speech," in Proc. ICASSP, May 2008, pp. 4733-4736.
- (2008) Proc. ICASSP , pp. 4733-4736
- Bouvrie, J.¹ Ezzat, T.² Poggio, T.³

51
- 0033316361
- Hierarchical models of object recognition in cortex
- DOI 10.1038/14819
- M. Riesenhuber and T. Poggio, "Hierarchical models of object recognition in cortex," Nature Neurosci., vol. 2, pp. 1019-1025, 1999. (Pubitemid 30599567)
- (1999) Nature Neuroscience , vol.2 , Issue.11 , pp. 1019-1025
- Riesenhuber, M.¹ Poggio, T.²

52
- 84857467917
- Tech. Rep., MIT-CSAIL-TR-2010-051, CBCL-292
- J. Bouvrie, T. Poggio, L. Rosasco, S. Smale, and A. Wibisono, Generalization and Properties of the Neural Response Mass. Inst. of Technol., 2010, Tech. Rep., MIT-CSAIL-TR-2010-051, CBCL-292.
- (2010) Generalization and Properties of the Neural Response Mass. Inst. of Technol.
- Bouvrie, J.¹ Poggio, T.² Rosasco, L.³ Smale, S.⁴ Wibisono, A.⁵

53
- 0031220487
- Effects of phase on the perception of intervocalic stop consonants
- PII S016763939700054X
- L. Liu, J. He, and G. Palm, "Effects of phase on the perception of intervocalic stop consonants,"Speech Commun., vol. 22, no. 4,pp. 403-417, 1997. (Pubitemid 127433607)
- (1997) Speech Communication , vol.22 , Issue.4 , pp. 403-417
- Liu, L.¹ He, J.² Palm, G.³

54
- 0034843163
- Using phase spectrum information for improved speech recognition performance
- R. Schluter and H. Ney, "Using phase spectrum information for improved speech recognition performance," in Proc. ICASSP, 2001, pp. 133-136. (Pubitemid 32839205)
- (2001) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 133-136
- Schluter, R.¹ Ney, H.²

55
- 0003913694
- An efficient implementation of the Patterson-Holdsworth auditory filter bank
- M. Slaney, An efficient implementation of the Patterson-Holdsworth auditory filter bank, Apple Computer Tech. Rep., 1993, no. 35.
- (1993) Apple Computer Tech. Rep. , Issue.35
- Slaney, M.¹

56
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol. 47, pp. 103-108, 1990.
- (1990) Hear. Res. , vol.47 , pp. 103-108
- Glasberg, B.R.¹ Moore, B.C.J.²

57
- 0004412846
- SVOS final report: The auditory filterbank
- R. D. Patterson, Holdsworth, I. Nimmo-Smith, and P. Rice, "SVOS final report: The auditory filterbank," APU Rep., 1988, no. 2341.
- (1988) APU Rep. , Issue.2341
- Patterson, R.D.¹ Nimmo-Smith, H.I.² Rice, P.³

58
- 0003241883
- Splines models for observational data
- Philadelphia, PA: SIAM
- G. Wahba, "Splines models for observational data," inSeries in Applied Mathematics. Philadelphia, PA: SIAM, 1990, vol. 59.
- (1990) InSeries in Applied Mathematics , vol.59
- Wahba, G.¹

59
- 0001219859
- Regularization theory and neural networks architectures
- F. Girosi, M. Jones, and T. Poggio, "Regularization theory and neural networks architectures," Neural Comput., vol. 7, pp. 219-269, 1995.
- (1995) Neural Comput. , vol.7 , pp. 219-269
- Girosi, F.¹ Jones, M.² Poggio, T.³

60
- 34547539413
- Gammatone features and feature combination for large vocabulary speech recognition
- R. Schluter, L. Bezrukov, H. Wagner, and H. Ney, "Gammatone features and feature combination for large vocabulary speech recognition," in Proc. ICASSP, 2007, pp. 649-652.
- (2007) Proc. ICASSP , pp. 649-652
- Schluter, R.¹ Bezrukov, L.² Wagner, H.³ Ney, H.⁴

61
- 0038669544
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- H. G Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ASR, 2000, pp. 181-188.
- (2000) Proc. ASR , pp. 181-188
- Hirsch, H.G.¹ Pearce, D.²

62
- 0038404463
- ITU-T Recommendation
- "Transmission Performance Characteristics of Pulse Code Modulation Channels," 1996, ITU-T Recommendation G.712.
- (1996) Transmission Performance Characteristics of Pulse Code Modulation Channels , pp. 712

63
- 84873975005
- [Online] Available
- "HTK Speech Recognition Toolkit," 2011 [Online]. Available: htk.eng.cam.ac.uk/
- (2011) HTK Speech Recognition Toolkit

64
- 0009589650
- ETSI ES Version 1.1.3
- "Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms," 2003, ETSI ES 201 108 Version 1.1.3.
- (2003) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms , pp. 201108

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.