메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2346-2350

Phone classification by a hierarchy of invariant representation layers

Author keywords

Auditory cortex; Convolutional network; Invariance; Phonetic classification

Indexed keywords

CLASSIFICATION (OF INFORMATION); COMPLEX NETWORKS; FEATURE EXTRACTION; INVARIANCE; SPEECH; SPEECH COMMUNICATION; TELEPHONE SETS;

EID: 84910037127     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (38)
  • 1
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 2
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " The Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) The Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 3
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • Jul
    • R. P. Lippmann, "Speech recognition by machines and humans, " Speech Communication, vol. 22, no. 1, pp. 1-15, Jul. 1997.
    • (1997) Speech Communication , vol.22 , Issue.1 , pp. 1-15
    • Lippmann, R.P.1
  • 5
    • 84897584256 scopus 로고    scopus 로고
    • Phonetic feature encoding in human superior temporal gyrus
    • Jan
    • N. Mesgarani, C. Cheung, K. Johnson, and E. F. Chang, "Phonetic feature encoding in human superior temporal gyrus, " Science, vol. 343, no. 6174, pp. 1006-1010, Jan. 2014.
    • (2014) Science , vol.343 , Issue.6174 , pp. 1006-1010
    • Mesgarani, N.1    Cheung, C.2    Johnson, K.3    Chang, E.F.4
  • 6
    • 82855178812 scopus 로고    scopus 로고
    • Hierarchical representations in the auditory cortex
    • Jun
    • T. O. Sharpee, C. A. Atencio, and C. E. Schreiner, "Hierarchical representations in the auditory cortex, " Curr. Opin. Neurobiol., vol. 21, no. 5, pp. 761-767, Jun. 2011.
    • (2011) Curr. Opin. Neurobiol. , vol.21 , Issue.5 , pp. 761-767
    • Sharpee, T.O.1    Atencio, C.A.2    Schreiner, C.E.3
  • 7
    • 85032751341 scopus 로고    scopus 로고
    • Hearing is believing: Biologically inspired methods for robust automatic speech recognition
    • Nov
    • R. Stern and N. Morgan, "Hearing is believing: Biologically inspired methods for robust automatic speech recognition, " IEEE Signal Process. Mag., vol. 29, no. 6, pp. 34-43, Nov. 2012.
    • (2012) IEEE Signal Process. Mag , vol.29 , Issue.6 , pp. 34-43
    • Stern, R.1    Morgan, N.2
  • 11
    • 77952744810 scopus 로고    scopus 로고
    • Sparse representations in audio and music: From coding to source separation
    • June
    • M. Plumbley, T. Blumensath, L. Daudet, R. Gribonval, and M. Davies, "Sparse representations in audio and music: From coding to source separation, " Proceedings of the IEEE, vol. 98, no. 6, pp. 995-1005, June 2010.
    • (2010) Proceedings of the IEEE , vol.98 , Issue.6 , pp. 995-1005
    • Plumbley, M.1    Blumensath, T.2    Daudet, L.3    Gribonval, R.4    Davies, M.5
  • 17
    • 33645410496 scopus 로고
    • Receptive fields, binocular interaction and functional architecture in the cat's visual cortex
    • Jan
    • D. H. Hubel and T. N. Wiesel, "Receptive fields, binocular interaction and functional architecture in the cat's visual cortex, " Journal of Physiology, vol. 160, no. 1, pp. 106-154, Jan. 1962.
    • (1962) Journal of Physiology , vol.160 , Issue.1 , pp. 106-154
    • Hubel, D.H.1    Wiesel, T.N.2
  • 21
    • 84863380535 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • H. Lee, P. T. Pham, Y. Largman, and A. Y. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks, " in Advances in Neural Information Processing Systems (NIPS) 22, 2009, pp. 1096-1104.
    • (2009) Advances in Neural Information Processing Systems (NIPS) , vol.22 , pp. 1096-1104
    • Lee, H.1    Pham, P.T.2    Largman, Y.3    Ng, A.Y.4
  • 26
    • 84905267489 scopus 로고    scopus 로고
    • Deep scattering spectrum
    • J. Anden and S. Mallat, "Deep scattering spectrum, " 2013, IEEE Trans. Signal Processing (submitted). [Online]. Available: Http://arxiv.org/abs/1304.6763.
    • (2013) IEEE Trans. Signal Processing
    • Anden, J.1    Mallat, S.2
  • 30
    • 84976447702 scopus 로고    scopus 로고
    • A comparison of the data requirements of automatic speech recognition systems and human listeners
    • Geneva, Switzerland
    • R. K. Moore, "A comparison of the data requirements of automatic speech recognition systems and human listeners, " in Proc. EUROSPEECH, 8th European Conf. on Speech Communication and Technology, Geneva, Switzerland, 2003, pp. 2582-2584.
    • (2003) Proc. EUROSPEECH, 8th European Conf. on Speech Communication and Technology , pp. 2582-2584
    • Moore, R.K.1
  • 31
    • 80053971654 scopus 로고    scopus 로고
    • Video-based descriptors for object recognition
    • Sep
    • T. Lee and S. Soatto, "Video-based descriptors for object recognition, " Image and Vision Computing, vol. 29, no. 10, pp. 639-652, Sep. 2011.
    • (2011) Image and Vision Computing , vol.29 , Issue.10 , pp. 639-652
    • Lee, T.1    Soatto, S.2
  • 32
    • 80055110996 scopus 로고    scopus 로고
    • From Finite Groups to Lie Groups, ser. Universitext. Springer
    • Y. Kosmann-Schwarzbach, Groups and Symmetries, From Finite Groups to Lie Groups, ser. Universitext. Springer, 2010.
    • (2010) Groups and Symmetries
    • Kosmann-Schwarzbach, Y.1
  • 33
    • 84963012029 scopus 로고
    • Some theorems on distribution functions
    • Oct
    • H. Cramer and H. Wold, "Some theorems on distribution functions, " Journal of the London Mathematical Society, vol. 1-11, no. 4, pp. 290-294, Oct. 1936.
    • (1936) Journal of the London Mathematical Society , vol.1-11 , Issue.4 , pp. 290-294
    • Cramer, H.1    Wold, H.2
  • 34
    • 0022014331 scopus 로고
    • Spatiotemporal energy models for the perception of motion
    • Feb
    • E. Adelson and J. Bergen, "Spatiotemporal energy models for the perception of motion, " Journal of the Optical Society of America A, vol. 2, no. 2, pp. 284-299, Feb. 1985.
    • (1985) Journal of the Optical Society of America A , vol.2 , Issue.2 , pp. 284-299
    • Adelson, E.1    Bergen, J.2
  • 35
    • 0033316361 scopus 로고    scopus 로고
    • Hierarchical models of object recognition
    • Nov
    • M. Riesenhuber and T. Poggio, "Hierarchical models of object recognition, " Nature Neurosience, vol. 2, no. 11, pp. 1019-1025, Nov. 2000.
    • (2000) Nature Neurosience , vol.2 , Issue.11 , pp. 1019-1025
    • Riesenhuber, M.1    Poggio, T.2
  • 36
    • 59849113779 scopus 로고    scopus 로고
    • A statistical, formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data
    • Apr
    • R. E. Turner, T. C. Walters, J. J. M. Monaghan, and R. D. Patterson, "A statistical, formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data, " J. Acoust. Soc. Am., vol. 125, no. 4, pp. 2374-2386, Apr. 2009.
    • (2009) J. Acoust. Soc. Am. , vol.125 , Issue.4 , pp. 2374-2386
    • Turner, R.E.1    Walters, T.C.2    Monaghan, J.J.M.3    Patterson, R.D.4
  • 38


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.