메뉴 건너뛰기




Volumn , Issue , 2014, Pages 890-894

Acoustic modeling with deep neural networks using raw time signal for LVCSR

Author keywords

Acoustic modeling; Neural networks; Raw signal

Indexed keywords

EXTRACTION; FEATURE EXTRACTION; FILTER BANKS; NEURAL NETWORKS; SIGNAL PROCESSING; SPEECH COMMUNICATION; TIME DOMAIN ANALYSIS;

EID: 84910065702     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (176)

References (20)
  • 1
  • 4
    • 0024861871 scopus 로고
    • Approximation by superpositions of a sigmoidal function
    • G. Cybenko, "Approximation by superpositions of a sigmoidal function, " Mathematics of Control, Signals and Systems, vol. 2, no. 4, pp. 303-314, 1989.
    • (1989) Mathematics of Control, Signals and Systems , vol.2 , Issue.4 , pp. 303-314
    • Cybenko, G.1
  • 5
    • 0024880831 scopus 로고
    • Multilayer feedforward networks are universal approximators
    • Jul
    • K. Hornik, M. B. Stinchcombe, and H. White, "Multilayer feedforward networks are universal approximators, " Neural Networks, vol. 2, no. 5, pp. 359-366, Jul. 1989.
    • (1989) Neural Networks , vol.2 , Issue.5 , pp. 359-366
    • Hornik, K.1    Stinchcombe, M.B.2    White, H.3
  • 8
    • 84858985237 scopus 로고    scopus 로고
    • Improved acoustic feature combination for LVCSR by neural networks
    • Florence, Italy, Aug
    • C. Plahl, R. Schlüter, and H. Ney, "Improved acoustic feature combination for LVCSR by neural networks, " in Proc. Interspeech, Florence, Italy, Aug. 2011, pp. 1237-1240.
    • (2011) Proc. Interspeech , pp. 1237-1240
    • Plahl, C.1    Schlüter, R.2    Ney, H.3
  • 9
    • 84906273908 scopus 로고    scopus 로고
    • Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks
    • Lyon, France, Aug
    • D. Palaz, R. Collobert, and M. Magimai.-Doss, "Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks, " in Proc. Interspeech, Lyon, France, Aug. 2013, pp. 1766-1770.
    • (2013) Proc. Interspeech , pp. 1766-1770
    • Palaz, D.1    Collobert, R.2    Magimai.-Doss, M.3
  • 11
    • 84985742249 scopus 로고
    • Linear predictive hidden Markov models and the speech signal
    • Paris, France, May
    • A. B. Poritz, "Linear predictive hidden Markov models and the speech signal, " in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 7, Paris, France, May 1982, pp. 1291- 1294.
    • (1982) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing , vol.7 , pp. 1291-1294
    • Poritz, A.B.1
  • 12
    • 13244265597 scopus 로고    scopus 로고
    • Revisiting autoregressive hidden Markov modeling of speech signals
    • Feb
    • Y. Ephraim and W. J. J. Roberts, "Revisiting autoregressive hidden Markov modeling of speech signals, " IEEE Signal Processing Letters, vol. 12, no. 2, pp. 166-169, Feb. 2005.
    • (2005) IEEE Signal Processing Letters , vol.12 , Issue.2 , pp. 166-169
    • Ephraim, Y.1    Roberts, W.J.J.2
  • 13
    • 84910063277 scopus 로고    scopus 로고
    • Subband acoustic waveform front-end for robust speech recognition using support vector machines
    • Brighton, UK, Sep
    • J. Yousafzai, Z. Cvetkovíc, and P. Sollich, "Subband acoustic waveform front-end for robust speech recognition using support vector machines, " in Proc. Interspeech, Brighton, UK, Sep. 2009, pp. 2679-2682.
    • (2009) Proc. Interspeech , pp. 2679-2682
    • Yousafzai, J.1    Cvetkovíc, Z.2    Sollich, P.3
  • 14
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, " IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 16
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 19
    • 77956509090 scopus 로고    scopus 로고
    • Rectified linear units improve restricted Boltzmann machines
    • Haifa, Israel, Jun
    • V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines, " in Proc. of the 27th Int. Conf. on Machine Learning, Haifa, Israel, Jun. 2010, pp. 807-814.
    • (2010) Proc. of the 27th Int. Conf. on Machine Learning , pp. 807-814
    • Nair, V.1    Hinton, G.E.2
  • 20
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • Aug
    • B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data, " Hearing Research, vol. 47, no. 1-2, pp. 103-138, Aug. 1990.
    • (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.