메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 26-30

Convolutional neural networks for acoustic modeling of raw time signal in LVCSR

Author keywords

Acoustic modeling; Convolutional neural networks; Raw time signal

Indexed keywords

CONVOLUTION; EXTRACTION; FEATURE EXTRACTION; FILTER BANKS; NEURAL NETWORKS; SPEECH COMMUNICATION;

EID: 84959110637     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (106)

References (21)
  • 1
    • 84910065702 scopus 로고    scopus 로고
    • Acoustic modeling with deep neural networks using raw time signal for LVCSR
    • Singapore, Sep.
    • Z. Tüske, P. Golik, R. Schlüter, and H. Ney, "Acoustic modeling with deep neural networks using raw time signal for LVCSR, " in Proc. Interspeech, Singapore, Sep. 2014, pp. 890-894.
    • (2014) Proc. Interspeech , pp. 890-894
    • Tüske, Z.1    Golik, P.2    Schlüter, R.3    Ney, H.4
  • 2
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, " IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 3
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 7
    • 84906273908 scopus 로고    scopus 로고
    • Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks
    • Lyon, France, Aug.
    • D. Palaz, R. Collobert, and M. Magimai.-Doss, "Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks, " in Proc. Interspeech, Lyon, France, Aug. 2013, pp. 1766-1770.
    • (2013) Proc. Interspeech , pp. 1766-1770
    • Palaz, D.1    Collobert, R.2    Magimai.-Doss, M.3
  • 11
    • 84876231242 scopus 로고    scopus 로고
    • ImageNet classification with deep convolutional neural networks
    • F. Pereira, C. Burges, L. Bottou, and K. Weinberger, Eds. Curran Associates, Inc.
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks, " in Advances in Neural Information Processing Systems 25, F. Pereira, C. Burges, L. Bottou, and K. Weinberger, Eds. Curran Associates, Inc., 2012, pp. 1097-1105.
    • (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 1097-1105
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 13
    • 84906257050 scopus 로고    scopus 로고
    • Neural network acoustic models for the DARPA RATS program
    • Lyon, France, Aug.
    • H. Soltau, H. Kuo, L. Mangu, G. Saon, and T. Beran, "Neural network acoustic models for the DARPA RATS program, " in Proc. Interspeech, Lyon, France, Aug. 2013, pp. 3092-3096.
    • (2013) Proc. Interspeech , pp. 3092-3096
    • Soltau, H.1    Kuo, H.2    Mangu, L.3    Saon, G.4    Beran, T.5
  • 15
    • 77956509090 scopus 로고    scopus 로고
    • Rectified linear units improve restricted Boltzmann machines
    • Haifa, Israel, Jun.
    • V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines, " in Proc. of the 27th Int. Conf. on Machine Learning, Haifa, Israel, Jun. 2010, pp. 807-814.
    • (2010) Proc. of the 27th Int. Conf. on Machine Learning , pp. 807-814
    • Nair, V.1    Hinton, G.E.2
  • 16
    • 84959129131 scopus 로고    scopus 로고
    • Accessed: 2015-03-27
    • (2013) Quaero Programme. Accessed: 2015-03-27. [Online]. Available: http: //www. quaero. org
    • (2013) Quaero Programme
  • 17
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • Honolulu, HI, USA, Dec.
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Honolulu, HI, USA, Dec. 2011, pp. 24-29.
    • (2011) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 20
    • 33745213373 scopus 로고    scopus 로고
    • Multi-resolution RASTA filtering for TANDEM-based ASR
    • Lisbon, Portugal, Sep.
    • H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR, " in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 361-364.
    • (2005) Proc. Interspeech , pp. 361-364
    • Hermansky, H.1    Fousek, P.2
  • 21
    • 84910036228 scopus 로고    scopus 로고
    • Robust CNN-based speech recognition with Gabor filter kernels
    • Singapore, Sep.
    • S. Chang and N. Morgan, "Robust CNN-based speech recognition with Gabor filter kernels, " in Proc. Interspeech, Singapore, Sep. 2014, pp. 905-909.
    • (2014) Proc. Interspeech , pp. 905-909
    • Chang, S.1    Morgan, N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.