메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1766-1770

Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks

Author keywords

Artificial neu ral networks; Automatic speech recognition; Convolutional neural networks; Data driven feature extraction; Phonemes

Indexed keywords

FEATURE EXTRACTION; IMAGE PROCESSING; LEARNING SYSTEMS; SPEECH RECOGNITION;

EID: 84906273908     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (111)

References (23)
  • 1
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition, " Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 4
    • 0002291365 scopus 로고
    • Generalization and network design strategies
    • R. Pfeifer, Z. Schreter, F. Fogelman, and L. Steels, Eds. Zurich, Switzerland: Elsevier
    • Y. LeCun, "Generalization and network design strategies, " in Connectionism in Perspective, R. Pfeifer, Z. Schreter, F. Fogelman, and L. Steels, Eds. Zurich, Switzerland: Elsevier, 1989.
    • (1989) Connectionism in Perspective
    • Lecun, Y.1
  • 6
    • 13244265597 scopus 로고    scopus 로고
    • Revisiting autoregressive hidden Markov modeling of speech signals
    • Feb
    • Y. Ephraim and W. J. J. Roberts, "Revisiting autoregressive hidden markov modeling of speech signals, " IEEE Signal Processing Letters, vol. 12, no. 2, pp. 166-169, Feb. 2005.
    • (2005) IEEE Signal Processing Letters , vol.12 , Issue.2 , pp. 166-169
    • Ephraim, Y.1    Roberts, W.J.J.2
  • 7
    • 54349106040 scopus 로고    scopus 로고
    • Switching linear dynamical systems for noise robust speech recognition
    • Aug
    • B. Mesot and D. Barber, "Switching linear dynamical systems for noise robust speech recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 6, pp. 1850-1858, Aug. 2008.
    • (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.6 , pp. 1850-1858
    • Mesot, B.1    Barber, D.2
  • 8
    • 0028195651 scopus 로고
    • Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization
    • IEEE Transactions on
    • H. Sheikhzadeh and L. Deng, "Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization, " Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 1, p. 8089, 1994.
    • (1994) Speech and Audio Processing , vol.2 , Issue.1 , pp. 8089
    • Sheikhzadeh, H.1    Deng, L.2
  • 9
    • 70450190485 scopus 로고    scopus 로고
    • Tuning support vector machines for robust phoneme classification with acoustic waveforms
    • J. Yousafzai, Z. Cvetkovic, and P. Sollich, "Tuning support vector machines for robust phoneme classification with acoustic waveforms, " in INTERSPEECH, 2009, pp. 2391-2394.
    • (2009) Interspeech , pp. 2391-2394
    • Yousafzai, J.1    Cvetkovic, Z.2    Sollich, P.3
  • 10
    • 84055211743 scopus 로고    scopus 로고
    • Acoustic modeling using deep belief networks
    • IEEE Transactions on, jan
    • A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 14 -22, jan. 2012.
    • (2012) Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 14-22
    • Mohamed, A.1    Dahl, G.2    Hinton, G.3
  • 11
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
    • IEEE International Conference on, 2012
    • O. Abdel-Hamid, A.-r. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, 2012, pp. 4277-4280.
    • (2012) Acoustics, Speech and Signal Processing (ICASSP) , pp. 4277-4280
    • Abdel-Hamid, O.1    Mohamed, A.-R.2    Jiang, H.3    Penn, G.4
  • 12
  • 16
    • 0000583248 scopus 로고
    • Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition
    • NATO ASI series ed. F. Fogelman Soulie and J. Herault, Eds
    • J. Bridle, "Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition, " in Neuro-computing: Algorithms, Architectures and Applications, NATO ASI series ed., F. Fogelman Soulie and J. Herault, Eds., 1990, pp. 227-236.
    • (1990) Neuro-computing: Algorithms, Architectures and Applications , pp. 227-236
    • Bridle, J.1
  • 17
    • 33847215211 scopus 로고
    • Stochastic gradient learning in neural networks
    • Nimes, France: EC2
    • L. Bottou, "Stochastic gradient learning in neural networks, " in Proceedings of Neuro-Nmes 91. Nimes, France: EC2, 1991.
    • (1991) Proceedings of Neuro-Nmes , vol.91
    • Bottou, L.1
  • 18
  • 20
    • 0029306621 scopus 로고
    • Continuous speech recognition
    • IEEE, May
    • N. Morgan and H. Bourlard, "Continuous speech recognition, " Signal Processing Magazine, IEEE, vol. 12, no. 3, pp. 24 -42, May 1995.
    • (1995) Signal Processing Magazine , vol.12 , Issue.3 , pp. 24-42
    • Morgan, N.1    Bourlard, H.2
  • 22
    • 70349212558 scopus 로고    scopus 로고
    • Phoneme recognition using spectral envelope and modulation frequency features
    • IEEE International Conference on
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Phoneme recognition using spectral envelope and modulation frequency features, " in Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, 2009, pp. 4453-4456.
    • (2009) Acoustics, Speech and Signal Processing, 2009, ICASSP 2009 , pp. 4453-4456
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.