SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2012, Pages 4277-4280

Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition

(4) Abdel Hamid, Ossama a Mohamed, Abdel Rahman b Jiang, Hui a Penn, Gerald b

a YORK UNIVERSITY (Canada)

b UNIVERSITY OF TORONTO (Canada)

Author keywords

acoustic modeling; local filtering; max pooling; neural networks; speech recognition

Indexed keywords

ACOUSTIC MODELING; CNN MODELS; CONVOLUTIONAL NEURAL NETWORK; DATA SETS; FREQUENCY DOMAINS; HIDDEN LAYERS; MAX-POOLING; RELATIVE ERRORS; SPEAKER-INDEPENDENT SPEECH RECOGNITION; SPECTRAL VARIATION; SPEECH RECOGNITION PERFORMANCE; SPEECH SIGNALS; TEST SETS; TRANSLATION INVARIANCE;

IMAGE PROCESSING; NEURAL NETWORKS;

SPEECH RECOGNITION;

EID: 84867605836 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2012.6288864 Document Type: Conference Paper

Times cited : (853)

References (11)

1
- 80051624332
- Acoustic modeling using deep belief networks
- A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," Audio, Speech, and Language Processing, IEEE Transactions on, 2011.
- (2011) Audio, Speech, and Language Processing, IEEE Transactions on
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

2
- 84858972572
- Making deep belief networks effective for large vocabulary continuous speech recognition
- T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition," in ASRU, 2011.
- (2011) ASRU
- Sainath, T.N.¹ Kingsbury, B.² Ramabhadran, B.³ Fousek, P.⁴ Novak, P.⁵ Mohamed, A.⁶

3
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- G. Li F. Seide and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech 2011.
- (2011) Interspeech
- Li, G.¹ Seide, F.² Yu, D.³

4
- 0032203257
- Gradient-based learning applied to document recognition
- Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," in Proceedings of the IEEE, 1998, pp. 2278-2324.
- Proceedings of the IEEE, 1998 , pp. 2278-2324
- Lecun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

5
- 5044231640
- Learning methods for generic object recognition with invariance to pose and lighting
- IEEE Press
- Y. LeCun, F. Huang, and L. Bottou, "Learning methods for generic object recognition with invariance to pose and lighting," in Proceedings of CVPR'04. 2004, IEEE Press.
- Proceedings of CVPR'04. 2004
- LeCun, Y.¹ Huang, F.² Bottou, L.³

6
- 0002263996
- Convolutional networks for images, speech, and time-series
- M. A. Arbib, Ed. MIT Press
- Y. LeCun and Y. Bengio, "Convolutional networks for images, speech, and time-series," in The Handbook of Brain Theory and Neural Networks, M. A. Arbib, Ed. 1995, MIT Press.
- (1995) The Handbook of Brain Theory and Neural Networks
- LeCun, Y.¹ Bengio, Y.²

7
- 84863380535
- Unsupervised feature learning for audio classification using convolutional deep belief networks
- H. Lee, P. Pham, Y. Largman, and A. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks," in Advances in Neural Information Processing Systems 22, pp. 1096-1104. 2009.
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1096-1104
- Lee, H.¹ Pham, P.² Largman, Y.³ Ng, A.⁴

8
- 0141629128
- Experiments in vocal tract normalization
- A. Andreou, T. Kamm, and J. Cohen, "Experiments in vocal tract normalization," in Proc. the CAIP Workshop: Frontiers in Speech Recognition II, 1994.
- Proc. the CAIP Workshop: Frontiers in Speech Recognition II, 1994
- Andreou, A.¹ Kamm, T.² Cohen, J.³

9
- 0032050110
- Maximum likelihood linear transformations for hmm-based speech recognition
- M.J.F. Gales, "Maximum likelihood linear transformations for hmm-based speech recognition," Computer Speech and Language, vol. 12, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , pp. 75-98
- Gales, M.J.F.¹

10
- 0024768209
- Speaker-independent phone recognition using hidden markov models
- November
- K. F. Lee and H. W. Hon, "Speaker-independent phone recognition using hidden markov models," IEEE Transactions on Audio, Speech and Language Processing, vol. 37, no. 11, pp. 1641-1648, November 1989.
- (1989) IEEE Transactions on Audio, Speech and Language Processing , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.F.¹ Hon, H.W.²

11
- 85162069624
- Phone recognition with the mean-covariance restricted boltzmann machine
- G. E. Dahl, M. Ranzato, A. Mohamed, and G. E. Hinton, "Phone recognition with the mean-covariance restricted boltzmann machine," in Advances in Neural Information Processing Systems, 2010, number 23.
- (2010) Advances in Neural Information Processing Systems , Issue.23
- Dahl, G.E.¹ Ranzato, M.² Mohamed, A.³ Hinton, G.E.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.