SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 1849-1853

Deep segmental neural networks for speech recognition

(4) Abdel Hamid, Ossama a Deng, Li b Yu, Dong b Jiang, Hui a

a YORK UNIVERSITY (Canada)

b MICROSOFT RESEARCH (United States)

Author keywords

Deep segmental neural network; Segmental conditional random field; Segmental model

Indexed keywords

HIDDEN MARKOV MODELS; HYBRID SYSTEMS; RANDOM PROCESSES;

DECODING ALGORITHM; DEEP NEURAL NETWORKS; EVALUATION EXPERIMENTS; LARGE VOCABULARY SPEECH RECOGNITION; SEGMENTAL CONDITIONAL RANDOM FIELDS; VARIABLE LENGTH;

SPEECH RECOGNITION;

EID: 84906282118 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (40)

References (22)

1
- 84055222005
- Contextdependent pre-trained deep neural networks for largevocabulary speech recognition
- jan
- G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for largevocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30 -42, jan. 2012.
- (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

2
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Interspeech 2011, 2011.
- (2011) Interspeech 2011
- Seide, F.¹ Li, G.² Yu, D.³

3
- 84858972572
- Making deep belief networks effective for large vocabulary continuous speech recognition
- T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition, " in ASRU, 2011.
- (2011) ASRU
- Sainath, T.N.¹ Kingsbury, B.² Ramabhadran, B.³ Fousek, P.⁴ Novak, P.⁵ Mohamed, A.⁶

4
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- IEEE, nov
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82 -97, nov. 2012.
- (2012) Signal Processing Magazine , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

5
- 84871387302
- The deep tensor neural network with applications to large vocabulary speech recognition
- D. Yu, L. Deng, and F. Seide, "The deep tensor neural network with applications to large vocabulary speech recognition, " IEEE Trans. Audio, Speech, and Language Proc., vol. 21, no. 2, pp. 388-396, 2013.
- (2013) IEEE Trans. Audio, Speech, and Language Proc. , vol.21 , Issue.2 , pp. 388-396
- Yu, D.¹ Deng, L.² Seide, F.³

6
- 84890545163
- A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
- May
- L. Deng, O. Abdel-Hamid, and D. Yu, "A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, May 2013.
- (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
- Deng, L.¹ Abdel-Hamid, O.² Yu, D.³

7
- 84890491198
- Recent advances in deep learning for speech research at microsoft
- May
- L. Deng, J. Li, J.-T. Huang, K. Yao, D. Yu, F. Seide, M. Seltzer, G. Zweig, X. He, J. Williams, Y. Gong, and A. Acero, "Recent advances in deep learning for speech research at Microsoft, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, May 2013.
- (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
- Deng, L.¹ Li, J.² Huang, J.-T.³ Yao, K.⁴ Yu, D.⁵ Seide, F.⁶ Seltzer, M.⁷ Zweig, G.⁸ He, X.⁹ Williams, J.¹⁰ Gong, Y.¹¹ Acero, A.¹²

8
- 84873303660
- Speech recognition using long-span temporal patterns in a deep network model
- March
- S. M. Siniscalchi, D. Yu, L. Deng, and C. hui Lee, "Speech recognition using long-span temporal patterns in a deep network model, " IEEE Signal Processing Letters, March 2013.
- (2013) IEEE Signal Processing Letters
- Siniscalchi, S.M.¹ Yu, D.² Deng, L.³ Lee, C.H.⁴

9
- 0030245363
- From hmm's to segment models: A unified view of stochastic modeling for speech recognition
- M. Ostendorf, V. Digalakis, and O. Kimball, "From HMM's to segment models: A unified view of stochastic modeling for speech recognition, " Speech and Audio Processing, IEEE Transactions on, vol. 4, no. 5, pp. 360-378, 1996.
- (1996) Speech and Audio Processing, IEEE Transactions on , vol.4 , Issue.5 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.² Kimball, O.³

10
- 0026854213
- A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal
- L. Deng, "A generalized hidden markov model with stateconditioned trend functions of time for the speech signal, " Signal Processing, vol. 27, no. 1, pp. 65 - 78, 1992.
- (1992) Signal Processing , vol.27 , Issue.1 , pp. 65-78
- Deng, L.¹

11
- 0028516022
- Speech recognition using hidden markov models with polynomial regression functions as nonstationary states
- oct
- L. Deng, M. Aksmanovic, X. Sun, and C. Wu, "Speech recognition using hidden markov models with polynomial regression functions as nonstationary states, " Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 4, pp. 507 -520, oct 1994.
- (1994) Speech and Audio Processing, IEEE Transactions on , vol.2 , Issue.4 , pp. 507-520
- Deng, L.¹ Aksmanovic, M.² Sun, X.³ Wu, C.⁴

12
- 77949370075
- A segmental CRF approach to large vocabulary continuous speech recognition
- ASRU 2009. IEEE Workshop on, 13 2009-dec. 17
- G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition, " in Automatic Speech Recognition Understanding, 2009. ASRU 2009. IEEE Workshop on, 13 2009-dec. 17 2009, pp. 152 -157.
- (2009) Automatic Speech Recognition Understanding, 2009 , pp. 152-157
- Zweig, G.¹ Nguyen, P.²

13
- 80051659716
- Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop
- may
- G. Zweig, P. Nguyen, D. Van-Compernolle, K. Demuynck, L. Atlas, P. Clark, G. Sell, M. Wang, F. Sha, H. Hermansky, D. Karakos, A. Jansen, S. Thomas, G. Sivaram, S. Bowman, and J. Kao, "Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop, " in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, may 2011, pp. 5044 -5047.
- (2011) Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on , pp. 5044-5047
- Zweig, G.¹ Nguyen, P.² Van-Compernolle, D.³ Demuynck, K.⁴ Atlas, L.⁵ Clark, P.⁶ Sell, G.⁷ Wang, M.⁸ Sha, F.⁹ Hermansky, H.¹⁰ Karakos, D.¹¹ Jansen, A.¹² Thomas, S.¹³ Sivaram, G.¹⁴ Bowman, S.¹⁵ Kao, J.¹⁶

14
- 84055211743
- Acoustic modeling using deep belief networks
- jan
- A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 14 -22, jan. 2012.
- (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

15
- 84867605836
- Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
- march
- O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, march 2012, pp. 4277 - 4280.
- (2012) Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on , pp. 4277-4280
- Abdel-Hamid, O.¹ Mohamed, A.² Jiang, H.³ Penn, G.⁴

16
- 84867598637
- Classification and recognition with direct segment models
- march
- G. Zweig, "Classification and recognition with direct segment models, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, march 2012, pp. 4161 -4164.
- (2012) Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on , pp. 4161-4164
- Zweig, G.¹

17
- 84906230797
- Continuous speech recognition using segmental neural nets
- IJCNN. International Joint Conference on, vol. 2, jun
- S. Austin, G. Zavaliagkos, J. Makhoul, and R. Schwartz, "Continuous speech recognition using segmental neural nets, " in Neural Networks, 1992. IJCNN., International Joint Conference on, vol. 2, jun 1992, pp. 314 -319 vol.2.
- (1992) Neural Networks, 1992 , vol.2 , pp. 314-319
- Austin, S.¹ Zavaliagkos, G.² Makhoul, J.³ Schwartz, R.⁴

18
- 84878565391
- Efficient segmental conditional random fields for one-pass phone recognition
- Y. He and E. Fosler-Lussier, "Efficient segmental conditional random fields for one-pass phone recognition, " in Interspeech 2012, 2012.
- (2012) Interspeech 2012
- He, Y.¹ Fosler-Lussier, E.²

19
- 78049406405
- Backpropagation training for multilayer conditional random field based phone recognition
- march
- R. Prabhavalkar and E. Fosler-Lussier, "Backpropagation training for multilayer conditional random field based phone recognition, " in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, march 2010, pp. 5534 -5537.
- (2010) Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on , pp. 5534-5537
- Prabhavalkar, R.¹ Fosler-Lussier, E.²

20
- 79959840616
- Investigation of full-sequence training of deep belief networks for speech recognition
- A.-R. Mohamed, D. Yu, and L. Deng, "Investigation of full-sequence training of deep belief networks for speech recognition, " in Interspeech, 2010, pp. 2846-2849.
- (2010) Interspeech , pp. 2846-2849
- Mohamed, A.-R.¹ Yu, D.² Deng, L.³

21
- 79959828814
- Deep-structured hidden conditional random fields for phonetic recognition
- D. Yu and L. Deng, "Deep-structured hidden conditional random fields for phonetic recognition, " in Interspeech, 2010.
- (2010) Interspeech
- Yu, D.¹ Deng, L.²

22
- 79951759981
- Neural conditional random fields
- T.-M.-T. Do and T. Artieres, "Neural conditional random fields, " in Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, vol. 9, 5 2010.
- (2010) Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics , vol.9 , Issue.5
- Do, T.-M.-T.¹ Artieres, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.