메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1915-1919

Ensemble deep learning for speech recognition

Author keywords

Deep learning; Ensemble learning; Log linear system combination; Speech recognition; Stacking

Indexed keywords

CONVEX OPTIMIZATION; LEARNING SYSTEMS; LINEAR SYSTEMS; OPTIMIZATION; RECURRENT NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION;

EID: 84910048046     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (230)

References (40)
  • 2
    • 84879854889 scopus 로고    scopus 로고
    • Representation learning: A review and new perspectives
    • Bengio, Y., Courville, A., and Vincent, P. "Representation learning: A review and new perspectives, " IEEE Trans. PAMI, vol. 38, pp. 1798-1828, 2013.
    • (2013) IEEE Trans. PAMI , vol.38 , pp. 1798-1828
    • Bengio, Y.1    Courville, A.2    Vincent, P.3
  • 4
    • 0030196364 scopus 로고    scopus 로고
    • Stacked regression
    • Breiman, L. "Stacked regression, " Machine Learning, Vol. 24, pp. 49-64, 1996.
    • (1996) Machine Learning , vol.24 , pp. 49-64
    • Breiman, L.1
  • 6
    • 85083950550 scopus 로고    scopus 로고
    • A primal-dual method for training recurrent neural networks constrained by the echo-state property
    • April
    • Chen, J. and Deng, L. "A primal-dual method for training recurrent neural networks constrained by the echo-state property, " Proc. Int. Conf. Learning Representations, April, 2014.
    • (2014) Proc. Int. Conf. Learning Representations
    • Chen, J.1    Deng, L.2
  • 7
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent, pre-trained deep neural networks for large vocabulary speech recognition
    • Dahl, G., Yu, D., Deng, L., and Acero, A. "Contextdependent, pre-trained deep neural networks for large vocabulary speech recognition, " IEEE Trans. Audio, Speech, & Language Proc., Vol. 20, pp. 30-42, 2012.
    • (2012) IEEE Trans. Audio, Speech, & Language Proc. , vol.20 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 8
    • 84905280906 scopus 로고    scopus 로고
    • Sequence classification using the high-level features extracted from deep neural networks
    • Deng, L. and Chen, J. "Sequence classification using the high-level features extracted from deep neural networks, " Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Deng, L.1    Chen, J.2
  • 9
    • 84890545163 scopus 로고    scopus 로고
    • A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
    • Deng, L., Abdel-Hamid, O., and Yu, D. "A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion, " Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Deng, L.1    Abdel-Hamid, O.2    Yu, D.3
  • 11
    • 84890526837 scopus 로고    scopus 로고
    • New types of deep neural network learning for speech recognition and related applications: An overview
    • Deng, L., Hinton, G., and Kingsbury, B. "New types of deep neural network learning for speech recognition and related applications: An overview, " Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Deng, L.1    Hinton, G.2    Kingsbury, B.3
  • 12
    • 84890468916 scopus 로고    scopus 로고
    • Deep learning for speech recognition and related applications
    • Deng, L., Yu, D., and Hinton, G. "Deep Learning for Speech Recognition and Related Applications" NIPS Workshop, 2009.
    • (2009) NIPS Workshop
    • Deng, L.1    Yu, D.2    Hinton, G.3
  • 13
    • 84867614591 scopus 로고    scopus 로고
    • Scalable stacking and learning for building deep architectures
    • Deng, L., Yu, D., and Platt, J. "Scalable stacking and learning for building deep architectures, " Proc. ICASSP, 2012.
    • (2012) Proc. ICASSP
    • Deng, L.1    Yu, D.2    Platt, J.3
  • 14
    • 84890543083 scopus 로고    scopus 로고
    • Speech recognition with deep recurrent neural networks
    • Graves, A., Mohamed, A., and Hinton, G. "Speech recognition with deep recurrent neural networks, " Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Graves, A.1    Mohamed, A.2    Hinton, G.3
  • 15
    • 84893701254 scopus 로고    scopus 로고
    • Hybrid speech recognition with deep bidirectional LSTM
    • Graves, A., Jaitly, N., and Mohamed, A. "Hybrid speech recognition with deep bidirectional LSTM, " Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Graves, A.1    Jaitly, N.2    Mohamed, A.3
  • 17
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • July
    • Hinton, G. and Salakhutdinov, R. "Reducing the dimensionality of data with neural networks, " Science, vol. 313. no. 5786, pp. 504 - 507, July 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.1    Salakhutdinov, R.2
  • 18
    • 84878539964 scopus 로고    scopus 로고
    • Application of pre-trained deep neural networks to large vocabulary speech recognition
    • Jaitly, N., Nguyen, P., and Vanhoucke, V. "Application of pre-trained deep neural networks to large vocabulary speech recognition, " Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Jaitly, N.1    Nguyen, P.2    Vanhoucke, V.3
  • 19
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
    • Kingsbury, B., Sainath, T., and Soltau, H. "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization, " Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Kingsbury, B.1    Sainath, T.2    Soltau, H.3
  • 21
    • 80053451847 scopus 로고    scopus 로고
    • Learning recurrent neural networks with Hessian-free optimization
    • Martens, J. and Sutskever, I. "Learning recurrent neural networks with Hessian-free optimization, " Proc. ICML, 2011.
    • (2011) Proc. ICML
    • Martens, J.1    Sutskever, I.2
  • 25
    • 79959840616 scopus 로고    scopus 로고
    • Investigation of fullsequence training of deep belief networks for speech recognition
    • Mohamed, A., Yu, D., and Deng, L. "Investigation of fullsequence training of deep belief networks for speech recognition, " Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Mohamed, A.1    Yu, D.2    Deng, L.3
  • 26
    • 84255177123 scopus 로고    scopus 로고
    • Deep and wide: Multiple layers in automatic speech recognition
    • January
    • Morgan, N. "Deep and wide: Multiple layers in automatic speech recognition, " IEEE Trans. Audio, Speech, and Language Processing, Vol. 20 (1), January 2012.
    • (2012) IEEE Trans. Audio, Speech, and Language Processing , vol.20 , Issue.1
    • Morgan, N.1
  • 27
    • 84897497795 scopus 로고    scopus 로고
    • On the difficulty of training recurrent neural networks
    • Pascanu, R., Mikolov, T., and Bengio, Y. "On the difficulty of training recurrent neural networks, " Proc. ICML, 2013.
    • (2013) Proc. ICML
    • Pascanu, R.1    Mikolov, T.2    Bengio, Y.3
  • 28
    • 0028392167 scopus 로고
    • An application of recurrent nets to phone probability estimation
    • Robinson, A. "An application of recurrent nets to phone probability estimation, " IEEE Trans. Neural Networks, Vol. 5, pp. 298-305, 1994.
    • (1994) IEEE Trans. Neural Networks , vol.5 , pp. 298-305
    • Robinson, A.1
  • 29
    • 84886829539 scopus 로고    scopus 로고
    • Optimization techniques to improve training speed of deep neural networks for large speech tasks
    • Nov
    • Sainath, T., Kingsbury, B., Soltau, H., and Ramabhadran, B. "Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks, " IEEE Trans. Audio, Speech, and Language Processing, vol.21, no.11, pp.2267-2276, Nov. 2013.
    • (2013) IEEE Trans. Audio, Speech, and Language Processing , vol.21 , Issue.11 , pp. 2267-2276
    • Sainath, T.1    Kingsbury, B.2    Soltau, H.3    Ramabhadran, B.4
  • 32
    • 84858972572 scopus 로고    scopus 로고
    • Making deep belief networks effective for large vocabulary continuous speech recognition
    • Sainath, T., Kingsbury, B., Ramabhadran, B., Novak, P., and Mohamed, A. "Making deep belief networks effective for large vocabulary continuous speech recognition, " Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Sainath, T.1    Kingsbury, B.2    Ramabhadran, B.3    Novak, P.4    Mohamed, A.5
  • 33
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Seide, F., Li, G., and Yu, D. "Conversational speech transcription using context-dependent deep neural networks, " Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 34
    • 80053459857 scopus 로고    scopus 로고
    • Generating text with recurrent neural networks
    • Sutskever, I., Martens J., and Hinton, G. "Generating text with recurrent neural networks, " Proc. ICML, 2011.
    • (2011) Proc. ICML
    • Martens, J.1    Sutskever, I.2    Hinton, G.3
  • 37
    • 0026692226 scopus 로고
    • Stacked generalization
    • Wolpert, D. "Stacked generalization, " Neural Networks, vol. 5, no. 2, pp. 241-259, 1992.
    • (1992) Neural Networks , vol.5 , Issue.2 , pp. 241-259
    • Wolpert, D.1
  • 39
    • 84871387302 scopus 로고    scopus 로고
    • The deep tensor neural network with applications to large vocabulary speech recognition
    • Yu, D., Deng, L., and Seide, F. "The deep tensor neural network with applications to large vocabulary speech recognition, " IEEE Trans. Audio, Speech, and Language Processing, vol. 21, no. 2, pp. 388-396, 2013.
    • (2013) IEEE Trans. Audio, Speech, and Language Processing , vol.21 , Issue.2 , pp. 388-396
    • Yu, D.1    Deng, L.2    Seide, F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.