메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1058-1062

1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs

Author keywords

[No Author keywords available]

Indexed keywords

ION BEAMS; PROGRAM PROCESSORS; STOCHASTIC SYSTEMS;

EID: 84910069984     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1108)

References (31)
  • 2
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational Speech Transcription Using Context-Dependent Deep Neural Networks, " Interspeech, 2011.
    • (2011) Interspeech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 7
    • 84890512601 scopus 로고    scopus 로고
    • Asynchronous stochastic gradient descent for DNN training
    • S. Zhang, C. Zhang, Z. You, R. Zheng, and B. Xu, "Asynchronous Stochastic Gradient Descent for DNN Training, " ICASSP, 2013.
    • (2013) ICASSP
    • Zhang, S.1    Zhang, C.2    You, Z.3    Zheng, R.4    Xu, B.5
  • 8
    • 84905269646 scopus 로고    scopus 로고
    • On parallelizability of stochastic gradient descent for speech DNNs
    • F. Seide, H. Fu, J. Droppo, G. Li, D. Yu, "On Parallelizability of Stochastic Gradient Descent for Speech DNNs, " ICASSP 2014.
    • (2014) ICASSP
    • Seide, F.1    Fu, H.2    Droppo, J.3    Li, G.4    Yu, D.5
  • 9
    • 84910105867 scopus 로고    scopus 로고
    • "Delta-Sigma Modulation, " Wikipedia, http://en.wikipedia.org/wiki/Delta-sigmamodulation.
    • Delta-Sigma Modulation
  • 10
    • 0028464214 scopus 로고
    • Context-dependent connectionist probabilty estimatation in a hybrid hidden markov model- neural net speech recognition system
    • H. Franco et al., "Context-Dependent Connectionist Probabilty Estimatation in a Hybrid Hidden Markov Model- Neural Net Speech Recognition System, " Computer Speech and Language, vol. 8, pp. 211-222, 1994.
    • (1994) Computer Speech and Language , vol.8 , pp. 211-222
    • Franco, H.1
  • 11
    • 84890496567 scopus 로고    scopus 로고
    • A cluster- based multiple deep neural networks method for large vocabulary continuous speech recognition
    • P. Zhou, C. Liu, Q. Liu, L. Dai, and H. Jiang, "A Cluster- Based Multiple Deep Neural Networks Method for Large Vocabulary Continuous Speech Recognition, " ICASSP, 2013.
    • (2013) ICASSP
    • Zhou, P.1    Liu, C.2    Liu, Q.3    Dai, L.4    Jiang, H.5
  • 13
    • 84906227589 scopus 로고    scopus 로고
    • Restructuring of deep neural network acoustic models with singular value decomposition
    • J. Xue, J. Li, and Y. Gong, "Restructuring of Deep Neural Network Acoustic Models with Singular Value Decomposition, " Interspeech 2013.
    • (2013) Interspeech
    • Xue, J.1    Li, J.2    Gong, Y.3
  • 14
    • 84906237512 scopus 로고    scopus 로고
    • Investigations on hessian- free optimization for cross-entropy training of deep neural networks
    • S. Wiesler, J. Li, and J. Xue, "Investigations on Hessian- Free Optimization for Cross-Entropy Training of Deep Neural Networks, " Interspeech, 2013.
    • (2013) Interspeech
    • Wiesler, S.1    Li, J.2    Xue, J.3
  • 15
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
    • B. Kingsbury, T. Sainath, and H. Soltau, "Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization, " Interspeech, 2012.
    • (2012) Interspeech
    • Kingsbury, B.1    Sainath, T.2    Soltau, H.3
  • 16
    • 84905239342 scopus 로고    scopus 로고
    • Improving deep neural network acoustic models using generalized maxout networks
    • X. Zhang, J. Trmal, D. Povey, S. Khudanpur, "Improving Deep Neural Network Acoustic Models Using Generalized Maxout Networks, " ICASSP 2014.
    • (2014) ICASSP
    • Zhang, X.1    Trmal, J.2    Povey, D.3    Khudanpur, S.4
  • 17
    • 80051762104 scopus 로고    scopus 로고
    • Distributed optimization and statistical learning via the alternating direction method of multipliers
    • S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, "Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers, " in Foundations and Trends in Machine Learning, Vol. 3, No. 1 (2010) 1-122.
    • (2010) Foundations and Trends in Machine Learning , vol.3 , Issue.1 , pp. 1-122
    • Boyd, S.1    Parikh, N.2    Chu, E.3    Peleato, B.4    Eckstein, J.5
  • 19
    • 84878397276 scopus 로고    scopus 로고
    • Pipelined back-propagation for context-dependent deep neural networks
    • X. Chen, A. Eversole, G. Li, D. Yu, and F. Seide, "Pipelined Back-Propagation for Context-Dependent Deep Neural Networks, " Interspeech, 2012.
    • (2012) Interspeech
    • Chen, X.1    Eversole, A.2    Li, G.3    Yu, D.4    Seide, F.5
  • 22
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. Hinton, S. Osindero, and Y. Teh, "A Fast Learning Algorithm for Deep Belief Nets", Neural Computation, vol. 18, pp. 1527-1554, 2006.
    • (2006) Neural Computation , vol.18 , pp. 1527-1554
    • Hinton, G.1    Osindero, S.2    Teh, Y.3
  • 23
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature Engineering in Context-Dependent Deep Neural Networks for Conversational Speech Transcription, " Proc. ASRU, Waikoloa Village, 2011.
    • (2011) Proc. ASRU, Waikoloa Village
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 24
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
    • B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " ICASSP, 2009.
    • (2009) ICASSP
    • Kingsbury, B.1
  • 25
    • 0022471098 scopus 로고
    • Learning representations by back-propagating errors
    • Oct
    • D. Rumelhart, G. Hinton, and R. Williams, "Learning Representations By Back-Propagating Errors, " Nature, vol. 323, Oct. 1986.
    • (1986) Nature , vol.323
    • Rumelhart, D.1    Hinton, G.2    Williams, R.3
  • 26
    • 77956541496 scopus 로고    scopus 로고
    • Deep learning via Hessian-free optimization
    • J. Martens, "Deep learning via Hessian-free optimization, " ICML, 2010.
    • (2010) ICML
    • Martens, J.1
  • 27
  • 29
    • 84910105866 scopus 로고    scopus 로고
    • "Evaluation Campaign, " IWSLT 2013, http://www.iwslt2013.org/59.php.
    • (2013) Evaluation Campaign
  • 31
    • 84887388950 scopus 로고    scopus 로고
    • An empirical study of learning rates in deep neural networks for speech recognition
    • A. Senior, G. Heigold, M.-A. Ranzato, K. Yang, "An Empirical Study of Learning Rates in Deep Neural Networks for Speech Recognition, " ICASSP, 2013.
    • (2013) ICASSP
    • Senior, A.1    Heigold, G.2    Ranzato, M.-A.3    Yang, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.