메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4974-4978

A nonmonotone learning rate strategy for SGD training of deep neural networks

Author keywords

deep learning; learning rate; nonmonotonicity; speech recognition; stepsize

Indexed keywords

AUDIO SIGNAL PROCESSING; DEEP LEARNING; LEARNING ALGORITHMS; NONLINEAR PROGRAMMING; SITE SELECTION; SPEECH COMMUNICATION; SPEECH RECOGNITION; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

EID: 84946057367     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178917     Document Type: Conference Paper
Times cited : (19)

References (22)
  • 2
    • 0242291662 scopus 로고    scopus 로고
    • second edition Numerical optimization
    • J. Nocedal and S. Wright, "Numerical optimization, second edition," Numerical optimization, pp. 497-528, 2006.
    • (2006) Numerical Optimization , pp. 497-528
    • Nocedal, J.1    Wright, S.2
  • 7
    • 80052250414 scopus 로고    scopus 로고
    • Adaptive subgradient methods for online learning and stochastic optimization
    • J. Duchi, E. Hazan, and Y. Singer, "Adaptive subgradient methods for online learning and stochastic optimization," The Journal of Machine Learning Research, vol. 12, pp. 2121-2159, 2011.
    • (2011) The Journal of Machine Learning Research , vol.12 , pp. 2121-2159
    • Duchi, J.1    Hazan, E.2    Singer, Y.3
  • 9
    • 84946050415 scopus 로고    scopus 로고
    • Accessed:-09-30
    • "Quicknet," http://www1.icsi.berkeley.edu/Speech/qn.html. Accessed: 2014-09-30.
    • (2014)
  • 11
    • 0022766519 scopus 로고
    • A nonmonotone line search technique for Newton's method
    • L. Grippo, F. Lampariello, and S. Lucidi, "A nonmonotone line search technique for Newton's method," SIAM Journal on Numerical Analysis, vol. 23, no. 4, pp. 707-716,1986.
    • (1986) SIAM Journal on Numerical Analysis , vol.23 , Issue.4 , pp. 707-716
    • Grippo, L.1    Lampariello, F.2    Lucidi, S.3
  • 12
    • 9944262108 scopus 로고    scopus 로고
    • A nonmonotone line search technique and its application to unconstrained optimization
    • H. Zhang and W. Hager, "A nonmonotone line search technique and its application to unconstrained optimization," SIAM Journal on Optimization, vol. 14, no. 4, pp. 1043-1056,2004.
    • (2004) SIAM Journal on Optimization , vol.14 , Issue.4 , pp. 1043-1056
    • Zhang, H.1    Hager, W.2
  • 13
    • 33745923778 scopus 로고    scopus 로고
    • The cyclic barzilai-borwein method for unconstrained optimization
    • Y. Dai, W. Hager, K. Schittkowski, and H. Zhang, "The cyclic barzilai-borwein method for unconstrained optimization," IMA Journal of Numerical Analysis, vol. 26, no. 3, pp. 604-627,2006.
    • (2006) IMA Journal of Numerical Analysis , vol.26 , Issue.3 , pp. 604-627
    • Dai, Y.1    Hager, W.2    Schittkowski, K.3    Zhang, H.4
  • 14
    • 0032186984 scopus 로고    scopus 로고
    • Incremental gradient algorithms with stepsizes bounded away from zero
    • M. Solodov, "Incremental gradient algorithms with stepsizes bounded away from zero," Computational Optimization and Applications, vol. 11, no. 1, pp. 23-35,1998.
    • (1998) Computational Optimization and Applications , vol.11 , Issue.1 , pp. 23-35
    • Solodov, M.1
  • 15
    • 0032222083 scopus 로고    scopus 로고
    • An incremental gradient (-projection) method with momentum term and adaptive stepsize rule
    • P. Tseng, "An incremental gradient (-projection) method with momentum term and adaptive stepsize rule," SIAM Journal on Optimization, vol. 8, no. 2, pp. 506-531, 1998.
    • (1998) SIAM Journal on Optimization , vol.8 , Issue.2 , pp. 506-531
    • Tseng, P.1
  • 17
    • 0030521134 scopus 로고    scopus 로고
    • Convergence analysis of gradient descent stochastic algorithms
    • A Shapiro and Y Wardi, "Convergence analysis of gradient descent stochastic algorithms," Journal of optimization theory and applications, vol. 91, no. 2, pp. 439-454,1996.
    • (1996) Journal of Optimization Theory and Applications , vol.91 , Issue.2 , pp. 439-454
    • Shapiro, A.1    Wardi, Y.2
  • 18
    • 84892854517 scopus 로고    scopus 로고
    • Stochastic first-and zeroth-order methods for nonconvex stochastic programming
    • Saeed Ghadimi and Guanghui Lan, "Stochastic first-and zeroth-order methods for nonconvex stochastic programming," SIAM Journal on Optimization, vol. 23, no. 4, pp. 2341-2368, 2013.
    • (2013) SIAM Journal on Optimization , vol.23 , Issue.4 , pp. 2341-2368
    • Ghadimi, S.1    Lan, G.2
  • 20
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using i-vectors
    • G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors," in Proceedings of ASRU, 2013.
    • (2013) Proceedings of ASRU
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 21
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chien, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proceedings of ASRU, 2011.
    • (2011) Proceedings of ASRU
    • Seide, F.1    Li, G.2    Chien, X.3    Yu, D.4
  • 22
    • 77956907243 scopus 로고    scopus 로고
    • On over-fitting in model selection and subsequent selection bias in performance evaluation
    • G. Cawley and N. Talbot, "On over-fitting in model selection and subsequent selection bias in performance evaluation," The Journal of Machine Learning Research, vol. 11, pp. 2079-2107,2010.
    • (2010) The Journal of Machine Learning Research , vol.11 , pp. 2079-2107
    • Cawley, G.1    Talbot, N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.