메뉴 건너뛰기




Volumn 1, Issue , 2015, Pages 448-456

Batch normalization: Accelerating deep network training by reducing internal covariate shift

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS;

EID: 84969584486     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (31101)

References (24)
  • 1
    • 84862277874 scopus 로고    scopus 로고
    • Understanding the difficulty of training deep feedforward neural networks
    • May
    • Bengio, Yoshua and Glorot, Xavier. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of AISTATS 2010, volume 9, pp. 249-256, May 2010.
    • (2010) Proceedings of AISTATS 2010 , vol.9 , pp. 249-256
    • Bengio, Y.1    Glorot, X.2
  • 4
    • 80052250414 scopus 로고    scopus 로고
    • Adaptive subgradient methods for online learning and stochastic optimization
    • July
    • Duchi, John, Hazan, Elad, and Singer, Yoram. Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res., 12: 2121-2159, July 2011. ISSN 1532-4435.
    • (2011) J. Mach. Learn. Res. , vol.12 , pp. 2121-2159
    • Duchi, J.1    Hazan, E.2    Singer, Y.3
  • 5
    • 85083951034 scopus 로고    scopus 로고
    • Knowledge matters: Importance of prior information for optimization
    • abs/1301.4083
    • G ülçehre, Caglar and Bengio, Yoshua. Knowledge matters: Importance of prior information for optimization. CoRR, abs/1301.4083, 2013.
    • (2013) CoRR
    • Gülçehre, C.1    Bengio, Y.2
  • 6
    • 84937472647 scopus 로고    scopus 로고
    • Delving deep into rectifiers: Surpassing human-level performance on imageNet classification
    • February
    • He, K., Zhang, X., Ren, S., and Sun, J. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. ArXiv e-prints, February 2015.
    • (2015) ArXiv E-prints
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 7
    • 0042826822 scopus 로고    scopus 로고
    • Independent component analysis: Algorithms and applications
    • May
    • Hyvarinen, A. and Oja, E. Independent component analysis: Algorithms and applications. Neural Netw., 13 (4-5): 411-130, May 2000.
    • (2000) Neural Netw , vol.13 , Issue.4-5 , pp. 411-430
    • Hyvarinen, A.1    Oja, E.2
  • 9
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • November
    • LeCun, Y, Bottou, L., Bengio, Y., and Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86 (11): 2278-2324, November 1998a.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • LeCun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 11
    • 52249097028 scopus 로고    scopus 로고
    • Nonlinear image representation using divisive normalization
    • IEEE Computer Society, Jun 23-28
    • Lyu, S and Simoncelli, E P. Nonlinear image representation using divisive normalization. In Proc. Computer Vision and Pattern Recognition, pp. 1-8. IEEE Computer Society, Jun 23-28 2008. doi: 10.1109/CVPR.2008.4587821.
    • (2008) Proc. Computer Vision and Pattern Recognition , pp. 1-8
    • Lyu, S.1    Simoncelli, E.P.2
  • 12
    • 77956509090 scopus 로고    scopus 로고
    • Rectified linear units improve restricted Boltzmann machines
    • Omnipress
    • Nair, Vinod and Hinton, Geoffrey E. Rectified linear units improve restricted boltzmann machines. In ICML, pp. 807-814. Omnipress, 2010.
    • (2010) ICML , pp. 807-814
    • Nair, V.1    Hinton, G.E.2
  • 14
    • 84969522474 scopus 로고    scopus 로고
    • Parallel training of deep neural networks with natural gradient and parameter averaging
    • abs/1410.7455
    • Povey, Daniel, Zhang, Xiaohui, and Khudanpur, Sanjeev. Parallel training of deep neural networks with natural gradient and parameter averaging. CoRR, abs/1410.7455, 2014.
    • (2014) CoRR
    • Povey, D.1    Zhang, X.2    Khudanpur, S.3
  • 17
    • 84969522090 scopus 로고    scopus 로고
    • Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
    • abs/1312.6120
    • Saxe, Andrew M., McClelland, James L., and Ganguli, Surya. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks. CoRR, abs/1312.6120, 2013.
    • (2013) CoRR
    • Saxe, A.M.1    McClelland, J.L.2    Ganguli, S.3
  • 18
    • 0037527188 scopus 로고    scopus 로고
    • Improving predictive inference under covariate shift by weighting the log-likelihood function
    • October
    • Shimodaira, Hidetoshi. Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90 (2): 227-244, October 2000.
    • (2000) Journal of Statistical Planning and Inference , vol.90 , Issue.2 , pp. 227-244
    • Shimodaira, H.1
  • 19
    • 84904163933 scopus 로고    scopus 로고
    • Dropout: A simple way to prevent neural networks from overfitting
    • January
    • Srivastava, Nitish, Hinton, Geoffrey, Krizhevsky, Alex, Sutskever, Ilya, and Salakhutdinov, Ruslan. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res., 15 (1): 1929-1958, January 2014.
    • (2014) J. Mach. Learn. Res. , vol.15 , Issue.1 , pp. 1929-1958
    • Srivastava, N.1    Hinton, G.2    Krizhevsky, A.3    Sutskever, I.4    Salakhutdinov, R.5
  • 20
    • 84897510162 scopus 로고    scopus 로고
    • On the importance of initialization and momentum in deep learning
    • JMLR.org
    • Sutskever, Ilya, Martens, James, Dahl, George E., and Hinton, Geoffrey E. On the importance of initialization and momentum in deep learning. In ICML (3), volume 28 of JMLR Proceedings, pp. 1139-1147. JMLR.org, 2013.
    • (2013) ICML (3) of JMLR Proceedings , vol.28 , pp. 1139-1147
    • Sutskever, I.1    Martens, J.2    Dahl, G.E.3    Hinton, G.E.4
  • 22
    • 85162533997 scopus 로고    scopus 로고
    • A convergence analysis of log-linear training
    • Shawe-Taylor, J., Zemel, R.S., Bartlett, P., Pereira, F.C.N., and Weinberger, K.Q. (eds.), Granada, Spain, December
    • Wiesler, Simon and Ney, Hermann. A convergence analysis of log-linear training. In Shawe-Taylor, J., Zemel, R.S., Bartlett, P., Pereira, F.C.N., and Weinberger, K.Q. (eds.), Advances in Neural Information Processing Systems 24, pp. 657-665, Granada, Spain, December 2011.
    • (2011) Advances in Neural Information Processing Systems , vol.24 , pp. 657-665
    • Wiesler, S.1    Ney, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.