메뉴 건너뛰기




Volumn , Issue , 2016, Pages

GeePS: Scalable deep learning on distributed GPUs with a GPU-specialized parameter server

Author keywords

[No Author keywords available]

Indexed keywords

NETWORK LAYERS; PROGRAM PROCESSORS;

EID: 84971575164     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2901318.2901323     Document Type: Conference Paper
Times cited : (310)

References (36)
  • 1
    • 84954514573 scopus 로고    scopus 로고
    • NVIDIA cuBLAS https://developer.nvidia.com/cublas.
    • NVIDIA cuBLAS
  • 2
    • 84971553276 scopus 로고    scopus 로고
    • NVIDIA cuDNN https://developer.nvidia.com/cudnn.
    • NVIDIA cuDNN
  • 4
    • 84919919193 scopus 로고    scopus 로고
    • Distributed stochastic gradient MCMC
    • S. Ahn, B. Shahbaba, and M. Welling. Distributed stochastic gradient MCMC. In ICML, 2014.
    • (2014) ICML
    • Ahn, S.1    Shahbaba, B.2    Welling, M.3
  • 5
    • 34547975052 scopus 로고    scopus 로고
    • Scaling learning algorithms towards AI
    • Y. Bengio, Y. LeCun, et al. Scaling learning algorithms towards AI. Large-scale kernel machines, 34(5), 2007.
    • (2007) Large-scale Kernel Machines , vol.34 , Issue.5
    • Bengio, Y.1    LeCun, Y.2
  • 7
    • 85069497682 scopus 로고    scopus 로고
    • Project Adam: Building an efficient and scalable deep learning training system
    • T. Chilimbi, Y. Suzue, J. Apacible, and K. Kalyanaraman. Project Adam: Building an efficient and scalable deep learning training system. In OSDI, 2014.
    • (2014) OSDI
    • Chilimbi, T.1    Suzue, Y.2    Apacible, J.3    Kalyanaraman, K.4
  • 8
    • 84866714584 scopus 로고    scopus 로고
    • Multi-column deep neural networks for image classification
    • D. Ciresan, U. Meier, and J. Schmidhuber. Multi-column deep neural networks for image classification. In CVPR, 2012.
    • (2012) CVPR
    • Ciresan, D.1    Meier, U.2    Schmidhuber, J.3
  • 12
    • 84971509545 scopus 로고    scopus 로고
    • Scalable deep learning on distributed GPUs with a GPU-specialized parameter server
    • H. Cui, G. R. Ganger, and P. B. Gibbons. Scalable deep learning on distributed GPUs with a GPU-specialized parameter server. CMU PDL Technical Report (CMU-PDL-15-107), 2015.
    • (2015) CMU PDL Technical Report
    • Cui, H.1    Ganger, G.R.2    Gibbons, P.B.3
  • 17
    • 84891720231 scopus 로고    scopus 로고
    • PRObE: A thousand-node experimental cluster for computer systems research
    • G. Gibson, G. Grider, A. Jacobson, and W. Lloyd. PRObE: A thousand-node experimental cluster for computer systems research. USENIX ;login:, 2013.
    • (2013) USENIX ;login:
    • Gibson, G.1    Grider, G.2    Jacobson, A.3    Lloyd, W.4
  • 23
    • 84876231242 scopus 로고    scopus 로고
    • ImageNet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. ImageNet classification with deep convolutional neural networks. In NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 25
    • 82155188108 scopus 로고    scopus 로고
    • Piccolo: Building fast, distributed programs with partitioned tables
    • R. Power and J. Li. Piccolo: Building fast, distributed programs with partitioned tables. In OSDI, 2010.
    • (2010) OSDI
    • Power, R.1    Li, J.2
  • 36
    • 84912111128 scopus 로고    scopus 로고
    • Asynchronous distributed ADMM algorithm for global variable consensus optimization
    • R. Zhang and J. Kwok. Asynchronous distributed ADMM algorithm for global variable consensus optimization. In ICML, 2014.
    • (2014) ICML
    • Zhang, R.1    Kwok, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.