메뉴 건너뛰기




Volumn 29, Issue 20, 2017, Pages

FPGA-accelerated deep convolutional neural networks for high throughput and energy efficiency

Author keywords

Accelerator; Caffe; CNN; FPGA; Matrix Multiplier

Indexed keywords

ACCELERATION; APPLICATION PROGRAMS; COMPUTER PROGRAMMING; CONVOLUTION; DEEP NEURAL NETWORKS; ENERGY EFFICIENCY; FIELD PROGRAMMABLE GATE ARRAYS (FPGA); MATRIX ALGEBRA; MEMORY ARCHITECTURE; PARTICLE ACCELERATORS; PROGRAM PROCESSORS; SYSTEM-ON-CHIP;

EID: 84966447574     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.3850     Document Type: Conference Paper
Times cited : (47)

References (28)
  • 6
    • 84919463060 scopus 로고    scopus 로고
    • GPU implementation of a parallel two-list algorithm for the subset-sum problem
    • Wan L, Li K, Liu J, Li K. GPU implementation of a parallel two-list algorithm for the subset-sum problem. Concurrency Computation Practice Experience 2015; 27(1):119–145.
    • (2015) Concurrency Computation Practice Experience , vol.27 , Issue.1 , pp. 119-145
    • Wan, L.1    Li, K.2    Liu, J.3    Li, K.4
  • 7
    • 84928685154 scopus 로고    scopus 로고
    • An iteration-based hybrid parallel algorithm for tridiagonal systems of equations on multi-core architectures
    • Tang G, Yang W, Li K, Ye Y, Xiao G, Li K. An iteration-based hybrid parallel algorithm for tridiagonal systems of equations on multi-core architectures. Concurrency Computation Practice Experience 2015; 27(17):5076–5095.
    • (2015) Concurrency Computation Practice Experience , vol.27 , Issue.17 , pp. 5076-5095
    • Tang, G.1    Yang, W.2    Li, K.3    Ye, Y.4    Xiao, G.5    Li, K.6
  • 9
    • 70450060046 scopus 로고    scopus 로고
    • FPL 2009. International Conference on Field Programmable Logic and Applications, 2009, IEEE,, Prague, Czech Republic
    • Farabet C, Poulet C, Han JY, LeCun Y. CNP: an FPGA-based processor for convolutional networks. FPL 2009. International Conference on Field Programmable Logic and Applications, 2009: IEEE, Prague, Czech Republic, 2009; 32–37.
    • (2009) CNP: an FPGA-based processor for convolutional networks , pp. 32-37
    • Farabet, C.1    Poulet, C.2    Han, J.Y.3    LeCun, Y.4
  • 20
  • 25
    • 84920152252 scopus 로고    scopus 로고
    • Accuracy evaluation of deep belief networks with fixed-point arithmetic
    • Jiang J, Hu R, Mikel L, Dou Y. Accuracy evaluation of deep belief networks with fixed-point arithmetic. Computer Modelling & New Technologies 2014; 18(6):7–14.
    • (2014) Computer Modelling & New Technologies , vol.18 , Issue.6 , pp. 7-14
    • Jiang, J.1    Hu, R.2    Mikel, L.3    Dou, Y.4
  • 26
    • 84966674121 scopus 로고    scopus 로고
    • Learning both weights and connections for efficient neural networks
    • Han S, Pool J, Tran J, Dally WJ. Learning both weights and connections for efficient neural networks. arXiv preprint 2015: arXiv:1506.02626.
    • (2015) arXiv preprint
    • Han, S.1    Pool, J.2    Tran, J.3    Dally, W.J.4
  • 27
    • 84919470072 scopus 로고    scopus 로고
    • Performance analysis and optimization for SPMV on GPU using probabilistic modeling
    • Li K, Yang W, Li K. Performance analysis and optimization for SPMV on GPU using probabilistic modeling. IEEE Transactions on Parallel and Distributed Systems 2015; 26(1):196–205.
    • (2015) IEEE Transactions on Parallel and Distributed Systems , vol.26 , Issue.1 , pp. 196-205
    • Li, K.1    Yang, W.2    Li, K.3
  • 28
    • 84939230567 scopus 로고    scopus 로고
    • Performance optimization using partitioned SPMV on GPUs and multicore CPUs
    • Yang W, Li K, Mo Z, Li K. Performance optimization using partitioned SPMV on GPUs and multicore CPUs. IEEE Transactions on Computers 2015; 64(9):2623–2636.
    • (2015) IEEE Transactions on Computers , vol.64 , Issue.9 , pp. 2623-2636
    • Yang, W.1    Li, K.2    Mo, Z.3    Li, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.