SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 9528, Issue , 2015, Pages 64-76

Unified virtual memory support for deep CNN accelerator on soC FPGA

(5) Xiao, Tao a Qiao, Yuran a Shen, Junzhong a Yang, Qianming a Wen, Mei a

a NATIONAL UNIVERSITY OF DEFENSE TECHNOLOGY (China)

Author keywords

Accelerator; Coherence; Deep CNN; SoC; Unified virtual memory

Indexed keywords

ACCELERATION; COHERENT LIGHT; COMPUTER ARCHITECTURE; DYNAMIC RANDOM ACCESS STORAGE; ENERGY EFFICIENCY; FIELD PROGRAMMABLE GATE ARRAYS (FPGA); INFORMATION MANAGEMENT; MEMORY ARCHITECTURE; NETWORK ARCHITECTURE; NEURAL NETWORKS; PARALLEL ARCHITECTURES; PARTICLE ACCELERATORS; RECONFIGURABLE HARDWARE;

COMPUTATIONAL TASK; CONVOLUTIONAL NEURAL NETWORK; DEEP CNN; EFFICIENT MANAGEMENTS; HARDWARE ACCELERATORS; MEMORY MANAGEMENT; PHYSICAL MEMORY; VIRTUAL MEMORY;

SYSTEM-ON-CHIP;

EID: 84959333505 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-27119-4_5 Document Type: Conference Paper

Times cited : (4)

References (19)

1
- 79953830059
- A taxonomy of accelerator architectures and their programming models. IBM
- Cascaval, C., Chatterjee, S., Franke, H., Gildea, K., Pattnaik, P.: A taxonomy of accelerator architectures and their programming models. IBM J. Res. Dev. 54(5), 473–482 (2010)
- (2010) J. Res. Dev , vol.54 , Issue.5 , pp. 473-482
- Cascaval, C.¹ Chatterjee, S.² Franke, H.³ Gildea, K.⁴ Pattnaik, P.⁵

2
- 84897749415
- Disengaged scheduling for fair, protected access to fast computational accelerators
- ACM
- Menychtas, K., Shen, K., Scott, M. L.: Disengaged scheduling for fair, protected access to fast computational accelerators. In: Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2014), pp. 301–316. ACM (2014)
- (2014) Proceedings of the 19Th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2014) , pp. 301-316
- Menychtas, K.¹ Shen, K.² Scott, M.L.³

3
- 85026934451
- Abdelgawad, H. M., Safar, M., Wahba, A. M.: High level synthesis of canny edge detection algorithm on zynq platform
- High Level Synthesis of Canny Edge Detection Algorithm on Zynq Platform
- Abdelgawad, H.M.¹ Safar, M.² Wahba, A.M.³

4
- 84988235942
- Vallina, F. M., Kohn, C., Joshi, P.: Zynq all programmable soc sobel filter implementation using the vivado hls tool, vol. XAPP890, pp. 1–16 (2012)
- (2012) Zynq All Programmable Soc Sobel Filter Implementation Using the Vivado Hls Tool , vol.XAPP890 , pp. 1-16
- Vallina, F.M.¹ Kohn, C.² Joshi, P.³

5
- 84962921765
- Optimizing fpga-based accelerator design for deep convolutional neural networks
- Zhang, C., Li, P., Sun, G., Guan, Y., Xiao, B., Cong, J.: Optimizing fpga-based accelerator design for deep convolutional neural networks. In: Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, pp. 161–170. ACM (2015)
- (2015) Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays , pp. 161-170
- Zhang, C.¹ Li, P.² Sun, G.³ Guan, Y.⁴ Xiao, B.⁵ Cong, J.⁶

6
- 84908529622
- A 240 g-ops/s mobile coprocessor for deep neural networks
- Gokhale, V., Jin, J., Dundar, A., Martini, B., Culurciello, E.: A 240 g-ops/s mobile coprocessor for deep neural networks. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 696–701 (2014)
- (2014) 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) , pp. 696-701
- Gokhale, V.¹ Jin, J.² Dundar, A.³ Martini, B.⁴ Culurciello, E.⁵

7
- 84913580146
- Caffe: Convolutional architecture for fast feature embedding
- ACM
- Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the ACM International Conference on Multimedia, pp. 675–678. ACM (2014)
- (2014) Proceedings of the ACM International Conference on Multimedia , pp. 675-678
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

8
- 0004287409
- O’Reilly Media Inc., Sebastopol
- Corbet, J., Rubini, A., Kroah-Hartman, G.: Linux Device Drivers. O’Reilly Media Inc., Sebastopol (2005)
- (2005) Linux Device Drivers
- Corbet, J.¹ Rubini, A.² Kroah-Hartman, G.³

9
- 84885922370
- Energy and performance exploration of accelerator coherency port using xilinx zynq
- ACM
- Sadri, M., Weis, C., Wehn, N., Benini, L.: Energy and performance exploration of accelerator coherency port using xilinx zynq. In: Proceedings of the 10th FPGAworld Conference, p. 5. ACM (2013)
- (2013) Proceedings of the 10Th Fpgaworld Conference , pp. 5
- Sadri, M.¹ Weis, C.² Wehn, N.³ Benini, L.⁴

10
- 84959418007
- Zynq-7000 All Programmable SoC Technical Reference Manual (UG585), Xilinx. Inc., March
- Zynq-7000 All Programmable SoC Technical Reference Manual (UG585), Xilinx. Inc., March 2013
- (2013)

11
- 70450060046
- Cnp: An fpga-based processor for convolutional networks
- IEEE
- Farabet, C., Poulet, C., Han, J. Y., LeCun, Y.: Cnp: an fpga-based processor for convolutional networks. In: International Conference on Field Programmable Logic and Applications, FPL 2009, pp. 32–37. IEEE (2009)
- (2009) International Conference on Field Programmable Logic and Applications, FPL 2009 , pp. 32-37
- Farabet, C.¹ Poulet, C.² Han, J.Y.³ Lecun, Y.⁴

12
- 71049121470
- A massively parallel coprocessor for convolutional neural networks
- IEEE
- Sankaradas, M., Jakkula, V., Cadambi, S., Chakradhar, S., Durdanovic, I., Cosatto, E., Graf, H. P.: A massively parallel coprocessor for convolutional neural networks. In: 20th IEEE International Conference on Application-specific Systems, Architectures and Processors, ASAP 2009, pp. 53–60. IEEE (2009)
- (2009) 20Th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2009 , pp. 53-60
- Sankaradas, M.¹ Jakkula, V.² Cadambi, S.³ Chakradhar, S.⁴ Durdanovic, I.⁵ Cosatto, E.⁶ Graf, H.P.⁷

13
- 84876231242
- Imagenet classification with deep convolutional neural networks
- Krizhevsky, A., Sutskever, I., Hinton, G. E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
- (2012) Advances in Neural Information Processing Systems , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

14
- 84959418008
- PetaLinux Tools User Guide, Xilinx. Inc., June
- PetaLinux Tools User Guide: Board Bringup Guide (UG980), Xilinx. Inc., June 2014
- (2014) Board Bringup Guide (UG980)

15
- 84959418009
- Amd heterogeneous uniform memory access
- Rogers, P., Fellow, C.: Amd heterogeneous uniform memory access (2013)
- (2013)
- Rogers, P.¹ Fellow, C.²

16
- 84959418010
- January
- Garg, I. C.: Amd kaveri review: A8-7600 and a10-7850k tested, January 2014. http://www.anandtech.com/show/7677/amd-kaveri-review-a8-7600-a10-7850k/6
- (2014) Amd Kaveri Review: A8-7600 and A10-7850K Tested
- Garg, I.C.¹

17
- 67650507097
- Maintaining I/O data coherence in embedded multicore systems
- Berg, T.: Maintaining I/O data coherence in embedded multicore systems. IEEE Micro 29(3), 10–19 (2009)
- (2009) IEEE Micro , vol.29 , Issue.3 , pp. 10-19
- Berg, T.¹

18
- 84865547094
- Big. Little processing with arm cortex-a15 & cortex-a7
- Greenhalgh, P.: Big. little processing with arm cortex-a15 & cortex-a7. ARMWhite paper (2011)
- (2011) Armwhite Paper
- Greenhalgh, P.¹

19
- 84864276428
- Efficient memory allocations on a many-core accelerator
- IEEE
- Koutras, I., Bartzas, A., Soudris, D.: Efficient memory allocations on a many-core accelerator. In: ARCS Workshops (ARCS), pp. 1–6. IEEE (2012)
- (2012) ARCS Workshops (ARCS) , pp. 1-6
- Koutras, I.¹ Bartzas, A.² Soudris, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.