메뉴 건너뛰기




Volumn , Issue , 2018, Pages 93-96

Understanding Performance Differences of FPGAs and GPUs

Author keywords

Analytical model; FPGA; GPU; Performance comparison

Indexed keywords

ANALYTICAL MODELS; BANDWIDTH; BENCHMARKING; COMPUTERS; GRAPHICS PROCESSING UNIT; PROGRAM PROCESSORS;

EID: 85057754695     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/FCCM.2018.00023     Document Type: Conference Paper
Times cited : (96)

References (9)
  • 1
    • 70649092154 scopus 로고    scopus 로고
    • Rodinia: A benchmark suite for heterogeneous computing
    • S. Che et al., "Rodinia: A benchmark suite for heterogeneous computing, " in IISWC'2009, pp. 44-54. [Online]. Available: http://www. cs. virginia. edu/-skadron/wiki/rodinia/index. php
    • IISWC'2009 , pp. 44-54
    • Che, S.1
  • 2
    • 79953076698 scopus 로고    scopus 로고
    • High-level synthesis for FPGAs: From prototyping to deployment
    • J. Cong et al., "High-level synthesis for fpgas: From prototyping to deployment, " TCAD'2011, vol. 30, no. 4, pp. 473-491.
    • TCAD'2011 , vol.30 , Issue.4 , pp. 473-491
    • Cong, J.1
  • 3
    • 85027720661 scopus 로고    scopus 로고
    • accessed: 2018-01-21
    • "Amazon EC2 F1 Instances, " https://aws. amazon. com/ec2/instance-types/f1/, accessed: 2018-01-21.
    • Amazon EC2 F1 Instances
  • 4
    • 85064380638 scopus 로고    scopus 로고
    • Xilinx 16nm datacenter device family with in-package hbm and ccix interconnect
    • G. Singh et al., "Xilinx 16nm datacenter device family with in-package hbm and ccix interconnect, " in HotChips'2017.
    • HotChips'2017
    • Singh, G.1
  • 5
    • 85023605978 scopus 로고    scopus 로고
    • An optimal microarchitecture for stencil computation acceleration based on non-uniform partitioning of data reuse buffers
    • J. Cong et al., "An optimal microarchitecture for stencil computation acceleration based on non-uniform partitioning of data reuse buffers, " in DAC'2014, pp. 1-6.
    • DAC'2014 , pp. 1-6
    • Cong, J.1
  • 6
    • 85023621468 scopus 로고    scopus 로고
    • Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks
    • C. Zhang et al., "Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks, " in ICCAD'2016, pp. 1-8.
    • ICCAD'2016 , pp. 1-8
    • Zhang, C.1
  • 7
    • 52349084750 scopus 로고    scopus 로고
    • Accelerating compute-intensive applications with GPUs and FPGAs
    • S. Che et al., "Accelerating compute-intensive applications with GPUs and fpgas, " in SASP'2008, pp. 101-107.
    • SASP'2008 , pp. 101-107
    • Che, S.1
  • 8
    • 84906329903 scopus 로고    scopus 로고
    • On the characterization of OpenCL dwarfs on fixed and reconfigurable platforms
    • K. Krommydas et al., "On the characterization of OpenCL dwarfs on fixed and reconfigurable platforms, " in ASAP'2014, pp. 153-160.
    • ASAP'2014 , pp. 153-160
    • Krommydas, K.1
  • 9
    • 85034443289 scopus 로고    scopus 로고
    • Evaluating and optimizing OpenCL kernels for high performance computing with FPGAs
    • H. R. Zohouri et al., "Evaluating and optimizing OpenCL kernels for high performance computing with fpgas, " in SC'2016, pp. 35: 1-35: 12.
    • SC'2016 , pp. 351-3512
    • Zohouri, H.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.