메뉴 건너뛰기




Volumn , Issue , 2016, Pages 46-56

Analyzing the energy-efficiency of sparse matrix multiplication on heterogeneous systems: A comparative study of GPU, Xeon Phi and FPGA

Author keywords

[No Author keywords available]

Indexed keywords

ACCELERATION; BIG DATA; COMPUTER HARDWARE; DATA MINING; DATA TRANSFER; FIELD PROGRAMMABLE GATE ARRAYS (FPGA); HARDWARE; MATRIX ALGEBRA; RECONFIGURABLE HARDWARE; TELECOMMUNICATION NETWORKS;

EID: 84978634383     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISPASS.2016.7482073     Document Type: Conference Paper
Times cited : (26)

References (70)
  • 1
    • 84978755800 scopus 로고    scopus 로고
    • accessed: 2015-09-04
    • "The Green500 List-June 2015," http://www.green500.org/lists/ green201506, accessed: 2015-09-04.
    • The Green500 List-June 2015
  • 6
    • 77649253148 scopus 로고    scopus 로고
    • Performance comparison of graphics processors to reconfigurable logic: A case study
    • B. Cope, P. Cheung, W. Luk, and L. Howes, "Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study," IEEE Trans. on Computers, vol. 59, no. 4, 2010.
    • (2010) IEEE Trans. on Computers , vol.59 , Issue.4
    • Cope, B.1    Cheung, P.2    Luk, W.3    Howes, L.4
  • 8
    • 84982813068 scopus 로고    scopus 로고
    • Sda: Software-defined accelerator for large-scale DNN systems
    • J. Ouyang, S. Lin, W. Qi, Y. Wang, B. Yu, and S. Jiang, "SDA: Software-Defined Accelerator for Large-Scale DNN Systems," in HotChips26, 2014.
    • (2014) HotChips26
    • Ouyang, J.1    Lin, S.2    Qi, W.3    Wang, Y.4    Yu, B.5    Jiang, S.6
  • 9
    • 84978687066 scopus 로고    scopus 로고
    • Intel xeon+FPGA platform for the data center
    • Workshop on Recorifigurable Computing for the Masses
    • P. K. Gupta, "Intel Xeon+FPGA Platform for the Data Center," in Field Programmable Logic and Applications (FPL), Workshop on Recorifigurable Computing for the Masses, 2014.
    • (2014) Field Programmable Logic and Applications (FPL)
    • Gupta, P.K.1
  • 21
    • 81355161778 scopus 로고    scopus 로고
    • The university of Florida sparse matrix collection
    • T. A. Davis and Y. Hu, "The University of Florida Sparse Matrix Collection," ACM Trans. Math. Softw., vol. 38, no. 1, 2011.
    • (2011) ACM Trans. Math. Softw. , vol.38 , Issue.1
    • Davis, T.A.1    Hu, Y.2
  • 25
    • 77951180817 scopus 로고    scopus 로고
    • Instruction set innovations for the convey HC-l computer
    • T. Brewer, "Instruction Set Innovations for the Convey HC-l Computer," Micro, IEEE, vol. 30, no. 2, 2010.
    • (2010) Micro, IEEE , vol.30 , Issue.2
    • Brewer, T.1
  • 28
    • 84875673115 scopus 로고    scopus 로고
    • Version 4.304.55 ed., NVIDIA Corp.
    • NVML API REFERENCE MANUAL, Version 4.304.55 ed., NVIDIA Corp., 2012.
    • (2012) NVML Api Reference Manual
  • 31
    • 84978635984 scopus 로고    scopus 로고
    • Electronic Educational Devices, accessed: 2015-09-08
    • Watts up? and Watts up? PRO Operators Manual, https://www. wattsupmeters.com, Electronic Educational Devices, accessed: 2015-09-08.
    • Watts Up? and Watts Up? PRO Operators Manual
  • 32
    • 85043146402 scopus 로고    scopus 로고
    • V2.1 ed., Standard Performance Evaluation Corporation (SPEC), SPEC Power and Performance Committee
    • Power and Peiformance Benchmark Methodology, V2.1 ed., Standard Performance Evaluation Corporation (SPEC), SPEC Power and Performance Committee, 2012.
    • (2012) Power and Peiformance Benchmark Methodology
  • 35
    • 0030243819 scopus 로고    scopus 로고
    • Energy dissipation in general purpose microprocessors
    • R. Gonzalez and M. Horowitz, "Energy dissipation in general purpose microprocessors," Solid-State Circuits, vol. 31, no. 9, 1996.
    • (1996) Solid-State Circuits , vol.31 , Issue.9
    • Gonzalez, R.1    Horowitz, M.2
  • 43
    • 84903765018 scopus 로고    scopus 로고
    • cusparse, accessed: 2015-09-04. 56
    • "NVIDIA CUDA Sparse Matrix library," https://developer.nvidia.coml cusparse, accessed: 2015-09-04. 56
    • NVIDIA CUDA Sparse Matrix Library
  • 47
    • 0003550735 scopus 로고
    • SPARSKIT: A basic tool kit for sparse matrix computations
    • version 2
    • Y. Saad, "SPARSKIT: a basic tool kit for sparse matrix computations," Tech. Rep., 1994, version 2.
    • (1994) Tech. Rep.
    • Saad, Y.1
  • 49
    • 84864051848 scopus 로고    scopus 로고
    • ClSpMV: A cross-platform OpenCL SpMV framework on GPUs
    • B.-Y. Su and K. Keutzer, "clSpMV: A Cross-Platform OpenCL SpMV Framework on GPUs," in Supercomputing (ISC), 2012.
    • (2012) Supercomputing (ISC)
    • Su, B.-Y.1    Keutzer, K.2
  • 50
    • 84911360428 scopus 로고    scopus 로고
    • A unified sparse matrix data format for efficient general sparse matrixvector multiply on modern processors with wide SIMD units
    • M. Kreutzer, G. Hager, G. Wellein, H. Fehske, and A. R. Bishop, "A unified sparse matrix data format for efficient general sparse matrixvector multiply on modern processors with wide SIMD units," SIAM Journal on Scientific Computing, vol. 36, no. 5, 2014.
    • (2014) SIAM Journal on Scientific Computing , vol.36 , Issue.5
    • Kreutzer, M.1    Hager, G.2    Wellein, G.3    Fehske, H.4    Bishop, A.R.5
  • 60
    • 84978756887 scopus 로고    scopus 로고
    • Floating-point megafunctions
    • -, "Floating-Point Megafunctions," User Guide, 2013.
    • (2013) User Guide
    • Altera Corp1
  • 63
    • 84978767122 scopus 로고    scopus 로고
    • Sdaccel development environment
    • Xilinx, Inc., "SDAccel Development Environment," User Guide, 2015.
    • (2015) User Guide
    • Xilinx, Inc.,1
  • 65
    • 47249127725 scopus 로고    scopus 로고
    • The case for energy-proportional computing
    • L. A. Barroso and U. Hölzle, "The Case for Energy-Proportional Computing," Computer, vol. 40, no. 12, 2007.
    • (2007) Computer , vol.40 , Issue.12
    • Barroso, L.A.1    Hölzle, U.2
  • 66
    • 85021450123 scopus 로고    scopus 로고
    • Energy aware consolidation for cloud computing
    • S. Srikantaiah, A. Kansal, and F. Zhao, "Energy Aware Consolidation for Cloud Computing," in HotPower, 2008.
    • (2008) HotPower
    • Srikantaiah, S.1    Kansal, A.2    Zhao, F.3
  • 67
    • 84901242759 scopus 로고    scopus 로고
    • A survey on techniques for improving the energy efficiency of large-scale distributed systems
    • A.-C. Orgerie, M. D. d. Assuncao, and L. Lefevre, " A Survey on Techniques for Improving the Energy Efficiency of Large-scale Distributed Systems," ACM Comput. Surv., vol. 46, no. 4, 2014.
    • (2014) ACM Comput. Surv. , vol.46 , Issue.4
    • Orgerie, A.-C.1    Assuncao, M.D.D.2    Lefevre, L.3
  • 69
    • 84940769996 scopus 로고    scopus 로고
    • Energy-efficient microserver based on a 12-core l.8ghz 188k-coremark 28nm bulk CMOS 64b soc for big-data applications with 159gb/sll memory bandwidth system density
    • R. Luijten, D. Pham, R. Clauberg, M. Cossale, H. Nguyen, and M. Pandya, "Energy-Efficient Microserver Based on a 12-Core l.8GHz 188K-CoreMark 28nm Bulk CMOS 64b SoC for Big-Data Applications with 159GB/slL Memory Bandwidth System Density," in SolidState Circuits Conference (ISSCC), 2015.
    • (2015) SolidState Circuits Conference (ISSCC)
    • Luijten, R.1    Pham, D.2    Clauberg, R.3    Cossale, M.4    Nguyen, H.5    Pandya, M.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.