SCOPUS 정보 검색 플랫폼

ISPASS 2016 - International Symposium on Performance Analysis of Systems and Software

Volumn , Issue , 2016, Pages 46-56

Analyzing the energy-efficiency of sparse matrix multiplication on heterogeneous systems: A comparative study of GPU, Xeon Phi and FPGA

(4) Giefers, Heiner a Staar, Peter a Bekas, Costas a Hagleitner, Christoph a

a IBM RESEARCH ZURICH (Switzerland)

Author keywords

[No Author keywords available]

Indexed keywords

ACCELERATION; BIG DATA; COMPUTER HARDWARE; DATA MINING; DATA TRANSFER; FIELD PROGRAMMABLE GATE ARRAYS (FPGA); HARDWARE; MATRIX ALGEBRA; RECONFIGURABLE HARDWARE; TELECOMMUNICATION NETWORKS;

COMPARATIVE STUDIES; COMPUTE-INTENSIVE TASKS; HARDWARE ACCELERATORS; HETEROGENEOUS SYSTEMS; HIGH PERFORMANCE COMPUTING; MODERN COMPUTER SYSTEMS; OPTIMAL SOLUTIONS; SYSTEM'S PERFORMANCE;

ENERGY EFFICIENCY;

EID: 84978634383 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ISPASS.2016.7482073 Document Type: Conference Paper

Times cited : (26)

References (70)

1
- 84978755800
- accessed: 2015-09-04
- "The Green500 List-June 2015," http://www.green500.org/lists/ green201506, accessed: 2015-09-04.
- The Green500 List-June 2015

2
- 84886548074
- Trends in energy-efficient computing: A perspective from the Green500
- B. Subramaniam, W. Saunders, T. Scogland, and w.-c. Feng, "Trends in energy-efficient computing: A perspective from the Green500," in Green Computing Coriference (IGCC), 2013.
- (2013) Green Computing Coriference (IGCC)
- Subramaniam, B.¹ Saunders, W.² Scogland, T.³ Feng, W.-C.⁴

3
- 84863393413
- A performance and energy comparison of FPGAs, GPUs, and multicores for slidingwindow applications
- J. Fowers, G. Brown, P. Cooke, and G. Stitt, " A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Slidingwindow Applications," in Int. Symp. on Field-programmable Gate Arrays (FPGA), 2012.
- (2012) Int. Symp. on Field-programmable Gate Arrays (FPGA)
- Fowers, J.¹ Brown, G.² Cooke, P.³ Stitt, G.⁴

4
- 77957919571
- BLAS comparison on FPGA, CPU and GPU
- S. Kestur, J. Davis, and O. Williams, "BLAS Comparison on FPGA, CPU and GPU," in Annual Symposium on VLSI (ISVLSI), 2010.
- (2010) Annual Symposium on VLSI (ISVLSI)
- Kestur, S.¹ Davis, J.² Williams, O.³

5
- 79952909372
- Bridging the GPGPU-FPGA efficiency gap
- C. W. F1etcher, I. A. Lebedev, N. B. Asadi, D. R. Burke, and J. Wawrzy nek, "Bridging the GPGPU-FPGA Efficiency Gap," in Field Programmable Gate Arrays (FPGA), 2011.
- (2011) Field Programmable Gate Arrays (FPGA)
- Fletcher, C.W.¹ Lebedev, I.A.² Asadi, N.B.³ Burke, D.R.⁴ Wawrzy Nek, J.⁵

6
- 77649253148
- Performance comparison of graphics processors to reconfigurable logic: A case study
- B. Cope, P. Cheung, W. Luk, and L. Howes, "Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study," IEEE Trans. on Computers, vol. 59, no. 4, 2010.
- (2010) IEEE Trans. on Computers , vol.59 , Issue.4
- Cope, B.¹ Cheung, P.² Luk, W.³ Howes, L.⁴

7
- 84905454486
- A reconfigurable fabric for accelerating large-scale datacenter services
- A. Putnam, A. Caulfield, E. Chung, D. Chiou, K. Constantinides, J. Demme, H. Esmaeilzadeh, J. Fowers, G. Gopal, J. Gray, M. Haselman, S. Hauck, S. Heil, A. Hormati, J.-Y. Kim, S. Lanka, J. Larus, E. Peterson, S. Pope, A. Smith, J. Thong, P. Xiao, and D. Burger, "A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services," in Int. Symp. on Computer Architecture (ISCA), 2014.
- (2014) Int. Symp. on Computer Architecture (ISCA)
- Putnam, A.¹ Caulfield, A.² Chung, E.³ Chiou, D.⁴ Constantinides, K.⁵ Demme, J.⁶ Esmaeilzadeh, H.⁷ Fowers, J.⁸ Gopal, G.⁹ Gray, J.¹⁰ Haselman, M.¹¹ Hauck, S.¹² Heil, S.¹³ Hormati, A.¹⁴ Kim, J.-Y.¹⁵ Lanka, S.¹⁶ Larus, J.¹⁷ Peterson, E.¹⁸ Pope, S.¹⁹ Smith, A.²⁰ more..

8
- 84982813068
- Sda: Software-defined accelerator for large-scale DNN systems
- J. Ouyang, S. Lin, W. Qi, Y. Wang, B. Yu, and S. Jiang, "SDA: Software-Defined Accelerator for Large-Scale DNN Systems," in HotChips26, 2014.
- (2014) HotChips26
- Ouyang, J.¹ Lin, S.² Qi, W.³ Wang, Y.⁴ Yu, B.⁵ Jiang, S.⁶

9
- 84978687066
- Intel xeon+FPGA platform for the data center
- Workshop on Recorifigurable Computing for the Masses
- P. K. Gupta, "Intel Xeon+FPGA Platform for the Data Center," in Field Programmable Logic and Applications (FPL), Workshop on Recorifigurable Computing for the Masses, 2014.
- (2014) Field Programmable Logic and Applications (FPL)
- Gupta, P.K.¹

10
- 84922876530
- CAPI: A coherent accelerator processor interface
- J. Stuecheli, B. Blaner, C. Johns, and M. Siegel, "CAPI: A Coherent Accelerator Processor Interface," IBM Journal of Research and Development, vol. 59, no. 1, 2015.
- (2015) IBM Journal of Research and Development , vol.59 , Issue.1
- Stuecheli, J.¹ Blaner, B.² Johns, C.³ Siegel, M.⁴

11
- 35648995516
- EECS Department, University of California, Berkeley, Tech. Rep.
- K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, I. Shalf, S. W. Williams, and K. A. Yelick, "The Landscape of Parallel Computing Research: A View from Berkeley," EECS Department, University of California, Berkeley, Tech. Rep., 2006.
- (2006) The Landscape of Parallel Computing Research: A View from Berkeley
- Asanovic, K.¹ Bodik, R.² Catanzaro, B.C.³ Gebis, J.J.⁴ Husbands, P.⁵ Keutzer, K.⁶ Patterson, D.A.⁷ Plishker, W.L.⁸ Shalf, I.⁹ Williams, S.W.¹⁰ Yelick, K.A.¹¹

12
- 77954719557
- The scalable heterogeneous computing (shoc) benchmark suite
- A. Danalis, G. Marin, C. McCurdy, J. Meredith, P. Roth, K. Spafford, V. Tipparaju, and J. Vetter, "The Scalable HeterOgeneous Computing (SHOC) Benchmark Suite," in 3rd Workshop on General-Purpose Computation on Graphics Processors (GPGPU), 2010.
- (2010) 3rd Workshop on General-Purpose Computation on Graphics Processors (GPGPU)
- Danalis, A.¹ Marin, G.² McCurdy, C.³ Meredith, J.⁴ Roth, P.⁵ Spafford, K.⁶ Tipparaju, V.⁷ Vetter, J.⁸

13
- 70649092154
- Rodinia: A benchmark suite for heterogeneous computing
- S. Che, M. Boyer, J. Meng, D. Tarjan, J. Sheaffer, S.-H. Lee, and K. Skadron, "Rodinia: A benchmark suite for heterogeneous computing," in Int. Symp. on Workload Characterization (llSWC), 2009.
- (2009) Int. Symp. on Workload Characterization (LlSWC)
- Che, S.¹ Boyer, M.² Meng, J.³ Tarjan, D.⁴ Sheaffer, J.⁵ Lee, S.-H.⁶ Skadron, K.⁷

14
- 84861017830
- Opencl and the 13 dwarfs: A work in progress
- W-c. Feng, H. Lin, T. Scogland, and J. Zhang, "OpenCL and the 13 Dwarfs: A Work in Progress," in Int. Conf. on Peiformance Engineering (ICPE), 2012.
- (2012) Int. Conf. on Peiformance Engineering (ICPE)
- Feng, W.-C.¹ Lin, H.² Scogland, T.³ Zhang, J.⁴

15
- 84903825248
- A unified methodology for a fast benchmarking of parallel architecture
- A. Guerre, J.-T. Acquaviva, and Y. Lhuillier, "A unified methodology for a fast benchmarking of parallel architecture," in Design, Automation and Test in Europe (DATE), 2014.
- (2014) Design, Automation and Test in Europe (DATE)
- Guerre, A.¹ Acquaviva, J.-T.² Lhuillier, Y.³

16
- 84906342283
- Analyzing the energyefficiency of dense linear algebra kerneis by power-profiling a hybrid CPUIFPGA system
- H. Giefers, R. Polig, and C. Hagleitner, "Analyzing the energyefficiency of dense linear algebra kerneis by power-profiling a hybrid CPUIFPGA system," in Application-specijic Systems, Architectures and Processors (ASAP), 2014.
- (2014) Application-specijic Systems, Architectures and Processors (ASAP)
- Giefers, H.¹ Polig, R.² Hagleitner, C.³

17
- 84918776204
- The power-performance tradeoffs of the intel xeon phi on HPC applications
- B. Li, H.-C. Chang, S. L. Song, c.-Y. Su, T. Meyer, J. Mooring, and K. Cameron, "The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications," in Int. Workshop on Large Scale Parallel Processing (LSPP), 2014.
- (2014) Int. Workshop on Large Scale Parallel Processing (LSPP)
- Li, B.¹ Chang, H.-C.² Song, S.L.³ Su, C.-Y.⁴ Meyer, T.⁵ Mooring, J.⁶ Cameron, K.⁷

18
- 84863347222
- A performance analysis framework for identifying potential benefits in GPGPU applications
- J. Sim, A. Dasgupta, H. Kim, and R. Vuduc, "A performance analysis framework for identifying potential benefits in GPGPU applications," in Principles and Practice of Parallel Programming (PPoPP), 2012.
- (2012) Principles and Practice of Parallel Programming (PPoPP)
- Sim, J.¹ Dasgupta, A.² Kim, H.³ Vuduc, R.⁴

19
- 77952579552
- Demystifying GPU microarchitecture through microbenchmarking
- M. S.-A. H. Wong, M.-M. Papadopoulou and A. Moshovos, "Demystifying GPU microarchitecture through microbenchmarking," in Peiformance Analysis of Systems Software (ISPASS), 2010.
- (2010) Peiformance Analysis of Systems Software (ISPASS)
- Wong, M.S.-A.H.¹ Papadopoulou, M.-M.² Moshovos, A.³

20
- 84887917163
- CUSPARSE library: A set of basic linear algebrasubroutines for sparse matrices
- M. Naumov, L. S. Chien, P. Vandermersch, and U. Kapasi, "CUSPARSE Library: A Set of Basic Linear AlgebraSubroutines for Sparse Matrices," in GPU Technology Coriference, 2010.
- (2010) GPU Technology Coriference
- Naumov, M.¹ Chien, L.S.² Vandermersch, P.³ Kapasi, U.⁴

21
- 81355161778
- The university of Florida sparse matrix collection
- T. A. Davis and Y. Hu, "The University of Florida Sparse Matrix Collection," ACM Trans. Math. Softw., vol. 38, no. 1, 2011.
- (2011) ACM Trans. Math. Softw. , vol.38 , Issue.1
- Davis, T.A.¹ Hu, Y.²

22
- 84879835573
- Efficient sparse matrix-vector multiplication on x86-based many-core processors
- X. Liu, M. Smelyanskiy, E. Chow, and P. Dubey, "Efficient Sparse Matrix-vector Multiplication on x86-based Many-core Processors," in Int. Con! on Supercomputing (ISC), 2013.
- (2013) Int. Con! on Supercomputing (ISC)
- Liu, X.¹ Smelyanskiy, M.² Chow, E.³ Dubey, P.⁴

23
- 84891525228
- Performance evaluation of sparse matrix multiplication kerneis on intel xeon phi
- E. Saule, K. Kaya, and Ü. VataIyürek, "Performance Evaluation of Sparse Matrix Multiplication Kerneis on Intel Xeon Phi," in Parallel Processing and Applied Mathematics (PPAM), 2013.
- (2013) Parallel Processing and Applied Mathematics (PPAM)
- Saule, E.¹ Kaya, K.² Vataiyürek, U.³

24
- 77949382525
- FPGA vs. GPU for sparse matrix vector multiply
- Y. Zhang, Y. Shalabi, R. Jain, K. Nagar, and J. Bakos, "FPGA vs. GPU for sparse matrix vector multiply," in Field-Programmable Technology (FPT), 2009.
- (2009) Field-Programmable Technology (FPT)
- Zhang, Y.¹ Shalabi, Y.² Jain, R.³ Nagar, K.⁴ Bakos, J.⁵

25
- 77951180817
- Instruction set innovations for the convey HC-l computer
- T. Brewer, "Instruction Set Innovations for the Convey HC-l Computer," Micro, IEEE, vol. 30, no. 2, 2010.
- (2010) Micro, IEEE , vol.30 , Issue.2
- Brewer, T.¹

26
- 79958758626
- A sparse matrix personality for the convey HC-l
- K. Nagar and J. Bakos, " A Sparse Matrix Personality for the Convey HC-l," in Field-Programmable Custom Computing Machines (FCCM), 2011.
- (2011) Field-Programmable Custom Computing Machines (FCCM)
- Nagar, K.¹ Bakos, J.²

27
- 84864958615
- Towards a universal FPGA matrix-vector multiplication architecture
- S. K. Amd, John D. Davis and E. S. Chung, "Towards a Universal FPGA Matrix-Vector Multiplication Architecture," in Field-Programmable Custom Computing Machines (FCCM), 2012.
- (2012) Field-Programmable Custom Computing Machines (FCCM)
- Amd, S.K.¹ Davis, J.D.² Chung, E.S.³

28
- 84875673115
- Version 4.304.55 ed., NVIDIA Corp.
- NVML API REFERENCE MANUAL, Version 4.304.55 ed., NVIDIA Corp., 2012.
- (2012) NVML Api Reference Manual

29
- 84938811918
- Measuring GPU power with the K20 built-in sensor
- M. Burtscher, I. Zecena, and Z. Zong, "Measuring GPU Power with the K20 Built-in Sensor," in Proceedings ofWorkshop on General Purpose Processing Using GPUs, ser. GPGPU-7, 2014.
- (2014) Proceedings OfWorkshop on General Purpose Processing Using GPUs, Ser. GPGPU-7
- Burtscher, M.¹ Zecena, I.² Zong, Z.³

30
- 77957942221
- RAPL: Memory power estimation and capping
- H. David, E. Gorbatov, U. R. Hanebutte, R. Khanna, and C. Le, "RAPL: Memory Power Estimation and Capping," in Int. Symp. on Low Power Electronics and Design (ISLPED), 2010.
- (2010) Int. Symp. on Low Power Electronics and Design (ISLPED)
- David, H.¹ Gorbatov, E.² Hanebutte, U.R.³ Khanna, R.⁴ Le, C.⁵

31
- 84978635984
- Electronic Educational Devices, accessed: 2015-09-08
- Watts up? and Watts up? PRO Operators Manual, https://www. wattsupmeters.com, Electronic Educational Devices, accessed: 2015-09-08.
- Watts Up? and Watts Up? PRO Operators Manual

32
- 85043146402
- V2.1 ed., Standard Performance Evaluation Corporation (SPEC), SPEC Power and Performance Committee
- Power and Peiformance Benchmark Methodology, V2.1 ed., Standard Performance Evaluation Corporation (SPEC), SPEC Power and Performance Committee, 2012.
- (2012) Power and Peiformance Benchmark Methodology

33
- 85056434690
- V G. Oklobdzija, The Computer Engineering Handbook, 2001.
- (2001) The Computer Engineering Handbook
- Oklobdzija, V.G.¹

34
- 0034316092
- Poweraware microarchitecture: Design and modeling challenges for nextgeneration microprocessors
- D. Brooks, P. Bose, S. Schuster, H. Jacobson, P. Kudva, A. Buyuktosunoglu, J.-D. Wellman, V Zyuban, M. Gupta, and P. Cook, "PowerAware Microarchitecture: Design and Modeling Challenges for NextGeneration Microprocessors," IEEE Micro, vol. 20, no. 6, 2000.
- (2000) IEEE Micro , vol.20 , Issue.6
- Brooks, D.¹ Bose, P.² Schuster, S.³ Jacobson, H.⁴ Kudva, P.⁵ Buyuktosunoglu, A.⁶ Wellman, J.-D.⁷ Zyuban, V.⁸ Gupta, M.⁹ Cook, P.¹⁰

35
- 0030243819
- Energy dissipation in general purpose microprocessors
- R. Gonzalez and M. Horowitz, "Energy dissipation in general purpose microprocessors," Solid-State Circuits, vol. 31, no. 9, 1996.
- (1996) Solid-State Circuits , vol.31 , Issue.9
- Gonzalez, R.¹ Horowitz, M.²

36
- 0028736474
- Low-power digital design
- M. Horowitz, T. Indermaur, and R. Gonzalez, "Low-Power Digital Design," in IEEE Symp. Low Power Electronics, 1994.
- (1994) IEEE Symp. Low Power Electronics
- Horowitz, M.¹ Indermaur, T.² Gonzalez, R.³

37
- 84938840293
- Intel
- Intel Xeon Phi Coprocessor. Datasheet, Intel, 2014.
- (2014) Intel Xeon Phi Coprocessor. Datasheet

38
- 84978639078
- NVIDIA
- Tesla K20 GPU Accelerator, BD-06455-00Lv07, NVIDIA, 2013.
- (2013) Tesla K20 GPU Accelerator, BD-06455-00Lv07

39
- 84994848735
- Nallatech
- PCle-385N Product Brief, Nallatech, 2014.
- (2014) PCle-385N Product Brief

40
- 84894450601
- 13th ed., Altera Corp., Dec.
- Altera SDK for OpenCL. Programming Guide, 13th ed., Altera Corp., Dec. 2013.
- (2013) Altera SDK for OpenCL. Programming Guide

41
- 84978781254
- Intel
- Intel Xeon Processor E5-2600 v2. Product Brief, Intel, 2014.
- (2014) Intel Xeon Processor E5-2600 v2. Product Brief

42
- 84906719549
- Altera, Jul.
- Stratix V Device Handbook, Altera, Jul. 2014.
- (2014) Stratix V Device Handbook

43
- 84903765018
- cusparse, accessed: 2015-09-04. 56
- "NVIDIA CUDA Sparse Matrix library," https://developer.nvidia.coml cusparse, accessed: 2015-09-04. 56
- NVIDIA CUDA Sparse Matrix Library

44
- 0010828897
- B. Dally, P. Hanrahan, and R. Fedkiw, " A Streaming Supercomputer. Stanford Computer Systems Laboratory White Paper," 2001.
- (2001) A Streaming Supercomputer. Stanford Computer Systems Laboratory White Paper
- Dally, B.¹ Hanrahan, P.² Fedkiw, R.³

45
- 70450227686
- Performance evaluation of the sparse matrix-vector multiplication on modern architectures
- G. Goumas, K. Kourtis, N. Anastopoulos, V Karakasis, and N. Koziris, "Performance evaluation of the sparse matrix-vector multiplication on modern architectures," The Journal of Supercomputing, vol. 50, no. 1, 2009.
- (2009) The Journal of Supercomputing , vol.50 , Issue.1
- Goumas, G.¹ Kourtis, K.² Anastopoulos, N.³ Karakasis, V.⁴ Koziris, N.⁵

46
- 10044233808
- Ph.D. dissertation, University of California, Berkeley, CA, USA
- R. W Vuduc, "Automatie Performance Tuning of Sparse Matrix Kerneis," Ph.D. dissertation, University of California, Berkeley, CA, USA, 2004.
- (2004) Automatie Performance Tuning of Sparse Matrix Kerneis
- Vuduc, R.W.¹

47
- 0003550735
- SPARSKIT: A basic tool kit for sparse matrix computations
- version 2
- Y. Saad, "SPARSKIT: a basic tool kit for sparse matrix computations," Tech. Rep., 1994, version 2.
- (1994) Tech. Rep.
- Saad, Y.¹

48
- 84896855863
- YaSpMV: Yet another SpMV framework on GPUs
- S. Yan, C. Li, Y. Zhang, and H. Zhou, "yaSpMV: Yet Another SpMV Framework on GPUs," in Principles and Practice of Parallel Programming (PPoPP), 2014.
- (2014) Principles and Practice of Parallel Programming (PPoPP)
- Yan, S.¹ Li, C.² Zhang, Y.³ Zhou, H.⁴

49
- 84864051848
- ClSpMV: A cross-platform OpenCL SpMV framework on GPUs
- B.-Y. Su and K. Keutzer, "clSpMV: A Cross-Platform OpenCL SpMV Framework on GPUs," in Supercomputing (ISC), 2012.
- (2012) Supercomputing (ISC)
- Su, B.-Y.¹ Keutzer, K.²

50
- 84911360428
- A unified sparse matrix data format for efficient general sparse matrixvector multiply on modern processors with wide SIMD units
- M. Kreutzer, G. Hager, G. Wellein, H. Fehske, and A. R. Bishop, "A unified sparse matrix data format for efficient general sparse matrixvector multiply on modern processors with wide SIMD units," SIAM Journal on Scientific Computing, vol. 36, no. 5, 2014.
- (2014) SIAM Journal on Scientific Computing , vol.36 , Issue.5
- Kreutzer, M.¹ Hager, G.² Wellein, G.³ Fehske, H.⁴ Bishop, A.R.⁵

51
- 77749340082
- Model-driven autotuning of sparse matrix-vector multiply on GPUs
- J. W Choi, A. Singh, and R. W Vuduc, "Model-driven Autotuning of Sparse Matrix-vector Multiply on GPUs," in Principles and Practice of Parallel Programming (PPoPP), 2010.
- (2010) Principles and Practice of Parallel Programming (PPoPP)
- Choi, J.W.¹ Singh, A.² Vuduc, R.W.³

52
- 84978736009
- Tech. Rep., reference Guide
- R. Pozo and K. Remington, "SparseLib++ vl.5-Sparse Matrix Class Library," Tech. Rep., 1996, reference Guide.
- (1996) SparseLib++ vl.5-Sparse Matrix Class Library
- Pozo, R.¹ Remington, K.²

53
- 84891784800
- accessed: 2015-09-16
- T. A. Davis, "SuiteSparse: A Suite of Sparse Matrix Software," http: //www.suitesparse.com. accessed: 2015-09-16.
- SuiteSparse: A Suite of Sparse Matrix Software
- Davis, T.A.¹

54
- 84936931250
- Efficient sparse matrix-vector multiplication on GPUS using the csr storage format
- J. L. Greathouse and M. Daga, "Efficient sparse matrix-vector multiplication on gpus using the csr storage format," in High Performance Computing, Networking, Storage and Analysis (SC), 2014.
- (2014) High Performance Computing, Networking, Storage and Analysis (SC)
- Greathouse, J.L.¹ Daga, M.²

55
- 84983319724
- Stochastic matrix-function estimators: Scalable bigdata kerneis with high performance
- P. W J. Staar, P. K. Barkoutsos, R. Istrate, A. C. I. Malossi, I. Tavernelli, N. Moll, H. Giefers, C. Hagleitner, C. Bekas, and A. Curioni, "Stochastic Matrix-Function Estimators: Scalable BigData Kerneis with High Performance," in Parallel and Distributed Processing Symposium (IPDPS), 2016.
- (2016) Parallel and Distributed Processing Symposium (IPDPS)
- Staar, P.W.J.¹ Barkoutsos, P.K.² Istrate, R.³ Malossi, A.C.I.⁴ Tavernelli, I.⁵ Moll, N.⁶ Giefers, H.⁷ Hagleitner, C.⁸ Bekas, C.⁹ Curioni, A.¹⁰

56
- 0032683760
- The case for application-specific benchmarking
- M. Seltzer, D. Krinsky, K. Smith, and X. Zhang, "The Case for Application-Specific Benchmarking," in Hot Topics in Operating Systems (HOTOS), 1999.
- (1999) Hot Topics in Operating Systems (HOTOS)
- Seltzer, M.¹ Krinsky, D.² Smith, K.³ Zhang, X.⁴

57
- 79957493885
- Where is the data? Why you cannot debate CPU vs. GPU performance without the answer
- C. Gregg and K. Hazelwood, "Where is the Data? Why you Cannot Debate CPU vS. GPU Performance Without the Answer," in Peiformance Analysis of Systems and Software (ISPASS), 201l.
- Peiformance Analysis of Systems and Software (ISPASS)
- Gregg, C.¹ Hazelwood, K.²

58
- 84978770765
- Altera Corp., "IP Compiler for PCI Express," 2013.
- (2013) IP Compiler for PCI Express
- Altera Corp¹

59
- 84978656854
- -, "External Memory Interface Handbook," 2015.
- (2015) External Memory Interface Handbook
- Altera Corp¹

60
- 84978756887
- Floating-point megafunctions
- -, "Floating-Point Megafunctions," User Guide, 2013.
- (2013) User Guide
- Altera Corp¹

61
- 84978721750
- Floating-point operator v7.l
- Xilinx, Inc., "Floating-Point Operator v7.l," LogiCORE IP Product Guide, 2015.
- (2015) LogiCORE IP Product Guide
- Xilinx, Inc.,¹

62
- 84988038566
- A survey and evaluation of FPGA high-level synthesis tools
- Preprint
- R. Nane, V-Mo Sima, C. Pilato, J. Choi, B. Fort, A. Canis, Y. Chen, H. Hsiao, S. Brown, F. Ferrandi, J. Anderson, and K. Bertels, "A survey and evaluation of fpga high-level synthesis tools," ComputerAided Design of Integrated Circuits and Systems, IEEE Transactions on, 2016, Preprint.
- (2016) ComputerAided Design of Integrated Circuits and Systems, IEEE Transactions on
- Nane, R.¹ Sima, V.² Pilato, C.³ Choi, J.⁴ Fort, B.⁵ Canis, A.⁶ Chen, Y.⁷ Hsiao, H.⁸ Brown, S.⁹ Ferrandi, F.¹⁰ Anderson, J.¹¹ Bertels, K.¹²

63
- 84978767122
- Sdaccel development environment
- Xilinx, Inc., "SDAccel Development Environment," User Guide, 2015.
- (2015) User Guide
- Xilinx, Inc.,¹

64
- 84955581883
- Comparative analysis of opencl vs. HDL with image-processing kerneis on Stratix-V FPGA
- K. Hili, S. Craciun, A. George, and H. Lam, "Comparative analysis of OpenCL vs. HDL with image-processing kerneis on Stratix-V FPGA," in Application-specijic Systems, Architectures and Processors (ASAP), 2015.
- (2015) Application-specijic Systems, Architectures and Processors (ASAP)
- Hili, K.¹ Craciun, S.² George, A.³ Lam, H.⁴

65
- 47249127725
- The case for energy-proportional computing
- L. A. Barroso and U. Hölzle, "The Case for Energy-Proportional Computing," Computer, vol. 40, no. 12, 2007.
- (2007) Computer , vol.40 , Issue.12
- Barroso, L.A.¹ Hölzle, U.²

66
- 85021450123
- Energy aware consolidation for cloud computing
- S. Srikantaiah, A. Kansal, and F. Zhao, "Energy Aware Consolidation for Cloud Computing," in HotPower, 2008.
- (2008) HotPower
- Srikantaiah, S.¹ Kansal, A.² Zhao, F.³

67
- 84901242759
- A survey on techniques for improving the energy efficiency of large-scale distributed systems
- A.-C. Orgerie, M. D. d. Assuncao, and L. Lefevre, " A Survey on Techniques for Improving the Energy Efficiency of Large-scale Distributed Systems," ACM Comput. Surv., vol. 46, no. 4, 2014.
- (2014) ACM Comput. Surv. , vol.46 , Issue.4
- Orgerie, A.-C.¹ Assuncao, M.D.D.² Lefevre, L.³

68
- 84960124111
- Hewlett-Packard
- HP Moonshot System Family Guide, Hewlett-Packard, 2014.
- (2014) HP Moonshot System Family Guide

69
- 84940769996
- Energy-efficient microserver based on a 12-core l.8ghz 188k-coremark 28nm bulk CMOS 64b soc for big-data applications with 159gb/sll memory bandwidth system density
- R. Luijten, D. Pham, R. Clauberg, M. Cossale, H. Nguyen, and M. Pandya, "Energy-Efficient Microserver Based on a 12-Core l.8GHz 188K-CoreMark 28nm Bulk CMOS 64b SoC for Big-Data Applications with 159GB/slL Memory Bandwidth System Density," in SolidState Circuits Conference (ISSCC), 2015.
- (2015) SolidState Circuits Conference (ISSCC)
- Luijten, R.¹ Pham, D.² Clauberg, R.³ Cossale, M.⁴ Nguyen, H.⁵ Pandya, M.⁶

70
- 84964876912
- Performance and productivity evaluation of hybrid-threading hls versus hdls
- G. Wang, H. Lam, A. George, and G. Edwards, "Performance and Productivity Evaluation of Hybrid-Threading HLS versus HDLs," in High Peiformance Extreme Computing Coriference (HPEC), 2015.
- (2015) High Peiformance Extreme Computing Coriference (HPEC)
- Wang, G.¹ Lam, H.² George, A.³ Edwards, G.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.