-
3
-
-
78651103346
-
StarPU: A unified platform for task scheduling on heterogeneous multicore architectures
-
C. Augonnet et al. StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. CCPE, 23(2):187-198, 2011.
-
(2011)
CCPE
, vol.23
, Issue.2
, pp. 187-198
-
-
Augonnet, C.1
-
5
-
-
85019041699
-
A cloud-scale acceleration architecture
-
A. M. Caulfield et al. A cloud-scale acceleration architecture. In ISCA, 2016.
-
(2016)
ISCA
-
-
Caulfield, A.M.1
-
6
-
-
85008936147
-
Efficient kernel synthesis for performance portable programming
-
L.-W. Chang et al. Efficient kernel synthesis for performance portable programming. In MICRO, 2016.
-
(2016)
MICRO
-
-
Chang, L.-W.1
-
7
-
-
84963739101
-
A programming system for future proofing performance critical libraries
-
L.-W. Chang et al. A programming system for future proofing performance critical libraries. In PPoPP, 2016.
-
(2016)
PPoPP
-
-
Chang, L.-W.1
-
8
-
-
79951696448
-
Single-chip heterogeneous computing: Does the future include custom logic, FPGAs, and GPGPUs?
-
E. S. Chung et al. Single-chip heterogeneous computing: Does the future include custom logic, FPGAs, and GPGPUs? In MICRO, 2010.
-
(2010)
MICRO
-
-
Chung, E.S.1
-
9
-
-
79959734507
-
OmpSs: A proposal for programming heterogeneous multi-core architectures
-
A. Duran et al. OmpSs: a proposal for programming heterogeneous multi-core architectures. PPL, 21(02):173-193, 2011.
-
(2011)
PPL
, vol.21
, Issue.2
, pp. 173-193
-
-
Duran, A.1
-
10
-
-
84994741673
-
Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications
-
V. Garcia-Flores et al. Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications. In IISWC, 2016.
-
(2016)
IISWC
-
-
Garcia-Flores, V.1
-
11
-
-
85019024615
-
Chai: Collaborative heterogeneous applications for integrated-architectures
-
in press
-
J. Gómez-Luna et al. Chai: Collaborative heterogeneous applications for integrated-architectures. In ISPASS, 2017 (in press).
-
(2017)
ISPASS
-
-
Gómez-Luna, J.1
-
13
-
-
85018942987
-
-
Intel, Programming Guide, October
-
Intel. Intel FPGA SDK for OpenCL. Programming Guide, October 2016.
-
(2016)
Intel FPGA SDK for OpenCL
-
-
-
14
-
-
85027948271
-
Generalized multiAmdahl: Optimization of heterogeneous multi-accelerator SoC
-
Jan
-
A. Morad et al. Generalized multiAmdahl: Optimization of heterogeneous multi-accelerator SoC. IEEE CAL, 13(1):37-40, Jan. 2014.
-
(2014)
IEEE CAL
, vol.13
, Issue.1
, pp. 37-40
-
-
Morad, A.1
-
15
-
-
85016071892
-
Exploring the features of OpenCL 2.0
-
S. Mukherjee et al. Exploring the features of OpenCL 2.0. In IWOCL, 2015.
-
(2015)
IWOCL
-
-
Mukherjee, S.1
-
16
-
-
84978733890
-
A comprehensive performance analysis of HSA and OpenCL 2.0
-
S. Mukherjee et al. A comprehensive performance analysis of HSA and OpenCL 2.0. In ISPASS, 2016.
-
(2016)
ISPASS
-
-
Mukherjee, S.1
-
17
-
-
85006694340
-
-
NVIDIA, White paper
-
NVIDIA. NVIDIA Tesla P100. White paper, 2016.
-
(2016)
NVIDIA Tesla P100
-
-
-
19
-
-
84905454486
-
A reconfigurable fabric for accelerating large-scale datacenter services
-
A. Putnam et al. A reconfigurable fabric for accelerating large-scale datacenter services. In ISCA, 2014.
-
(2014)
ISCA
-
-
Putnam, A.1
-
20
-
-
84883116448
-
Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines
-
J. Ragan-Kelley et al. Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines. In PLDI, 2013.
-
(2013)
PLDI
-
-
Ragan-Kelley, J.1
-
21
-
-
84994777428
-
Hetero-Mark, a benchmark suite for CPU-GPU collaborative computing
-
Y. Sun et al. Hetero-Mark, a benchmark suite for CPU-GPU collaborative computing. In IISWC, 2016.
-
(2016)
IISWC
-
-
Sun, Y.1
-
22
-
-
85017558564
-
-
Xilinx, White Paper, June
-
Xilinx. Zynq UltraScale+ MPSoCs. White Paper, June 2016.
-
(2016)
Zynq UltraScale+ MPSoCs
-
-
|