-
6
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
Austin, TX, USA
-
S. Che et al. Rodinia: A Benchmark Suite for Heterogeneous Computing. In International Symposium on Workload Characterization (IISWC), pages 44-54, Austin, TX, USA, 2009.
-
(2009)
International Symposium on Workload Characterization (IISWC)
, pp. 44-54
-
-
Che, S.1
-
7
-
-
79951707102
-
Memory latency reduction via thread throttling
-
Atlanta, Georgia, USA
-
H.-Y. Cheng et al. Memory Latency Reduction via Thread Throttling. In International Symposium on Microarchitecture (MICRO), pages 53-64, Atlanta, Georgia, USA, 2010.
-
(2010)
International Symposium on Microarchitecture (MICRO)
, pp. 53-64
-
-
Cheng, H.-Y.1
-
9
-
-
47349104432
-
Dynamic warp formation and scheduling for efficient gpu control flow
-
Chicago, Illinois, USA
-
W. W. L. Fung et al. Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow. In International Symposium on Microarchitecture (MICRO), pages 407-420, Chicago, Illinois, USA, 2007.
-
(2007)
International Symposium on Microarchitecture (MICRO)
, pp. 407-420
-
-
Fung, W.W.L.1
-
10
-
-
53749092570
-
Parallel computing experiences with cuda
-
M. Garland et al. Parallel Computing Experiences with CUDA. Micro, IEEE, 28(4), 2008.
-
(2008)
Micro, IEEE
, vol.28
, Issue.4
-
-
Garland, M.1
-
11
-
-
80052533471
-
Energy-efficient mechanisms for managing thread context in throughput processors
-
San Jose, California, USA
-
M. Gebhart et al. Energy-Efficient Mechanisms for Managing Thread Context in Throughput Processors. In International Symposium on Computer architecture (ISCA), pages 235-246, San Jose, California, USA, 2011.
-
(2011)
International Symposium on Computer Architecture (ISCA)
, pp. 235-246
-
-
Gebhart, M.1
-
18
-
-
84881151222
-
GPUWattch: Enabling energy optimizations in gpgpus
-
Tel-Aviv, Israel
-
J. Leng et al. GPUWattch: Enabling Energy Optimizations in GPGPUs. In International Symposium on Computer Architecture (ISCA), pages 487-498, Tel-Aviv, Israel, 2013.
-
(2013)
International Symposium on Computer Architecture (ISCA)
, pp. 487-498
-
-
Leng, J.1
-
20
-
-
84863342255
-
Improving gpu performance via large warps and two-level warp scheduling
-
Porto Alegre, Brazil
-
V. Narasiman et al. Improving GPU Performance via Large Warps and Two-Level Warp Scheduling. In International Symposium on Microarchitecture (MICRO), pages 308-317, Porto Alegre, Brazil, 2011.
-
(2011)
International Symposium on Microarchitecture (MICRO)
, pp. 308-317
-
-
Narasiman, V.1
-
23
-
-
84904009579
-
-
NVIDIA. Fermi: NVIDIA's Next Generation CUDA Compute Architecture
-
NVIDIA. Fermi: NVIDIA's Next Generation CUDA Compute Architecture, 2011.
-
(2011)
-
-
-
26
-
-
84904009580
-
-
NVIDIA. NVIDIA PerfKit: NVIDIA Performance Toolkit, 2013
-
NVIDIA. NVIDIA PerfKit: NVIDIA Performance Toolkit, 2013.
-
-
-
-
30
-
-
84888866287
-
Parboil: A revised benchmark suite for scientific and commercial throughput computing
-
J. Stratton et al. Parboil: A Revised Benchmark Suite for Scientific and Commercial Throughput Computing. Center for Reliable and High-Performance Computing, 2012.
-
(2012)
Center for Reliable and High-Performance Computing
-
-
Stratton, J.1
|