-
2
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
S. Che et al. Rodinia: A benchmark suite for heterogeneous computing. In Intl. Symp. Workload Characterization, 2009.
-
(2009)
Intl. Symp. Workload Characterization
-
-
Che, S.1
-
3
-
-
83155184570
-
Dymaxion: Optimizing memory access patterns for heterogeneous systems
-
S. Che, J.W. Sheaffer, and K. Skadron. Dymaxion: Optimizing memory access patterns for heterogeneous systems. In Intl. Conf. High Performance Computing, Networking, Storage and Analysis, 2011.
-
(2011)
Intl. Conf. High Performance Computing, Networking, Storage and Analysis
-
-
Che, S.1
Sheaffer, J.W.2
Skadron, K.3
-
4
-
-
79951707102
-
Memory latency reduction via thread throttling
-
H.-Y. Cheng et al. Memory latency reduction via thread throttling. In Intl. Symp. Microarchitecture, 2010.
-
(2010)
Intl. Symp. Microarchitecture
-
-
Cheng, H.-Y.1
-
8
-
-
80052533471
-
Energy-efficient mechanisms for managing thread context in throughput processors
-
M. Gebhart et al. Energy-efficient mechanisms for managing thread context in throughput processors. In Intl. Symp. Computer Architecture, 2011.
-
(2011)
Intl. Symp. Computer Architecture
-
-
Gebhart, M.1
-
9
-
-
84870700173
-
Auto-tuning a high-level language targeted to GPU codes
-
S. Grauer-Gray et al. Auto-tuning a high-level language targeted to GPU codes. In Innovative Parallel Computing, 2012.
-
(2012)
Innovative Parallel Computing
-
-
Grauer-Gray, S.1
-
10
-
-
67650635164
-
Many-core vs many-thread machines: Stay away from the valley
-
January-June
-
Z. Guz et al. Many-core vs. many-thread machines: Stay away from the valley. IEEE Computer Architecture Letters, 8(1):25-28, January-June 2009.
-
(2009)
IEEE Computer Architecture Letters
, vol.8
, Issue.1
, pp. 25-28
-
-
Guz, Z.1
-
11
-
-
0030677581
-
The design and analysis of a cache architecture for texture mapping
-
Z. S. Hakura and A. Gupta. The design and analysis of a cache architecture for texture mapping. In Intl. Symp. Computer Architecture, 1997.
-
(1997)
Intl. Symp. Computer Architecture
-
-
Hakura, Z.S.1
Gupta, A.2
-
12
-
-
77954998134
-
High performance cache replacement using re-reference interval prediction (RRIP)
-
A. Jaleel et al. High performance cache replacement using re-reference interval prediction (RRIP). In Intl. Symp. on Computer Architecture, 2010.
-
(2010)
Intl. Symp. on Computer Architecture
-
-
Jaleel, A.1
-
14
-
-
84904014280
-
Many-thread aware prefetching mechanisms for GPGPU applications
-
J. Lee et al. Many-thread aware prefetching mechanisms for GPGPU applications. In Intl. Symp. Microarchitecture, 2010.
-
(2010)
Intl. Symp. Microarchitecture
-
-
Lee, J.1
-
16
-
-
77950987305
-
Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling
-
J. Meng and K. Skadron. Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling. In Intl. Conf. Computer Design, 2009.
-
(2009)
Intl. Conf. Computer Design
-
-
Meng, J.1
Skadron, K.2
-
17
-
-
77954976292
-
Dynamic warp subdivision for integrated branch and memory divergence tolerance
-
J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for integrated branch and memory divergence tolerance. In Intl. Symp. Computer Architecture, 2010.
-
(2010)
Intl. Symp. Computer Architecture
-
-
Meng, J.1
Tarjan, D.2
Skadron, K.3
-
19
-
-
84863342255
-
Improving GPU performance via large warps and two-level warp scheduling
-
V. Narasiman et al. Improving GPU performance via large warps and two-level warp scheduling. In Intl. Symp. Microarchitecture, 2011.
-
(2011)
Intl. Symp. Microarchitecture
-
-
Narasiman, V.1
-
22
-
-
0003850954
-
-
Prentice Hall, 2nd edition
-
J. M. Rabaey, A. Chandrakasan, and B. Nikolic. Digital Integrated Circuits: A Design Perspective, chapter 12. Prentice Hall, 2nd edition, 2003.
-
(2003)
Digital Integrated Circuits: A Design Perspective, Chapter 12
-
-
Rabaey, J.M.1
Chandrakasan, A.2
Nikolic, B.3
|