-
2
-
-
38149022809
-
-
"Teraflops research chip," http://techresearch.intel.com/ articles/Tera-Scale/1449.htm.
-
Teraflops Research Chip
-
-
-
3
-
-
49749147124
-
Comparison of memory write policies for NoC based multicore cache coherent systems
-
P. G. de Massas and F. Pétrot, "Comparison of memory write policies for NoC based multicore cache coherent systems," Proc. of DATE, 2008.
-
Proc. of DATE, 2008
-
-
De Massas, P.G.1
Pétrot, F.2
-
4
-
-
0034848112
-
Route packets, not wires: On-chip interconnection networks
-
W. J. Dally and B. Towles, "Route packets, not wires: on-chip interconnection networks," Proc. of DAC, 2001.
-
Proc. of DAC, 2001
-
-
Dally, W.J.1
Towles, B.2
-
5
-
-
62349096250
-
Contention-aware application mapping for network-on-chip communication architectures
-
C. L. Chou and R. Marculescu, "Contention-aware application mapping for network-on-chip communication architectures," Proc. of ICCD, 2008.
-
Proc. of ICCD, 2008
-
-
Chou, C.L.1
Marculescu, R.2
-
6
-
-
79957493237
-
Distance associativity for high-performance energy-efficient non-uniform cache architectures
-
Z. Chishti et al., "Distance associativity for high-performance energy-efficient non-uniform cache architectures," Proc. of Micro, 2003.
-
Proc. of Micro, 2003
-
-
Chishti, Z.1
-
7
-
-
40349103382
-
An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches
-
C. Kim et al., "An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches," Proc. of ASPLOS, 2002.
-
Proc. of ASPLOS, 2002
-
-
Kim, C.1
-
12
-
-
33748870886
-
Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
-
M. M. K. Martin et al., "Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset," SIGARCH Comput. Archit. News, 2005.
-
(2005)
SIGARCH Comput. Archit. News
-
-
Martin, M.M.K.1
-
13
-
-
0036469676
-
Simics: A full system simulation platform
-
P. S. Magnusson et al., "Simics: A full system simulation platform," IEEE Computer, 2002.
-
(2002)
IEEE Computer
-
-
Magnusson, P.S.1
-
15
-
-
0003450887
-
-
Technical Report, Western Research Laboratory
-
P. Shivakumar and N. P. Jouppi, "Cacti 3.0: An integrated cache timing, power, and area model," Technical Report, Western Research Laboratory.
-
Cacti 3.0: An Integrated Cache Timing, Power, and Area Model
-
-
Shivakumar, P.1
Jouppi, N.P.2
-
19
-
-
0000459730
-
Combining loop transformations considering caches and scheduling
-
M. E. Wolf et al., "Combining loop transformations considering caches and scheduling," Proc. of MICRO, 1996.
-
Proc. of MICRO, 1996
-
-
Wolf, M.E.1
-
20
-
-
79957531269
-
Application mapping for chip multiprocessors
-
G. Chen et al., "Application mapping for chip multiprocessors," Proc. of DAC, 2008.
-
Proc. of DAC, 2008
-
-
Chen, G.1
-
21
-
-
16244409520
-
Multi-objective mapping for mesh-based noc architectures
-
G. Ascia et al., "Multi-objective mapping for mesh-based noc architectures," Proc. of CODES+ISSS, 2004.
-
Proc. of CODES+ISSS, 2004
-
-
Ascia, G.1
-
22
-
-
34547538303
-
A flexible data to L2 cache mapping approach for future multicore processors
-
L. Jin et al., "A flexible data to L2 cache mapping approach for future multicore processors," Proc. of MSPC, 2006.
-
Proc. of MSPC, 2006
-
-
Jin, L.1
-
23
-
-
79960161840
-
Cache topology aware computation mapping for multicores
-
M. Kandemir et al., "Cache topology aware computation mapping for multicores," Proc. of PLDI, 2010.
-
Proc. of PLDI, 2010
-
-
Kandemir, M.1
-
24
-
-
79957446180
-
A modular simulation framework for spatial and temporal task mapping onto multi-processor SoC platform
-
T. Kempf et al., "A modular simulation framework for spatial and temporal task mapping onto multi-processor SoC platform," Proc. of DATE.
-
Proc. of DATE
-
-
Kempf, T.1
-
25
-
-
34047117937
-
Communication-aware allocation and scheduling framework for stream-oriented multiprocessor systems-on-chip
-
M. Ruggiero et al., "Communication-aware allocation and scheduling framework for stream-oriented multiprocessor systems-on-chip," Proc. of DATE, 2006.
-
Proc. of DATE, 2006
-
-
Ruggiero, M.1
-
26
-
-
34547183989
-
Integrated scratchpad memory optimization and task scheduling for MPSOC architectures
-
V. Suhendra et al., "Integrated scratchpad memory optimization and task scheduling for MPSOC architectures," Proc. of CASES, 2006.
-
Proc. of CASES, 2006
-
-
Suhendra, V.1
-
27
-
-
33847213882
-
Mapping applications to NoC platforms with multithreaded processor resources
-
R. Pop and S. Kumar, "Mapping applications to NoC platforms with multithreaded processor resources." The NORCHIP Conference, 2005.
-
The NORCHIP Conference, 2005
-
-
Pop, R.1
Kumar, S.2
-
28
-
-
77749302593
-
Scheduling threads for constructive cache sharing on CMPs
-
S. Chen et al., "Scheduling threads for constructive cache sharing on CMPs," Proc. of SPAA, 2007.
-
Proc. of SPAA, 2007
-
-
Chen, S.1
-
30
-
-
77954741851
-
Compiler techniques for reducing data cache miss rate on a multithreaded architecture
-
S. Sarkar and D. M. Tullsen, "Compiler techniques for reducing data cache miss rate on a multithreaded architecture," Proc. of HiPEAC, 2008.
-
Proc. of HiPEAC, 2008
-
-
Sarkar, S.1
Tullsen, D.M.2
-
31
-
-
76749137634
-
Optimizing shared cache behavior of chip multiprocessors
-
M. Kandemir et al., "Optimizing shared cache behavior of chip multiprocessors," Proc. of MICRO, 2009.
-
Proc. of MICRO, 2009
-
-
Kandemir, M.1
-
32
-
-
77954733665
-
Using processor affinity in loop scheduling on shared-memory multiprocessors
-
E. P. Markatos and T. J. LeBlanc, "Using processor affinity in loop scheduling on shared-memory multiprocessors," Proc. of IPDPS, 1994.
-
Proc. of IPDPS, 1994
-
-
Markatos, E.P.1
LeBlanc, T.J.2
-
33
-
-
79957437565
-
Data access partitioning for fine-grain parallelism on multicore architectures
-
M. Chu et al., "Data access partitioning for fine-grain parallelism on multicore architectures," Proc. of Micro, 2007.
-
Proc. of Micro, 2007
-
-
Chu, M.1
-
34
-
-
70449628310
-
Data layout transformation for enhancing data locality on NUCA chip multiprocessors
-
A. Lu et al., "Data layout transformation for enhancing data locality on NUCA chip multiprocessors," Proc. of PACT, 2009.
-
Proc. of PACT, 2009
-
-
Lu, A.1
-
35
-
-
77957563463
-
Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?
-
E. Zhang et al., "Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?" Proc. of PPOPP, 2010.
-
Proc. of PPOPP, 2010
-
-
Zhang, E.1
-
36
-
-
27544432313
-
Optimizing replication, communication, and capacity allocation in CMPs
-
Z. Chishti et al., "Optimizing replication, communication, and capacity allocation in CMPs," Proc. of ISCA, 2005.
-
Proc. of ISCA, 2005
-
-
Chishti, Z.1
-
37
-
-
76749139374
-
A hierarchical model of data locality
-
C. Zhang et al., "A hierarchical model of data locality," Proc. of POPL, 2006.
-
Proc. of POPL, 2006
-
-
Zhang, C.1
|