SCOPUS 정보 검색 플랫폼

Proceedings - International Symposium on Code Generation and Optimization, CGO 2011

Volumn , Issue , 2011, Pages 191-200

Neighborhood-aware data locality optimization for NoC-based multicores

(4) Kandemir, Mahmut a Zhang, Yuanrui a Liu, Jun a Yemliha, Taylan b

a The Center for Nanotechnology Education and Utilization (United States)

b Syracuse University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION PROGRAMS; COMPILER ALGORITHMS; CRITICAL ISSUES; DATA ACCESS; DATA LOCALITY; DATA LOCALITY OPTIMIZATION; DATA MOVEMENTS; DATA REUSE; MULTI CORE; MULTI-CORE SYSTEMS; MULTI-CORES; MULTI-THREADED APPLICATION; NETWORK ON CHIP; ON-CHIP CACHE; OVERALL EXECUTION;

EXPERIMENTS; NETWORK COMPONENTS; PROGRAM COMPILERS; SCHEDULING ALGORITHMS; SENSITIVITY ANALYSIS; VLSI CIRCUITS;

OPTIMIZATION;

EID: 79957447964 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CGO.2011.5764687 Document Type: Conference Paper

Times cited : (10)

References (39)

1
- 78049338046
- "Single-chip cloud computer," http://techresearch.intel.com/ articles/Tera-Scale/1826.htm.
- Single-chip Cloud Computer

2
- 38149022809
- "Teraflops research chip," http://techresearch.intel.com/ articles/Tera-Scale/1449.htm.
- Teraflops Research Chip

3
- 49749147124
- Comparison of memory write policies for NoC based multicore cache coherent systems
- P. G. de Massas and F. Pétrot, "Comparison of memory write policies for NoC based multicore cache coherent systems," Proc. of DATE, 2008.
- Proc. of DATE, 2008
- De Massas, P.G.¹ Pétrot, F.²

4
- 0034848112
- Route packets, not wires: On-chip interconnection networks
- W. J. Dally and B. Towles, "Route packets, not wires: on-chip interconnection networks," Proc. of DAC, 2001.
- Proc. of DAC, 2001
- Dally, W.J.¹ Towles, B.²

5
- 62349096250
- Contention-aware application mapping for network-on-chip communication architectures
- C. L. Chou and R. Marculescu, "Contention-aware application mapping for network-on-chip communication architectures," Proc. of ICCD, 2008.
- Proc. of ICCD, 2008
- Chou, C.L.¹ Marculescu, R.²

6
- 79957493237
- Distance associativity for high-performance energy-efficient non-uniform cache architectures
- Z. Chishti et al., "Distance associativity for high-performance energy-efficient non-uniform cache architectures," Proc. of Micro, 2003.
- Proc. of Micro, 2003
- Chishti, Z.¹

7
- 40349103382
- An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches
- C. Kim et al., "An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches," Proc. of ASPLOS, 2002.
- Proc. of ASPLOS, 2002
- Kim, C.¹

8
- 21644472427
- Managing wire delay in large chip-multiprocessor caches
- B. M. Beckmann and D. A. Wood, "Managing wire delay in large chip-multiprocessor caches," Proc. of Micro, 2004.
- Proc. of Micro, 2004
- Beckmann, B.M.¹ Wood, D.A.²

9
- 0004112961
- Addison Wesley
- J. F. Kurose and K. W. Ross, "Computer networking: A top-down approach featuring the internet," Addison Wesley, 2003.
- (2003) Computer Networking: A Top-down Approach Featuring the Internet
- Kurose, J.F.¹ Ross, K.W.²

10
- 0003904906
- Technical Report, University of Maryland
- W. Kelly et al., "The omega library interface guide," Technical Report, University of Maryland, 1995.
- (1995) The Omega Library Interface Guide
- Kelly, W.¹

11
- 77954723427
- Technical Report, Microsoft
- "Phoenix compiler infrastructure," Technical Report, Microsoft.
- Phoenix Compiler Infrastructure

12
- 33748870886
- Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
- M. M. K. Martin et al., "Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset," SIGARCH Comput. Archit. News, 2005.
- (2005) SIGARCH Comput. Archit. News
- Martin, M.M.K.¹

13
- 0036469676
- Simics: A full system simulation platform
- P. S. Magnusson et al., "Simics: A full system simulation platform," IEEE Computer, 2002.
- (2002) IEEE Computer
- Magnusson, P.S.¹

14
- 66749099556
- Technical Report, Princeton University
- N. Agarwal et al., "Garnet: A detailed interconnection network model inside a full-system simulation framework," Technical Report, Princeton University.
- Garnet: A Detailed Interconnection Network Model Inside A Full-system Simulation Framework
- Agarwal, N.¹

15
- 0003450887
- Technical Report, Western Research Laboratory
- P. Shivakumar and N. P. Jouppi, "Cacti 3.0: An integrated cache timing, power, and area model," Technical Report, Western Research Laboratory.
- Cacti 3.0: An Integrated Cache Timing, Power, and Area Model
- Shivakumar, P.¹ Jouppi, N.P.²

16
- 51549095074
- Technical Report, Princeton University
- C. Bienia et al., "The PARSEC benchmark suite: characterization and architectural implications," Technical Report, Princeton University, 2008.
- (2008) The PARSEC Benchmark Suite: Characterization and Architectural Implications
- Bienia, C.¹

17
- 67650816174
- V. Aslot et al., "SPEComp: A new benchmark suite for measuring parallel computer performance," 2001.
- (2001) SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance
- Aslot, V.¹

18
- 0003605996
- The NAS parallel benchmarks
- D. Weeratunga et al., "The NAS parallel benchmarks," NAS Technical Report, 1994.
- (1994) NAS Technical Report
- Weeratunga, D.¹

19
- 0000459730
- Combining loop transformations considering caches and scheduling
- M. E. Wolf et al., "Combining loop transformations considering caches and scheduling," Proc. of MICRO, 1996.
- Proc. of MICRO, 1996
- Wolf, M.E.¹

20
- 79957531269
- Application mapping for chip multiprocessors
- G. Chen et al., "Application mapping for chip multiprocessors," Proc. of DAC, 2008.
- Proc. of DAC, 2008
- Chen, G.¹

21
- 16244409520
- Multi-objective mapping for mesh-based noc architectures
- G. Ascia et al., "Multi-objective mapping for mesh-based noc architectures," Proc. of CODES+ISSS, 2004.
- Proc. of CODES+ISSS, 2004
- Ascia, G.¹

22
- 34547538303
- A flexible data to L2 cache mapping approach for future multicore processors
- L. Jin et al., "A flexible data to L2 cache mapping approach for future multicore processors," Proc. of MSPC, 2006.
- Proc. of MSPC, 2006
- Jin, L.¹

23
- 79960161840
- Cache topology aware computation mapping for multicores
- M. Kandemir et al., "Cache topology aware computation mapping for multicores," Proc. of PLDI, 2010.
- Proc. of PLDI, 2010
- Kandemir, M.¹

24
- 79957446180
- A modular simulation framework for spatial and temporal task mapping onto multi-processor SoC platform
- T. Kempf et al., "A modular simulation framework for spatial and temporal task mapping onto multi-processor SoC platform," Proc. of DATE.
- Proc. of DATE
- Kempf, T.¹

25
- 34047117937
- Communication-aware allocation and scheduling framework for stream-oriented multiprocessor systems-on-chip
- M. Ruggiero et al., "Communication-aware allocation and scheduling framework for stream-oriented multiprocessor systems-on-chip," Proc. of DATE, 2006.
- Proc. of DATE, 2006
- Ruggiero, M.¹

26
- 34547183989
- Integrated scratchpad memory optimization and task scheduling for MPSOC architectures
- V. Suhendra et al., "Integrated scratchpad memory optimization and task scheduling for MPSOC architectures," Proc. of CASES, 2006.
- Proc. of CASES, 2006
- Suhendra, V.¹

27
- 33847213882
- Mapping applications to NoC platforms with multithreaded processor resources
- R. Pop and S. Kumar, "Mapping applications to NoC platforms with multithreaded processor resources." The NORCHIP Conference, 2005.
- The NORCHIP Conference, 2005
- Pop, R.¹ Kumar, S.²

28
- 77749302593
- Scheduling threads for constructive cache sharing on CMPs
- S. Chen et al., "Scheduling threads for constructive cache sharing on CMPs," Proc. of SPAA, 2007.
- Proc. of SPAA, 2007
- Chen, S.¹

29
- 49749086465
- User-aware dynamic task allocation in networks-on-chip
- C. L. Chou and R. Marculescu, "User-aware dynamic task allocation in networks-on-chip," Proc. of DATE, 2008.
- Proc. of DATE, 2008
- Chou, C.L.¹ Marculescu, R.²

30
- 77954741851
- Compiler techniques for reducing data cache miss rate on a multithreaded architecture
- S. Sarkar and D. M. Tullsen, "Compiler techniques for reducing data cache miss rate on a multithreaded architecture," Proc. of HiPEAC, 2008.
- Proc. of HiPEAC, 2008
- Sarkar, S.¹ Tullsen, D.M.²

31
- 76749137634
- Optimizing shared cache behavior of chip multiprocessors
- M. Kandemir et al., "Optimizing shared cache behavior of chip multiprocessors," Proc. of MICRO, 2009.
- Proc. of MICRO, 2009
- Kandemir, M.¹

32
- 77954733665
- Using processor affinity in loop scheduling on shared-memory multiprocessors
- E. P. Markatos and T. J. LeBlanc, "Using processor affinity in loop scheduling on shared-memory multiprocessors," Proc. of IPDPS, 1994.
- Proc. of IPDPS, 1994
- Markatos, E.P.¹ LeBlanc, T.J.²

33
- 79957437565
- Data access partitioning for fine-grain parallelism on multicore architectures
- M. Chu et al., "Data access partitioning for fine-grain parallelism on multicore architectures," Proc. of Micro, 2007.
- Proc. of Micro, 2007
- Chu, M.¹

34
- 70449628310
- Data layout transformation for enhancing data locality on NUCA chip multiprocessors
- A. Lu et al., "Data layout transformation for enhancing data locality on NUCA chip multiprocessors," Proc. of PACT, 2009.
- Proc. of PACT, 2009
- Lu, A.¹

35
- 77957563463
- Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?
- E. Zhang et al., "Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?" Proc. of PPOPP, 2010.
- Proc. of PPOPP, 2010
- Zhang, E.¹

36
- 27544432313
- Optimizing replication, communication, and capacity allocation in CMPs
- Z. Chishti et al., "Optimizing replication, communication, and capacity allocation in CMPs," Proc. of ISCA, 2005.
- Proc. of ISCA, 2005
- Chishti, Z.¹

37
- 76749139374
- A hierarchical model of data locality
- C. Zhang et al., "A hierarchical model of data locality," Proc. of POPL, 2006.
- Proc. of POPL, 2006
- Zhang, C.¹

38
- 0037952146
- Morgan Kaufmann
- R. Allen and K. Kennedy, Optimizing compilers for modern architectures: A dependence-based approach. Morgan Kaufmann.
- Optimizing Compilers for Modern Architectures: A Dependence-based Approach
- Allen, R.¹ Kennedy, K.²

39
- 0003927035
- Addison-Wesley Publishing Company
- M. Wolf, High-performance compilers for parallel computing. Addison-Wesley Publishing Company.
- High-performance Compilers for Parallel Computing
- Wolf, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.