메뉴 건너뛰기




Volumn , Issue , 2011, Pages 191-200

Neighborhood-aware data locality optimization for NoC-based multicores

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION PROGRAMS; COMPILER ALGORITHMS; CRITICAL ISSUES; DATA ACCESS; DATA LOCALITY; DATA LOCALITY OPTIMIZATION; DATA MOVEMENTS; DATA REUSE; MULTI CORE; MULTI-CORE SYSTEMS; MULTI-CORES; MULTI-THREADED APPLICATION; NETWORK ON CHIP; ON-CHIP CACHE; OVERALL EXECUTION;

EID: 79957447964     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CGO.2011.5764687     Document Type: Conference Paper
Times cited : (10)

References (39)
  • 2
    • 38149022809 scopus 로고    scopus 로고
    • "Teraflops research chip," http://techresearch.intel.com/ articles/Tera-Scale/1449.htm.
    • Teraflops Research Chip
  • 3
    • 49749147124 scopus 로고    scopus 로고
    • Comparison of memory write policies for NoC based multicore cache coherent systems
    • P. G. de Massas and F. Pétrot, "Comparison of memory write policies for NoC based multicore cache coherent systems," Proc. of DATE, 2008.
    • Proc. of DATE, 2008
    • De Massas, P.G.1    Pétrot, F.2
  • 4
    • 0034848112 scopus 로고    scopus 로고
    • Route packets, not wires: On-chip interconnection networks
    • W. J. Dally and B. Towles, "Route packets, not wires: on-chip interconnection networks," Proc. of DAC, 2001.
    • Proc. of DAC, 2001
    • Dally, W.J.1    Towles, B.2
  • 5
    • 62349096250 scopus 로고    scopus 로고
    • Contention-aware application mapping for network-on-chip communication architectures
    • C. L. Chou and R. Marculescu, "Contention-aware application mapping for network-on-chip communication architectures," Proc. of ICCD, 2008.
    • Proc. of ICCD, 2008
    • Chou, C.L.1    Marculescu, R.2
  • 6
    • 79957493237 scopus 로고    scopus 로고
    • Distance associativity for high-performance energy-efficient non-uniform cache architectures
    • Z. Chishti et al., "Distance associativity for high-performance energy-efficient non-uniform cache architectures," Proc. of Micro, 2003.
    • Proc. of Micro, 2003
    • Chishti, Z.1
  • 7
    • 40349103382 scopus 로고    scopus 로고
    • An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches
    • C. Kim et al., "An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches," Proc. of ASPLOS, 2002.
    • Proc. of ASPLOS, 2002
    • Kim, C.1
  • 8
    • 21644472427 scopus 로고    scopus 로고
    • Managing wire delay in large chip-multiprocessor caches
    • B. M. Beckmann and D. A. Wood, "Managing wire delay in large chip-multiprocessor caches," Proc. of Micro, 2004.
    • Proc. of Micro, 2004
    • Beckmann, B.M.1    Wood, D.A.2
  • 12
    • 33748870886 scopus 로고    scopus 로고
    • Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
    • M. M. K. Martin et al., "Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset," SIGARCH Comput. Archit. News, 2005.
    • (2005) SIGARCH Comput. Archit. News
    • Martin, M.M.K.1
  • 13
    • 0036469676 scopus 로고    scopus 로고
    • Simics: A full system simulation platform
    • P. S. Magnusson et al., "Simics: A full system simulation platform," IEEE Computer, 2002.
    • (2002) IEEE Computer
    • Magnusson, P.S.1
  • 19
    • 0000459730 scopus 로고    scopus 로고
    • Combining loop transformations considering caches and scheduling
    • M. E. Wolf et al., "Combining loop transformations considering caches and scheduling," Proc. of MICRO, 1996.
    • Proc. of MICRO, 1996
    • Wolf, M.E.1
  • 20
    • 79957531269 scopus 로고    scopus 로고
    • Application mapping for chip multiprocessors
    • G. Chen et al., "Application mapping for chip multiprocessors," Proc. of DAC, 2008.
    • Proc. of DAC, 2008
    • Chen, G.1
  • 21
    • 16244409520 scopus 로고    scopus 로고
    • Multi-objective mapping for mesh-based noc architectures
    • G. Ascia et al., "Multi-objective mapping for mesh-based noc architectures," Proc. of CODES+ISSS, 2004.
    • Proc. of CODES+ISSS, 2004
    • Ascia, G.1
  • 22
    • 34547538303 scopus 로고    scopus 로고
    • A flexible data to L2 cache mapping approach for future multicore processors
    • L. Jin et al., "A flexible data to L2 cache mapping approach for future multicore processors," Proc. of MSPC, 2006.
    • Proc. of MSPC, 2006
    • Jin, L.1
  • 23
    • 79960161840 scopus 로고    scopus 로고
    • Cache topology aware computation mapping for multicores
    • M. Kandemir et al., "Cache topology aware computation mapping for multicores," Proc. of PLDI, 2010.
    • Proc. of PLDI, 2010
    • Kandemir, M.1
  • 24
    • 79957446180 scopus 로고    scopus 로고
    • A modular simulation framework for spatial and temporal task mapping onto multi-processor SoC platform
    • T. Kempf et al., "A modular simulation framework for spatial and temporal task mapping onto multi-processor SoC platform," Proc. of DATE.
    • Proc. of DATE
    • Kempf, T.1
  • 25
    • 34047117937 scopus 로고    scopus 로고
    • Communication-aware allocation and scheduling framework for stream-oriented multiprocessor systems-on-chip
    • M. Ruggiero et al., "Communication-aware allocation and scheduling framework for stream-oriented multiprocessor systems-on-chip," Proc. of DATE, 2006.
    • Proc. of DATE, 2006
    • Ruggiero, M.1
  • 26
    • 34547183989 scopus 로고    scopus 로고
    • Integrated scratchpad memory optimization and task scheduling for MPSOC architectures
    • V. Suhendra et al., "Integrated scratchpad memory optimization and task scheduling for MPSOC architectures," Proc. of CASES, 2006.
    • Proc. of CASES, 2006
    • Suhendra, V.1
  • 27
    • 33847213882 scopus 로고    scopus 로고
    • Mapping applications to NoC platforms with multithreaded processor resources
    • R. Pop and S. Kumar, "Mapping applications to NoC platforms with multithreaded processor resources." The NORCHIP Conference, 2005.
    • The NORCHIP Conference, 2005
    • Pop, R.1    Kumar, S.2
  • 28
    • 77749302593 scopus 로고    scopus 로고
    • Scheduling threads for constructive cache sharing on CMPs
    • S. Chen et al., "Scheduling threads for constructive cache sharing on CMPs," Proc. of SPAA, 2007.
    • Proc. of SPAA, 2007
    • Chen, S.1
  • 29
    • 49749086465 scopus 로고    scopus 로고
    • User-aware dynamic task allocation in networks-on-chip
    • C. L. Chou and R. Marculescu, "User-aware dynamic task allocation in networks-on-chip," Proc. of DATE, 2008.
    • Proc. of DATE, 2008
    • Chou, C.L.1    Marculescu, R.2
  • 30
    • 77954741851 scopus 로고    scopus 로고
    • Compiler techniques for reducing data cache miss rate on a multithreaded architecture
    • S. Sarkar and D. M. Tullsen, "Compiler techniques for reducing data cache miss rate on a multithreaded architecture," Proc. of HiPEAC, 2008.
    • Proc. of HiPEAC, 2008
    • Sarkar, S.1    Tullsen, D.M.2
  • 31
    • 76749137634 scopus 로고    scopus 로고
    • Optimizing shared cache behavior of chip multiprocessors
    • M. Kandemir et al., "Optimizing shared cache behavior of chip multiprocessors," Proc. of MICRO, 2009.
    • Proc. of MICRO, 2009
    • Kandemir, M.1
  • 32
    • 77954733665 scopus 로고    scopus 로고
    • Using processor affinity in loop scheduling on shared-memory multiprocessors
    • E. P. Markatos and T. J. LeBlanc, "Using processor affinity in loop scheduling on shared-memory multiprocessors," Proc. of IPDPS, 1994.
    • Proc. of IPDPS, 1994
    • Markatos, E.P.1    LeBlanc, T.J.2
  • 33
    • 79957437565 scopus 로고    scopus 로고
    • Data access partitioning for fine-grain parallelism on multicore architectures
    • M. Chu et al., "Data access partitioning for fine-grain parallelism on multicore architectures," Proc. of Micro, 2007.
    • Proc. of Micro, 2007
    • Chu, M.1
  • 34
    • 70449628310 scopus 로고    scopus 로고
    • Data layout transformation for enhancing data locality on NUCA chip multiprocessors
    • A. Lu et al., "Data layout transformation for enhancing data locality on NUCA chip multiprocessors," Proc. of PACT, 2009.
    • Proc. of PACT, 2009
    • Lu, A.1
  • 35
    • 77957563463 scopus 로고    scopus 로고
    • Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?
    • E. Zhang et al., "Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?" Proc. of PPOPP, 2010.
    • Proc. of PPOPP, 2010
    • Zhang, E.1
  • 36
    • 27544432313 scopus 로고    scopus 로고
    • Optimizing replication, communication, and capacity allocation in CMPs
    • Z. Chishti et al., "Optimizing replication, communication, and capacity allocation in CMPs," Proc. of ISCA, 2005.
    • Proc. of ISCA, 2005
    • Chishti, Z.1
  • 37
    • 76749139374 scopus 로고    scopus 로고
    • A hierarchical model of data locality
    • C. Zhang et al., "A hierarchical model of data locality," Proc. of POPL, 2006.
    • Proc. of POPL, 2006
    • Zhang, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.