메뉴 건너뛰기




Volumn , Issue , 2012, Pages 91-102

TAP: A TLP-aware cache management policy for a CPU-GPU heterogeneous architecture

Author keywords

[No Author keywords available]

Indexed keywords

CACHE MANAGEMENT; CACHE MANAGEMENT POLICIES; CACHE MANAGEMENT SCHEMES; CACHE PARTITIONING; DYNAMIC CACHE; HETEROGENEOUS ARCHITECTURES; INTERVAL PREDICTION; MEMORY LATENCIES; NEW MECHANISMS; SHARED CACHE; SHARED RESOURCES; THREAD-LEVEL PARALLELISM;

EID: 84860351946     PISSN: 15300897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HPCA.2012.6168947     Document Type: Conference Paper
Times cited : (103)

References (29)
  • 1
    • 84860329522 scopus 로고    scopus 로고
    • MacSim. http://code.google.com/p/macsim/.
    • MacSim
  • 2
    • 84860345475 scopus 로고    scopus 로고
    • AMD. Fusion. http://sites.amd.com/us/fusion/apu/Pages/fusion.aspx.
    • AMD. Fusion
  • 4
    • 76749083861 scopus 로고    scopus 로고
    • Pseudo-LIFO: The foundation of a new family of replacement policies for last-level caches
    • M. Chaudhuri. Pseudo-LIFO: the foundation of a new family of replacement policies for last-level caches. In MICRO-42, pages 401-412, 2009.
    • (2009) MICRO-42 , pp. 401-412
    • Chaudhuri, M.1
  • 6
    • 78149233155 scopus 로고    scopus 로고
    • A dynamic compiler for bulk-synchronous applications in heterogeneous systems
    • Ocelot
    • G. Diamos, A. Kerr, S. Yalamanchili, and N. Clark. Ocelot: A dynamic compiler for bulk-synchronous applications in heterogeneous systems. In PACT-19, 2010.
    • (2010) PACT-19
    • Diamos, G.1    Kerr, A.2    Yalamanchili, S.3    Clark, N.4
  • 7
    • 84903753195 scopus 로고    scopus 로고
    • Intel. Sandy Bridge. http://software.intel.com/en-us/articles/sandy- bridge/.
    • Sandy Bridge
  • 9
    • 77954998134 scopus 로고    scopus 로고
    • High performance cache replacement using re-reference interval prediction (RRIP)
    • A. Jaleel, K. B. Theobald, S. C. Steely, Jr., and J. Emer. High performance cache replacement using re-reference interval prediction (RRIP). In ISCA-32, pages 60-71, 2010.
    • (2010) ISCA-32 , pp. 60-71
    • Jaleel, A.1    Theobald, K.B.2    Steely Jr., S.C.3    Emer, J.4
  • 10
    • 10444238444 scopus 로고    scopus 로고
    • Fair cache sharing and partitioning in a chip multiprocessor architecture
    • S. Kim, D. Chandra, and Y. Solihin. Fair cache sharing and partitioning in a chip multiprocessor architecture. In PACT-13, pages 111-122, 2004.
    • (2004) PACT-13 , pp. 111-122
    • Kim, S.1    Chandra, D.2    Solihin, Y.3
  • 12
    • 77952569781 scopus 로고    scopus 로고
    • Understanding how offchip memory bandwidth partitioning in chip multiprocessors affects system performance
    • jan.
    • F. Liu, X. Jiang, and Y. Solihin. Understanding how offchip memory bandwidth partitioning in chip multiprocessors affects system performance. In HPCA-16, pages 1-12, jan. 2010.
    • (2010) HPCA-16 , pp. 1-12
    • Liu, F.1    Jiang, X.2    Solihin, Y.3
  • 13
    • 49949085575 scopus 로고    scopus 로고
    • MLPaware dynamic cache partitioning
    • Springer
    • M. Moretó, F. J. Cazorla, A. Ramírez, and M. Valero. MLPaware dynamic cache partitioning. In HiPEAC'08, volume 4917, pages 337-352. Springer, 2008.
    • (2008) HiPEAC'08 , vol.4917 , pp. 337-352
    • Moretó, M.1    Cazorla, F.J.2    Ramírez, A.3    Valero, M.4
  • 15
    • 84893316257 scopus 로고    scopus 로고
    • NVIDIA. Project denver. http://blogs.nvidia.com/2011/01/project-denver- processor-to-usher-in-new-era-of-computing/.
    • Project Denver
  • 17
    • 21644454187 scopus 로고    scopus 로고
    • Pinpointing representative portions of large intel®itanium® programs with dynamic instrumentation
    • H. Patil, R. Cohn, M. Charney, R. Kapoor, A. Sun, and A. Karunanidhi. Pinpointing representative portions of large intel®itanium®programs with dynamic instrumentation. In MICRO-37, pages 81-92, 2004.
    • (2004) MICRO-37 , pp. 81-92
    • Patil, H.1    Cohn, R.2    Charney, M.3    Kapoor, R.4    Sun, A.5    Karunanidhi, A.6
  • 19
    • 35348920021 scopus 로고    scopus 로고
    • Adaptive insertion policies for high performance caching
    • M. K. Qureshi, A. Jaleel, Y. N. Patt, S. C. Steely, and J. Emer. Adaptive insertion policies for high performance caching. In ISCA-29, pages 381-391, 2007.
    • (2007) ISCA-29 , pp. 381-391
    • Qureshi, M.K.1    Jaleel, A.2    Patt, Y.N.3    Steely, S.C.4    Emer, J.5
  • 20
    • 33845874613 scopus 로고    scopus 로고
    • A case for MLP-aware cache replacement
    • M. K. Qureshi, D. N. Lynch, O. Mutlu, and Y. N. Patt. A case for MLP-aware cache replacement. In ISCA-28, pages 167-178, 2006.
    • (2006) ISCA-28 , pp. 167-178
    • Qureshi, M.K.1    Lynch, D.N.2    Mutlu, O.3    Patt, Y.N.4
  • 21
    • 34548042910 scopus 로고    scopus 로고
    • Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
    • M. K. Qureshi and Y. N. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In MICRO-39, pages 423-432, 2006.
    • (2006) MICRO-39 , pp. 423-432
    • Qureshi, M.K.1    Patt, Y.N.2
  • 22
    • 76749118968 scopus 로고    scopus 로고
    • Sharp control: Controlled shared cache management in chip multiprocessors
    • dec.
    • S. Srikantaiah, M. Kandemir, and Q. Wang. Sharp control: Controlled shared cache management in chip multiprocessors. In MICRO-42, pages 517 -528, dec. 2009.
    • (2009) MICRO-42 , pp. 517-528
    • Srikantaiah, S.1    Kandemir, M.2    Wang, Q.3
  • 23
    • 84949769332 scopus 로고    scopus 로고
    • A new memory monitoring scheme for memory-aware scheduling and partitioning
    • feb.
    • G. Suh, S. Devadas, and L. Rudolph. A new memory monitoring scheme for memory-aware scheduling and partitioning. In HPCA-8, pages 117-128, feb. 2002.
    • (2002) HPCA-8 , pp. 117-128
    • Suh, G.1    Devadas, S.2    Rudolph, L.3
  • 25
    • 55849133016 scopus 로고    scopus 로고
    • IPC-based cache partitioning: An ipc-oriented dynamic shared cache partitioning mechanism
    • aug.
    • G. Suo, X. Yang, G. Liu, J. Wu, K. Zeng, B. Zhang, and Y. Lin. IPC-based cache partitioning: An ipc-oriented dynamic shared cache partitioning mechanism. In ICHIT'08, pages 399 -406, aug. 2008.
    • (2008) ICHIT'08 , pp. 399-406
    • Suo, G.1    Yang, X.2    Liu, G.3    Wu, J.4    Zeng, K.5    Zhang, B.6    Lin, Y.7
  • 26
    • 84872059246 scopus 로고    scopus 로고
    • The IMPACT Research Group, UIUC
    • The IMPACT Research Group, UIUC. Parboil benchmark suite. http://impact.crhc.illinois.edu/parboil.php.
    • Parboil Benchmark Suite
  • 27
    • 70450279102 scopus 로고    scopus 로고
    • PIPP: Promotion/insertion pseudopartitioning of multi-core shared caches
    • Y. Xie and G. H. Loh. PIPP: promotion/insertion pseudopartitioning of multi-core shared caches. In ISCA-31, pages 174-183, 2009.
    • (2009) ISCA-31 , pp. 174-183
    • Xie, Y.1    Loh, G.H.2
  • 28
    • 77949645981 scopus 로고    scopus 로고
    • Scalable shared-cache management by containing thrashing workloads
    • Y. N. Patt, P. Foglia, E. Duesterwald, P. Faraboschi, and X. Martorell, editors, Springer
    • Y. Xie and G. H. Loh. Scalable shared-cache management by containing thrashing workloads. In Y. N. Patt, P. Foglia, E. Duesterwald, P. Faraboschi, and X. Martorell, editors, HiPEAC'10, volume 5952, pages 262-276. Springer, 2010.
    • (2010) HiPEAC'10 , vol.5952 , pp. 262-276
    • Xie, Y.1    Loh, G.H.2
  • 29
    • 77956204832 scopus 로고    scopus 로고
    • Off-chip memory bandwidth minimization through cache partitioning for multi-core platforms
    • C. Yu and P. Petrov. Off-chip memory bandwidth minimization through cache partitioning for multi-core platforms. In DAC'10, pages 132-137, 2010...
    • (2010) DAC'10 , pp. 132-137
    • Yu, C.1    Petrov, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.