메뉴 건너뛰기




Volumn , Issue , 2013, Pages 297-309

Imbalanced cache partitioning for balanced data-parallel programs

Author keywords

multicore; reuse distance; shared cache partitioning

Indexed keywords

CACHE PARTITIONING; CHIP-MULTIPROCESSOR; DATA-PARALLEL APPLICATIONS; MULTI CORE; MULTI-THREADED PROGRAMS; REUSE DISTANCE; ROUND-ROBIN FASHIONS; SHARED CACHE;

EID: 84892536329     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2540708.2540734     Document Type: Conference Paper
Times cited : (16)

References (41)
  • 2
    • 61449142683 scopus 로고    scopus 로고
    • Refactoring for data locality
    • Feb.
    • K. Beyls and E. D'Hollander. Refactoring for data locality. Computer, 42(2):62-71, Feb. 2009.
    • (2009) Computer , vol.42 , Issue.2 , pp. 62-71
    • Beyls, K.1    D'Hollander, E.2
  • 4
    • 34548023929 scopus 로고    scopus 로고
    • Cooperative cache partitioning for chip multiprocessors
    • New York, NY, USA, ACM
    • J. Chang and G. S. Sohi. Cooperative cache partitioning for chip multiprocessors. In Proc. 21st Annual Int'l Conf. Supercomputing, ICS '07, pages 242-252, New York, NY, USA, 2007. ACM.
    • (2007) Proc. 21st Annual Int'l Conf. Supercomputing, ICS '07 , pp. 242-252
    • Chang, J.1    Sohi, G.S.2
  • 8
    • 84867497756 scopus 로고    scopus 로고
    • Full-system simulation from embedded to high-performance systems
    • R. Leupers and O. Temam, editors, chapter 3, Springer US
    • J. Engblom, D. Aarno, and B. Werner. Full-system simulation from embedded to high-performance systems. In R. Leupers and O. Temam, editors, Processor and System-on-Chip Simulation, chapter 3, pages 25-45. Springer US, 2010.
    • (2010) Processor and System-on-Chip Simulation , pp. 25-45
    • Engblom, J.1    Aarno, D.2    Werner, B.3
  • 10
    • 33745793237 scopus 로고    scopus 로고
    • Path-basedreuse distance analysis
    • Compiler Construction, Springer Berlin Heidelberg
    • C. Fang, S. Carr, S. Önder, and Z. Wang. Path-basedreuse distance analysis. In Compiler Construction, Lecture Notes in Computer Science, pages 32-46. Springer Berlin Heidelberg, 2006.
    • (2006) Lecture Notes in Computer Science , pp. 32-46
    • Fang, C.1    Carr, S.2    Önder, S.3    Wang, Z.4
  • 11
    • 78651391009 scopus 로고    scopus 로고
    • Quality of service shared cache management in chip multiprocessor architecture
    • Dec.
    • F. Guo, Y. Solihin, L. Zhao, and R. Iyer. Quality of service shared cache management in chip multiprocessor architecture. ACM Trans. Archit. Code Optim., 7(3):14:1-14:33, Dec. 2010.
    • (2010) ACM Trans. Archit. Code Optim. , vol.7 , Issue.3
    • Guo, F.1    Solihin, Y.2    Zhao, L.3    Iyer, R.4
  • 14
    • 8344246922 scopus 로고    scopus 로고
    • Cqos: A framework for enabling qos in shared caches of cmp platforms
    • New York, NY, USA, ACM
    • R. Iyer. Cqos: a framework for enabling qos in shared caches of cmp platforms. In Proc. 18th Annual Int'l Conf. Supercomputing, ICS '04, pages 257-266, New York, NY, USA, 2004. ACM.
    • (2004) Proc. 18th Annual Int'l Conf. Supercomputing, ICS '04 , pp. 257-266
    • Iyer, R.1
  • 16
    • 77951616746 scopus 로고    scopus 로고
    • Is reuse distance applicable to data locality analysis on chip multiprocessors?
    • Proc. 19th Joint European Conf. on Theory and Practice of Software, Int'l Conference on Compiler Construction, Springer-Verlag
    • Y. Jiang, E. Z. Zhang, K. Tian, and X. Shen. Is reuse distance applicable to data locality analysis on chip multiprocessors? In Proc. 19th Joint European Conf. on Theory and Practice of Software, Int'l Conference on Compiler Construction, volume 6011 of CC'10/ETAPS'10, pages 264-282. Springer-Verlag, 2010.
    • (2010) CC'10/ETAPS'10 , vol.6011 , pp. 264-282
    • Jiang, Y.1    Zhang, E.Z.2    Tian, K.3    Shen, X.4
  • 25
    • 34548042910 scopus 로고    scopus 로고
    • Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
    • IEEE CS
    • M. K. Qureshi and Y. N. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In Proc. 39th Ann. IEEE/ACM Int'l Symp. Microarchitecture, MICRO 39, pages 423-432. IEEE CS, 2006.
    • (2006) Proc. 39th Ann. IEEE/ACM Int'l Symp. Microarchitecture, MICRO 39 , pp. 423-432
    • Qureshi, M.K.1    Patt, Y.N.2
  • 30
    • 0026925878 scopus 로고
    • Optimal partitioning of cache memory
    • DOI 10.1109/12.165388
    • H. Stone, J. Turek, and J. Wolf. Optimal partitioning of cache memory. IEEE Trans. Computers, 41:1054-1068, 1992. (Pubitemid 23573779)
    • (1992) IEEE Transactions on Computers , vol.41 , Issue.9 , pp. 1054-1068
    • Stone, H.S.1    Turek, J.2    Wolf, J.L.3
  • 31
    • 1642371317 scopus 로고    scopus 로고
    • Dynamic partitioning of shared cache memory
    • Apr.
    • G. E. Suh, L. Rudolph, and S. Devadas. Dynamic partitioning of shared cache memory. J. Supercomput., 28:7-26, Apr. 2004.
    • (2004) J. Supercomput. , vol.28 , pp. 7-26
    • Suh, G.E.1    Rudolph, L.2    Devadas, S.3
  • 35
    • 84874865302 scopus 로고    scopus 로고
    • Efficient reuse distance analysis of multicore scaling for loop-based parallel programs
    • Feb.
    • M.-J. Wu and D. Yeung. Efficient reuse distance analysis of multicore scaling for loop-based parallel programs. ACM Trans. Comput. Syst., 31(1):1:1-1:37, Feb. 2013.
    • (2013) ACM Trans. Comput. Syst. , vol.31 , Issue.1
    • Wu, M.-J.1    Yeung, D.2
  • 36
    • 70450279102 scopus 로고    scopus 로고
    • Pipp: Promotion/insertion pseudo-partitioning of multi-core shared caches
    • New York, NY, USA, ACM
    • Y. Xie and G. H. Loh. Pipp: promotion/insertion pseudo-partitioning of multi-core shared caches. In Proc. 36th Annual Int'l Symp. on Computer Architecture, ISCA '09, pages 174-183, New York, NY, USA, 2009. ACM.
    • (2009) Proc. 36th Annual Int'l Symp. On Computer Architecture, ISCA '09 , pp. 174-183
    • Xie, Y.1    Loh, G.H.2
  • 38
    • 33947360666 scopus 로고    scopus 로고
    • Miss rate prediction across program inputs and cache configurations
    • DOI 10.1109/TC.2007.50
    • Y. Zhong, S. Dropsho, X. Shen, A. Studer, and C. Ding. Miss Rate Prediction Across Program Inputs and Cache Configurations. IEEE Trans. Computers, 56(3):328-343, Mar. 2007. (Pubitemid 46443330)
    • (2007) IEEE Transactions on Computers , vol.56 , Issue.3 , pp. 328-343
    • Zhong, Y.1    Dropsho, S.G.2    Shen, X.3    Studer, A.4    Ding, C.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.