메뉴 건너뛰기




Volumn 44, Issue 3, 2009, Pages 121-132

RapidMRC: Approximating L2 Miss rate curves on commodity systems for online optimizations?

Author keywords

[No Author keywords available]

Indexed keywords

CACHE SIZE; COMMODITY SYSTEMS; L2 CACHE; MISS-RATE; MULTI-CORE PROCESSOR; ON CURRENTS; ONLINE OPTIMIZATION; PERFORMANCE IMPROVEMENTS; PERFORMANCE MONITORING; SOFTWARE TECHNIQUES; SOURCE CODES;

EID: 67650796123     PISSN: 15232867     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Conference Paper
Times cited : (108)

References (48)
  • 1
    • 0033337012 scopus 로고    scopus 로고
    • Selective cache ways: On-demand cache resource allocation
    • D. Albonesi. Selective cache ways: on-demand cache resource allocation. In MICRO, pages 248-259, 1999.
    • (1999) MICRO , pp. 248-259
    • Albonesi, D.1
  • 2
    • 84944730283 scopus 로고    scopus 로고
    • Scheduling algorithms with bus bandwidth considerations for SMPs
    • C. Antonopoulos, D. Nikolopoulos, and T. Papatheodorou. Scheduling algorithms with bus bandwidth considerations for SMPs. In ICPP, pages 547-554, 2003.
    • (2003) ICPP , pp. 547-554
    • Antonopoulos, C.1    Nikolopoulos, D.2    Papatheodorou, T.3
  • 3
    • 42149141725 scopus 로고    scopus 로고
    • PATH: Page access tracking to improve memory management
    • R. Azimi, L. Soares, M. Stumm, T. Walsh, and A. Demke Brown. PATH: page access tracking to improve memory management. In ISMM, pages 31-42, 2007.
    • (2007) ISMM , pp. 31-42
    • Azimi, R.1    Soares, L.2    Stumm, M.3    Walsh, T.4    Demke Brown, A.5
  • 4
    • 32844456410 scopus 로고    scopus 로고
    • Online performance analysis by statistical sampling of microprocessor performance counters
    • R. Azimi, M. Stumm, and R. Wisniewski. Online performance analysis by statistical sampling of microprocessor performance counters. In ICS, pages 101-110, 2005.
    • (2005) ICS , pp. 101-110
    • Azimi, R.1    Stumm, M.2    Wisniewski, R.3
  • 5
    • 0034461413 scopus 로고    scopus 로고
    • Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures
    • R. Balasubramonian, D. Albonesi, A. Buyuktosunoglu, and S. Dwarkadas. Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures. In MICRO, pages 245-257, 2000.
    • (2000) MICRO , pp. 245-257
    • Balasubramonian, R.1    Albonesi, D.2    Buyuktosunoglu, A.3    Dwarkadas, S.4
  • 6
    • 2642534571 scopus 로고    scopus 로고
    • StatCache: A probabilistic approach to efficient and accurate data locality analysis
    • E. Berg and E. Hagersten. StatCache: A probabilistic approach to efficient and accurate data locality analysis. In ISPASS, pages 20-27, 2004.
    • (2004) ISPASS , pp. 20-27
    • Berg, E.1    Hagersten, E.2
  • 7
    • 33244462442 scopus 로고    scopus 로고
    • Fast data-locality profiling of native execution
    • E. Berg and E. Hagersten. Fast data-locality profiling of native execution. In SIGMETRICS, pages 169-180, 2005.
    • (2005) SIGMETRICS , pp. 169-180
    • Berg, E.1    Hagersten, E.2
  • 8
    • 33750837706 scopus 로고    scopus 로고
    • A statistical multiprocessor cache model
    • E. Berg, H. Zeffer, and E. Hagersten. A statistical multiprocessor cache model. In ISPASS, pages 89-99, 2006.
    • (2006) ISPASS , pp. 89-99
    • Berg, E.1    Zeffer, H.2    Hagersten, E.3
  • 9
    • 16644378094 scopus 로고    scopus 로고
    • Design and implementation of a dynamic optimization framework for Windows
    • D. Bruening, E. Duesterwald, and S. Amarasinghe. Design and implementation of a dynamic optimization framework for Windows. In FDDO, 2001.
    • (2001) FDDO
    • Bruening, D.1    Duesterwald, E.2    Amarasinghe, S.3
  • 11
    • 21244474546 scopus 로고    scopus 로고
    • Predicting inter-thread cache contention on a chip multi-processor architecture
    • D. Chandra, F. Guo, S. Kim, and Y. Solihin. Predicting inter-thread cache contention on a chip multi-processor architecture. In HPCA, pages 340-351, 2005.
    • (2005) HPCA , pp. 340-351
    • Chandra, D.1    Guo, F.2    Kim, S.3    Solihin, Y.4
  • 12
    • 40349095122 scopus 로고    scopus 로고
    • Managing distributed, shared L2 caches through OS-level page allocation
    • S. Cho and L Jin. Managing distributed, shared L2 caches through OS-level page allocation. In MICRO, pages 455-468, 2006.
    • (2006) MICRO , pp. 455-468
    • Cho, S.1    Jin, L.2
  • 14
    • 33749602130 scopus 로고    scopus 로고
    • Performance of multithreaded chip multiprocessors and implications for operating system design
    • A. Fedorova, M. Seltzer, C. Small, and D. Nussbaum. Performance of multithreaded chip multiprocessors and implications for operating system design. In USENIX ATC, pages 26-26, 2005.
    • (2005) USENIX ATC , pp. 26-26
    • Fedorova, A.1    Seltzer, M.2    Small, C.3    Nussbaum, D.4
  • 15
    • 33750363816 scopus 로고    scopus 로고
    • An analytical model for cache replacement policy performance
    • F. Guo and Y. Solihin. An analytical model for cache replacement policy performance. In SIGMETRICS, pages 228-239, 2006.
    • (2006) SIGMETRICS , pp. 228-239
    • Guo, F.1    Solihin, Y.2
  • 16
    • 8344246922 scopus 로고    scopus 로고
    • CQoS: A framework for enabling QoS in shared caches of CMP platforms
    • R. Iyer. CQoS: a framework for enabling QoS in shared caches of CMP platforms. In ICS, pages 257-266, 2004.
    • (2004) ICS , pp. 257-266
    • Iyer, R.1
  • 18
    • 84894240497 scopus 로고    scopus 로고
    • A low-overhead high-performance unified buffer management scheme that exploits sequential and looping references
    • J. Kim, J. Choi, J. Kim, S. Noh, S. Min, Y. Cho, and C. Kim. A low-overhead high-performance unified buffer management scheme that exploits sequential and looping references. In OSDI, pages 119-34, 2000.
    • (2000) OSDI , pp. 119-134
    • Kim, J.1    Choi, J.2    Kim, J.3    Noh, S.4    Min, S.5    Cho, Y.6    Kim, C.7
  • 19
    • 10444238444 scopus 로고    scopus 로고
    • Fair cache sharing and partitioning in a chip multiprocessor architecture
    • S. Kim, D. Chandra, and Y. Solihin. Fair cache sharing and partitioning in a chip multiprocessor architecture. In PACT, pages 111-122, 2004.
    • (2004) PACT , pp. 111-122
    • Kim, S.1    Chandra, D.2    Solihin, Y.3
  • 20
    • 80053025940 scopus 로고
    • Implementing stack simulation for highly-associative memories
    • Y. Kim, M. Hill, and D. Wood. Implementing stack simulation for highly-associative memories. In SIGMETRICS, pages 212-213, 1991.
    • (1991) SIGMETRICS , pp. 212-213
    • Kim, Y.1    Hill, M.2    Wood, D.3
  • 21
    • 0031348717 scopus 로고    scopus 로고
    • OS-controlled cache predictability for real-time systems
    • J. Liedtke, H. Härtig, and M. Hohmuth. OS-controlled cache predictability for real-time systems. In RTAS, pages 213-227, 1997.
    • (1997) RTAS , pp. 213-227
    • Liedtke, J.1    Härtig, H.2    Hohmuth, M.3
  • 22
    • 57749186047 scopus 로고    scopus 로고
    • Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
    • J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In HPCA, pages 367-378, 2008.
    • (2008) HPCA , pp. 367-378
    • Lin, J.1    Lu, Q.2    Ding, X.3    Zhang, Z.4    Zhang, X.5    Sadayappan, P.6
  • 23
    • 2342468635 scopus 로고    scopus 로고
    • Organizing the last line of defense before hitting the memory wall for CMPs
    • C. Liu, A. Sivasubramaniam, and M. Kandemir. Organizing the last line of defense before hitting the memory wall for CMPs. In HPCA, pages 176-185, 2004.
    • (2004) HPCA , pp. 176-185
    • Liu, C.1    Sivasubramaniam, A.2    Kandemir, M.3
  • 25
    • 0014701246 scopus 로고
    • Evaluation techniques and storage hierarchies
    • R. Mattson, J. Gecsei, D. Slutz, and I. Traiger. Evaluation techniques and storage hierarchies. IBM Systems J., 9(2):78-117, 1970.
    • (1970) IBM Systems J , vol.9 , Issue.2 , pp. 78-117
    • Mattson, R.1    Gecsei, J.2    Slutz, D.3    Traiger, I.4
  • 26
    • 63549102138 scopus 로고    scopus 로고
    • Multi-optimization power management for chip multiprocessors
    • K. Meng, R. Joseph, R. Dick, and L. Shang. Multi-optimization power management for chip multiprocessors. In PACT, pages 177-186, 2008.
    • (2008) PACT , pp. 177-186
    • Meng, K.1    Joseph, R.2    Dick, R.3    Shang, L.4
  • 27
    • 34548033967 scopus 로고    scopus 로고
    • JIT instrumentation: A novel approach to dynamically instrument operating systems
    • M. Olszewski, K. Mierle, A. Czajkowski, and A. Demke Brown. JIT instrumentation: a novel approach to dynamically instrument operating systems. In EuroSys, pages 3-16, 2007.
    • (2007) EuroSys , pp. 3-16
    • Olszewski, M.1    Mierle, K.2    Czajkowski, A.3    Demke Brown, A.4
  • 29
    • 34548042910 scopus 로고    scopus 로고
    • Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
    • M. Qureshi and Y. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In MICRO, pages 423-432, 2006.
    • (2006) MICRO , pp. 423-432
    • Qureshi, M.1    Patt, Y.2
  • 30
    • 34247108325 scopus 로고    scopus 로고
    • Architectural support for operating system-driven CMP cache management
    • N. Rafique, W. Lim, and M. Thottethodi. Architectural support for operating system-driven CMP cache management. In PACT, pages 2-12, 2006.
    • (2006) PACT , pp. 2-12
    • Rafique, N.1    Lim, W.2    Thottethodi, M.3
  • 31
    • 0031367150 scopus 로고    scopus 로고
    • A resource allocation model for QoS management
    • R. Rajkumar, C. Lee, J. Lehoczky, and D. Siewiorek. A resource allocation model for QoS management. In RTSS, pages 298-308, 1997.
    • (1997) RTSS , pp. 298-308
    • Rajkumar, R.1    Lee, C.2    Lehoczky, J.3    Siewiorek, D.4
  • 32
    • 10444263677 scopus 로고    scopus 로고
    • Architectural support for enhanced SMT job scheduling
    • A. Settle, J. Kihm, A. Janiszewski, and D. Connors. Architectural support for enhanced SMT job scheduling. In PACT, 2004.
    • (2004) PACT
    • Settle, A.1    Kihm, J.2    Janiszewski, A.3    Connors, D.4
  • 33
    • 77953988297 scopus 로고    scopus 로고
    • Using Valgrind to detect undefined value errors with bit-precision
    • J. Seward and N. Nethercote. Using Valgrind to detect undefined value errors with bit-precision. In USENIX ATC, pages 17-30, 2005.
    • (2005) USENIX ATC , pp. 17-30
    • Seward, J.1    Nethercote, N.2
  • 34
    • 33846527415 scopus 로고    scopus 로고
    • Locality approximation using time
    • X. Shen, J. Shaw, B. Meeker, and C. Ding. Locality approximation using time. In POPL, pages 55-61, 2007.
    • (2007) POPL , pp. 55-61
    • Shen, X.1    Shaw, J.2    Meeker, B.3    Ding, C.4
  • 35
    • 0038345698 scopus 로고    scopus 로고
    • Phase tracking and prediction
    • T. Sherwood, S. Sair, and B. Calder. Phase tracking and prediction. In ISCA, pages 336-349, 2003.
    • (2003) ISCA , pp. 336-349
    • Sherwood, T.1    Sair, S.2    Calder, B.3
  • 36
    • 0034443570 scopus 로고    scopus 로고
    • Symbiotic jobscheduling for a simultaneous multithreading processor
    • A. Snavely and D. Tullsen. Symbiotic jobscheduling for a simultaneous multithreading processor. In ASPLOS, pages 234-244, 2000.
    • (2000) ASPLOS , pp. 234-244
    • Snavely, A.1    Tullsen, D.2
  • 37
    • 66749168716 scopus 로고    scopus 로고
    • Reducing the harmful effects of last-level cache polluters with an OS-level, software-only pollute buffer
    • L. Soares, D. Tam, and M. Stumm. Reducing the harmful effects of last-level cache polluters with an OS-level, software-only pollute buffer. In MICRO, 2008.
    • (2008) MICRO
    • Soares, L.1    Tam, D.2    Stumm, M.3
  • 38
    • 67650033179 scopus 로고    scopus 로고
    • Dynamic partitioning of the cache hierarchy in shared data centers
    • G. Soundararajan, J. Chen, M. Sharaf, and C. Amza. Dynamic partitioning of the cache hierarchy in shared data centers. In VLDB, pages 635-646, 2008.
    • (2008) VLDB , pp. 635-646
    • Soundararajan, G.1    Chen, J.2    Sharaf, M.3    Amza, C.4
  • 39
    • 67650085342 scopus 로고    scopus 로고
    • Adaptive set pinning: Managing shared caches in chip multiprocessors
    • S. Srikantaiah, M. Kandemir, and M. Irwin. Adaptive set pinning: Managing shared caches in chip multiprocessors. In ASPLOS, pages 135-144, 2008.
    • (2008) ASPLOS , pp. 135-144
    • Srikantaiah, S.1    Kandemir, M.2    Irwin, M.3
  • 40
    • 0026925878 scopus 로고
    • Optimal partitioning of cache memory
    • H. Stone, J. Turek, and J. Wolf. Optimal partitioning of cache memory. IEEE TOC, 41(9):1054-1068, 1992.
    • (1992) IEEE TOC , vol.41 , Issue.9 , pp. 1054-1068
    • Stone, H.1    Turek, J.2    Wolf, J.3
  • 41
    • 1642371317 scopus 로고    scopus 로고
    • Dynamic partitioning of shared cache memory
    • Apr
    • E. Suh, L Rudolph, and S. Devadas. Dynamic partitioning of shared cache memory. The J. of Supercomputing, 28(1):7-26, Apr. 2004.
    • (2004) The J. of Supercomputing , vol.28 , Issue.1 , pp. 7-26
    • Suh, E.1    Rudolph, L.2    Devadas, S.3
  • 42
    • 57749176037 scopus 로고    scopus 로고
    • Managing shared L2 caches on multicore systems in software
    • D. Tam, R. Azimi, L. Soares, and M. Stumm. Managing shared L2 caches on multicore systems in software. In WIOSCA, pages 26-33, 2007.
    • (2007) WIOSCA , pp. 26-33
    • Tam, D.1    Azimi, R.2    Soares, L.3    Stumm, M.4
  • 43
    • 34548030923 scopus 로고    scopus 로고
    • Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors
    • D. Tam, R. Azimi, and M. Stumm. Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors. In EuroSys, pages 47-58, 2007.
    • (2007) EuroSys , pp. 47-58
    • Tam, D.1    Azimi, R.2    Stumm, M.3
  • 44
    • 0026881152 scopus 로고
    • Improving disk cache hit-ratios through cache partitioning
    • D. Thiebaut, H. Stone, and J. Wolf. Improving disk cache hit-ratios through cache partitioning. IEEE TOC, 41(6):665-676, 1992.
    • (1992) IEEE TOC , vol.41 , Issue.6 , pp. 665-676
    • Thiebaut, D.1    Stone, H.2    Wolf, J.3
  • 45
    • 85076757258 scopus 로고    scopus 로고
    • CRAMM: Virtual memory support for garbage-collected applications
    • T. Yang, E. Berger, S. Kaplan, and J. Moss. CRAMM: virtual memory support for garbage-collected applications. In OSDI, pages 103-116, 2006.
    • (2006) OSDI , pp. 103-116
    • Yang, T.1    Berger, E.2    Kaplan, S.3    Moss, J.4
  • 48
    • 84893513203 scopus 로고    scopus 로고
    • The multi-queue replacement algorithm for second level buffer caches
    • Y. Zhou, J. Philbin, and K. Li. The multi-queue replacement algorithm for second level buffer caches. In USENIX ATC, pages 91-104, 2001.
    • (2001) USENIX ATC , pp. 91-104
    • Zhou, Y.1    Philbin, J.2    Li, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.