메뉴 건너뛰기




Volumn , Issue , 2009, Pages 121-132

RapidMRC: Approximating L2 miss rate curves on commodity systems for online optimizations

Author keywords

Experimentation; Management; Measurement; Performance

Indexed keywords

APPLICATION PROGRAMS; CURVE FITTING; MANAGEMENT; MEASUREMENT; MULTICORE PROGRAMMING;

EID: 67650088533     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1508244.1508259     Document Type: Conference Paper
Times cited : (41)

References (48)
  • 2
    • 84944730283 scopus 로고    scopus 로고
    • Scheduling algorithms with bus bandwidth considerations for SMPs
    • C. Antonopoulos, D. Nikolopoulos, and T. Papatheodorou. Scheduling algorithms with bus bandwidth considerations for SMPs. In ICPP, pages 547.554, 2003.
    • (2003) ICPP , pp. 547-554
    • Antonopoulos, C.1    Nikolopoulos, D.2    Papatheodorou, T.3
  • 3
    • 42149141725 scopus 로고    scopus 로고
    • PATH: Page access tracking to improve memory management
    • R. Azimi, L. Soares, M. Stumm, T. Walsh, and A. Demke Brown. PATH: page access tracking to improve memory management. In ISMM, pages 31.42, 2007.
    • (2007) ISMM , pp. 31-42
    • Azimi, R.1    Soares, L.2    Stumm, M.3    Walsh, T.4    Brown, A.D.5
  • 4
    • 32844456410 scopus 로고    scopus 로고
    • Online performance analysis by statistical sampling of microprocessor performance counters
    • DOI 10.1145/1088149.1088163, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
    • R. Azimi, M. Stumm, and R. Wisniewski. Online performance analysis by statistical sampling of microprocessor performance counters. In ICS, pages 101. 110, 2005. (Pubitemid 43251314)
    • (2005) Proceedings of the International Conference on Supercomputing , pp. 101-110
    • Azimi, R.1    Stumm, M.2    Wisniewski, R.W.3
  • 6
    • 2642534571 scopus 로고    scopus 로고
    • StatCache: A probabilistic approach to efficient and accurate data locality analysis
    • E. Berg and E. Hagersten. StatCache: A probabilistic approach to efficient and accurate data locality analysis. In ISPASS, pages 20.27, 2004.
    • (2004) ISPASS , pp. 20-27
    • Berg, E.1    Hagersten, E.2
  • 7
    • 33244462442 scopus 로고    scopus 로고
    • Fast data-locality profiling of native execution
    • SIGMETRICS 2005: International Conference on Measurement and Modeling of Computer Systems - Proceedings
    • E. Berg and E. Hagersten. Fast data-locality profiling of native execution. In SIGMETRICS, pages 169.180, 2005. (Pubitemid 43275419)
    • (2005) Performance Evaluation Review , vol.33 , Issue.1 , pp. 169-180
    • Berg, E.1    Hagersten, E.2
  • 9
    • 16644378094 scopus 로고    scopus 로고
    • Design and implementation of a dynamic optimization framework for Windows
    • D. Bruening, E. Duesterwald, and S. Amarasinghe. Design and implementation of a dynamic optimization framework for Windows. In FDDO, 2001.
    • (2001) FDDO
    • Bruening, D.1    Duesterwald, E.2    Amarasinghe, S.3
  • 11
    • 21244474546 scopus 로고    scopus 로고
    • Predicting inter-thread cache contention on a chip multi-processor architecture
    • D. Chandra, F. Guo, S. Kim, and Y. Solihin. Predicting inter-thread cache contention on a chip multi-processor architecture. In HPCA, pages 340.351, 2005.
    • (2005) HPCA , pp. 340-351
    • Chandra, D.1    Guo, F.2    Kim, S.3    Solihin, Y.4
  • 12
    • 40349095122 scopus 로고    scopus 로고
    • Managing distributed, shared L2 caches through OS-level page allocation
    • DOI 10.1109/MICRO.2006.31, 4041868, Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-39
    • S. Cho and L Jin. Managing distributed, shared L2 caches through OS-level page allocation. In MICRO, pages 455.468, 2006. (Pubitemid 351337018)
    • (2006) Proceedings of the Annual International Symposium on Microarchitecture, MICRO , pp. 455-465
    • Cho, S.1    Jin, L.2
  • 14
    • 33749602130 scopus 로고    scopus 로고
    • Performance of multithreaded chip multiprocessors and implications for operating system design
    • A. Fedorova, M. Seltzer, C. Small, and D. Nussbaum. Performance of multithreaded chip multiprocessors and implications for operating system design. In USENIX ATC, pages 26.26, 2005.
    • (2005) USENIX ATC , pp. 26-26
    • Fedorova, A.1    Seltzer, M.2    Small, C.3    Nussbaum, D.4
  • 15
    • 33750363816 scopus 로고    scopus 로고
    • An analytical model for cache replacement policy performance
    • DOI 10.1145/1140103.1140304, SIGMETRICS 2006/Performance 2006 - Joint International Conference on Measurement and Modeling of Computer Systems, Proceedings
    • F. Guo and Y. Solihin. An analytical model for cache replacement policy performance. In SIGMETRICS, pages 228.239, 2006. (Pubitemid 44619094)
    • (2006) Performance Evaluation Review , vol.34 , Issue.1 , pp. 228-239
    • Guo, F.1    Solihin, Y.2
  • 16
    • 8344246922 scopus 로고    scopus 로고
    • CQoS: A framework for enabling QoS in shared caches of CMP platforms
    • R. Iyer. CQoS: a framework for enabling QoS in shared caches of CMP platforms. In ICS, pages 257.266, 2004.
    • (2004) ICS , pp. 257-266
    • Iyer, R.1
  • 17
    • 36349002905 scopus 로고    scopus 로고
    • QoS policies and architecture for cache/memory in CMP platforms
    • DOI 10.1145/1269899.1254886, SIGMETRICS'07 - Proceedings of the 2007 International Conference on Measurement and Modeling of Computer Systems
    • R. Iyer, L. Zhao, F. Guo, R. Illikkal, D. Newell, Y. Solihin, L. Hsu, and S. Reinhardt. QoS policies and architecture for cache/memory in CMP platforms. In SIGMETRICS, pages 25.36, 2007. (Pubitemid 350158070)
    • (2007) Performance Evaluation Review , vol.35 , Issue.1 , pp. 25-36
    • Iyer, R.1    Zhao, L.2    Guo, F.3    Illikkal, R.4    Makineni, S.5    Newell, D.6    Solihin, Y.7    Hsu, L.8    Reinhardt, S.9
  • 18
    • 84894240497 scopus 로고    scopus 로고
    • A low-overhead high-performance unified buffer management scheme that exploits sequential and looping references
    • J. Kim, J. Choi, J. Kim, S. Noh, S. Min, Y. Cho, and C. Kim. A low-overhead high-performance unified buffer management scheme that exploits sequential and looping references. In OSDI, pages 119.34, 2000.
    • (2000) OSDI , pp. 119-134
    • Kim, J.1    Choi, J.2    Kim, J.3    Noh, S.4    Min, S.5    Cho, Y.6    Kim, C.7
  • 19
    • 10444238444 scopus 로고    scopus 로고
    • Fair cache sharing and partitioning in a chip multiprocessor architecture
    • S. Kim, D. Chandra, and Y. Solihin. Fair cache sharing and partitioning in a chip multiprocessor architecture. In PACT, pages 111.122, 2004.
    • (2004) PACT , pp. 111-122
    • Kim, S.1    Chandra, D.2    Solihin, Y.3
  • 20
    • 80053025940 scopus 로고
    • Implementing stack simulation for highlyassociative memories
    • Y. Kim, M. Hill, and D. Wood. Implementing stack simulation for highlyassociative memories. In SIGMETRICS, pages 212.213, 1991.
    • (1991) SIGMETRICS , pp. 212-213
    • Kim, Y.1    Hill, M.2    Wood, D.3
  • 21
    • 0031348717 scopus 로고    scopus 로고
    • OS-controlled cache predictability for real-time systems
    • J. Liedtke, H. Härtig, and M. Hohmuth. OS-controlled cache predictability for real-time systems. In RTAS, pages 213.227, 1997.
    • (1997) RTAS , pp. 213-227
    • Liedtke, J.1    Härtig, H.2    Hohmuth, M.3
  • 22
    • 57749186047 scopus 로고    scopus 로고
    • Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
    • J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In HPCA, pages 367.378, 2008.
    • (2008) HPCA , pp. 367-378
    • Lin, J.1    Lu, Q.2    Ding, X.3    Zhang, Z.4    Zhang, X.5    Sadayappan, P.6
  • 23
    • 2342468635 scopus 로고    scopus 로고
    • Organizing the last line of defense before hitting the memory wall for CMPs
    • C. Liu, A. Sivasubramaniam, and M. Kandemir. Organizing the last line of defense before hitting the memory wall for CMPs. In HPCA, pages 176.185, 2004.
    • (2004) HPCA , pp. 176-185
    • Liu, C.1    Sivasubramaniam, A.2    Kandemir, M.3
  • 25
    • 0014701246 scopus 로고
    • Evaluation techniques and storage hierarchies
    • R. Mattson, J. Gecsei, D. Slutz, and I. Traiger. Evaluation techniques and storage hierarchies. IBM Systems J., 9(2):78.117, 1970.
    • (1970) IBM Systems J. , vol.9 , Issue.2 , pp. 78-117
    • Mattson, R.1    Gecsei, J.2    Slutz, D.3    Traiger, I.4
  • 26
    • 63549102138 scopus 로고    scopus 로고
    • Multi-optimization power management for chip multiprocessors
    • K. Meng, R. Joseph, R. Dick, and L. Shang. Multi-optimization power management for chip multiprocessors. In PACT, pages 177.186, 2008.
    • (2008) PACT , pp. 177-186
    • Meng, K.1    Joseph, R.2    Dick, R.3    Shang, L.4
  • 27
    • 34548033967 scopus 로고    scopus 로고
    • JIT instrumentation: A novel approach to dynamically instrument operating systems
    • DOI 10.1145/1272996.1273000, Operating Systems Review - Proceedings of the 2007 EuroSys Conference
    • M. Olszewski, K. Mierle, A. Czajkowski, and A. Demke Brown. JIT instrumentation: a novel approach to dynamically instrument operating systems. In EuroSys, pages 3.16, 2007. (Pubitemid 47281571)
    • (2007) Operating Systems Review (ACM) , pp. 3-16
    • Olszewski, M.1    Mierle, K.2    Czajkowski, A.3    Brown, A.D.4
  • 29
    • 34548042910 scopus 로고    scopus 로고
    • Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
    • DOI 10.1109/MICRO.2006.49, 4041865, Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-39
    • M. Qureshi and Y. Patt. Utility-based cache partitioning: A low-overhead, highperformance, runtime mechanism to partition shared caches. In MICRO, pages 423.432, 2006. (Pubitemid 351337015)
    • (2006) Proceedings of the Annual International Symposium on Microarchitecture, MICRO , pp. 423-432
    • Qureshi, M.K.1    Patt, Y.N.2
  • 31
    • 0031367150 scopus 로고    scopus 로고
    • A resource allocation model for QoS management
    • R. Rajkumar, C. Lee, J. Lehoczky, and D. Siewiorek. A resource allocation model for QoS management. In RTSS, pages 298.308, 1997.
    • (1997) RTSS , pp. 298-308
    • Rajkumar, R.1    Lee, C.2    Lehoczky, J.3    Siewiorek, D.4
  • 32
    • 10444263677 scopus 로고    scopus 로고
    • Architectural support for enhanced SMT job scheduling
    • A. Settle, J. Kihm, A. Janiszewski, and D. Connors. Architectural support for enhanced SMT job scheduling. In PACT, 2004.
    • (2004) PACT
    • Settle, A.1    Kihm, J.2    Janiszewski, A.3    Connors, D.4
  • 33
    • 77953988297 scopus 로고    scopus 로고
    • Using Valgrind to detect undefined value errors with bit-precision
    • J. Seward and N. Nethercote. Using Valgrind to detect undefined value errors with bit-precision. In USENIX ATC, pages 17.30, 2005.
    • (2005) USENIX ATC , pp. 17-30
    • Seward, J.1    Nethercote, N.2
  • 35
    • 0038345698 scopus 로고    scopus 로고
    • Phase tracking and prediction
    • T. Sherwood, S. Sair, and B. Calder. Phase tracking and prediction. In ISCA, pages 336.349, 2003.
    • (2003) ISCA , pp. 336-349
    • Sherwood, T.1    Sair, S.2    Calder, B.3
  • 37
    • 66749168716 scopus 로고    scopus 로고
    • Reducing the harmful effects of last-level cache polluters with an OS-level, software-only pollute buffer
    • L. Soares, D. Tam, and M. Stumm. Reducing the harmful effects of last-level cache polluters with an OS-level, software-only pollute buffer. In MICRO, 2008.
    • (2008) MICRO
    • Soares, L.1    Tam, D.2    Stumm, M.3
  • 38
    • 67650033179 scopus 로고    scopus 로고
    • Dynamic partitioning of the cache hierarchy in shared data centers
    • G. Soundararajan, J. Chen, M. Sharaf, and C. Amza. Dynamic partitioning of the cache hierarchy in shared data centers. In VLDB, pages 635.646, 2008.
    • (2008) VLDB , pp. 635-646
    • Soundararajan, G.1    Chen, J.2    Sharaf, M.3    Amza, C.4
  • 41
    • 1642371317 scopus 로고    scopus 로고
    • Dynamic partitioning of shared cache memory
    • Apr.
    • E. Suh, L Rudolph, and S. Devadas. Dynamic partitioning of shared cache memory. The J. of Supercomputing, 28(1):7.26, Apr. 2004.
    • (2004) J. of Supercomputing , vol.28 , Issue.1 , pp. 7-26
    • Suh, E.1    Rudolph, L.2    Devadas, S.3
  • 42
    • 57749176037 scopus 로고    scopus 로고
    • Managing shared L2 caches on multicore systems in software
    • D. Tam, R. Azimi, L. Soares, and M. Stumm. Managing shared L2 caches on multicore systems in software. In WIOSCA, pages 26.33, 2007.
    • (2007) WIOSCA , pp. 26-33
    • Tam, D.1    Azimi, R.2    Soares, L.3    Stumm, M.4
  • 43
    • 34548030923 scopus 로고    scopus 로고
    • Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors
    • DOI 10.1145/1272996.1273004, Operating Systems Review - Proceedings of the 2007 EuroSys Conference
    • D. Tam, R. Azimi, and M. Stumm. Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors. In EuroSys, pages 47.58, 2007. (Pubitemid 47281574)
    • (2007) Operating Systems Review (ACM) , pp. 47-58
    • Tam, D.1    Azimi, R.2    Stumm, M.3
  • 44
    • 0026881152 scopus 로고
    • Improving disk cache hit-ratios through cache partitioning
    • D. Thiebaut, H. Stone, and J.Wolf. Improving disk cache hit-ratios through cache partitioning. IEEE TOC, 41(6):665.676, 1992.
    • (1992) IEEE TOC , vol.41 , Issue.6 , pp. 665-676
    • Thiebaut, D.1    Stone, H.2    Wolf, J.3
  • 45
    • 85076757258 scopus 로고    scopus 로고
    • CRAMM: Virtual memory support for garbage-collected applications
    • T. Yang, E. Berger, S. Kaplan, and J. Moss. CRAMM: virtual memory support for garbage-collected applications. In OSDI, pages 103.116, 2006.
    • (2006) OSDI , pp. 103-116
    • Yang, T.1    Berger, E.2    Kaplan, S.3    Moss, J.4
  • 48
    • 84893513203 scopus 로고    scopus 로고
    • The multi-queue replacement algorithm for second level buffer caches
    • Y. Zhou, J. Philbin, and K. Li. The multi-queue replacement algorithm for second level buffer caches. In USENIX ATC, pages 91?104, 2001
    • (2001) USENIX ATC , pp. 91-104
    • Zhou, Y.1    Philbin, J.2    Li, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.