메뉴 건너뛰기




Volumn , Issue , 2013, Pages 357-368

To hardware prefetch or not to prefetch? A virtualized environment study and core binding approach

Author keywords

Prefetching; Shared cache; Virtualization

Indexed keywords

APPLICATION PERFORMANCE; HARDWARE AND SOFTWARE; PREFETCHING; QUANTITATIVE MEASUREMENT; SHARED CACHE; SYSTEM ADMINISTRATORS; VIRTUALIZATIONS; VIRTUALIZED ENVIRONMENT;

EID: 84875660340     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2451116.2451155     Document Type: Conference Paper
Times cited : (7)

References (44)
  • 1
    • 34547461216 scopus 로고    scopus 로고
    • A comparison of software and hardware techniques for x86 virtualization
    • ADAMS, K., AND AGESEN, O. A comparison of software and hardware techniques for x86 virtualization. In ASPLOS (2006), pp. 2-13.
    • (2006) ASPLOS , pp. 2-13
    • Adams, K.1    Agesen, O.2
  • 2
    • 84875671100 scopus 로고    scopus 로고
    • BIOS and kernel developer's guide for AMD family 10h processors
    • AMD. BIOS and kernel developer's guide for AMD family 10h processors. White Paper, 2010.
    • (2010) AMD, White Paper
  • 3
    • 70649107128 scopus 로고    scopus 로고
    • A communication characterisation of splash-2 and parsec
    • BARROW-WILLIAMS, N., FENSCH, C., AND MOORE, S. A communication characterisation of Splash-2 and Parsec. In IISWC (2009), pp. 86-97.
    • (2009) IISWC , pp. 86-97
    • Barrow-williams, N.1    Fensch, C.2    Moore, S.3
  • 4
    • 70449652917 scopus 로고    scopus 로고
    • Characterizing the TLB behavior of emerging parallel workloads on chip multiprocessors
    • BHATTACHARJEE, A., AND MARTONOSI, M. Characterizing the TLB behavior of emerging parallel workloads on chip multiprocessors. In PACT (2009), pp. 29-40.
    • (2009) PACT , pp. 29-40
    • Bhattacharjee, A.1    Martonosi, M.2
  • 6
    • 76749142994 scopus 로고    scopus 로고
    • Coordinated control of multiple prefetchers in multi-core systems
    • EBRAHIMI, E., MUTLU, O., LEE, C. J., AND PATT, Y. N. Coordinated control of multiple prefetchers in multi-core systems. In Micro (2009), pp. 316-326.
    • (2009) Micro , pp. 316-326
    • Ebrahimi, E.1    Mutlu, O.2    Lee, C.J.3    Patt, Y.N.4
  • 7
    • 64949179220 scopus 로고    scopus 로고
    • Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems
    • EBRAHIMI, E., MUTLU, O., AND PATT, Y. N. Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems. In HPCA (2009), pp. 7 - 17.
    • (2009) HPCA , pp. 7-17
    • Ebrahimi, E.1    Mutlu, O.2    Patt, Y.N.3
  • 9
    • 84875675406 scopus 로고    scopus 로고
    • FILEBENCH. Filebench
    • FILEBENCH. Filebench. http://sourceforge.net/apps/mediawiki/filebench.
  • 10
    • 82155203098 scopus 로고    scopus 로고
    • Cuanta: Quantifying effects of shared on-chip resource interference for consolidated virtual machines
    • 22:1-22:14
    • GOVINDAN, S., LIU, J., KANSAL, A., AND SIVASUBRAMANIAM, A. Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines. In SoCC (2011), pp. 22:1-22:14.
    • (2011) SoCC
    • Govindan, S.1    Liu, J.2    Kansal, A.3    Sivasubramaniam, A.4
  • 11
    • 84875648785 scopus 로고    scopus 로고
    • IBM eServer xSeries 366 tuning tips
    • IBM. IBM eServer xSeries 366 tuning tips. Technical Report, 2005.
    • (2005) IBM, Technical Report
  • 12
    • 84875643805 scopus 로고    scopus 로고
    • Virtualization on the IBM system x3950 server
    • IBM. Virtualization on the IBM system x3950 server. Technical Report, 2006.
    • (2006) IBM, Technical Report
  • 13
    • 84875665139 scopus 로고    scopus 로고
    • Tuning IBM system x servers for performance
    • IBM. Tuning IBM system x servers for performance. Technical Report, 2007.
    • (2007) IBM, Technical Report
  • 14
    • 84875660564 scopus 로고    scopus 로고
    • Achieving fast, scalable I/O for virtualized servers
    • INTEL. Achieving fast, scalable I/O for virtualized servers. White Paper, 2009.
    • (2009) INTEL, White Paper
  • 16
    • 77954998134 scopus 로고    scopus 로고
    • High performance cache replacement using re-reference interval prediction (RRIP)
    • JALEEL, A., THEOBALD, K. B., STEELY JR., S. C., AND EMER, J. High performance cache replacement using re-reference interval prediction (RRIP). In ISCA (2010), pp. 60-71.
    • (2010) ISCA , pp. 60-71
    • Jaleel, A.1    Theobald, K.B.2    Steely Jr., S.C.3    Emer, J.4
  • 17
    • 33846554108 scopus 로고    scopus 로고
    • Geiger: Monitoring the buffer cache in a virtual machine environment
    • JONES, S. T., ARPACI-DUSSEAU, A. C., AND ARPACI-DUSSEAU, R. H. Geiger: monitoring the buffer cache in a virtual machine environment. In ASPLOS (2006), pp. 14-24.
    • (2006) ASPLOS , pp. 14-24
    • Jones, S.T.1    Arpaci-dusseau, A.C.2    Arpaci-dusseau, R.H.3
  • 18
    • 79951697650 scopus 로고    scopus 로고
    • Sampling dead block prediction for last-level caches
    • KHAN, S.M., TIAN, Y., AND JIMENEZ, D. A. Sampling dead block prediction for last-level caches. In Micro (2010), pp. 175-186.
    • (2010) Micro , pp. 175-186
    • Khan, S.M.1    Tian, Y.2    Jimenez, D.A.3
  • 19
  • 20
    • 80052522711 scopus 로고    scopus 로고
    • Prefetch-aware shared resource management for multi-core systems
    • LEE, C. J., MUTLU, O., NARASIMAN, V., AND PATT, Y. N. Prefetch-aware shared resource management for multi-core systems. In ISCA (2011), pp. 141-152.
    • (2011) ISCA , pp. 141-152
    • Lee, C.J.1    Mutlu, O.2    Narasiman, V.3    Patt, Y.N.4
  • 21
    • 77952569781 scopus 로고    scopus 로고
    • Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance
    • LIU, F., JIANG, X., AND SOLIHIN, Y. Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance. In HPCA (2010), pp. 1-12.
    • (2010) HPCA , pp. 1-12
    • Liu, F.1    Jiang, X.2    Solihin, Y.3
  • 22
    • 79960188403 scopus 로고    scopus 로고
    • Studying the impact of hardware prefetching and bandwidth partitioning in chip-multiprocessors
    • LIU, F., AND SOLIHIN, Y. Studying the impact of hardware prefetching and bandwidth partitioning in chip-multiprocessors. In SIGMETRICS (2011), pp. 37-48.
    • (2011) SIGMETRICS , pp. 37-48
    • Liu, F.1    Solihin, Y.2
  • 23
    • 0031594020 scopus 로고    scopus 로고
    • An analysis of database workload performance on simultaneous multithreaded processors
    • LO, J., BARROSO, L. A., EGGERS, S. J., GHARACHORLOO, K., LEVY, H. M., AND PAREKH, S. S. An analysis of database workload performance on simultaneous multithreaded processors. In ISCA (1998), pp. 39-50.
    • (1998) ISCA , pp. 39-50
    • Lo, J.1    Barroso, L.A.2    Eggers, S.J.3    Gharachorloo, K.4    Levy, H.M.5    Parekh, S.S.6
  • 24
    • 84863391679 scopus 로고    scopus 로고
    • DVM: Towards a datacenter-scale virtual machine
    • MA, Z., SHENG, Z., GU, L., WEN, L., AND ZHANG, G. DVM: towards a datacenter-scale virtual machine. In VEE (2012), pp. 39- 50.
    • (2012) VEE , pp. 39-50
    • Ma, Z.1    Sheng, Z.2    Gu, L.3    Wen, L.4    Zhang, G.5
  • 25
    • 84858771269 scopus 로고    scopus 로고
    • Reducing memory interference in multicore systems via application-aware memory channel partitioning
    • MURALIDHARA, S. P., SUBRAMANIAN, L., MUTLU, O., KANDEMIR, M., AND MOSCIBRODA, T. Reducing memory interference in multicore systems via application-aware memory channel partitioning. In Micro (2011), pp. 374-385.
    • (2011) Micro , pp. 374-385
    • Muralidhara, S.P.1    Subramanian, L.2    Mutlu, O.3    Kandemir, M.4    Moscibroda, T.5
  • 26
    • 77954365569 scopus 로고    scopus 로고
    • Scheduling I/O in virtual machine monitors
    • ONGARO, D., COX, A. L., AND RIXNER, S. Scheduling I/O in virtual machine monitors. In VEE (2008), pp. 14-24.
    • (2008) VEE , pp. 14-24
    • Ongaro, D.1    Cox, A.L.2    Rixner, S.3
  • 28
    • 84875658216 scopus 로고    scopus 로고
    • Algorithms to take advantage of hardware prefetching
    • PAN, S., CHERNG, C., DICK, K., AND LADNER, R. E. Algorithms to take advantage of hardware prefetching. In ALENEX (2007).
    • (2007) ALENEX
    • Pan, S.1    Cherng, C.2    Dick, K.3    Ladner, R.E.4
  • 29
    • 84863929827 scopus 로고    scopus 로고
    • Page/slab cache control in a virtualized environment
    • SINGH, B. Page/slab cache control in a virtualized environment. In Linux Symposium (2010), pp. 252-262.
    • (2010) Linux Symposium , pp. 252-262
    • Singh, B.1
  • 30
    • 66749168716 scopus 로고    scopus 로고
    • Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer
    • SOARES, L., TAM, D., AND STUMM, M. Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer. In MICRO (2008), pp. 258-269.
    • (2008) MICRO , pp. 258-269
    • Soares, L.1    Tam, D.2    Stumm, M.3
  • 31
    • 67650085342 scopus 로고    scopus 로고
    • Adaptive set pinning: Managing shared caches in chip multiprocessors
    • SRIKANTAIAH, S., KANDEMIR, M., AND IRWIN, M. J. Adaptive set pinning: managing shared caches in chip multiprocessors. In ASPLOS (2008), pp. 135-144.
    • (2008) ASPLOS , pp. 135-144
    • Srikantaiah, S.1    Kandemir, M.2    Irwin, M.J.3
  • 32
    • 34547655822 scopus 로고    scopus 로고
    • Feedback directed prefetching: Improving the performance and bandwidthefficiency of hardware prefetchers
    • SRINATH, S., MUTLU, O., KIM, H., AND PATT, Y. N. Feedback directed prefetching: Improving the performance and bandwidthefficiency of hardware prefetchers. In HPCA (2007), pp. 63-74.
    • (2007) HPCA , pp. 63-74
    • Srinath, S.1    Mutlu, O.2    Kim, H.3    Patt, Y.N.4
  • 34
    • 57749176037 scopus 로고    scopus 로고
    • Managing shared L2 caches on multicore systems in software
    • TAM, D., AZIMI, R., SOARES, L., AND STUMM, M. Managing shared L2 caches on multicore systems in software. In WIOSCA (2007).
    • (2007) WIOSCA
    • Tam, D.1    Azimi, R.2    Soares, L.3    Stumm, M.4
  • 35
    • 80052535250 scopus 로고    scopus 로고
    • The impact of memory subsystem resource sharing on datacenter applications
    • TANG, L., MARS, J., VACHHARAJANI, N., HUNDT, R., AND SOFFA, M. L. The impact of memory subsystem resource sharing on datacenter applications. In ISCA (2011), pp. 283-294.
    • (2011) ISCA , pp. 283-294
    • Tang, L.1    Mars, J.2    Vachharajani, N.3    Hundt, R.4    Soffa, M.L.5
  • 36
    • 79551699363 scopus 로고    scopus 로고
    • Efficient prefetching with hybrid schemes and use of program feedback to adjust prefetcher aggressiveness
    • VERMA, S., KOPPELMAN, D.M., AND PENG, L. Efficient prefetching with hybrid schemes and use of program feedback to adjust prefetcher aggressiveness. Journal of Instruction-Level Parallelism, 13 (2011), 1-14.
    • (2011) Journal of Instruction-Level Parallelism , vol.13 , pp. 1-14
    • Verma, S.1    Koppelman, D.M.2    Peng, L.3
  • 37
    • 84875670701 scopus 로고    scopus 로고
    • VMware VMmark v1.0.0 Results - Dell PowerEdge R900
    • VMWARE. VMware VMmark v1.0.0 Results - Dell PowerEdge R900. Technical Report, 2008.
    • (2008) VMWARE, Technical Report
  • 38
    • 84875651434 scopus 로고    scopus 로고
    • Performance best practices for VMware vSphere 5.0
    • VMWARE. Performance best practices for VMware vSphere 5.0. Technical Report, 2011.
    • (2011) VMWARE, Technical Report
  • 39
    • 84978382687 scopus 로고    scopus 로고
    • Memory resource management in vmware esx server
    • WALDSPURGER, C. A. Memory resource management in vmware esx server. In SIGOPS Oper. Syst. Rev. (2002), pp. 181-194.
    • (2002) SIGOPS Oper. Syst. Rev. , pp. 181-194
    • Waldspurger, C.A.1
  • 40
    • 84863379287 scopus 로고    scopus 로고
    • PACMan: Prefetch-aware cache management for high performance caching
    • WU, C.-J., JALEEL, A., MARTONOSI, M., STEELY JR., S. C., AND EMER, J. PACMan: prefetch-aware cache management for high performance caching. In Micro (2011), pp. 442-453.
    • (2011) Micro , pp. 442-453
    • Wu, C.-J.1    Jaleel, A.2    Martonosi, M.3    Steely Jr., S.C.4    Emer, J.5
  • 41
    • 70450279102 scopus 로고    scopus 로고
    • PIPP: Promotion/insertion pseudopartitioning of multi-core shared caches
    • XIE, Y., AND LOH, G. H. PIPP: promotion/insertion pseudopartitioning of multi-core shared caches. In ISCA (2009), pp. 174-183.
    • (2009) ISCA , pp. 174-183
    • Xie, Y.1    Loh, G.H.2
  • 42
    • 77749340037 scopus 로고    scopus 로고
    • Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?
    • ZHANG, E., JIANG, Y., AND SHEN, X. Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs? In PPoPP (2010), pp. 203-212.
    • (2010) PPoPP , pp. 203-212
    • Zhang, E.1    Jiang, Y.2    Shen, X.3
  • 43
    • 70349111334 scopus 로고    scopus 로고
    • Towards practical page coloring-based multicore cache management
    • ZHANG, X., DWARKADAS, S., AND SHEN, K. Towards practical page coloring-based multicore cache management. In Eurosys (2009), pp. 89-102.
    • (2009) Eurosys , pp. 89-102
    • Zhang, X.1    Dwarkadas, S.2    Shen, K.3
  • 44
    • 77952248898 scopus 로고    scopus 로고
    • Addressing shared resource contention in multicore processors via scheduling
    • ZHURAVLEV, S., BLAGODUROV, S., AND FEDOROVA, A. Addressing shared resource contention in multicore processors via scheduling. In ASPLOS (2010), pp. 129-142.
    • (2010) ASPLOS , pp. 129-142
    • Zhuravlev, S.1    Blagodurov, S.2    Fedorova, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.