SCOPUS 정보 검색 플랫폼

International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS

Volumn , Issue , 2013, Pages 357-368

To hardware prefetch or not to prefetch? A virtualized environment study and core binding approach

(2) Kang, Hui a Wong, Jennifer L a

a Stony Brook University (United States)

Author keywords

Prefetching; Shared cache; Virtualization

Indexed keywords

APPLICATION PERFORMANCE; HARDWARE AND SOFTWARE; PREFETCHING; QUANTITATIVE MEASUREMENT; SHARED CACHE; SYSTEM ADMINISTRATORS; VIRTUALIZATIONS; VIRTUALIZED ENVIRONMENT;

OBJECT ORIENTED PROGRAMMING; VIRTUAL REALITY;

HARDWARE;

EID: 84875660340 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2451116.2451155 Document Type: Conference Paper

Times cited : (7)

References (44)

1
- 34547461216
- A comparison of software and hardware techniques for x86 virtualization
- ADAMS, K., AND AGESEN, O. A comparison of software and hardware techniques for x86 virtualization. In ASPLOS (2006), pp. 2-13.
- (2006) ASPLOS , pp. 2-13
- Adams, K.¹ Agesen, O.²

2
- 84875671100
- BIOS and kernel developer's guide for AMD family 10h processors
- AMD. BIOS and kernel developer's guide for AMD family 10h processors. White Paper, 2010.
- (2010) AMD, White Paper

3
- 70649107128
- A communication characterisation of splash-2 and parsec
- BARROW-WILLIAMS, N., FENSCH, C., AND MOORE, S. A communication characterisation of Splash-2 and Parsec. In IISWC (2009), pp. 86-97.
- (2009) IISWC , pp. 86-97
- Barrow-williams, N.¹ Fensch, C.² Moore, S.³

4
- 70449652917
- Characterizing the TLB behavior of emerging parallel workloads on chip multiprocessors
- BHATTACHARJEE, A., AND MARTONOSI, M. Characterizing the TLB behavior of emerging parallel workloads on chip multiprocessors. In PACT (2009), pp. 29-40.
- (2009) PACT , pp. 29-40
- Bhattacharjee, A.¹ Martonosi, M.²

5
- 79953093822
- PhD thesis, Princeton University, January
- BIENIA, C. Benchmarking Modern Multiprocessors. PhD thesis, Princeton University, January 2011.
- (2011) Benchmarking Modern Multiprocessors
- Bienia, C.¹

6
- 76749142994
- Coordinated control of multiple prefetchers in multi-core systems
- EBRAHIMI, E., MUTLU, O., LEE, C. J., AND PATT, Y. N. Coordinated control of multiple prefetchers in multi-core systems. In Micro (2009), pp. 316-326.
- (2009) Micro , pp. 316-326
- Ebrahimi, E.¹ Mutlu, O.² Lee, C.J.³ Patt, Y.N.⁴

7
- 64949179220
- Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems
- EBRAHIMI, E., MUTLU, O., AND PATT, Y. N. Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems. In HPCA (2009), pp. 7 - 17.
- (2009) HPCA , pp. 7-17
- Ebrahimi, E.¹ Mutlu, O.² Patt, Y.N.³

8
- 84858791438
- Clearing the clouds: A study of emerging scale-out workloads on modern hardware
- FERDMAN, M., ADILEH, A., KOCBERBER, O., VOLOS, S., ALISAFAEE, M., JEVDJIC, D., KAYNAK, C., POPESCU, A. D., AILAMAKI, A., AND FALSAFI, B. Clearing the clouds: a study of emerging scale-out workloads on modern hardware. In ASPLOS (2012), pp. 37- 48.
- (2012) ASPLOS , pp. 37-48
- Ferdman, M.¹ Adileh, A.² Kocberber, O.³ Volos, S.⁴ Alisafaee, M.⁵ Jevdjic, D.⁶ Kaynak, C.⁷ Popescu, A.D.⁸ Ailamaki, A.⁹ Falsafi, B.¹⁰

9
- 84875675406
- FILEBENCH. Filebench
- FILEBENCH. Filebench. http://sourceforge.net/apps/mediawiki/filebench.

10
- 82155203098
- Cuanta: Quantifying effects of shared on-chip resource interference for consolidated virtual machines
- 22:1-22:14
- GOVINDAN, S., LIU, J., KANSAL, A., AND SIVASUBRAMANIAM, A. Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines. In SoCC (2011), pp. 22:1-22:14.
- (2011) SoCC
- Govindan, S.¹ Liu, J.² Kansal, A.³ Sivasubramaniam, A.⁴

11
- 84875648785
- IBM eServer xSeries 366 tuning tips
- IBM. IBM eServer xSeries 366 tuning tips. Technical Report, 2005.
- (2005) IBM, Technical Report

12
- 84875643805
- Virtualization on the IBM system x3950 server
- IBM. Virtualization on the IBM system x3950 server. Technical Report, 2006.
- (2006) IBM, Technical Report

13
- 84875665139
- Tuning IBM system x servers for performance
- IBM. Tuning IBM system x servers for performance. Technical Report, 2007.
- (2007) IBM, Technical Report

14
- 84875660564
- Achieving fast, scalable I/O for virtualized servers
- INTEL. Achieving fast, scalable I/O for virtualized servers. White Paper, 2009.
- (2009) INTEL, White Paper

15
- 84858767531
- CRUISE: Cache replacement and utility-aware scheduling
- JALEEL, A., NAJAF-ABADI, H. H., SUBRAMANIAM, S., STEELY, S. C., AND EMER, J. CRUISE: cache replacement and utility-aware scheduling. In ASPLOS (2012), pp. 249-260.
- (2012) ASPLOS , pp. 249-260
- Jaleel, A.¹ Najaf-abadi, H.H.² Subramaniam, S.³ Steely, S.C.⁴ Emer, J.⁵

16
- 77954998134
- High performance cache replacement using re-reference interval prediction (RRIP)
- JALEEL, A., THEOBALD, K. B., STEELY JR., S. C., AND EMER, J. High performance cache replacement using re-reference interval prediction (RRIP). In ISCA (2010), pp. 60-71.
- (2010) ISCA , pp. 60-71
- Jaleel, A.¹ Theobald, K.B.² Steely Jr., S.C.³ Emer, J.⁴

17
- 33846554108
- Geiger: Monitoring the buffer cache in a virtual machine environment
- JONES, S. T., ARPACI-DUSSEAU, A. C., AND ARPACI-DUSSEAU, R. H. Geiger: monitoring the buffer cache in a virtual machine environment. In ASPLOS (2006), pp. 14-24.
- (2006) ASPLOS , pp. 14-24
- Jones, S.T.¹ Arpaci-dusseau, A.C.² Arpaci-dusseau, R.H.³

18
- 79951697650
- Sampling dead block prediction for last-level caches
- KHAN, S.M., TIAN, Y., AND JIMENEZ, D. A. Sampling dead block prediction for last-level caches. In Micro (2010), pp. 175-186.
- (2010) Micro , pp. 175-186
- Khan, S.M.¹ Tian, Y.² Jimenez, D.A.³

19
- 66749189125
- Prefetch-Aware DRAM controllers
- LEE, C. J., MUTLU, O., NARASIMAN, V., AND PATT, Y. N. Prefetch-Aware DRAM controllers. In Micro (2008), pp. 200-209.
- (2008) Micro , pp. 200-209
- Lee, C.J.¹ Mutlu, O.² Narasiman, V.³ Patt, Y.N.⁴

20
- 80052522711
- Prefetch-aware shared resource management for multi-core systems
- LEE, C. J., MUTLU, O., NARASIMAN, V., AND PATT, Y. N. Prefetch-aware shared resource management for multi-core systems. In ISCA (2011), pp. 141-152.
- (2011) ISCA , pp. 141-152
- Lee, C.J.¹ Mutlu, O.² Narasiman, V.³ Patt, Y.N.⁴

21
- 77952569781
- Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance
- LIU, F., JIANG, X., AND SOLIHIN, Y. Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance. In HPCA (2010), pp. 1-12.
- (2010) HPCA , pp. 1-12
- Liu, F.¹ Jiang, X.² Solihin, Y.³

22
- 79960188403
- Studying the impact of hardware prefetching and bandwidth partitioning in chip-multiprocessors
- LIU, F., AND SOLIHIN, Y. Studying the impact of hardware prefetching and bandwidth partitioning in chip-multiprocessors. In SIGMETRICS (2011), pp. 37-48.
- (2011) SIGMETRICS , pp. 37-48
- Liu, F.¹ Solihin, Y.²

23
- 0031594020
- An analysis of database workload performance on simultaneous multithreaded processors
- LO, J., BARROSO, L. A., EGGERS, S. J., GHARACHORLOO, K., LEVY, H. M., AND PAREKH, S. S. An analysis of database workload performance on simultaneous multithreaded processors. In ISCA (1998), pp. 39-50.
- (1998) ISCA , pp. 39-50
- Lo, J.¹ Barroso, L.A.² Eggers, S.J.³ Gharachorloo, K.⁴ Levy, H.M.⁵ Parekh, S.S.⁶

24
- 84863391679
- DVM: Towards a datacenter-scale virtual machine
- MA, Z., SHENG, Z., GU, L., WEN, L., AND ZHANG, G. DVM: towards a datacenter-scale virtual machine. In VEE (2012), pp. 39- 50.
- (2012) VEE , pp. 39-50
- Ma, Z.¹ Sheng, Z.² Gu, L.³ Wen, L.⁴ Zhang, G.⁵

25
- 84858771269
- Reducing memory interference in multicore systems via application-aware memory channel partitioning
- MURALIDHARA, S. P., SUBRAMANIAN, L., MUTLU, O., KANDEMIR, M., AND MOSCIBRODA, T. Reducing memory interference in multicore systems via application-aware memory channel partitioning. In Micro (2011), pp. 374-385.
- (2011) Micro , pp. 374-385
- Muralidhara, S.P.¹ Subramanian, L.² Mutlu, O.³ Kandemir, M.⁴ Moscibroda, T.⁵

26
- 77954365569
- Scheduling I/O in virtual machine monitors
- ONGARO, D., COX, A. L., AND RIXNER, S. Scheduling I/O in virtual machine monitors. In VEE (2008), pp. 14-24.
- (2008) VEE , pp. 14-24
- Ongaro, D.¹ Cox, A.L.² Rixner, S.³

27
- 84875644496
- OPROFILE. A system profiler for Linux. http://http://oprofile. sourceforge.net.
- A System Profiler for Linux

28
- 84875658216
- Algorithms to take advantage of hardware prefetching
- PAN, S., CHERNG, C., DICK, K., AND LADNER, R. E. Algorithms to take advantage of hardware prefetching. In ALENEX (2007).
- (2007) ALENEX
- Pan, S.¹ Cherng, C.² Dick, K.³ Ladner, R.E.⁴

29
- 84863929827
- Page/slab cache control in a virtualized environment
- SINGH, B. Page/slab cache control in a virtualized environment. In Linux Symposium (2010), pp. 252-262.
- (2010) Linux Symposium , pp. 252-262
- Singh, B.¹

30
- 66749168716
- Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer
- SOARES, L., TAM, D., AND STUMM, M. Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer. In MICRO (2008), pp. 258-269.
- (2008) MICRO , pp. 258-269
- Soares, L.¹ Tam, D.² Stumm, M.³

31
- 67650085342
- Adaptive set pinning: Managing shared caches in chip multiprocessors
- SRIKANTAIAH, S., KANDEMIR, M., AND IRWIN, M. J. Adaptive set pinning: managing shared caches in chip multiprocessors. In ASPLOS (2008), pp. 135-144.
- (2008) ASPLOS , pp. 135-144
- Srikantaiah, S.¹ Kandemir, M.² Irwin, M.J.³

32
- 34547655822
- Feedback directed prefetching: Improving the performance and bandwidthefficiency of hardware prefetchers
- SRINATH, S., MUTLU, O., KIM, H., AND PATT, Y. N. Feedback directed prefetching: Improving the performance and bandwidthefficiency of hardware prefetchers. In HPCA (2007), pp. 63-74.
- (2007) HPCA , pp. 63-74
- Srinath, S.¹ Mutlu, O.² Kim, H.³ Patt, Y.N.⁴

33
- 84875656072
- SYSBENCH. Sysbench: a system performance benchmark. http://sysbench. sourceforge.net.
- Sysbench: A System Performance Benchmark

34
- 57749176037
- Managing shared L2 caches on multicore systems in software
- TAM, D., AZIMI, R., SOARES, L., AND STUMM, M. Managing shared L2 caches on multicore systems in software. In WIOSCA (2007).
- (2007) WIOSCA
- Tam, D.¹ Azimi, R.² Soares, L.³ Stumm, M.⁴

35
- 80052535250
- The impact of memory subsystem resource sharing on datacenter applications
- TANG, L., MARS, J., VACHHARAJANI, N., HUNDT, R., AND SOFFA, M. L. The impact of memory subsystem resource sharing on datacenter applications. In ISCA (2011), pp. 283-294.
- (2011) ISCA , pp. 283-294
- Tang, L.¹ Mars, J.² Vachharajani, N.³ Hundt, R.⁴ Soffa, M.L.⁵

36
- 79551699363
- Efficient prefetching with hybrid schemes and use of program feedback to adjust prefetcher aggressiveness
- VERMA, S., KOPPELMAN, D.M., AND PENG, L. Efficient prefetching with hybrid schemes and use of program feedback to adjust prefetcher aggressiveness. Journal of Instruction-Level Parallelism, 13 (2011), 1-14.
- (2011) Journal of Instruction-Level Parallelism , vol.13 , pp. 1-14
- Verma, S.¹ Koppelman, D.M.² Peng, L.³

37
- 84875670701
- VMware VMmark v1.0.0 Results - Dell PowerEdge R900
- VMWARE. VMware VMmark v1.0.0 Results - Dell PowerEdge R900. Technical Report, 2008.
- (2008) VMWARE, Technical Report

38
- 84875651434
- Performance best practices for VMware vSphere 5.0
- VMWARE. Performance best practices for VMware vSphere 5.0. Technical Report, 2011.
- (2011) VMWARE, Technical Report

39
- 84978382687
- Memory resource management in vmware esx server
- WALDSPURGER, C. A. Memory resource management in vmware esx server. In SIGOPS Oper. Syst. Rev. (2002), pp. 181-194.
- (2002) SIGOPS Oper. Syst. Rev. , pp. 181-194
- Waldspurger, C.A.¹

40
- 84863379287
- PACMan: Prefetch-aware cache management for high performance caching
- WU, C.-J., JALEEL, A., MARTONOSI, M., STEELY JR., S. C., AND EMER, J. PACMan: prefetch-aware cache management for high performance caching. In Micro (2011), pp. 442-453.
- (2011) Micro , pp. 442-453
- Wu, C.-J.¹ Jaleel, A.² Martonosi, M.³ Steely Jr., S.C.⁴ Emer, J.⁵

41
- 70450279102
- PIPP: Promotion/insertion pseudopartitioning of multi-core shared caches
- XIE, Y., AND LOH, G. H. PIPP: promotion/insertion pseudopartitioning of multi-core shared caches. In ISCA (2009), pp. 174-183.
- (2009) ISCA , pp. 174-183
- Xie, Y.¹ Loh, G.H.²

42
- 77749340037
- Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?
- ZHANG, E., JIANG, Y., AND SHEN, X. Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs? In PPoPP (2010), pp. 203-212.
- (2010) PPoPP , pp. 203-212
- Zhang, E.¹ Jiang, Y.² Shen, X.³

43
- 70349111334
- Towards practical page coloring-based multicore cache management
- ZHANG, X., DWARKADAS, S., AND SHEN, K. Towards practical page coloring-based multicore cache management. In Eurosys (2009), pp. 89-102.
- (2009) Eurosys , pp. 89-102
- Zhang, X.¹ Dwarkadas, S.² Shen, K.³

44
- 77952248898
- Addressing shared resource contention in multicore processors via scheduling
- ZHURAVLEV, S., BLAGODUROV, S., AND FEDOROVA, A. Addressing shared resource contention in multicore processors via scheduling. In ASPLOS (2010), pp. 129-142.
- (2010) ASPLOS , pp. 129-142
- Zhuravlev, S.¹ Blagodurov, S.² Fedorova, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.