메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1-12

Performance analysis of the memory management unit under scale-out workloads

Author keywords

[No Author keywords available]

Indexed keywords

BUFFER STORAGE; PHYSICAL ADDRESSES;

EID: 84946036877     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IISWC.2014.6983034     Document Type: Conference Paper
Times cited : (42)

References (49)
  • 1
    • 84946029996 scopus 로고    scopus 로고
    • "Cloudsuite Overview," http://parsa.epfl.ch/cloudsuite/overview.html.
    • Cloudsuite Overview
  • 4
    • 84946076235 scopus 로고    scopus 로고
    • "Perf wiki," https://perf.wiki.kernel.org/index.php/Main Page.
    • Perf Wiki
  • 5
    • 84894591387 scopus 로고    scopus 로고
    • "SPEC CPU 2006," https://www.spec.org/cpu2006/.
    • (2006) SPEC CPU
  • 6
    • 84904578672 scopus 로고    scopus 로고
    • "The/proc filesystem," www.kernel.org/doc/Documentation/filesystems/proc.txt.
    • The/proc Filesystem
  • 9
    • 0026140567 scopus 로고
    • The interaction of architecture and operating system design
    • T. E. Anderson, H. M. Levy, B. N. Bershad, and E. D. Lazowska, "The interaction of architecture and operating system design," ASPLOS, 1991, pp. 108-120.
    • (1991) ASPLOS , pp. 108-120
    • Anderson, T.E.1    Levy, H.M.2    Bershad, B.N.3    Lazowska, E.D.4
  • 11
    • 77955012281 scopus 로고    scopus 로고
    • Translation caching: Skip, don't walk (the page table)
    • T. W. Barr, A. L. Cox, and S. Rixner, "Translation Caching: Skip, Don't Walk (the Page Table)," ISCA, 2010, pp. 48-59.
    • (2010) ISCA , pp. 48-59
    • Barr, T.W.1    Cox, A.L.2    Rixner, S.3
  • 12
    • 84883331358 scopus 로고    scopus 로고
    • The datacenter as a computer: An introduction to the design of warehouse-scale machines
    • second edition, ser
    • L. A. Barroso, J. Clidaras, and U. Hlzle, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition, ser. Synthesis Lectures on Computer Architecture, 2013.
    • (2013) Synthesis Lectures on Computer Architecture
    • Barroso, L.A.1    Clidaras, J.2    Hlzle, U.3
  • 13
    • 84881179047 scopus 로고    scopus 로고
    • Efficient virtual memory for big memory servers
    • A. Basu, J. Gandhi, J. Chang, M. D. Hill, and M. M. Swift, "Efficient Virtual Memory for Big Memory Servers," ISCA, 2013, pp. 237-248.
    • (2013) ISCA , pp. 237-248
    • Basu, A.1    Gandhi, J.2    Chang, J.3    Hill, M.D.4    Swift, M.M.5
  • 14
    • 84864859089 scopus 로고    scopus 로고
    • Reducing memory reference energy with opportunistic virtual caching
    • A. Basu, M. D. Hill, and M. M. Swift, "Reducing memory reference energy with opportunistic virtual caching," ISCA, 2012, pp. 297-308.
    • (2012) ISCA , pp. 297-308
    • Basu, A.1    Hill, M.D.2    Swift, M.M.3
  • 15
    • 84892513543 scopus 로고    scopus 로고
    • Large-reach memory management unit caches
    • A. Bhattacharjee, "Large-reach Memory Management Unit Caches," MICRO, 2013, pp. 383-394.
    • (2013) MICRO , pp. 383-394
    • Bhattacharjee, A.1
  • 16
    • 77952252973 scopus 로고    scopus 로고
    • Inter-core cooperative tlb for chip multiprocessors
    • A. Bhattacharjee and M. Martonosi, "Inter-core Cooperative TLB for Chip Multiprocessors," ASPLOS, 2010, pp. 359-370.
    • (2010) ASPLOS , pp. 359-370
    • Bhattacharjee, A.1    Martonosi, M.2
  • 17
    • 70449652917 scopus 로고    scopus 로고
    • Characterizing the tlb behavior of emerging parallel workloads on chip multiprocessors
    • A. Bhattacharjee and M. Martonosi, "Characterizing the TLB Behavior of Emerging Parallel Workloads on Chip Multiprocessors," PACT, 2009, pp. 29-40.
    • (2009) PACT , pp. 29-40
    • Bhattacharjee, A.1    Martonosi, M.2
  • 19
    • 0031237070 scopus 로고    scopus 로고
    • Virtual-address caches part 1: Problems and solutions in uniprocessors
    • Sep.
    • M. Cekleov and M. Dubois, "Virtual-address caches part 1: Problems and solutions in uniprocessors," IEEE Micro, vol. 17, no. 5, pp. 64-71, Sep. 1997.
    • (1997) IEEE Micro , vol.17 , Issue.5 , pp. 64-71
    • Cekleov, M.1    Dubois, M.2
  • 20
    • 0031274147 scopus 로고    scopus 로고
    • Virtual-address caches, part 2: Multiprocessor issues
    • Nov.
    • M. Cekleov and M. Dubois, "Virtual-address caches, part 2: Multiprocessor issues," IEEE Micro, vol. 17, no. 6, pp. 69-74, Nov. 1997.
    • (1997) IEEE Micro , vol.17 , Issue.6 , pp. 69-74
    • Cekleov, M.1    Dubois, M.2
  • 21
    • 0026865575 scopus 로고
    • A simulation based study of tlb performance
    • J. B. Chen, A. Borg, and N. P. Jouppi, "A Simulation Based Study of TLB Performance," ISCA, 1992, pp. 114-123.
    • (1992) ISCA , pp. 114-123
    • Chen, J.B.1    Borg, A.2    Jouppi, N.P.3
  • 22
    • 84862110045 scopus 로고    scopus 로고
    • Dynamically reconfigurable hybrid cache: An energyefficient last-level cache design
    • Y.-T. Chen, J. Cong, H. Huang, B. Liu, C. Liu, M. Potkonjak, and G. Reinman, "Dynamically Reconfigurable Hybrid Cache: An Energyefficient Last-level Cache Design," DATE, 2012, pp. 45-50.
    • (2012) DATE , pp. 45-50
    • Chen, Y.-T.1    Cong, J.2    Huang, H.3    Liu, B.4    Liu, C.5    Potkonjak, M.6    Reinman, G.7
  • 23
    • 0022020051 scopus 로고
    • Performance of the VAX-11/780 translation buffer: Simulation and measurement
    • Feb.
    • D. W. Clark and J. S. Emer, "Performance of the VAX-11/780 translation buffer: simulation and measurement," ACM Trans. Comput. Syst., vol. 3, no. 1, pp. 31-62, Feb. 1985.
    • (1985) ACM Trans. Comput. Syst. , vol.3 , Issue.1 , pp. 31-62
    • Clark, D.W.1    Emer, J.S.2
  • 24
    • 0002634962 scopus 로고    scopus 로고
    • Optimizing the idle task and other mmu tricks
    • C. Dougan, P. Mackerras, and V. Yodaiken, "Optimizing the Idle Task and Other MMU Tricks," OSDI, 1999, pp. 229-237.
    • (1999) OSDI , pp. 229-237
    • Dougan, C.1    Mackerras, P.2    Yodaiken, V.3
  • 28
    • 0542404031 scopus 로고    scopus 로고
    • A look at several memory management units, tlb-refill mechanisms, and page table organizations
    • B. L. Jacob and T. N. Mudge, "A Look at Several Memory Management Units, TLB-refill Mechanisms, and Page Table Organizations," ASPLOS, 1998, pp. 295-306.
    • (1998) ASPLOS , pp. 295-306
    • Jacob, B.L.1    Mudge, T.N.2
  • 29
    • 84881191462 scopus 로고    scopus 로고
    • Die-stacked dram caches for servers: Hit ratio, latency, or bandwidth? Have it all with footprint cache
    • D. Jevdjic, S. Volos, and B. Falsafi, "Die-stacked DRAM Caches for Servers: Hit Ratio, Latency, or Bandwidth? Have It All with Footprint Cache," ISCA, 2013, pp. 404-415.
    • (2013) ISCA , pp. 404-415
    • Jevdjic, D.1    Volos, S.2    Falsafi, B.3
  • 30
    • 84893565625 scopus 로고    scopus 로고
    • Characterizing data analysis workloads in data centers
    • Z. Jia, L. Wang, J. Zhan, L. Zhang, and C. Luo, "Characterizing Data Analysis Workloads in Data Centers," IISWC, 2013, pp. 66-76.
    • (2013) IISWC , pp. 66-76
    • Jia, Z.1    Wang, L.2    Zhan, J.3    Zhang, L.4    Luo, C.5
  • 31
    • 0036287598 scopus 로고    scopus 로고
    • Going the distance for TLB prefetching: An application-driven study
    • G. B. Kandiraju and A. Sivasubramaniam, "Going the distance for TLB prefetching: an application-driven study," ISCA, 2002, pp. 195-206.
    • (2002) ISCA , pp. 195-206
    • Kandiraju, G.B.1    Sivasubramaniam, A.2
  • 32
    • 0036039466 scopus 로고    scopus 로고
    • Characterizing the d-tlb behavior of spec cpu2000 benchmarks
    • G. B. Kandiraju and A. Sivasubramaniam, "Characterizing the d-TLB Behavior of SPEC CPU2000 Benchmarks," SIGMETRICS, 2002, pp. 129-139.
    • (2002) SIGMETRICS , pp. 129-139
    • Kandiraju, G.B.1    Sivasubramaniam, A.2
  • 33
    • 84881178489 scopus 로고    scopus 로고
    • A new perspective for efficient virtual-cache coherence
    • S. Kaxiras and A. Ros, "A new perspective for efficient virtual-cache coherence," ISCA, 2013, pp. 535-546.
    • (2013) ISCA , pp. 535-546
    • Kaxiras, S.1    Ros, A.2
  • 34
    • 84881144734 scopus 로고    scopus 로고
    • Thin servers with smart pipes: Designing SoC accelerators for memcached
    • K. Lim, D. Meisner, A. G. Saidi, P. Ranganathan, and T. F. Wenisch, "Thin servers with smart pipes: designing SoC accelerators for memcached," ISCA, 2013, pp. 36-47.
    • (2013) ISCA , pp. 36-47
    • Lim, K.1    Meisner, D.2    Saidi, A.G.3    Ranganathan, P.4    Wenisch, T.F.5
  • 35
    • 84876533349 scopus 로고    scopus 로고
    • Noc-out: Microarchitecting a scale-out processor
    • P. Lotfi-Kamran, B. Grot, and B. Falsafi, "NOC-Out: Microarchitecting a Scale-Out Processor," MICRO, 2012, pp. 177-187.
    • (2012) MICRO , pp. 177-187
    • Lotfi-Kamran, P.1    Grot, B.2    Falsafi, B.3
  • 37
    • 84878619560 scopus 로고    scopus 로고
    • Tlb improvements for chip multiprocessors: Inter-core cooperative prefetchers and shared last-level tlbs
    • Apr.
    • D. Lustig, A. Bhattacharjee, and M. Martonosi, "TLB Improvements for Chip Multiprocessors: Inter-Core Cooperative Prefetchers and Shared Last-Level TLBs," ACM Trans. Archit. Code Optim., vol. 10, no. 1, pp. 2:1-2:38, Apr. 2013.
    • (2013) ACM Trans. Archit. Code Optim. , vol.10 , Issue.1 , pp. 21-238
    • Lustig, D.1    Bhattacharjee, A.2    Martonosi, M.3
  • 38
    • 52249092401 scopus 로고    scopus 로고
    • Investigating the tlb behavior of high-end scientific applications on commodity microprocessors
    • C. McCurdy, A. L. Cox, and J. Vetter, "Investigating the TLB Behavior of High-end Scientific Applications on Commodity Microprocessors," ISPASS, 2008, pp. 95-104.
    • (2008) ISPASS , pp. 95-104
    • McCurdy, C.1    Cox, A.L.2    Vetter, J.3
  • 41
    • 84903973894 scopus 로고    scopus 로고
    • Increasing TLB reach by exploiting clustering in page translations
    • B. Pham, A. Bhattacharjee, Y. Eckert, and G. H. Loh, "Increasing TLB reach by exploiting clustering in page translations." HPCA, 2014, pp. 558-567.
    • (2014) HPCA , pp. 558-567
    • Pham, B.1    Bhattacharjee, A.2    Eckert, Y.3    Loh, G.H.4
  • 43
    • 0034818158 scopus 로고    scopus 로고
    • Towards virtually-addressed memory hierarchies
    • X. Qiu and M. Dubois, "Towards Virtually-Addressed Memory Hierarchies," HPCA, 2001, pp. 51-62.
    • (2001) HPCA , pp. 51-62
    • Qiu, X.1    Dubois, M.2
  • 44
    • 84883540577 scopus 로고
    • The impact of architectural trends on operating system performance
    • M. Rosenblum, E. Bugnion, S. A. Herrod, E. Witchel, and A. Gupta, "The Impact of Architectural Trends on Operating System Performance," SOSP, 1995, pp. 285-298.
    • (1995) SOSP , pp. 285-298
    • Rosenblum, M.1    Bugnion, E.2    Herrod, S.A.3    Witchel, E.4    Gupta, A.5
  • 45
    • 0033707299 scopus 로고    scopus 로고
    • Recency-based tlb preloading
    • A. Saulsbury, F. Dahlgren, and P. Stenstr, "Recency-based TLB Preloading," ISCA, 2000, pp. 117-127.
    • (2000) ISCA , pp. 117-127
    • Saulsbury, A.1    Dahlgren, F.2    Stenstr, P.3
  • 46
    • 0028305546 scopus 로고
    • Surpassing the tlb performance of superpages with less operating system support
    • M. Talluri and M. D. Hill, "Surpassing the TLB Performance of Superpages with Less Operating System Support," ASPLOS, 1994, pp. 171-182.
    • (1994) ASPLOS , pp. 171-182
    • Talluri, M.1    Hill, M.D.2
  • 48
    • 79957470794 scopus 로고    scopus 로고
    • Characterization and dynamic mitigation of intra-application cache interference
    • C.-J. Wu and M. Martonosi, "Characterization and dynamic mitigation of intra-application cache interference," ISPASS, 2011, pp. 2-11.
    • (2011) ISPASS , pp. 2-11
    • Wu, C.-J.1    Martonosi, M.2
  • 49
    • 79952037020 scopus 로고    scopus 로고
    • Design of last-level on-chip cache using spin-torque transfer ram (stt ram)
    • Mar.
    • W. Xu, H. Sun, X. Wang, Y. Chen, and T. Zhang, "Design of Last-level On-chip Cache Using Spin-torque Transfer RAM (STT RAM)," IEEE Trans. Very Large Scale Integr. Syst., vol. 19, no. 3, pp. 483-493, Mar. 2011.
    • (2011) IEEE Trans. Very Large Scale Integr. Syst. , vol.19 , Issue.3 , pp. 483-493
    • Xu, W.1    Sun, H.2    Wang, X.3    Chen, Y.4    Zhang, T.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.