메뉴 건너뛰기




Volumn 28, Issue 2, 2016, Pages 189-210

Exploring performance and power properties of modern multi-core chips via simple machine models

Author keywords

ECM model; multi core; performance modeling; power modeling

Indexed keywords

CACHE MEMORY; CODES (SYMBOLS); ELECTRIC LOSSES; ENERGY EFFICIENCY; ENERGY UTILIZATION; MACHINERY;

EID: 84956679644     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.3180     Document Type: Article
Times cited : (78)

References (28)
  • 2
    • 77952562358 scopus 로고    scopus 로고
    • Roofline: an insightful visual performance model for floating-point programs and multicore architectures\
    • EECS Department, University of California, Berkeley, Oct. Available from: [Accessed on 3 December 2012]
    • Williams SW, Waterman A, Patterson DA,. Roofline: an insightful visual performance model for floating-point programs and multicore architectures. Technical Report UCB/EECS-2008-134, EECS Department, University of California, Berkeley, Oct 2008. Available from: http://www.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-134.html [Accessed on 3 December 2012].
    • (2008) Technical Report UCB/EECS-2008-134
    • Williams, S.W.1    Waterman, A.2    Patterson, D.A.3
  • 3
    • 77955113636 scopus 로고    scopus 로고
    • Introducing a performance model for bandwidth-limited loop kernels
    • Wyrzykowski R. Dongarra J. Karczewski K. Wasniewski J. (eds). Springer: Berlin / Heidelberg
    • Treibig J, Hager G,. Introducing a performance model for bandwidth-limited loop kernels. In Parallel Processing and Applied Mathematics, Lecture Notes in Computer Science, Vol. 6067, Wyrzykowski R, Dongarra J, Karczewski K, Wasniewski J, (eds). Springer: Berlin / Heidelberg, 2010; 615-624. DOI: 10.1007/978-3-642-14390-8-64.
    • (2010) Parallel Processing and Applied Mathematics, Lecture Notes in Computer Science , vol.6067 , pp. 615-624
    • Treibig, J.1    Hager, G.2
  • 4
    • 67650784628 scopus 로고    scopus 로고
    • Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs
    • Suleman MA, Qureshi MK, Patt YN., Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs. SIGARCH-Computer Architecture News 2008; 36 (1): 277-286. DOI: 10.1145/1353534.1346317.
    • (2008) SIGARCH - Computer Architecture News , vol.36 , Issue.1 , pp. 277-286
    • Suleman, M.A.1    Qureshi, M.K.2    Patt, Y.N.3
  • 5
    • 0034543848 scopus 로고    scopus 로고
    • Performance and scalability analysis of teraflop-scale parallel architectures using multidimensional wavefront applications
    • Hoisie A, Lubeck O, Wasserman HJ,. Performance and scalability analysis of teraflop-scale parallel architectures using multidimensional wavefront applications. International Journal of High Performance Computing Applications 2000; 14: 330-346. DOI: 10.1177/109434200001400405.
    • (2000) International Journal of High Performance Computing Applications , vol.14 , pp. 330-346
    • Hoisie, A.1    Lubeck, O.2    Wasserman, H.J.3
  • 11
    • 84859729360 scopus 로고    scopus 로고
    • Power-management architecture of the Intel microarchitecture code-named Sandy Bridge
    • Rotem E, Naveh A, Ananthakrishnan A, Rajwan D, Weissmann E,. Power-management architecture of the Intel microarchitecture code-named Sandy Bridge. IEEE Micro 2012; 32: 20-27. DOI: 10.1109/MM.2012.12.
    • (2012) IEEE Micro , vol.32 , pp. 20-27
    • Rotem, E.1    Naveh, A.2    Ananthakrishnan, A.3    Rajwan, D.4    Weissmann, E.5
  • 12
    • 84879408937 scopus 로고    scopus 로고
    • Measuring energy consumption for short code paths using RAPL
    • ACM: New York, NY, USA
    • Hähnel M, Döbel B, Völp M, Härtig H,. Measuring energy consumption for short code paths using RAPL. In ACM SIGMETRICS Performance Evaluation Review, Vol. 40(3). ACM: New York, NY, USA, 2012; 13-17, DOI: 10.1145/2425248.2425252.
    • (2012) ACM SIGMETRICS Performance Evaluation Review , vol.40 , Issue.3 , pp. 13-17
    • Hähnel, M.1    Döbel, B.2    Völp, M.3    Härtig, H.4
  • 15
    • 84874415705 scopus 로고    scopus 로고
    • Accessed on 3 December 2012
    • LIKWID performance tools. Available from: http://code.google.com/p/likwid [Accessed on 3 December 2012].
    • LIKWID performance tools
  • 16
    • 84885229762 scopus 로고    scopus 로고
    • likwid-bench: an extensible microbenchmarking platform for x86 multicore environments
    • Resch M. et al. (eds). Springer: Berlin Heidelberg, To appear
    • Treibig J, Hager G, Wellein G,. likwid-bench: an extensible microbenchmarking platform for x86 multicore environments. In Toolsfor High Performance Computing 2011, Resch M, et al. (eds). Springer: Berlin Heidelberg, 2012. To appear.
    • (2012) Toolsfor High Performance Computing 2011
    • Treibig, J.1    Hager, G.2    Wellein, G.3
  • 18
    • 84956639009 scopus 로고    scopus 로고
    • Accessed on 1 August 2012
    • Intel architecture code analyzer, version 1.1.3. Available from: http://software.intel.com/en-us/articles/intel-architecture-code-analyzer/ [Accessed on 1 August 2012].
    • Intel architecture code analyzer, version 1.1.3
  • 20
    • 0345025793 scopus 로고    scopus 로고
    • STREAM: sustainable memory bandwidth in high performance computers
    • University of Virginia, Charlottesville, VA 1991-2007 a continually updated technical report [Accessed on 1 August 2012]
    • McCalpin JD,. STREAM: sustainable memory bandwidth in high performance computers. Technical Report, University of Virginia, Charlottesville, VA 1991-2007. Available from: http://www.cs.virginia.edu/stream/, a continually updated technical report [Accessed on 1 August 2012].
    • Technical Report
    • McCalpin, J.D.1
  • 25
    • 0000039130 scopus 로고
    • Lattice BGK models for Navier-Stokes equation
    • Qian Y, d'Humières D, Lallemand P,. Lattice BGK models for Navier-Stokes equation. Europhysics Letters 1992; 17 (6): 79-484.
    • (1992) Europhysics Letters , vol.17 , Issue.6 , pp. 79-484
    • Qian, Y.1    D'Humières, D.2    Lallemand, P.3
  • 26
    • 21144470454 scopus 로고
    • Boundary conditions for lattice Boltzmann simulations
    • Ziegler D,. Boundary conditions for lattice Boltzmann simulations. Journal of Statistical Physics 1993; 71 (5/6): 1171-1177.
    • (1993) Journal of Statistical Physics , vol.71 , Issue.56 , pp. 1171-1177
    • Ziegler, D.1
  • 27
    • 33646809359 scopus 로고    scopus 로고
    • On the single processor performance of simple lattice Boltzmann kernels
    • Wellein G, Zeiser T, Hager G, Donath S,. On the single processor performance of simple lattice Boltzmann kernels. Computers & Fluids 2006; 35: 910-919.
    • (2006) Computers & Fluids , vol.35 , pp. 910-919
    • Wellein, G.1    Zeiser, T.2    Hager, G.3    Donath, S.4
  • 28
    • 73849092882 scopus 로고    scopus 로고
    • Benchmark analysis and application results for lattice Boltzmann simulations on NEC SX vector and Intel Nehalem systems
    • Zeiser T, Hager G, Wellein G,. Benchmark analysis and application results for lattice Boltzmann simulations on NEC SX vector and Intel Nehalem systems. Parallel Processing Letters 2009; 19 (4): 491-511.
    • (2009) Parallel Processing Letters , vol.19 , Issue.4 , pp. 491-511
    • Zeiser, T.1    Hager, G.2    Wellein, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.