메뉴 건너뛰기




Volumn , Issue , 2012, Pages 199-211

Systematic energy characterization of CMP/SMT processor systems via automated micro-benchmarks

Author keywords

automated micro benchmarks; counter based power models; energy per instruction; max power stressmark

Indexed keywords

CHARACTERIZATION STUDIES; ENERGY CHARACTERIZATION; ENERGY PER INSTRUCTION; MAX-POWER STRESSMARK; MEASUREMENT BASED ANALYSIS; MICRO-BENCHMARKS; MULTITHREADED PROCESSORS; POWER MODEL;

EID: 84876569219     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/MICRO.2012.27     Document Type: Conference Paper
Times cited : (43)

References (49)
  • 1
    • 84876577123 scopus 로고    scopus 로고
    • WEST: Cloning data cache behavior using stochastic traces
    • Feb
    • G. Balakrishnan et al., "WEST: Cloning data cache behavior using stochastic traces," in Proc. of HPCA'12, pp. 1-12, Feb 2012.
    • (2012) Proc. of HPCA , vol.12 , pp. 1-12
    • Balakrishnan, G.1
  • 2
    • 33749068692 scopus 로고    scopus 로고
    • Efficient power analysis using synthetic testcases
    • Oct
    • R. H. Bell Jr. et al., "Efficient power analysis using synthetic testcases," in Proc. of IISWC'05, pp. 110-118, Oct 2005.
    • (2005) Proc. of IISWC , vol.5 , pp. 110-118
    • Bell Jr., R.H.1
  • 3
    • 33750806476 scopus 로고    scopus 로고
    • Automatic testcase synthesis and performance model validation for high performance PowerPC processors
    • Mar
    • R. H. Bell Jr. et al., "Automatic testcase synthesis and performance model validation for high performance PowerPC processors," in Proc. of IISWC'06, pp. 154-165, Mar 2006.
    • (2006) Proc. of IISWC , vol.6 , pp. 154-165
    • Bell Jr., R.H.1
  • 4
    • 32844467866 scopus 로고    scopus 로고
    • Improved automatic testcase synthesis for performance model validation
    • Jun
    • R. H. Bell Jr. et al., "Improved automatic testcase synthesis for performance model validation," in Proc. of ICS'05, pp. 111-120, Jun 2005.
    • (2005) Proc. of ICS , vol.5 , pp. 111-120
    • Bell Jr., R.H.1
  • 5
    • 84944384544 scopus 로고    scopus 로고
    • The benefits of event-driven energy accounting in powersensitive systems
    • Sep
    • F. Bellosa, "The benefits of event-driven energy accounting in powersensitive systems," in Proc. of EW'00, pp. 37-42, Sep 2000.
    • (2000) Proc. of EW'00 , pp. 37-42
    • Bellosa, F.1
  • 6
    • 84864714114 scopus 로고    scopus 로고
    • POTRA: A framework for building power models for next generation multicore architectures
    • Jun
    • R. Bertran et al., "POTRA: A framework for building power models for next generation multicore architectures," in Proc. of SIGMETRICS'12, pp. 427-428, Jun 2012.
    • (2012) Proc. of SIGMETRICS , vol.12 , pp. 427-428
    • Bertran, R.1
  • 7
    • 84876578919 scopus 로고    scopus 로고
    • Counter-based power modeling methods: Top-down vs bottom-up
    • Aug
    • R. Bertran et al., "Counter-based power modeling methods: Top-down vs bottom-up," The Computer Journal, vol. 99, pp. 1-16, Aug 2012.
    • (2012) The Computer Journal , vol.99 , pp. 1-16
    • Bertran, R.1
  • 8
    • 84873394363 scopus 로고    scopus 로고
    • A systematic methodology to generate decomposable and responsive power models for CMPs
    • Apr
    • R. Bertran et al., "A systematic methodology to generate decomposable and responsive power models for CMPs," IEEE Trans. on Comp., vol. 99, pp. 1-14, Apr 2012.
    • (2012) IEEE Trans. on Comp. , vol.99 , pp. 1-14
    • Bertran, R.1
  • 9
    • 77954716360 scopus 로고    scopus 로고
    • Decomposable and responsive power models for multicore processors using performance counters
    • Jun
    • R. Bertran et al., "Decomposable and responsive power models for multicore processors using performance counters," in Proc. of ICS'10, pp. 147-158, Jun 2010.
    • (2010) Proc. of ICS , vol.10 , pp. 147-158
    • Bertran, R.1
  • 10
    • 36949030007 scopus 로고    scopus 로고
    • Complete system power estimation: A trickle-down approach based on performance events
    • Apr
    • W. Bircher et al., "Complete system power estimation: A trickle-down approach based on performance events," in Proc. of ISPASS'07, pp. 158-168, Apr 2007.
    • (2007) Proc. of ISPASS , vol.7 , pp. 158-168
    • Bircher, W.1
  • 11
    • 85008569578 scopus 로고    scopus 로고
    • Complete system power estimation using processor performance events
    • Apr
    • W. Bircher et al., "Complete system power estimation using processor performance events," IEEE Trans. on Comp., vol. 61, no. 4, pp. 563-577, Apr 2011.
    • (2011) IEEE Trans. on Comp. , vol.61 , Issue.4 , pp. 563-577
    • Bircher, W.1
  • 12
    • 0032069891 scopus 로고    scopus 로고
    • Calibration of microprocessor performance models
    • May
    • B. Black et al., "Calibration of microprocessor performance models," Computer, vol. 31, no. 5, pp. 59-65, May 1998.
    • (1998) Computer , vol.31 , Issue.5 , pp. 59-65
    • Black, B.1
  • 13
    • 79952905401 scopus 로고    scopus 로고
    • SWEEP: Evaluating computer system energy efficiency using synthetic workloads
    • Jan
    • K. D. Bois et al., "SWEEP: Evaluating computer system energy efficiency using synthetic workloads," in Proc. of HIPEAC'11, pp. 159-166, Jan 2011.
    • (2011) Proc. of HIPEAC , vol.11 , pp. 159-166
    • Bois, K.D.1
  • 14
    • 0013035429 scopus 로고    scopus 로고
    • Bounds modelling and compiler optimizations for superscalar performance tuning
    • Jun
    • P. Bose et al., "Bounds modelling and compiler optimizations for superscalar performance tuning," J. Syst. Archit., vol. 45, no. 12-13, pp. 1111-1137, Jun 1999.
    • (1999) J. Syst. Archit. , vol.45 , Issue.12-13 , pp. 1111-1137
    • Bose, P.1
  • 15
    • 77952231026 scopus 로고    scopus 로고
    • A power-efficient 32b ARM ISA processor using timingerror detection and correction for transient-error tolerance and adaptation to PVT variation
    • Feb
    • D. Bull et al., "A power-efficient 32b ARM ISA processor using timingerror detection and correction for transient-error tolerance and adaptation to PVT variation," in Proc. of ISSCC'10, pp. 284-285, Feb 2010.
    • (2010) Proc. of ISSCC , vol.10 , pp. 284-285
    • Bull, D.1
  • 16
    • 0016923706 scopus 로고
    • A synthetic benchmark
    • Feb
    • H. J. Curnow et al., "A synthetic benchmark," The Computer Journal, vol. 19, no. 1, pp. 43-49, Feb 1976.
    • (1976) The Computer Journal , vol.19 , Issue.1 , pp. 43-49
    • Curnow, H.J.1
  • 17
    • 84876535462 scopus 로고    scopus 로고
    • Linux has a generic performance monitoring API
    • Jul
    • S. Eranian, "Linux has a generic performance monitoring API!" in Proc. of CSCADS'09, p. 1, Jul 2009.
    • (2009) Proc. of CSCADS , vol.9 , pp. 1
    • Eranian, S.1
  • 18
    • 78751549461 scopus 로고    scopus 로고
    • Benchmark synthesis for architecture and compiler exploration
    • Dec
    • L. V. Ertvelde et al., "Benchmark synthesis for architecture and compiler exploration," in Proc. of IISWC'10, pp. 1-11, Dec 2010.
    • (2010) Proc. of IISWC , vol.10 , pp. 1-11
    • Ertvelde, L.V.1
  • 19
    • 79959205818 scopus 로고    scopus 로고
    • Adaptive energy-management features of the IBM POWER7 chip
    • May
    • M. Floyd et al., "Adaptive energy-management features of the IBM POWER7 chip," IBM J. Res. & Dev., vol. 55, no. 3, pp. 276-293, May 2011.
    • (2011) IBM J. Res. & Dev. , vol.55 , Issue.3 , pp. 276-293
    • Floyd, M.1
  • 20
    • 78149243974 scopus 로고    scopus 로고
    • SYstem-level Max POwer (SYMPO): A systematic approach for escalating system-level power consumption using synthetic benchmarks
    • Sep
    • K. Ganesan et al., "SYstem-level Max POwer (SYMPO): A systematic approach for escalating system-level power consumption using synthetic benchmarks," in Proc. of PACT'10, pp. 19-28, Sep 2010.
    • (2010) Proc. of PACT , vol.10 , pp. 19-28
    • Ganesan, K.1
  • 21
    • 83155193246 scopus 로고    scopus 로고
    • MAximum Multicore POwer (MAMPO): An automatic multithreaded synthetic power virus generation framework for multicore systems
    • Nov
    • K. Ganesan et al., "MAximum Multicore POwer (MAMPO): an automatic multithreaded synthetic power virus generation framework for multicore systems," in Proc. of SC'11, pp. 1-12, Nov 2011.
    • (2011) Proc. of SC , vol.11 , pp. 1-12
    • Ganesan, K.1
  • 22
    • 58149242627 scopus 로고    scopus 로고
    • A sub-2W low power IA processor for mobile internet devices in 45nm high-k metal gate CMOS
    • G. Gerosa et al., "A sub-2W low power IA processor for mobile internet devices in 45nm high-k metal gate CMOS," J. of Solid-State Circ., 2009.
    • (2009) J. of Solid-State Circ.
    • Gerosa, G.1
  • 23
    • 78449303566 scopus 로고    scopus 로고
    • Portable, scalable, per-core power estimation for intelligent resource management
    • Aug
    • B. Goel et al., "Portable, scalable, per-core power estimation for intelligent resource management," in Proc. of GREEN'10, pp. 135-146, Aug 2010.
    • (2010) Proc. of GREEN , vol.10 , pp. 135-146
    • Goel, B.1
  • 24
    • 77954015280 scopus 로고    scopus 로고
    • Servet: A benchmark suite for autotuning on multicore clusters
    • Apr
    • J. Gonzalez-Dominguez et al., "Servet: A benchmark suite for autotuning on multicore clusters," in Proc. of IPDPS'10, pp. 1-9, Apr 2010.
    • (2010) Proc. of IPDPS , vol.10 , pp. 1-9
    • Gonzalez-Dominguez, J.1
  • 25
    • 36849034066 scopus 로고    scopus 로고
    • SPEC CPU2006 benchmark descriptions
    • Sep
    • J. L. Henning, "SPEC CPU2006 benchmark descriptions," ACM SIGARCH News, vol. 34, no. Sep, pp. 1-17, 4 2006.
    • (2006) ACM SIGARCH News , vol.34 , Issue.4 , pp. 1-17
    • Henning, J.L.1
  • 26
    • 0032204476 scopus 로고    scopus 로고
    • Microprocessor power estimation using profile-driven program synthesis
    • Nov
    • C. Hsieh et al., "Microprocessor power estimation using profile-driven program synthesis," IEEE Trans. on Comp.-Aided Design. of Integ. Cir. & Sys., vol. 17, no. 11, pp. 1080-1089, Nov 1998.
    • (1998) IEEE Trans. on Comp.-Aided Design. of Integ. Cir. & Sys. , vol.17 , Issue.11 , pp. 1080-1089
    • Hsieh, C.1
  • 27
    • 84944414165 scopus 로고    scopus 로고
    • Runtime power monitoring in high-end processors: Methodology and empirical data
    • Dec
    • C. Isci et al., "Runtime power monitoring in high-end processors: methodology and empirical data," in Proc. of MICRO'03, pp. 96-108, Dec 2003.
    • (2003) Proc. of MICRO , vol.3 , pp. 96-108
    • Isci, C.1
  • 28
    • 0032669612 scopus 로고    scopus 로고
    • Comparing the memory system performance of the HP V-class and SGI Origin 2000 multiprocessors using microbenchmarks and scientific applications
    • Jun
    • R. Iyer et al., "Comparing the memory system performance of the HP V-class and SGI Origin 2000 multiprocessors using microbenchmarks and scientific applications," in Proc. of ICS'99, pp. 339-347, Jun 1999.
    • (1999) Proc. of ICS , vol.99 , pp. 339-347
    • Iyer, R.1
  • 29
    • 78149241070 scopus 로고    scopus 로고
    • Power and thermal characterization of POWER6 system
    • Sep
    • V. Jiménez et al., "Power and thermal characterization of POWER6 system," in Proc. of PACT'10, pp. 7-18, Sep 2010.
    • (2010) Proc. of PACT , vol.10 , pp. 7-18
    • Jiménez, V.1
  • 30
    • 42549100582 scopus 로고    scopus 로고
    • Performance cloning: A technique for disseminating proprietary applications as benchmarks
    • Oct
    • A. Joshi et al., "Performance cloning: A technique for disseminating proprietary applications as benchmarks," in Proc. of IISWC'06, pp. 105-115, Oct 2006.
    • (2006) Proc. of IISWC , vol.6 , pp. 105-115
    • Joshi, A.1
  • 31
    • 70449090751 scopus 로고    scopus 로고
    • The return of synthetic benchmarks
    • Jan
    • A. Joshi et al., "The return of synthetic benchmarks," in Proc. of SPEC Benchmark Workshop, pp. 1-11, Jan 2008.
    • (2008) Proc. of SPEC Benchmark Workshop , pp. 1-11
    • Joshi, A.1
  • 32
    • 51149119096 scopus 로고    scopus 로고
    • Distilling the essence of proprietary workloads into miniature benchmarks
    • Sep
    • A. Joshi et al., "Distilling the essence of proprietary workloads into miniature benchmarks," ACM Trans. on Arch. & Code Opt., vol. 5, no. 2, pp. 1-33, Sep 2008.
    • (2008) ACM Trans. on Arch. & Code Opt. , vol.5 , Issue.2 , pp. 1-33
    • Joshi, A.1
  • 33
    • 57749210436 scopus 로고    scopus 로고
    • Automated microprocessor stressmark generation
    • Feb
    • A. Joshi et al., "Automated microprocessor stressmark generation," in Proc. of HPCA'08, pp. 229-239, Feb 2008.
    • (2008) Proc. of HPCA , vol.8 , pp. 229-239
    • Joshi, A.1
  • 34
    • 80052700255 scopus 로고    scopus 로고
    • Automated di/dt stressmark generation for microprocessor power delivery networks
    • Aug
    • Y. Kim et al., "Automated di/dt stressmark generation for microprocessor power delivery networks," in Proc. of ISLPED'11, pp. 253-258, Aug 2011.
    • (2011) Proc. of ISLPED , vol.11 , pp. 253-258
    • Kim, Y.1
  • 37
    • 1842765540 scopus 로고    scopus 로고
    • Microbenchmarks for determining branch predictor organization
    • Apr
    • M. Milenkovic et al., "Microbenchmarks for determining branch predictor organization," Softw. Pract. Exper., vol. 34, no. 5, pp. 465-487, Apr 2004.
    • (2004) Softw. Pract. Exper. , vol.34 , Issue.5 , pp. 465-487
    • Milenkovic, M.1
  • 38
    • 31344459067 scopus 로고    scopus 로고
    • The implementation of a 2-core, multi-threaded itanium family processor
    • Jan
    • S. Naffziger et al., "The implementation of a 2-core, multi-threaded itanium family processor," J. of Solid-State Circ., vol. 41, no. 1, pp. 197-209, Jan 2006.
    • (2006) J. of Solid-State Circ. , vol.41 , Issue.1 , pp. 197-209
    • Naffziger, S.1
  • 39
    • 79951714116 scopus 로고    scopus 로고
    • AVF stressmark: Towards an automated methodology for bounding the worst-case vulnerability to soft errors
    • Dec
    • A. Nair et al., "AVF stressmark: Towards an automated methodology for bounding the worst-case vulnerability to soft errors," in Proc. of MICRO'10, pp. 125-136, Dec 2010.
    • (2010) Proc. of MICRO , vol.10 , pp. 125-136
    • Nair, A.1
  • 40
    • 79959247096 scopus 로고    scopus 로고
    • Automated full-system power characterization
    • May
    • S. Polfliet et al., "Automated full-system power characterization," IEEE Micro, vol. 31, no. 3, pp. 46-59, May 2011.
    • (2011) IEEE Micro , vol.31 , Issue.3 , pp. 46-59
    • Polfliet, S.1
  • 41
    • 77954733934 scopus 로고    scopus 로고
    • Real time power estimation and thread scheduling via performance counters
    • Jul
    • K. Singh et al., "Real time power estimation and thread scheduling via performance counters," ACM SIGARCH News, vol. 37, no. 2, pp. 46-55, Jul 2008.
    • (2008) ACM SIGARCH News , vol.37 , Issue.2 , pp. 46-55
    • Singh, K.1
  • 42
    • 81255177633 scopus 로고    scopus 로고
    • IBM POWER7 multicore server processor
    • May
    • B. Sinharoy et al., "IBM POWER7 multicore server processor," IBM J. Res. & Dev., vol. 55, no. 3, pp. 1-29, May 2011.
    • (2011) IBM J. Res. & Dev. , vol.55 , Issue.3 , pp. 1-29
    • Sinharoy, B.1
  • 43
    • 38849184010 scopus 로고    scopus 로고
    • Accurate on-line prediction of processor and memory energy usage under voltage scaling
    • Oct
    • D. C. Snowdon et al., "Accurate on-line prediction of processor and memory energy usage under voltage scaling," in Proc. of EMSOFT'07, pp. 84-93, Oct 2007.
    • (2007) Proc. of EMSOFT , vol.7 , pp. 84-93
    • Snowdon, D.C.1
  • 44
    • 0029713995 scopus 로고    scopus 로고
    • Instruction level power analysis and optimization of software
    • Jan
    • V. Tiwari et al., "Instruction level power analysis and optimization of software," in Proc. of VLSI'96, pp. 326-328, Jan 1996.
    • (1996) Proc. of VLSI , vol.96 , pp. 326-328
    • Tiwari, V.1
  • 45
    • 70349165968 scopus 로고    scopus 로고
    • Experiment flows and microbenchmarks for reverse engineering of branch predictor structures
    • Apr
    • V. Uzelac et al., "Experiment flows and microbenchmarks for reverse engineering of branch predictor structures," in Proc. of ISPASS'09, pp. 207-217, Apr 2009.
    • (2009) Proc. of ISPASS , vol.9 , pp. 207-217
    • Uzelac, V.1
  • 46
    • 0021504618 scopus 로고
    • Dhrystone: A synthetic systems programming benchmark
    • Oct
    • R. P. Weicker, "Dhrystone: a synthetic systems programming benchmark," Comm. of ACM, vol. 27, no. 10, pp. 1013-1030, Oct 1984.
    • (1984) Comm. of ACM , vol.27 , Issue.10 , pp. 1013-1030
    • Weicker, R.P.1
  • 47
    • 84876569869 scopus 로고
    • Synthesizing benchmarks with appropriate instruction mix and locality
    • Jun
    • W. S. Wong et al., "Synthesizing benchmarks with appropriate instruction mix and locality," in Proc. of ICCA'87, pp. 1-12, Jun 1987.
    • (1987) Proc. of ICCA , vol.87 , pp. 1-12
    • Wong, W.S.1
  • 48
    • 0024032181 scopus 로고
    • Benchmark synthesis using the lru cache hit function
    • Jun
    • W. S. Wong et al., "Benchmark Synthesis Using the LRU Cache Hit Function," IEEE Trans. on Comp., vol. 37, no. 6, pp. 637-645, Jun 1988.
    • (1988) IEEE Trans. on Comp. , vol.37 , Issue.6 , pp. 637-645
    • Wong, W.S.1
  • 49
    • 34547206285 scopus 로고    scopus 로고
    • A systematic method for functional unit power estimation in microprocessors
    • Jul
    • W. Wu et al., "A systematic method for functional unit power estimation in microprocessors," in Proc. of DAC'06, pp. 554-557, Jul 2006.
    • (2006) Proc. of DAC , vol.6 , pp. 554-557
    • Wu, W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.