메뉴 건너뛰기




Volumn , Issue , 2011, Pages 111-120

Improving throughput of power-constrained GPUs using dynamic voltage/frequency and core scaling

Author keywords

Dynamic voltage, frequency, and core scaling; GPU; Power constraint; Throughput

Indexed keywords

DYNAMIC VOLTAGE; GPU; GRAPHIC PROCESSING UNITS; INTEGRATED CORE; OFF-CHIP MEMORIES; ON-CHIP INTERCONNECTS; PARALLEL APPLICATION; POWER CONSTRAINTS; RUNTIMES;

EID: 84863037228     PISSN: 1089795X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/PACT.2011.17     Document Type: Conference Paper
Times cited : (62)

References (21)
  • 1
    • 79957439119 scopus 로고    scopus 로고
    • "Era of Tera," http://www.intel.com/pressroom/archive/releases/ 2007/20070204comp.htm
    • Era of Tera
  • 3
    • 49249086142 scopus 로고    scopus 로고
    • Larrabee: A many-core x86 architecture for visual computing
    • L. Seiler et al., "Larrabee: A Many-Core x86 Architecture for Visual Computing," ACM Trans. Graph., vol. 27, no, 3, pp. 1-15, 2008.
    • (2008) ACM Trans. Graph. , vol.27 , Issue.3 , pp. 1-15
    • Seiler, L.1
  • 4
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA Tesla: A unified graphics and computing architecture
    • DOI 10.1109/MM.2008.31
    • E. Lindholm et al., "NVIDIA Tesla: A Unified Graphics and Computing Architecture," in Proc. IEEE Micro, vol. 28, no. 2, pp. 39-55, 2008. (Pubitemid 351796170)
    • (2008) IEEE Micro , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 7
    • 56849102474 scopus 로고    scopus 로고
    • Efficient computation of sum- products on GPUs through software-managed cache
    • M. Silberstein et al., "Efficient Computation of Sum- Products on GPUs through Software-Managed Cache," in Proc. ACM Int. Conf. on Super Computing (ICS), pp. 309-318, 2008.
    • (2008) Proc. ACM Int. Conf. on Super Computing (ICS) , pp. 309-318
    • Silberstein, M.1
  • 10
    • 33750600861 scopus 로고    scopus 로고
    • New generation of predictive technology model for sub-45 nm early design exploration
    • DOI 10.1109/TED.2006.884077
    • W. Zhao and Y. Cao, "New Generation of Predictive Technology Model for Sub-45nm Early Design Exploration," IEEE Trans. on Electron Devices, vol. 53, no. 11, pp. 2816-2823, 2006. (Pubitemid 44680679)
    • (2006) IEEE Transactions on Electron Devices , vol.53 , Issue.11 , pp. 2816-2823
    • Zhao, W.1    Cao, Y.2
  • 11
    • 34548342439 scopus 로고    scopus 로고
    • Power delivery for high-performance microprocessor
    • K. Aygun et al. "Power Delivery for High-Performance Microprocessor," Intel Technology Journal, vol. 9, no. 4, pp. 273-283, 2005.
    • (2005) Intel Technology Journal , vol.9 , Issue.4 , pp. 273-283
    • Aygun, K.1
  • 12
    • 77952653698 scopus 로고    scopus 로고
    • Throughput analysis and optimization of power- and thermal-constrained multicore processors
    • J. Lee and N. Kim, "Throughput Analysis and Optimization of Power- and Thermal-Constrained Multicore Processors," in Proc. of IEEE Design Automation Conf. (DAC), pp. 47-50, 2009.
    • (2009) Proc. of IEEE Design Automation Conf. (DAC) , pp. 47-50
    • Lee, J.1    Kim, N.2
  • 15
    • 84863037801 scopus 로고    scopus 로고
    • http://ercbench.ece.wisc.edu/index.php
  • 17
    • 70449730913 scopus 로고    scopus 로고
    • Optimizing total power of many- core processors considering voltage scaling limit and process variations
    • J. Lee and N. Kim, "Optimizing Total Power of Many- Core Processors Considering Voltage Scaling Limit and Process Variations," in Proc. ACM/IEEE Int. Sym. on Low-Power Electronics and Design (ISLPED), pp. 201- 206, 2009.
    • (2009) Proc. ACM/IEEE Int. Sym. on Low-Power Electronics and Design (ISLPED) , pp. 201-206
    • Lee, J.1    Kim, N.2
  • 18
    • 70350060187 scopus 로고    scopus 로고
    • ORION 2.0: A fast and accurate NoC power and area model for early-stage design space exploration
    • A. Kahng et al., "ORION 2.0: A Fast and Accurate NoC Power and Area Model for Early-Stage Design Space Exploration," in Proc. IEEE Conf. on Design Automation and Test in Europe (DATE), pp. 423-428, 2009.
    • (2009) Proc. IEEE Conf. on Design Automation and Test in Europe (DATE) , pp. 423-428
    • Kahng, A.1
  • 19
    • 57749178620 scopus 로고    scopus 로고
    • System level analysis of fast, per-core DVFS using on-chip switching regulators
    • W. Kim et al., "System Level Analysis of Fast, Per-Core DVFS using On-Chip Switching Regulators," in Proc. IEEE Int. Symp. on High-Perf. Comp. Arch. (HPCA), pp. 77- 87, 2008.
    • (2008) Proc. IEEE Int. Symp. on High-Perf. Comp. Arch. (HPCA) , pp. 77-87
    • Kim, W.1
  • 20
    • 33748879741 scopus 로고    scopus 로고
    • Dynamic power-performance adaptation of parallel computation on chip multiprocessors
    • DOI 10.1109/HPCA.2006.1598114, 1598114, Proceedings - Twelfth International Symposium on High-Performance Computer Architecture, 2006
    • J. Li and J. Martinez, "Dynamic Power-Performance Adaptation of Parallel Computation on Chip Multiprocessors," in Proc. IEEE Int. Symp. on High-Perf. Comp. Arch. (HPCA), pp. 77- 87, 2006. (Pubitemid 44418379)
    • (2006) Proceedings - International Symposium on High-Performance Computer Architecture , vol.2006 , pp. 77-87
    • Li, J.1    Martinez, J.F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.