메뉴 건너뛰기




Volumn , Issue , 2011, Pages 41-46

Deriving a near-optimal power management policy using model-free reinforcement learning and Bayesian classification

Author keywords

Bayes Classification; Dynamic Power Management; Reinforcement Learning

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; COMPUTER AIDED DESIGN; DYNAMICAL SYSTEMS; MACHINE LEARNING; MARKOV PROCESSES; ONLINE SYSTEMS; POWER MANAGEMENT; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

EID: 80052684654     PISSN: 0738100X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2024724.2024735     Document Type: Conference Paper
Times cited : (50)

References (14)
  • 1
    • 0033706197 scopus 로고    scopus 로고
    • A survey of design techniques for system level dynamic power management
    • L. Benini, A. Bogliolo and G. De Micheli, "A survey of design techniques for system level dynamic power management," IEEE Trans. on VLSI Systems, Vol. 8, Issue 3, pp. 299-316, 2000.
    • (2000) IEEE Trans. on VLSI Systems , vol.8 , Issue.3 , pp. 299-316
    • Benini, L.1    Bogliolo, A.2    De Micheli, G.3
  • 2
    • 0030104176 scopus 로고    scopus 로고
    • Predictive system shutdown and other architectural techniques for energy efficient programmable computation
    • M. Srivastava, A. Chandrakasan and R. Brodersen, "Predictive system shutdown and other architectural techniques for energy efficient programmable computation," IEEE Trans. on VLSI, 1996.
    • (1996) IEEE Trans. on VLSI
    • Srivastava, M.1    Chandrakasan, A.2    Brodersen, R.3
  • 3
    • 0031335029 scopus 로고    scopus 로고
    • A predictive system shutdown method for energy saving of event-driven computation
    • C. H. Hwang and A. C. Wu, "A predictive system shutdown method for energy saving of event-driven computation," in ICCAD '97.
    • ICCAD '97
    • Hwang, C.H.1    Wu, A.C.2
  • 5
    • 78651575831 scopus 로고    scopus 로고
    • Dynamic Power Management Based on Continuous-Time Markov Decision Processes
    • Q. Qiu and M. Pedram, "Dynamic Power Management Based on Continuous-Time Markov Decision Processes," in DAC '99.
    • DAC '99
    • Qiu, Q.1    Pedram, M.2
  • 7
    • 34548321133 scopus 로고    scopus 로고
    • Dynamic power management under uncertain information
    • Apr.
    • H. Jung and M. Pedram, "Dynamic power management under uncertain information," in DATE '07, pp. 1060-1065, Apr. 2007.
    • (2007) DATE '07 , pp. 1060-1065
    • Jung, H.1    Pedram, M.2
  • 8
    • 34548299487 scopus 로고    scopus 로고
    • Stochastic Modeling and Optimization for Robust Power Management in a Partially Observable System
    • Apr.
    • Q. Qiu, Y. Tan and Q. Wu, "Stochastic Modeling and Optimization for Robust Power Management in a Partially Observable System," in DATE '07, pp. 779-784, Apr. 2007.
    • (2007) DATE '07 , pp. 779-784
    • Qiu, Q.1    Tan, Y.2    Wu, Q.3
  • 9
    • 46149096949 scopus 로고    scopus 로고
    • Dynamic power management using machine learning
    • Nov.
    • G. Dhiman and T. Simunic Rosing, "Dynamic power management using machine learning," in ICCAD '06, pp. 747-754, Nov. 2006.
    • (2006) ICCAD '06 , pp. 747-754
    • Dhiman, G.1    Simunic Rosing, T.2
  • 10
    • 76349083584 scopus 로고    scopus 로고
    • Adaptive power management using reinforcement learning
    • Nov.
    • Y. Tan, W. Liu and Q. Qiu, "Adaptive power management using reinforcement learning," in ICCAD '09, pp. 461-467, Nov. 2009.
    • (2009) ICCAD '09 , pp. 461-467
    • Tan, Y.1    Liu, W.2    Qiu, Q.3
  • 11
    • 85150714688 scopus 로고
    • Reinforcement learning methods for continuous-time Markov decision problems
    • MIT Press
    • S. Bradtke and M. Duff, "Reinforcement learning methods for continuous-time Markov decision problems," in Advances in Neural Information Processing Systems 7, pp. 393-400, MIT Press, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
    • Bradtke, S.1    Duff, M.2
  • 13
    • 0004049893 scopus 로고
    • PhD thesis, Cambridge University, Cambridge, England
    • C. Watkins, Learning from Delayed Rewards, PhD thesis, Cambridge University, Cambridge, England, 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.