SCOPUS 정보 검색 플랫폼

International Journal of Emerging Electric Power Systems

Volumn 3, Issue 1, 2005, Pages 1-35

Approximate value iteration in the reinforcement learning context. application to electrical power system control.

(4) Ernst, Damien a Glavic, Mevludin a Geurts, Pierre a Wehenkel, Louis a

Author keywords

Approximate value iteration; Electrical power oscillations damping; Power system control; reinforcement learning; TCSC control

Indexed keywords

APPROXIMATION ALGORITHMS; DISCRETE TIME CONTROL SYSTEMS; ELECTRIC POWER SYSTEMS; INTELLIGENT AGENTS; ITERATIVE METHODS; LEARNING ALGORITHMS; OPTIMAL CONTROL SYSTEMS; POWER CONTROL; REINFORCEMENT LEARNING;

DISCRETE-TIME OPTIMAL CONTROL PROBLEM; ELECTRICAL POWER; FOUR-MACHINE POWER SYSTEM; POWER SYSTEM CONTROLS; POWER SYSTEM OSCILLATIONS; THYRISTOR CONTROLLED SERIES CAPACITOR; VALUE ITERATION; VALUE ITERATION ALGORITHM;

ELECTRIC POWER SYSTEM CONTROL;

EID: 81355166317 PISSN: 21945756 EISSN: 1553779X Source Type: Journal
DOI: 10.2202/1553-779X.1066 Document Type: Review

Times cited : (41)

References (41)

1
- 0003787146
- University Press
- R. Bellman. Dynamic Programming. Princeton University Press, 1957.
- (1957) Dynamic Programming. Princeton
- Bellman, R.¹

2
- 84968468700
- Polynomial approximation-A new computational technique in dynamic programming: Allocation processes
- R. Bellman, R. Kalaba, and B. Kotkin. Polynomial approximation-A new computational technique in dynamic programming: Allocation processes. Mathematical Computation, 17:155-161, 1973.
- (1973) Mathematical Computation , vol.17 , pp. 155-161
- Bellman, R.¹ Kalaba, R.² Kotkin, B.³

3
- 85059357364
- Reinforcement learning for the control of large-scale power systems
- K. H. Chan, L. Jiang, P. Tilloston, and Q. H. Wu. Reinforcement learning for the control of large-scale power systems. In Proceedings of EIS'2000, Paisley, UK, 2000.
- (2000) Proceedings of EIS'2000, Paisley, UK
- Chan, K.H.¹ Jiang, L.² Tilloston, P.³ Wu, Q.H.⁴

4
- 0038338529
- EXaMINE-Experimentation of a monitoring and control system for managing vulnerabilities of the European infrastructure for electrical power exchange
- A. Diu and L. Wehenkel. EXaMINE-Experimentation of a monitoring and control system for managing vulnerabilities of the European infrastructure for electrical power exchange. In Proceedings of the IEEE PES Summer Meeting, panel session on power system security in the new market environment, Chicago, USA, 2002.
- (2002) Proceedings of the IEEE PES Summer Meeting, Panel Session on Power System Security in the New Market Environment, Chicago, USA
- Diu, A.¹ Wehenkel, L.²

5
- 1442288723
- PhD thesis University of Liége Belgium March
- D. Ernst. Near Optimal Closed-Loop Control. Application to Electric Power Systems. PhD thesis, University of Liége, Belgium, March 2003.
- (2003) Near Optimal Closed-Loop Control. Application to Electric Power Systems
- Ernst, D.¹

6
- 85059428466
- Selecting concise sets of samples for a reinforcement learning agent
- D. Ernst. Selecting concise sets of samples for a reinforcement learning agent. Submitted, 2005.
- (2005) Submitted
- Ernst, D.¹

7
- 9444250519
- Iteratively extending time horizon reinforcement learning
- N. Lavra L. Gamberger, and L. Todorovski, editors Dubrovnik, Croatia September Springer-Verlag Heidelberg
- D. Ernst, P. Geurts, and L. Wehenkel. Iteratively extending time horizon reinforcement learning. In N. Lavra, L. Gamberger, and L. Todorovski, editors, Proceedings of the 14th European Conference on Machine Learning, pages 96-107, Dubrovnik, Croatia, September 2003. Springer-Verlag Heidelberg.
- (2003) Proceedings of the 14th European Conference on Machine Learning , pp. 96-107
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

8
- 21844465127
- Tree-based batch mode reinforcement learning
- April
- D. Ernst, P. Geurts, and L. Wehenkel. Tree-based batch mode reinforcement learning. Journal of Machine Learning Research, 6:503-556, April 2005.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

9
- 1442265466
- Power system stability control: Reinforcement learning framework
- February
- D. Ernst, M. Glavic, and L. Wehenkel. Power system stability control: reinforcement learning framework. IEEE Transactions on Power Systems, 19:427-435, February 2004.
- (2004) IEEE Transactions on Power Systems , vol.19 , pp. 427-435
- Ernst, D.¹ Glavic, M.² Wehenkel, L.³

10
- 85059364784
- FACTS devices controlled by means of reinforcement learning algorithms
- June
- D. Ernst and L. Wehenkel. FACTS devices controlled by means of reinforcement learning algorithms. In Proceedings of the 14th Power System Computation Conference, Sevilla, Spain, June 2002.
- (2002) Proceedings of the 14th Power System Computation Conference, Sevilla, Spain
- Ernst, D.¹ Wehenkel, L.²

11
- 2442476951
- PhD thesis, University of Liége, Belgium, May
- P. Geurts. Contributions to Decision Tree Induction: Bias/Variance Tradeoff and Time Series Classification. PhD thesis, University of Liége, Belgium, May 2002.
- (2002) Contributions to Decision Tree Induction: Bias/Variance Tradeoff and Time Series Classification
- Geurts, P.¹

12
- 85059429655
- Extremely randomized trees
- P. Geurts, D. Ernst, and L. Wehenkel. Extremely randomized trees. Submitted, 2004.
- (2004) Submitted
- Geurts, P.¹ Ernst, D.² Wehenkel, L.³

13
- 0004275365
- Control lyapunov functions: A control strategy for damping of power oscillations in large power systems
- PhD thesis
- M. Ghandhari. Control Lyapunov Functions: A Control Strategy for Damping of Power Oscillations in Large Power Systems. PhD thesis, Royal Instituate of Technology, Dept. of Electric Power Engineering, Electric Power Systems, 2000.
- (2000) Royal Instituate of Technology, Dept. of Electric Power Engineering, Electric Power Systems
- Ghandhari, M.¹

14
- 13844311438
- Combining a stability and a performance oriented control in power systems
- M. Glavic, D. Ernst, and L. Wehenkel. Combining a stability and a performance oriented control in power systems. IEEE Transactions on Power Systems, 20(1):525-525, 2005.
- (2005) IEEE Transactions on Power Systems , vol.20 , Issue.1 , pp. 525
- Glavic, M.¹ Ernst, D.² Wehenkel, L.³

15
- 0003989207
- Approximate solutions to markov decision processes phd thesis
- June
- G.J. Gordon. Approximate Solutions to Markov Decision Processes. PhD thesis, Carnegie Mellon University, June 1999.
- (1999) Carnegie Mellon University
- Gordon, G.J.¹

16
- 85059416017
- Discrete-Time markov control processes
- O. Herńandez-Lerma and B. Lasserre. Discrete-Time Markov Control Processes. Springer, New-York, 1996.
- (1996) Springer New-York
- Herńandez-Lerma, O.¹ Lasserre, B.²

17
- 0003391699
- N.G. Hingorani and L. Gyugyi. Understanding FACTS. IEEE press, 2000.
- (2000) Understanding FACTS IEEE press
- Hingorani, N.G.¹ Gyugyi, L.²

18
- 0036872144
- Adaptation in load shedding under vulnerable operating conditions
- J. Jung, C.C. Liu, S.L. Tanimoto, and V. Vittal. Adaptation in load shedding under vulnerable operating conditions. IEEE Transactions on Power Systems, 17(4):1199-1205, 2002.
- (2002) IEEE Transactions on Power Systems , vol.17 , Issue.4 , pp. 1199-1205
- Jung, J.¹ Liu, C.C.² Tanimoto, S.L.³ Vittal, V.⁴

19
- 0032073263
- Planning and acting in partially observable stochastic domains
- L.P. Kaelbling, M.L. Littman, and A.R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 1998.
- (1998) Artificial Intelligence , vol.101
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

20
- 0029679044
- Reinforcement learning: A survey
- L.P. Kaelbling, M.L. Littman, and A.W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

21
- 0003393654
- McGraw-Hill
- P. Kundur. Power System Stability and Control. McGraw-Hill, 1994.
- (1994) Power System Stability and Control
- Kundur, P.¹

22
- 0033224869
- Learning coordinated fuzzy logic control of dynamic quadrature boosters in multimachine power systems. IEE Part C-Generation
- B. H. Li and Q. H. Wu. Learning coordinated fuzzy logic control of dynamic quadrature boosters in multimachine power systems. IEE Part C-Generation, Transmission, and Distribution, 146(6):577-585, 1999.
- (1999) Transmission, and Distribution , vol.146 , Issue.6 , pp. 577-585
- Li, B.H.¹ Wu, Q.H.²

23
- 0034250198
- The strategic power infrastructure defense (SPID) system
- C.C. Liu, J. Jung, G.T. Heydt, and V. Vittal. The strategic power infrastructure defense (SPID) system. IEEE Control System Magazine, 20: 40-52, 2000.
- (2000) IEEE Control System Magazine , vol.20 , pp. 40-52
- Liu, C.C.¹ Jung, J.² Heydt, G.T.³ Vittal, V.⁴

24
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less real time
- A.W. Moore and C.G. Atkeson. Prioritized sweeping: reinforcement learning with less data and less real time. Machine Learning, 13:103-130, 1993.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

25
- 0036804005
- Kernel-based reinforcement learning in average-cost problems
- D. Ormoneit and P. Glynn. Kernel-based reinforcement learning in average-cost problems. IEEE Transactions on Automatic Control, 47 (10):1624-1636, 2002.
- (2002) IEEE Transactions on Automatic Control , vol.47 , Issue.10 , pp. 1624-1636
- Ormoneit, D.¹ Glynn, P.²

26
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit and S. Sen. Kernel-based reinforcement learning. Machine Learning, 49(2-3):161-178, 2002.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

27
- 85059390236
- Transient stability of power systems: Theory and practice
- M. Pavella and P.G.Murthy. Transient Stability of Power Systems: Theory and Practice. John Wiley & Sons, 1994.
- (1994) John Wiley Sons
- Pavella, M.¹ Murthy, P.G.²

28
- 0004242822
- G. Rogers. Power System Oscillations. Kluwer Academic Publishers, 2000.
- (2000) Power System Oscillations Kluwer Academic Publishers
- Rogers, G.¹

29
- 0001509947
- Using randomization to break the curse of dimensionality
- J. Rust. Using randomization to break the curse of dimensionality. Econometrica, 65(3):487-516, 1997.
- (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
- Rust, J.¹

30
- 84898972974
- Reinforcement learning for dynamical channel allocation in cellular telephone systems
- M.C. Mozer, M.I. Jordan, and T. Petsche, editors The MIT Press
- S. Singh and D. Bertsekas. Reinforcement learning for dynamical channel allocation in cellular telephone systems. In M.C. Mozer, M.I. Jordan, and T. Petsche, editors, Advances in Neural Information Processing Systems, volume 9, pages 974-980. The MIT Press, 1997.
- (1997) Advances in Neural Information Processing Systems , vol.9 , pp. 974-980
- Singh, S.¹ Bertsekas, D.²

31
- 85132026293
- Integrated architectures for learning, planning and reacting based on approximating dynamic programming
- San Mateo, CA Morgan Kaufmann
- R.S. Sutton. Integrated architectures for learning, planning and reacting based on approximating dynamic programming. In Proceedings of the Seventh International Conference on Machine Learning, pages 216-224, San Mateo, CA, 1990. Morgan Kaufmann.
- (1990) Proceedings of the Seventh International Conference on Machine Learning , pp. 216-224
- Sutton, R.S.¹

32
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R.S. Sutton, D. McAllester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems, 12:1057-1063, 2000.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.S.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

33
- 85059361157
- Washington DC, USA 16-17 November
- C.W. Taylor. Response-based, feedforward wide-Area control. Position paper for NSF/DOE/EPRI Sponsored Workshop on Future Research Directions for Complex Interactive Networks, Washington DC, USA, 16-17 November 2000., 2000.
- (2000) Response-based Feedforward Wide-Area Control. Position Paper for NSF/DOE/EPRI Sponsored Workshop on Future Research Directions for Complex Interactive Networks , vol.2000
- Taylor, C.W.¹

34
- 0000985504
- TD-Gammon a self-Teaching backgammon program achieves master-level play
- G.J. Tesauro. TD-Gammon, a self-Teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.J.¹

35
- 0029752470
- Feature-based methods for large-scale dynamic programming
- J.N. Tsitsiklis and B. Van Roy. Feature-based methods for large-scale dynamic programming. Machine Learning, 22:59-94, 1996.
- (1996) Machine Learning , vol.22 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

36
- 84898938510
- Actor-critic algorithms
- J.N. Tsitsiklis and B. Van Roy. Actor-critic algorithms. Advances in Neural Information Processing Systems, 12, 2000.
- (2000) Advances in Neural Information Processing Systems , vol.12
- Tsitsiklis, J.N.¹ Van Roy, B.²

37
- 0242443337
- Implementation of adaptive critic-based neurocontrollers for turbogenerators in a multimachine power system
- G.K. Venayagamoorthy, R.G. Harley, and D.C. Wunsch. Implementation of adaptive critic-based neurocontrollers for turbogenerators in a multimachine power system. IEEE Transactions on Neural Networks, 14(5): 1047-1064, 2003.
- (2003) IEEE Transactions on Neural Networks , vol.14 , Issue.5 , pp. 1047-1064
- Venayagamoorthy, G.K.¹ Harley, R.G.² Wunsch, D.C.³

38
- 0004049893
- Learning from delayed rewards
- PhD thesis
- C.J.C.H. Watkins. Learning from Delayed Rewards. PhD thesis, Cambridge University, Cambridge, England, 1989.
- (1989) Cambridge University, Cambridge, England
- Watkins, C.J.C.H.¹

39
- 0037942525
- Emergency control and its strategies
- Trondheim, Norway
- L. Wehenkel. Emergency control and its strategies. In Proceedings of the 13-Th PSCC, pages 35-48, Trondheim, Norway, 1999.
- (1999) Proceedings of the 13-Th PSCC , pp. 35-48
- Wehenkel, L.¹

40
- 84983135293
- New developments in the application of automatic learning to power system control
- L. Wehenkel, M. Glavic, and D. Ernst. New developments in the application of automatic learning to power system control. In Proceedings of the 15th Power System Computation Conference, 2005.
- (2005) Proceedings of the 15th Power System Computation Conference
- Wehenkel, L.¹ Glavic, M.² Ernst, D.³

41
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- R.J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.