SCOPUS 정보 검색 플랫폼

2009 International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2009

Volumn 2, Issue , 2009, Pages 396-399

A survey of approximate dynamic programming

(4) Wang, Lin a Peng, Hui a Zhu, Hua Yong a Shen, Lin Cheng a

a NATIONAL UNIVERSITY OF DEFENSE TECHNOLOGY (China)

Author keywords

Approximate dynamic programming; Dynamic programming; Markov decision processes; Reinforcement learning

Indexed keywords

APPROXIMATE DYNAMIC PROGRAMMING; COMPUTATIONAL REQUIREMENTS; DECISION PROBLEMS; FUNCTION APPROXIMATION; IN-PROCESS; MARKOV DECISION PROCESSES; MATHEMATICAL FORMULATION; MULTI-STAGE; REAL APPLICATIONS; RESEARCH DIRECTIONS; STANDARD METHOD; STATE SPACE;

CYBERNETICS; LEARNING ALGORITHMS; MARKOV PROCESSES; REINFORCEMENT; REINFORCEMENT LEARNING; SURVEYS;

DYNAMIC PROGRAMMING;

EID: 73649096185 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IHMSC.2009.222 Document Type: Conference Paper

Times cited : (13)

References (37)

1
- 84893393162
- ADP: Goals, Opportunities and Principles
- IEEE Press John Wiley & sons, Inc
- P. Werbos, "ADP: Goals, Opportunities and Principles," HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING, IEEE Press John Wiley & sons, Inc. 2004, pp.1-42.
- (2004) HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING , pp. 1-42
- Werbos, P.¹

2
- 84923005963
- Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems
- IEEE Press John Wiley & sons, Inc
- W. B. Powell and B. Van Roy, "Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems," HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING, IEEE Press John Wiley & sons, Inc. 2004, pp.261-284
- (2004) HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING , pp. 261-284
- Powell, W.B.¹ Van Roy, B.²

3
- 0003950434
- Stable Adaptive Control Using New Critic Designs
- ArXiv.org: adaporg/9810001
- P. Werbos, "Stable Adaptive Control Using New Critic Designs," 1998 , (ArXiv.org: adaporg/9810001).
- (1998)
- Werbos, P.¹

4
- 0002557583
- Advanced forecasting for global crisis warning and models of intelligence
- P. Werbos, "Advanced forecasting for global crisis warning and models of intelligence," General Systems Yearbook, 1977.
- (1977) General Systems Yearbook
- Werbos, P.¹

5
- 0024888479
- Neural networks for control and system identification
- P. Werbos, "Neural networks for control and system identification," IEEE Proceeding of Conference on Decision and Control, 1989.
- (1989) IEEE Proceeding of Conference on Decision and Control
- Werbos, P.¹

6
- 0015667648
- Punish/reward: Learning with a Critic in adaptive threshold systems
- B. Widrow, N, Gupta and S. Maitra, "Punish/reward: learning with a Critic in adaptive threshold systems," IEEE Trans. SMC, vol. 5, 1973, pp.455-465.
- (1973) IEEE Trans. SMC , vol.5 , pp. 455-465
- Widrow, B.¹ Gupta, N.² Maitra, S.³

7
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- A. Barto, R. Sutton and C. Anderson, "Neuronlike adaptive elements that can solve difficult learning control problems," IEEE Trans. SMC, vol. 13, 1983, pp.834-846.
- (1983) IEEE Trans. SMC , vol.13 , pp. 834-846
- Barto, A.¹ Sutton, R.² Anderson, C.³

8
- 73649144578
- Dynamic Programming and Suboptimal Control: A Survey from ADP to MPC,
- 2632
- D. Dimitri, P. Bertsekas, "Dynamic Programming and Suboptimal Control: A Survey from ADP to MPC," 2005, Report LIDS 2632.
- (2005) Report LIDS
- Dimitri, D.¹ Bertsekas, P.²

9
- 0041345290
- Efficient Reinforcement Learning Using Recursive Least-Squares Methods
- Xin Xu, Han-gen He and Dewen Hu, "Efficient Reinforcement Learning Using Recursive Least-Squares Methods," Journal of Artificial Intelligence Research , Vol.16 , 2002, pp.259-292.
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 259-292
- Xu, X.¹ He, H.-G.² Hu, D.³

10
- 0000430514
- The Convergence of TD(λ) for General λ
- P. D. Dayan, "The Convergence of TD(λ) for General λ," Machine Learning, vol. 8, 1992, pp.341-362.
- (1992) Machine Learning , vol.8 , pp. 341-362
- Dayan, P.D.¹

11
- 73649094483
- An Analysis of Temporal-Difference Learning with Function Approximation
- John N. Tsitsiklis and Benjamin Van Roy, "An Analysis of Temporal-Difference Learning with Function Approximation" Van Roy's homepages , 1997.
- (1997) Van Roy's homepages
- Tsitsiklis, J.N.¹ Van Roy, B.²

12
- 0141704189
- Accelerating Critic Learning in Approximate Dynamic Programming Via Value Templates and Perceptual Learning
- T. T. Shannon, R. A. Santiago and G. Lendaris, "Accelerating Critic Learning in Approximate Dynamic Programming Via Value Templates and Perceptual Learning," IEEEE 0-7803-7898-9/03, 2003, pp.2922-2927
- (2003) IEEEE 0-7803-7898-9/03 , pp. 2922-2927
- Shannon, T.T.¹ Santiago, R.A.² Lendaris, G.³

13
- 33847661590
- Adaptive Critic Design Based Neuro-Fuzzy Controller for a Static Compensator in a Multimachine Power System
- S. Mohagheghi and Ganesh K. Venayagamoorthy, "Adaptive Critic Design Based Neuro-Fuzzy Controller for a Static Compensator in a Multimachine Power System," IEEE Transactions on Power Syatems, vol. 21, NO. 4, 2006 pp.1744-1755.
- (2006) IEEE Transactions on Power Syatems , vol.21 , Issue.4 , pp. 1744-1755
- Mohagheghi, S.¹ Venayagamoorthy, G.K.²

14
- 84986024692
- Learning and optimization - from a system theoretic perspective
- Wiley-IEEE Press
- X.-R. Cao, "Learning and optimization - from a system theoretic perspective," Hanudbook of Learning and Approximnate Dynamic Programmning: Scaling up to the Real World, Eds. Wiley-IEEE Press, 2004.
- (2004) Hanudbook of Learning and Approximnate Dynamic Programmning: Scaling up to the Real World, Eds
- Cao, X.-R.¹

15
- 33847112174
- An Approximate Dynamic Programming Approach to a Communication Constrained Sensor Management Problem
- J. L. Williams, J. W. Fisher and A. S. Willsky, "An Approximate Dynamic Programming Approach to a Communication Constrained Sensor Management Problem," 2005 7th International Conference on Information Fusion, 2005, pp.582-560.
- (2005) 2005 7th International Conference on Information Fusion , pp. 582-560
- Williams, J.L.¹ Fisher, J.W.² Willsky, A.S.³

16
- 40649113421
- Automotive Engine Torque and Air-Fuel Ratio Control Using Dual Heuristic Dynamic Programming
- H. Javaherian, D. Liu, and Olesia Kovalenko, "Automotive Engine Torque and Air-Fuel Ratio Control Using Dual Heuristic Dynamic Programming," 2006 International Joint Conference on Neural Networks, 2006, pp.518-526.
- (2006) 2006 International Joint Conference on Neural Networks , pp. 518-526
- Javaherian, H.¹ Liu, D.² Kovalenko, O.³

17
- 8744288519
- Adaptive critic learning techniques for automotive engine control
- H. Javaherian, D. Liu, Y. Zhang and O. Kovalenko, "Adaptive critic learning techniques for automotive engine control," Proceedings of the American Control Conference, ,2004, pp.4066-4071.
- (2004) Proceedings of the American Control Conference , pp. 4066-4071
- Javaherian, H.¹ Liu, D.² Zhang, Y.³ Kovalenko, O.⁴

18
- 20344386215
- Neural network modeling and adaptive critic control of automotive fuel-injection systems
- O. Kovalenko, D. Liu, and H. Javaherian, "Neural network modeling and adaptive critic control of automotive fuel-injection systems," Proceedings of the IEEE International Symposium on Intelligent Control, 2004, pp.386-373.
- (2004) Proceedings of the IEEE International Symposium on Intelligent Control , pp. 386-373
- Kovalenko, O.¹ Liu, D.² Javaherian, H.³

19
- 34548237758
- Cellular SRN Trained by Extended Kalman Filter Shows Promise for ADP
- R. Ilin, R. Kozma and P. Werbos, "Cellular SRN Trained by Extended Kalman Filter Shows Promise for ADP," 2006 International Joint Conference on Neural Networks, 2006, pp.506-511.
- (2006) 2006 International Joint Conference on Neural Networks , pp. 506-511
- Ilin, R.¹ Kozma, R.² Werbos, P.³

20
- 34047211520
- Decentralized Approximate Dynamic Programming for Dynamic Networks of Agents
- H. Lakshmanan and D. P. Farias, "Decentralized Approximate Dynamic Programming for Dynamic Networks of Agents," Proceedings of the 2006 American Control Conference, 2006, pp.1648-1654.
- (2006) Proceedings of the 2006 American Control Conference , pp. 1648-1654
- Lakshmanan, H.¹ Farias, D.P.²

21
- 33847215503
- On Approximate Dynamic Programming in Switching Systems
- A. Rantzer, "On Approximate Dynamic Programming in Switching Systems," 44th IEEE Conference on Decision and Control, and the European Control Conference, 2006, pp.1391-1397.
- (2006) 44th IEEE Conference on Decision and Control, and the European Control Conference , pp. 1391-1397
- Rantzer, A.¹

22
- 33747862706
- Relaxing Dynamic Programming
- B. Lincoln and A. Rantzer, "Relaxing Dynamic Programming," IEEE Transactions on Automatic Control, vol.51, No. 8, 2006, pp.249-1261.
- (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.8 , pp. 249-1261
- Lincoln, B.¹ Rantzer, A.²

23
- 40649089465
- Aggregation of Reinforcement Learning Algorithms
- Ju Jiang and M. S. Kamel, "Aggregation of Reinforcement Learning Algorithms," 2006 International Joint Conference on Neural Networks, 2006.
- (2006) International Joint Conference on Neural Networks
- Jiang, J.¹ Kamel, M.S.²

24
- 15744363553
- Ju Jiang, M. Kamel and Lei Chen, Reinforcement Learning and Aggregation, Proceedings of IEEE International Conference on Systems, Man, and Cybernetics 04, 2004, pp.1303-1308.
- Ju Jiang, M. Kamel and Lei Chen, "Reinforcement Learning and Aggregation," Proceedings of IEEE International Conference on Systems, Man, and Cybernetics 04, 2004, pp.1303-1308.

25
- 34249833101
- Q-learning
- C. J, C. H. Watkins and P. Dayan, " Q-learning ", Machine Learning, vol. 8, no. 3, 1992, pp.279-292.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

26
- 0004102479
- A Bradford Book, The MIT Press, Cambridge, Massachusetts, London, England, ISBN 0-262-19398-1
- R. S. Sutton and A. G. Barto, "Reinforcement Learning, An Introduction." A Bradford Book, The MIT Press, Cambridge, Massachusetts, London, England, ISBN 0-262-19398-1, 1998.
- (1998) Reinforcement Learning, An Introduction
- Sutton, R.S.¹ Barto, A.G.²

27
- 0029679044
- Reinforcement Learning: A Survey
- L. P. Kaelbling, M. L. Littman and A. W. Moore, "Reinforcement Learning: A Survey ", Journal of Artificial Intelligence Research, no.4, 1996, pp.237-255.
- (1996) Journal of Artificial Intelligence Research , Issue.4 , pp. 237-255
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

28
- 34547365679
- An Extension of Genetic Network Programming with Reinforcement Learning Using Actor-Critic
- H. Hatakeyama and S. Mabu, "An Extension of Genetic Network Programming with Reinforcement Learning Using Actor-Critic," 2006 IEEE Congress on Evolutionary Computation, 2006.
- (2006) IEEE Congress on Evolutionary Computation
- Hatakeyama, H.¹ Mabu, S.²

29
- 35048854058
- Genetic Network Programming with Reinforcement Learning and its Performance Evaluation
- S. Mabu, K. Hirasawa and J. Hu, "Genetic Network Programming with Reinforcement Learning and its Performance Evaluation", 2004 Gnenetic and Evolutionary Computation Conference, part II, 2004, pp.710-711.
- (2004) 2004 Gnenetic and Evolutionary Computation Conference, part II , pp. 710-711
- Mabu, S.¹ Hirasawa, K.² Hu, J.³

30
- 0034867763
- Comparison between genetic network programming (GNP) and genetic programming (GP)
- K. Hirasawa, M. Okubo, H. Katagiri, J. Hu, and J. Murata, "Comparison between genetic network programming (GNP) and genetic programming (GP)," Proc. of 2001 Congress on Evolutionary Computation, 2001, pp.1276-1282.
- (2001) Proc. of 2001 Congress on Evolutionary Computation , pp. 1276-1282
- Hirasawa, K.¹ Okubo, M.² Katagiri, H.³ Hu, J.⁴ Murata, J.⁵

31
- 40649087190
- Opposition-Based Q(λ) Algorithm
- M. Shokri, H. R. Tizhoosh and M. Kamel, "Opposition-Based Q(λ) Algorithm," 2006 International Joint Conference on Neural Networks, 2006.
- (2006) International Joint Conference on Neural Networks
- Shokri, M.¹ Tizhoosh, H.R.² Kamel, M.³

32
- 33745951445
- TANG Hao, ZHOU Lei and YUAN Ji-bin, Unified NDP method based on TD(0) learning for both average and discounted Markov decision processes, Control Theory& Application, vo1.23, no.2, 2006, pp.292-297.
- TANG Hao, ZHOU Lei and YUAN Ji-bin, "Unified NDP method based on TD(0) learning for both average and discounted Markov decision processes," Control Theory& Application, vo1.23, no.2, 2006, pp.292-297.

33
- 23444449149
- TANG Hao, YUAN Ji-Bin, LU Yang, and CHENG Wen-Juan, Performance Potential-based Neuro-dynamic Programming for SMDPs, ACTA AUTOMATICA SINICA, 31, no. 4, 2005, pp.642-646.
- TANG Hao, YUAN Ji-Bin, LU Yang, and CHENG Wen-Juan, "Performance Potential-based Neuro-dynamic Programming for SMDPs," ACTA AUTOMATICA SINICA, vol. 31, no. 4, 2005, pp.642-646.

34
- 2942718962
- A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies
- TANG Hao, XI Hong-Sheng and YIN Bo-Qun, "A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies," ACTA AUTOMATICA SINICA, vol. 30, No.2, 2004, pp.229-235.
- (2004) ACTA AUTOMATICA SINICA , vol.30 , Issue.2 , pp. 229-235
- Hao, T.A.N.G.¹ Hong-Sheng, X.I.² Bo-Qun, Y.I.N.³

35
- 33747872589
- Approximate dynamic programming based approach to process control and scheduling
- J. H. Lee and J. M. Lee, "Approximate dynamic programming based approach to process control and scheduling," Computers and Chemical Engineering, no. 30, 2006, pp.1603-1618.
- (2006) Computers and Chemical Engineering , Issue.30 , pp. 1603-1618
- Lee, J.H.¹ Lee, J.M.²

36
- 18444379381
- Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes
- J. M. Lee and J. H. Lee, "Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes," Automatica, vol. 41, no. 7, 2005, pp.281-1288.
- (2005) Automatica , vol.41 , Issue.7 , pp. 281-1288
- Lee, J.M.¹ Lee, J.H.²

37
- 27144544987
- Choice of approximator and design of penalty function for an approximate dynamic programming based control approach
- J. M. Lee, N. S. Kaisare and J. H. Lee, "Choice of approximator and design of penalty function for an approximate dynamic programming based control approach," Journal of Process Control, vol.16, no. 2, 2006, pp.135-156.
- (2006) Journal of Process Control , vol.16 , Issue.2 , pp. 135-156
- Lee, J.M.¹ Kaisare, N.S.² Lee, J.H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.