-
1
-
-
84893393162
-
ADP: Goals, Opportunities and Principles
-
IEEE Press John Wiley & sons, Inc
-
P. Werbos, "ADP: Goals, Opportunities and Principles," HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING, IEEE Press John Wiley & sons, Inc. 2004, pp.1-42.
-
(2004)
HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING
, pp. 1-42
-
-
Werbos, P.1
-
2
-
-
84923005963
-
Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems
-
IEEE Press John Wiley & sons, Inc
-
W. B. Powell and B. Van Roy, "Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems," HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING, IEEE Press John Wiley & sons, Inc. 2004, pp.261-284
-
(2004)
HANDBOOK of LEARNING and APPROXIMATE DYNAMIC PROGRAMMING
, pp. 261-284
-
-
Powell, W.B.1
Van Roy, B.2
-
3
-
-
0003950434
-
Stable Adaptive Control Using New Critic Designs
-
ArXiv.org: adaporg/9810001
-
P. Werbos, "Stable Adaptive Control Using New Critic Designs," 1998 , (ArXiv.org: adaporg/9810001).
-
(1998)
-
-
Werbos, P.1
-
4
-
-
0002557583
-
Advanced forecasting for global crisis warning and models of intelligence
-
P. Werbos, "Advanced forecasting for global crisis warning and models of intelligence," General Systems Yearbook, 1977.
-
(1977)
General Systems Yearbook
-
-
Werbos, P.1
-
6
-
-
0015667648
-
Punish/reward: Learning with a Critic in adaptive threshold systems
-
B. Widrow, N, Gupta and S. Maitra, "Punish/reward: learning with a Critic in adaptive threshold systems," IEEE Trans. SMC, vol. 5, 1973, pp.455-465.
-
(1973)
IEEE Trans. SMC
, vol.5
, pp. 455-465
-
-
Widrow, B.1
Gupta, N.2
Maitra, S.3
-
7
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
A. Barto, R. Sutton and C. Anderson, "Neuronlike adaptive elements that can solve difficult learning control problems," IEEE Trans. SMC, vol. 13, 1983, pp.834-846.
-
(1983)
IEEE Trans. SMC
, vol.13
, pp. 834-846
-
-
Barto, A.1
Sutton, R.2
Anderson, C.3
-
8
-
-
73649144578
-
Dynamic Programming and Suboptimal Control: A Survey from ADP to MPC,
-
2632
-
D. Dimitri, P. Bertsekas, "Dynamic Programming and Suboptimal Control: A Survey from ADP to MPC," 2005, Report LIDS 2632.
-
(2005)
Report LIDS
-
-
Dimitri, D.1
Bertsekas, P.2
-
9
-
-
0041345290
-
Efficient Reinforcement Learning Using Recursive Least-Squares Methods
-
Xin Xu, Han-gen He and Dewen Hu, "Efficient Reinforcement Learning Using Recursive Least-Squares Methods," Journal of Artificial Intelligence Research , Vol.16 , 2002, pp.259-292.
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 259-292
-
-
Xu, X.1
He, H.-G.2
Hu, D.3
-
10
-
-
0000430514
-
The Convergence of TD(λ) for General λ
-
P. D. Dayan, "The Convergence of TD(λ) for General λ," Machine Learning, vol. 8, 1992, pp.341-362.
-
(1992)
Machine Learning
, vol.8
, pp. 341-362
-
-
Dayan, P.D.1
-
11
-
-
73649094483
-
An Analysis of Temporal-Difference Learning with Function Approximation
-
John N. Tsitsiklis and Benjamin Van Roy, "An Analysis of Temporal-Difference Learning with Function Approximation" Van Roy's homepages , 1997.
-
(1997)
Van Roy's homepages
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
12
-
-
0141704189
-
Accelerating Critic Learning in Approximate Dynamic Programming Via Value Templates and Perceptual Learning
-
T. T. Shannon, R. A. Santiago and G. Lendaris, "Accelerating Critic Learning in Approximate Dynamic Programming Via Value Templates and Perceptual Learning," IEEEE 0-7803-7898-9/03, 2003, pp.2922-2927
-
(2003)
IEEEE 0-7803-7898-9/03
, pp. 2922-2927
-
-
Shannon, T.T.1
Santiago, R.A.2
Lendaris, G.3
-
13
-
-
33847661590
-
Adaptive Critic Design Based Neuro-Fuzzy Controller for a Static Compensator in a Multimachine Power System
-
S. Mohagheghi and Ganesh K. Venayagamoorthy, "Adaptive Critic Design Based Neuro-Fuzzy Controller for a Static Compensator in a Multimachine Power System," IEEE Transactions on Power Syatems, vol. 21, NO. 4, 2006 pp.1744-1755.
-
(2006)
IEEE Transactions on Power Syatems
, vol.21
, Issue.4
, pp. 1744-1755
-
-
Mohagheghi, S.1
Venayagamoorthy, G.K.2
-
16
-
-
40649113421
-
Automotive Engine Torque and Air-Fuel Ratio Control Using Dual Heuristic Dynamic Programming
-
H. Javaherian, D. Liu, and Olesia Kovalenko, "Automotive Engine Torque and Air-Fuel Ratio Control Using Dual Heuristic Dynamic Programming," 2006 International Joint Conference on Neural Networks, 2006, pp.518-526.
-
(2006)
2006 International Joint Conference on Neural Networks
, pp. 518-526
-
-
Javaherian, H.1
Liu, D.2
Kovalenko, O.3
-
17
-
-
8744288519
-
Adaptive critic learning techniques for automotive engine control
-
H. Javaherian, D. Liu, Y. Zhang and O. Kovalenko, "Adaptive critic learning techniques for automotive engine control," Proceedings of the American Control Conference, ,2004, pp.4066-4071.
-
(2004)
Proceedings of the American Control Conference
, pp. 4066-4071
-
-
Javaherian, H.1
Liu, D.2
Zhang, Y.3
Kovalenko, O.4
-
18
-
-
20344386215
-
Neural network modeling and adaptive critic control of automotive fuel-injection systems
-
O. Kovalenko, D. Liu, and H. Javaherian, "Neural network modeling and adaptive critic control of automotive fuel-injection systems," Proceedings of the IEEE International Symposium on Intelligent Control, 2004, pp.386-373.
-
(2004)
Proceedings of the IEEE International Symposium on Intelligent Control
, pp. 386-373
-
-
Kovalenko, O.1
Liu, D.2
Javaherian, H.3
-
22
-
-
33747862706
-
Relaxing Dynamic Programming
-
B. Lincoln and A. Rantzer, "Relaxing Dynamic Programming," IEEE Transactions on Automatic Control, vol.51, No. 8, 2006, pp.249-1261.
-
(2006)
IEEE Transactions on Automatic Control
, vol.51
, Issue.8
, pp. 249-1261
-
-
Lincoln, B.1
Rantzer, A.2
-
24
-
-
15744363553
-
-
Ju Jiang, M. Kamel and Lei Chen, Reinforcement Learning and Aggregation, Proceedings of IEEE International Conference on Systems, Man, and Cybernetics 04, 2004, pp.1303-1308.
-
Ju Jiang, M. Kamel and Lei Chen, "Reinforcement Learning and Aggregation," Proceedings of IEEE International Conference on Systems, Man, and Cybernetics 04, 2004, pp.1303-1308.
-
-
-
-
25
-
-
34249833101
-
Q-learning
-
C. J, C. H. Watkins and P. Dayan, " Q-learning ", Machine Learning, vol. 8, no. 3, 1992, pp.279-292.
-
(1992)
Machine Learning
, vol.8
, Issue.3
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
-
26
-
-
0004102479
-
-
A Bradford Book, The MIT Press, Cambridge, Massachusetts, London, England, ISBN 0-262-19398-1
-
R. S. Sutton and A. G. Barto, "Reinforcement Learning, An Introduction." A Bradford Book, The MIT Press, Cambridge, Massachusetts, London, England, ISBN 0-262-19398-1, 1998.
-
(1998)
Reinforcement Learning, An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
27
-
-
0029679044
-
Reinforcement Learning: A Survey
-
L. P. Kaelbling, M. L. Littman and A. W. Moore, "Reinforcement Learning: A Survey ", Journal of Artificial Intelligence Research, no.4, 1996, pp.237-255.
-
(1996)
Journal of Artificial Intelligence Research
, Issue.4
, pp. 237-255
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
28
-
-
34547365679
-
An Extension of Genetic Network Programming with Reinforcement Learning Using Actor-Critic
-
H. Hatakeyama and S. Mabu, "An Extension of Genetic Network Programming with Reinforcement Learning Using Actor-Critic," 2006 IEEE Congress on Evolutionary Computation, 2006.
-
(2006)
IEEE Congress on Evolutionary Computation
-
-
Hatakeyama, H.1
Mabu, S.2
-
29
-
-
35048854058
-
Genetic Network Programming with Reinforcement Learning and its Performance Evaluation
-
S. Mabu, K. Hirasawa and J. Hu, "Genetic Network Programming with Reinforcement Learning and its Performance Evaluation", 2004 Gnenetic and Evolutionary Computation Conference, part II, 2004, pp.710-711.
-
(2004)
2004 Gnenetic and Evolutionary Computation Conference, part II
, pp. 710-711
-
-
Mabu, S.1
Hirasawa, K.2
Hu, J.3
-
30
-
-
0034867763
-
Comparison between genetic network programming (GNP) and genetic programming (GP)
-
K. Hirasawa, M. Okubo, H. Katagiri, J. Hu, and J. Murata, "Comparison between genetic network programming (GNP) and genetic programming (GP)," Proc. of 2001 Congress on Evolutionary Computation, 2001, pp.1276-1282.
-
(2001)
Proc. of 2001 Congress on Evolutionary Computation
, pp. 1276-1282
-
-
Hirasawa, K.1
Okubo, M.2
Katagiri, H.3
Hu, J.4
Murata, J.5
-
32
-
-
33745951445
-
-
TANG Hao, ZHOU Lei and YUAN Ji-bin, Unified NDP method based on TD(0) learning for both average and discounted Markov decision processes, Control Theory& Application, vo1.23, no.2, 2006, pp.292-297.
-
TANG Hao, ZHOU Lei and YUAN Ji-bin, "Unified NDP method based on TD(0) learning for both average and discounted Markov decision processes," Control Theory& Application, vo1.23, no.2, 2006, pp.292-297.
-
-
-
-
33
-
-
23444449149
-
-
TANG Hao, YUAN Ji-Bin, LU Yang, and CHENG Wen-Juan, Performance Potential-based Neuro-dynamic Programming for SMDPs, ACTA AUTOMATICA SINICA, 31, no. 4, 2005, pp.642-646.
-
TANG Hao, YUAN Ji-Bin, LU Yang, and CHENG Wen-Juan, "Performance Potential-based Neuro-dynamic Programming for SMDPs," ACTA AUTOMATICA SINICA, vol. 31, no. 4, 2005, pp.642-646.
-
-
-
-
34
-
-
2942718962
-
A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies
-
TANG Hao, XI Hong-Sheng and YIN Bo-Qun, "A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies," ACTA AUTOMATICA SINICA, vol. 30, No.2, 2004, pp.229-235.
-
(2004)
ACTA AUTOMATICA SINICA
, vol.30
, Issue.2
, pp. 229-235
-
-
Hao, T.A.N.G.1
Hong-Sheng, X.I.2
Bo-Qun, Y.I.N.3
-
35
-
-
33747872589
-
Approximate dynamic programming based approach to process control and scheduling
-
J. H. Lee and J. M. Lee, "Approximate dynamic programming based approach to process control and scheduling," Computers and Chemical Engineering, no. 30, 2006, pp.1603-1618.
-
(2006)
Computers and Chemical Engineering
, Issue.30
, pp. 1603-1618
-
-
Lee, J.H.1
Lee, J.M.2
-
36
-
-
18444379381
-
Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes
-
J. M. Lee and J. H. Lee, "Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes," Automatica, vol. 41, no. 7, 2005, pp.281-1288.
-
(2005)
Automatica
, vol.41
, Issue.7
, pp. 281-1288
-
-
Lee, J.M.1
Lee, J.H.2
-
37
-
-
27144544987
-
Choice of approximator and design of penalty function for an approximate dynamic programming based control approach
-
J. M. Lee, N. S. Kaisare and J. H. Lee, "Choice of approximator and design of penalty function for an approximate dynamic programming based control approach," Journal of Process Control, vol.16, no. 2, 2006, pp.135-156.
-
(2006)
Journal of Process Control
, vol.16
, Issue.2
, pp. 135-156
-
-
Lee, J.M.1
Kaisare, N.S.2
Lee, J.H.3
|