-
1
-
-
85012688561
-
-
Princeton University Press New Jersey
-
R.E. Bellman Dynamic Programming 1957 Princeton University Press New Jersey
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
2
-
-
0019537951
-
Toward a modern theory of adaptive networks: Expectation and prediction
-
R.S. Sutton, and A.G. Barto Toward a modern theory of adaptive networks: expectation and prediction Psycol. Rev. 88 2 1981 135 170
-
(1981)
Psycol. Rev.
, vol.88
, Issue.2
, pp. 135-170
-
-
Sutton, R.S.1
Barto, A.G.2
-
5
-
-
0037349318
-
Simulation based strategy for nonlinear optimal control: Application to a microbial cell reactor
-
N.S. Kaisare, J.M. Lee, and J.H. Lee Simulation based strategy for nonlinear optimal control: application to a microbial cell reactor Int. J. Robust Nonlinear Control 13 3-4 2002 347 363
-
(2002)
Int. J. Robust Nonlinear Control
, vol.13
, Issue.3-4
, pp. 347-363
-
-
Kaisare, N.S.1
Lee, J.M.2
Lee, J.H.3
-
6
-
-
2942655578
-
Simulation-based learning of cost-to-go for control of nonlinear processes
-
J.M. Lee, and J.H. Lee Simulation-based learning of cost-to-go for control of nonlinear processes Korean J. Chem. Eng. 21 2 2004 338 344
-
(2004)
Korean J. Chem. Eng.
, vol.21
, Issue.2
, pp. 338-344
-
-
Lee, J.M.1
Lee, J.H.2
-
7
-
-
27144554879
-
An approximate dynamic programming based approach to dual adaptive control
-
submitted
-
J.M. Lee, J.H. Lee, An approximate dynamic programming based approach to dual adaptive control, Automatica, submitted.
-
Automatica
-
-
Lee, J.M.1
Lee, J.H.2
-
9
-
-
0001046225
-
Practical issues in temporal-difference learning
-
G.J. Tesauro Practical issues in temporal-difference learning Mach. Learn. 8 1992 257 277
-
(1992)
Mach. Learn.
, vol.8
, pp. 257-277
-
-
Tesauro, G.J.1
-
10
-
-
0000859970
-
Reinforcement learning applied to linear quadratic regulation
-
S.J. Hanson J. Cowan C.L. Giles Morgan Kaufmann
-
S.J. Bradtke Reinforcement learning applied to linear quadratic regulation S.J. Hanson J. Cowan C.L. Giles Advances in Neural Information Processing Systems vol. 5 1993 Morgan Kaufmann 295 302
-
(1993)
Advances in Neural Information Processing Systems
, vol.5
, pp. 295-302
-
-
Bradtke, S.J.1
-
11
-
-
0000433333
-
Temporal difference learning of position evaluation in the game of Go
-
J.D. Cowan G. Tesauro J. Alspector Morgan Kaufmann San Mateo, CA
-
N.N. Schraudolph, P. Dayan, and T.J. Sejnowski Temporal difference learning of position evaluation in the game of Go J.D. Cowan G. Tesauro J. Alspector Advances in Neural Information Processing Systems vol. 6 1994 Morgan Kaufmann San Mateo, CA 817 824
-
(1994)
Advances in Neural Information Processing Systems
, vol.6
, pp. 817-824
-
-
Schraudolph, N.N.1
Dayan, P.2
Sejnowski, T.J.3
-
14
-
-
0001133021
-
Generalization in reinforcement learning: Safely approximating the value function
-
G. Tesauro D. Touretzky Morgan Kaufmann
-
J.A. Boyan, and A.W. Moore Generalization in reinforcement learning: safely approximating the value function G. Tesauro D. Touretzky Advances in Neural Information Processing Systems vol. 7 1995 Morgan Kaufmann
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
-
-
Boyan, J.A.1
Moore, A.W.2
-
15
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
D.S. Touretzky M.C. Mozer M.E. Hasselmo MIT Press
-
R.S. Sutton Generalization in reinforcement learning: successful examples using sparse coarse coding D.S. Touretzky M.C. Mozer M.E. Hasselmo Advances in Neural Information Processing Systems vol. 8 1996 MIT Press 1038 1044
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.S.1
-
16
-
-
0038595393
-
Stable function approximation in dynamic programming
-
School of Computer Science, Carnegie Mellon University
-
G.J. Gordon, Stable function approximation in dynamic programming, Tech. Rep. CMU-CS-95-103, School of Computer Science, Carnegie Mellon University, 1995.
-
(1995)
Tech. Rep.
, vol.CMU-CS-95-103
-
-
Gordon, G.J.1
-
17
-
-
0031143730
-
An analysis of temporal-difference learning with function approximation
-
J.N. Tsitsiklis, and B. Van Roy An analysis of temporal-difference learning with function approximation IEEE Trans. Automat. Control 42 5 1997 674 690
-
(1997)
IEEE Trans. Automat. Control
, vol.42
, Issue.5
, pp. 674-690
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
18
-
-
0025229247
-
Consistency of HDP applied to a simple reinforcement learning problem
-
P.J. Werbos Consistency of HDP applied to a simple reinforcement learning problem Neural Networks 3 1990 179 189
-
(1990)
Neural Networks
, vol.3
, pp. 179-189
-
-
Werbos, P.J.1
-
19
-
-
0036832956
-
Kernel-based reinforcement learning
-
D. Ormoneit, and S. Sen Kernel-based reinforcement learning Mach. Learn. 49 2002 161 178
-
(2002)
Mach. Learn.
, vol.49
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
20
-
-
0036804005
-
Kernel-based reinforcement learning in average-cost problems
-
D. Ormoneit, and P. Glynn Kernel-based reinforcement learning in average-cost problems IEEE Trans. Automat. Control 47 10 2002 1624 1636
-
(2002)
IEEE Trans. Automat. Control
, vol.47
, Issue.10
, pp. 1624-1636
-
-
Ormoneit, D.1
Glynn, P.2
-
22
-
-
0001473437
-
On estimation of a probability density function and mode
-
E. Parzen On estimation of a probability density function and mode Ann. Math. Statist. 33 1962 1065 1076
-
(1962)
Ann. Math. Statist.
, vol.33
, pp. 1065-1076
-
-
Parzen, E.1
-
24
-
-
0039225090
-
A convergent reinforcement learning algorithm in the continuous case based on a finite difference method
-
Morgan Kaufmann
-
R. Munos A convergent reinforcement learning algorithm in the continuous case based on a finite difference method Proceedings of the International Joint Conference on Artificial Intelligence 1997 Morgan Kaufmann 826 831
-
(1997)
Proceedings of the International Joint Conference on Artificial Intelligence
, pp. 826-831
-
-
Munos, R.1
-
27
-
-
84898972974
-
Reinforcement learning for dynamic channel allocation in cellular telephone systems
-
M.C. Mozer M.I. Jordan T. Petsche MIT Press
-
S. Singh, and D. Bertsekas Reinforcement learning for dynamic channel allocation in cellular telephone systems M.C. Mozer M.I. Jordan T. Petsche Advances in Neural Information Processing Systems vol. 9 1997 MIT Press 974 980
-
(1997)
Advances in Neural Information Processing Systems
, vol.9
, pp. 974-980
-
-
Singh, S.1
Bertsekas, D.2
-
28
-
-
0003396255
-
-
The MathWorks, Inc., Natick, MA
-
H. Demuth, M. Beale, Neural Network Toolbox User's Guide (MATLAB), The MathWorks, Inc., Natick, MA, 2002.
-
(2002)
Neural Network Toolbox User's Guide (MATLAB)
-
-
Demuth, H.1
Beale, M.2
-
29
-
-
0000072761
-
Extended Kalman filter based nonlinear model predictive control
-
J.H. Lee, and N.L. Ricker Extended Kalman filter based nonlinear model predictive control Ind. Eng. Chem. Res. 33 6 1994 1530 1541
-
(1994)
Ind. Eng. Chem. Res.
, vol.33
, Issue.6
, pp. 1530-1541
-
-
Lee, J.H.1
Ricker, N.L.2
-
30
-
-
0042494945
-
-
The MathWorks, Inc., Natick, MA
-
M. Morari, N.L. Ricker, Model Predictive Control Toolbox User's Guide (MATLAB), The MathWorks, Inc., Natick, MA, 1995.
-
(1995)
Model Predictive Control Toolbox User's Guide (MATLAB)
-
-
Morari, M.1
Ricker, N.L.2
-
31
-
-
0033495697
-
Extended Kalman filter-based nonlinear model predictive control for a continuous MMA polymerization reactor
-
S.-M. Ahn, M.-J. Park, and H.-K. Rhee Extended Kalman filter-based nonlinear model predictive control for a continuous MMA polymerization reactor Indust. Eng. Chem. Res. 38 1999 3942 3949
-
(1999)
Indust. Eng. Chem. Res.
, vol.38
, pp. 3942-3949
-
-
Ahn, S.-M.1
Park, M.-J.2
Rhee, H.-K.3
-
33
-
-
0343171028
-
Estimation of a multivariate density
-
T. Cacoullos Estimation of a multivariate density Ann. Inst. Statist. Math. (Tokyo) 18 2 1966 179 189
-
(1966)
Ann. Inst. Statist. Math. (Tokyo)
, vol.18
, Issue.2
, pp. 179-189
-
-
Cacoullos, T.1
-
35
-
-
18444379381
-
Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes
-
J.M. Lee, and J.H. Lee Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes Automatica 41 7 2005 1281 1288
-
(2005)
Automatica
, vol.41
, Issue.7
, pp. 1281-1288
-
-
Lee, J.M.1
Lee, J.H.2
|