메뉴 건너뛰기




Volumn 16, Issue 2, 2006, Pages 135-156

Choice of approximator and design of penalty function for an approximate dynamic programming based control approach

Author keywords

Approximate dynamic programming; k nearest neighbor; Neural network

Indexed keywords

APPROXIMATION THEORY; CONTROL SYSTEMS; EXTRAPOLATION; ITERATIVE METHODS; NEURAL NETWORKS;

EID: 27144544987     PISSN: 09591524     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jprocont.2005.04.010     Document Type: Article
Times cited : (59)

References (35)
  • 1
  • 2
    • 0019537951 scopus 로고
    • Toward a modern theory of adaptive networks: Expectation and prediction
    • R.S. Sutton, and A.G. Barto Toward a modern theory of adaptive networks: expectation and prediction Psycol. Rev. 88 2 1981 135 170
    • (1981) Psycol. Rev. , vol.88 , Issue.2 , pp. 135-170
    • Sutton, R.S.1    Barto, A.G.2
  • 5
    • 0037349318 scopus 로고    scopus 로고
    • Simulation based strategy for nonlinear optimal control: Application to a microbial cell reactor
    • N.S. Kaisare, J.M. Lee, and J.H. Lee Simulation based strategy for nonlinear optimal control: application to a microbial cell reactor Int. J. Robust Nonlinear Control 13 3-4 2002 347 363
    • (2002) Int. J. Robust Nonlinear Control , vol.13 , Issue.3-4 , pp. 347-363
    • Kaisare, N.S.1    Lee, J.M.2    Lee, J.H.3
  • 6
    • 2942655578 scopus 로고    scopus 로고
    • Simulation-based learning of cost-to-go for control of nonlinear processes
    • J.M. Lee, and J.H. Lee Simulation-based learning of cost-to-go for control of nonlinear processes Korean J. Chem. Eng. 21 2 2004 338 344
    • (2004) Korean J. Chem. Eng. , vol.21 , Issue.2 , pp. 338-344
    • Lee, J.M.1    Lee, J.H.2
  • 7
    • 27144554879 scopus 로고    scopus 로고
    • An approximate dynamic programming based approach to dual adaptive control
    • submitted
    • J.M. Lee, J.H. Lee, An approximate dynamic programming based approach to dual adaptive control, Automatica, submitted.
    • Automatica
    • Lee, J.M.1    Lee, J.H.2
  • 9
    • 0001046225 scopus 로고
    • Practical issues in temporal-difference learning
    • G.J. Tesauro Practical issues in temporal-difference learning Mach. Learn. 8 1992 257 277
    • (1992) Mach. Learn. , vol.8 , pp. 257-277
    • Tesauro, G.J.1
  • 10
    • 0000859970 scopus 로고
    • Reinforcement learning applied to linear quadratic regulation
    • S.J. Hanson J. Cowan C.L. Giles Morgan Kaufmann
    • S.J. Bradtke Reinforcement learning applied to linear quadratic regulation S.J. Hanson J. Cowan C.L. Giles Advances in Neural Information Processing Systems vol. 5 1993 Morgan Kaufmann 295 302
    • (1993) Advances in Neural Information Processing Systems , vol.5 , pp. 295-302
    • Bradtke, S.J.1
  • 11
    • 0000433333 scopus 로고
    • Temporal difference learning of position evaluation in the game of Go
    • J.D. Cowan G. Tesauro J. Alspector Morgan Kaufmann San Mateo, CA
    • N.N. Schraudolph, P. Dayan, and T.J. Sejnowski Temporal difference learning of position evaluation in the game of Go J.D. Cowan G. Tesauro J. Alspector Advances in Neural Information Processing Systems vol. 6 1994 Morgan Kaufmann San Mateo, CA 817 824
    • (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 817-824
    • Schraudolph, N.N.1    Dayan, P.2    Sejnowski, T.J.3
  • 12
  • 14
    • 0001133021 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • G. Tesauro D. Touretzky Morgan Kaufmann
    • J.A. Boyan, and A.W. Moore Generalization in reinforcement learning: safely approximating the value function G. Tesauro D. Touretzky Advances in Neural Information Processing Systems vol. 7 1995 Morgan Kaufmann
    • (1995) Advances in Neural Information Processing Systems , vol.7
    • Boyan, J.A.1    Moore, A.W.2
  • 15
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • D.S. Touretzky M.C. Mozer M.E. Hasselmo MIT Press
    • R.S. Sutton Generalization in reinforcement learning: successful examples using sparse coarse coding D.S. Touretzky M.C. Mozer M.E. Hasselmo Advances in Neural Information Processing Systems vol. 8 1996 MIT Press 1038 1044
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.S.1
  • 16
    • 0038595393 scopus 로고
    • Stable function approximation in dynamic programming
    • School of Computer Science, Carnegie Mellon University
    • G.J. Gordon, Stable function approximation in dynamic programming, Tech. Rep. CMU-CS-95-103, School of Computer Science, Carnegie Mellon University, 1995.
    • (1995) Tech. Rep. , vol.CMU-CS-95-103
    • Gordon, G.J.1
  • 17
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • J.N. Tsitsiklis, and B. Van Roy An analysis of temporal-difference learning with function approximation IEEE Trans. Automat. Control 42 5 1997 674 690
    • (1997) IEEE Trans. Automat. Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 18
    • 0025229247 scopus 로고
    • Consistency of HDP applied to a simple reinforcement learning problem
    • P.J. Werbos Consistency of HDP applied to a simple reinforcement learning problem Neural Networks 3 1990 179 189
    • (1990) Neural Networks , vol.3 , pp. 179-189
    • Werbos, P.J.1
  • 19
    • 0036832956 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • D. Ormoneit, and S. Sen Kernel-based reinforcement learning Mach. Learn. 49 2002 161 178
    • (2002) Mach. Learn. , vol.49 , pp. 161-178
    • Ormoneit, D.1    Sen, S.2
  • 20
    • 0036804005 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning in average-cost problems
    • D. Ormoneit, and P. Glynn Kernel-based reinforcement learning in average-cost problems IEEE Trans. Automat. Control 47 10 2002 1624 1636
    • (2002) IEEE Trans. Automat. Control , vol.47 , Issue.10 , pp. 1624-1636
    • Ormoneit, D.1    Glynn, P.2
  • 22
    • 0001473437 scopus 로고
    • On estimation of a probability density function and mode
    • E. Parzen On estimation of a probability density function and mode Ann. Math. Statist. 33 1962 1065 1076
    • (1962) Ann. Math. Statist. , vol.33 , pp. 1065-1076
    • Parzen, E.1
  • 24
    • 0039225090 scopus 로고    scopus 로고
    • A convergent reinforcement learning algorithm in the continuous case based on a finite difference method
    • Morgan Kaufmann
    • R. Munos A convergent reinforcement learning algorithm in the continuous case based on a finite difference method Proceedings of the International Joint Conference on Artificial Intelligence 1997 Morgan Kaufmann 826 831
    • (1997) Proceedings of the International Joint Conference on Artificial Intelligence , pp. 826-831
    • Munos, R.1
  • 27
    • 84898972974 scopus 로고    scopus 로고
    • Reinforcement learning for dynamic channel allocation in cellular telephone systems
    • M.C. Mozer M.I. Jordan T. Petsche MIT Press
    • S. Singh, and D. Bertsekas Reinforcement learning for dynamic channel allocation in cellular telephone systems M.C. Mozer M.I. Jordan T. Petsche Advances in Neural Information Processing Systems vol. 9 1997 MIT Press 974 980
    • (1997) Advances in Neural Information Processing Systems , vol.9 , pp. 974-980
    • Singh, S.1    Bertsekas, D.2
  • 29
    • 0000072761 scopus 로고
    • Extended Kalman filter based nonlinear model predictive control
    • J.H. Lee, and N.L. Ricker Extended Kalman filter based nonlinear model predictive control Ind. Eng. Chem. Res. 33 6 1994 1530 1541
    • (1994) Ind. Eng. Chem. Res. , vol.33 , Issue.6 , pp. 1530-1541
    • Lee, J.H.1    Ricker, N.L.2
  • 31
    • 0033495697 scopus 로고    scopus 로고
    • Extended Kalman filter-based nonlinear model predictive control for a continuous MMA polymerization reactor
    • S.-M. Ahn, M.-J. Park, and H.-K. Rhee Extended Kalman filter-based nonlinear model predictive control for a continuous MMA polymerization reactor Indust. Eng. Chem. Res. 38 1999 3942 3949
    • (1999) Indust. Eng. Chem. Res. , vol.38 , pp. 3942-3949
    • Ahn, S.-M.1    Park, M.-J.2    Rhee, H.-K.3
  • 33
    • 0343171028 scopus 로고
    • Estimation of a multivariate density
    • T. Cacoullos Estimation of a multivariate density Ann. Inst. Statist. Math. (Tokyo) 18 2 1966 179 189
    • (1966) Ann. Inst. Statist. Math. (Tokyo) , vol.18 , Issue.2 , pp. 179-189
    • Cacoullos, T.1
  • 35
    • 18444379381 scopus 로고    scopus 로고
    • Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes
    • J.M. Lee, and J.H. Lee Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes Automatica 41 7 2005 1281 1288
    • (2005) Automatica , vol.41 , Issue.7 , pp. 1281-1288
    • Lee, J.M.1    Lee, J.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.