SCOPUS 정보 검색 플랫폼

Journal of Process Control

Volumn 16, Issue 2, 2006, Pages 135-156

Choice of approximator and design of penalty function for an approximate dynamic programming based control approach

(3) Lee, Jong Min a Kaisare, Niket S a Lee, Jay H a

a Georgia Institute of Technology (United States)

Author keywords

Approximate dynamic programming; k nearest neighbor; Neural network

Indexed keywords

APPROXIMATION THEORY; CONTROL SYSTEMS; EXTRAPOLATION; ITERATIVE METHODS; NEURAL NETWORKS;

APPROXIMATE DYNAMIC PROGRAMMING (ADP); FUNCTION APPROXIMATORS;

DYNAMIC PROGRAMMING;

EID: 27144544987 PISSN: 09591524 EISSN: None Source Type: Journal
DOI: 10.1016/j.jprocont.2005.04.010 Document Type: Article

Times cited : (59)

References (35)

1
- 85012688561
- Princeton University Press New Jersey
- R.E. Bellman Dynamic Programming 1957 Princeton University Press New Jersey
- (1957) Dynamic Programming
- Bellman, R.E.¹

2
- 0019537951
- Toward a modern theory of adaptive networks: Expectation and prediction
- R.S. Sutton, and A.G. Barto Toward a modern theory of adaptive networks: expectation and prediction Psycol. Rev. 88 2 1981 135 170
- (1981) Psycol. Rev. , vol.88 , Issue.2 , pp. 135-170
- Sutton, R.S.¹ Barto, A.G.²

3
- 0004102479
- MIT Press Cambridge, MA
- R.S. Sutton, and A.G. Barto Reinforcement Learning: An Introduction 1998 MIT Press Cambridge, MA
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

4
- 0003487482
- Athena Scientific Belmont, MA
- D.P. Bertsekas, and J.N. Tsitsiklis Neuro-dynamic Programming 1996 Athena Scientific Belmont, MA
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 0037349318
- Simulation based strategy for nonlinear optimal control: Application to a microbial cell reactor
- N.S. Kaisare, J.M. Lee, and J.H. Lee Simulation based strategy for nonlinear optimal control: application to a microbial cell reactor Int. J. Robust Nonlinear Control 13 3-4 2002 347 363
- (2002) Int. J. Robust Nonlinear Control , vol.13 , Issue.3-4 , pp. 347-363
- Kaisare, N.S.¹ Lee, J.M.² Lee, J.H.³

6
- 2942655578
- Simulation-based learning of cost-to-go for control of nonlinear processes
- J.M. Lee, and J.H. Lee Simulation-based learning of cost-to-go for control of nonlinear processes Korean J. Chem. Eng. 21 2 2004 338 344
- (2004) Korean J. Chem. Eng. , vol.21 , Issue.2 , pp. 338-344
- Lee, J.M.¹ Lee, J.H.²

7
- 27144554879
- An approximate dynamic programming based approach to dual adaptive control
- submitted
- J.M. Lee, J.H. Lee, An approximate dynamic programming based approach to dual adaptive control, Automatica, submitted.
- Automatica
- Lee, J.M.¹ Lee, J.H.²

8
- 8744223007
- Simulation-based dual mode controller for nonlinear processes
- Preprints of IFAC
- J.M. Lee, J.H. Lee, Simulation-based dual mode controller for nonlinear processes, in: Preprints of 7th International Symposium on Advanced Control of Chemical Processes, IFAC, 2003, pp. 225-230.
- (2003) 7th International Symposium on Advanced Control of Chemical Processes , pp. 225-230
- Lee, J.M.¹ Lee, J.H.²

9
- 0001046225
- Practical issues in temporal-difference learning
- G.J. Tesauro Practical issues in temporal-difference learning Mach. Learn. 8 1992 257 277
- (1992) Mach. Learn. , vol.8 , pp. 257-277
- Tesauro, G.J.¹

10
- 0000859970
- Reinforcement learning applied to linear quadratic regulation
- S.J. Hanson J. Cowan C.L. Giles Morgan Kaufmann
- S.J. Bradtke Reinforcement learning applied to linear quadratic regulation S.J. Hanson J. Cowan C.L. Giles Advances in Neural Information Processing Systems vol. 5 1993 Morgan Kaufmann 295 302
- (1993) Advances in Neural Information Processing Systems , vol.5 , pp. 295-302
- Bradtke, S.J.¹

11
- 0000433333
- Temporal difference learning of position evaluation in the game of Go
- J.D. Cowan G. Tesauro J. Alspector Morgan Kaufmann San Mateo, CA
- N.N. Schraudolph, P. Dayan, and T.J. Sejnowski Temporal difference learning of position evaluation in the game of Go J.D. Cowan G. Tesauro J. Alspector Advances in Neural Information Processing Systems vol. 6 1994 Morgan Kaufmann San Mateo, CA 817 824
- (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 817-824
- Schraudolph, N.N.¹ Dayan, P.² Sejnowski, T.J.³

12
- 0003270924
- Issues in using function approximation for reinforcement learning
- Lawrence Erlbaum Hillsdale, NJ
- S. Thrun, and A. Schwartz Issues in using function approximation for reinforcement learning Proceedings of the Fourth Connectionist Models Summer School 1993 Lawrence Erlbaum Hillsdale, NJ 255 263
- (1993) Proceedings of the Fourth Connectionist Models Summer School , pp. 255-263
- Thrun, S.¹ Schwartz, A.²

13
- 4544257178
- Approximating Q-values with basis function representations
- Lawrence Erlbaum Hillsdale, NJ
- P. Sabes Approximating Q-values with basis function representations Proceedings of the Fourth Connectionist Models Summer School 1993 Lawrence Erlbaum Hillsdale, NJ
- (1993) Proceedings of the Fourth Connectionist Models Summer School
- Sabes, P.¹

14
- 0001133021
- Generalization in reinforcement learning: Safely approximating the value function
- G. Tesauro D. Touretzky Morgan Kaufmann
- J.A. Boyan, and A.W. Moore Generalization in reinforcement learning: safely approximating the value function G. Tesauro D. Touretzky Advances in Neural Information Processing Systems vol. 7 1995 Morgan Kaufmann
- (1995) Advances in Neural Information Processing Systems , vol.7
- Boyan, J.A.¹ Moore, A.W.²

15
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- D.S. Touretzky M.C. Mozer M.E. Hasselmo MIT Press
- R.S. Sutton Generalization in reinforcement learning: successful examples using sparse coarse coding D.S. Touretzky M.C. Mozer M.E. Hasselmo Advances in Neural Information Processing Systems vol. 8 1996 MIT Press 1038 1044
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
- Sutton, R.S.¹

16
- 0038595393
- Stable function approximation in dynamic programming
- School of Computer Science, Carnegie Mellon University
- G.J. Gordon, Stable function approximation in dynamic programming, Tech. Rep. CMU-CS-95-103, School of Computer Science, Carnegie Mellon University, 1995.
- (1995) Tech. Rep. , vol.CMU-CS-95-103
- Gordon, G.J.¹

17
- 0031143730
- An analysis of temporal-difference learning with function approximation
- J.N. Tsitsiklis, and B. Van Roy An analysis of temporal-difference learning with function approximation IEEE Trans. Automat. Control 42 5 1997 674 690
- (1997) IEEE Trans. Automat. Control , vol.42 , Issue.5 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

18
- 0025229247
- Consistency of HDP applied to a simple reinforcement learning problem
- P.J. Werbos Consistency of HDP applied to a simple reinforcement learning problem Neural Networks 3 1990 179 189
- (1990) Neural Networks , vol.3 , pp. 179-189
- Werbos, P.J.¹

19
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit, and S. Sen Kernel-based reinforcement learning Mach. Learn. 49 2002 161 178
- (2002) Mach. Learn. , vol.49 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

20
- 0036804005
- Kernel-based reinforcement learning in average-cost problems
- D. Ormoneit, and P. Glynn Kernel-based reinforcement learning in average-cost problems IEEE Trans. Automat. Control 47 10 2002 1624 1636
- (2002) IEEE Trans. Automat. Control , vol.47 , Issue.10 , pp. 1624-1636
- Ormoneit, D.¹ Glynn, P.²

21
- 0001898381
- Practical reinforcement learning in continuous spaces
- Morgan Kaufmann San Francisco, CA
- W.D. Smart, and L.P. Kaelbling Practical reinforcement learning in continuous spaces Proceedings of 17th International Conference on Machine Learning 2000 Morgan Kaufmann San Francisco, CA 903 910
- (2000) Proceedings of 17th International Conference on Machine Learning , pp. 903-910
- Smart, W.D.¹ Kaelbling, L.P.²

22
- 0001473437
- On estimation of a probability density function and mode
- E. Parzen On estimation of a probability density function and mode Ann. Math. Statist. 33 1962 1065 1076
- (1962) Ann. Math. Statist. , vol.33 , pp. 1065-1076
- Parzen, E.¹

23
- 0003998452
- Wiley New York
- M.L. Puterman Markovian Decision Problems 1994 Wiley New York
- (1994) Markovian Decision Problems
- Puterman, M.L.¹

24
- 0039225090
- A convergent reinforcement learning algorithm in the continuous case based on a finite difference method
- Morgan Kaufmann
- R. Munos A convergent reinforcement learning algorithm in the continuous case based on a finite difference method Proceedings of the International Joint Conference on Artificial Intelligence 1997 Morgan Kaufmann 826 831
- (1997) Proceedings of the International Joint Conference on Artificial Intelligence , pp. 826-831
- Munos, R.¹

25
- 0003644124
- MIT Press Cambridge, MA
- R.A. Howard Dynamic Programming and Markov Processes 1960 MIT Press Cambridge, MA
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

26
- 84918834208
- A reinforcement learning approach to job-shop scheduling
- Morgan Kaufmann San Francisco, CA
- W. Zhang, and T.G. Dietterich A reinforcement learning approach to job-shop scheduling Proceedings of the Twelfth International Conference on Machine Learning (ICML'95) 1995 Morgan Kaufmann San Francisco, CA 1114 1120
- (1995) Proceedings of the Twelfth International Conference on Machine Learning (ICML'95) , pp. 1114-1120
- Zhang, W.¹ Dietterich, T.G.²

27
- 84898972974
- Reinforcement learning for dynamic channel allocation in cellular telephone systems
- M.C. Mozer M.I. Jordan T. Petsche MIT Press
- S. Singh, and D. Bertsekas Reinforcement learning for dynamic channel allocation in cellular telephone systems M.C. Mozer M.I. Jordan T. Petsche Advances in Neural Information Processing Systems vol. 9 1997 MIT Press 974 980
- (1997) Advances in Neural Information Processing Systems , vol.9 , pp. 974-980
- Singh, S.¹ Bertsekas, D.²

28
- 0003396255
- The MathWorks, Inc., Natick, MA
- H. Demuth, M. Beale, Neural Network Toolbox User's Guide (MATLAB), The MathWorks, Inc., Natick, MA, 2002.
- (2002) Neural Network Toolbox User's Guide (MATLAB)
- Demuth, H.¹ Beale, M.²

29
- 0000072761
- Extended Kalman filter based nonlinear model predictive control
- J.H. Lee, and N.L. Ricker Extended Kalman filter based nonlinear model predictive control Ind. Eng. Chem. Res. 33 6 1994 1530 1541
- (1994) Ind. Eng. Chem. Res. , vol.33 , Issue.6 , pp. 1530-1541
- Lee, J.H.¹ Ricker, N.L.²

30
- 0042494945
- The MathWorks, Inc., Natick, MA
- M. Morari, N.L. Ricker, Model Predictive Control Toolbox User's Guide (MATLAB), The MathWorks, Inc., Natick, MA, 1995.
- (1995) Model Predictive Control Toolbox User's Guide (MATLAB)
- Morari, M.¹ Ricker, N.L.²

31
- 0033495697
- Extended Kalman filter-based nonlinear model predictive control for a continuous MMA polymerization reactor
- S.-M. Ahn, M.-J. Park, and H.-K. Rhee Extended Kalman filter-based nonlinear model predictive control for a continuous MMA polymerization reactor Indust. Eng. Chem. Res. 38 1999 3942 3949
- (1999) Indust. Eng. Chem. Res. , vol.38 , pp. 3942-3949
- Ahn, S.-M.¹ Park, M.-J.² Rhee, H.-K.³

32
- 0003443397
- Chapman and Hall
- B.W. Silverman Density Estimation for Statistics and Data Analysis 1986 Chapman and Hall
- (1986) Density Estimation for Statistics and Data Analysis
- Silverman, B.W.¹

33
- 0343171028
- Estimation of a multivariate density
- T. Cacoullos Estimation of a multivariate density Ann. Inst. Statist. Math. (Tokyo) 18 2 1966 179 189
- (1966) Ann. Inst. Statist. Math. (Tokyo) , vol.18 , Issue.2 , pp. 179-189
- Cacoullos, T.¹

34
- 0003982971
- Springer-Verlag New York
- J. Nocedal, and S.J. Wright Numerical Optimization 1999 Springer-Verlag New York
- (1999) Numerical Optimization
- Nocedal, J.¹ Wright, S.J.²

35
- 18444379381
- Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes
- J.M. Lee, and J.H. Lee Approximate dynamic programming based approaches for input-output data-driven control of nonlinear processes Automatica 41 7 2005 1281 1288
- (2005) Automatica , vol.41 , Issue.7 , pp. 1281-1288
- Lee, J.M.¹ Lee, J.H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.