-
2
-
-
0035977495
-
Robust reinforcement learning control with static and dynamic stability
-
C. Anderson, R. M. Kretchner, R M. Young, and D. C. Hittle, Robust reinforcement learning control with static and dynamic stability, International Journal of Robust and Nonlinear Control, vol. 11,2001.
-
(2001)
International Journal of Robust and Nonlinear Control
, vol.11
-
-
Anderson, C.1
Kretchner, R.M.2
Young, R.M.3
Hittle, D.C.4
-
4
-
-
0346686091
-
A new method for stability analyis of nonlinear discrete-time systems
-
N. Barabanov and D. Prokhorov, A new method for stability analyis of nonlinear discrete-time systems, IEEE Trans. Automatic Control, vol. 48, no. 12, 2003.
-
(2003)
IEEE Trans. Automatic Control
, vol.48
, Issue.12
-
-
Barabanov, N.1
Prokhorov, D.2
-
5
-
-
0003544743
-
-
chapter Reinforcement Learning and Adaptive Critic Methods, Van Nostrand-Reinhold, New York
-
A. G. Barto, Handbook of Intelligent Control, chapter Reinforcement Learning and Adaptive Critic Methods, pp. 469-491, Van Nostrand-Reinhold, New York, 1992.
-
(1992)
Handbook of Intelligent Control
, pp. 469-491
-
-
Barto, A.G.1
-
7
-
-
85012688561
-
-
Princeton University Press, Princeton, NJ
-
R. E. Bellman, Dynamic Programming, Princeton University Press, Princeton, NJ, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
9
-
-
0033750123
-
Neurocontroller for fuzzy ball-and-beam systems with nonlinear, nonuniform friction
-
P. Eaton, D. Prokhorov, and D. Wunsch, Neurocontroller for fuzzy ball-and-beam systems with nonlinear, nonuniform friction, IEEE Trans. Neural Networks, pp. 423-435,2000.
-
(2000)
IEEE Trans. Neural Networks
, pp. 423-435
-
-
Eaton, P.1
Prokhorov, D.2
Wunsch, D.3
-
10
-
-
85088722092
-
Robust adaptive critic based neurocontrollers for missiles with model uncertainties
-
Montreal, Canada
-
Z. Huang and S. N. Balakrishnan, Robust adaptive critic based neurocontrollers for missiles with model uncertainties, 2001 AAA Guidance, Navigation and Control Conference, Montreal, Canada, 2001.
-
(2001)
2001 AAA Guidance, Navigation and Control Conference
-
-
Huang, Z.1
Balakrishnan, S.N.2
-
11
-
-
0033717755
-
Robust adaptive critic based neurocontrollers for systems with input uncertainties
-
Z. Huang and S.N. Balakrishnan, Robust adaptive critic based neurocontrollers for systems with input uncertainties, Proc. OfIJCNN’2000, pp. B-263, 2000.
-
(2000)
Proc. OfIJCNN’2000
-
-
Huang, Z.1
Balakrishnan, S.N.2
-
12
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore. Reinforcement learning: A survey, Journal of Artificial Intelligence Research, vol. 4, pp. 237-285,1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
13
-
-
0030677953
-
Immunized adaptive critics
-
Houston, A version of this was presented at ANNIE’96, November, St. Louis, MO
-
K. Krishna Kumar and J. Neidhoefer, Immunized adaptive critics, invited session on Adaptive Critics, ICNN’97, Houston, 1997. A version of this was presented at ANNIE’96, November, St. Louis, MO.
-
(1997)
Adaptive Critics, ICNN’97
-
-
Krishna Kumar, K.1
Neidhoefer, J.2
-
15
-
-
0031377386
-
More on training strategies for critic and action neural networks in dual heuristic programming method (Invited paper)
-
Orlando, FL
-
G. G. Lendaris, T. T. Shannon, and C. Paintz, More on training strategies for critic and action neural networks in dual heuristic programming method (invited paper), Proc. Of Systems, Man and Cybernetics Society International Conference’97, Orlando, FL, 1997.
-
(1997)
Proc. Of Systems, Man and Cybernetics Society International Conference’97
-
-
Lendaris, G.G.1
Shannon, T.T.2
Paintz, C.3
-
17
-
-
70449456752
-
Dual heuristic programming for fuzzy control
-
Vancouver, B.C
-
G. G. Lendaris, R. A. Santiago, and M. S. Carroll, Dual heuristic programming for fuzzy control, Proceeedings of IFSA /NAFIPS Conference, Vancouver, B.C., 2002.
-
(2002)
Proceeedings of IFSA /NAFIPS Conference
-
-
Lendaris, G.G.1
Santiago, R.A.2
Carroll, M.S.3
-
19
-
-
0141480188
-
Controller design via adaptive critic and model reference methods
-
Portland, OR
-
G. G. Lendaris, R. A. Santiago, J. McCarthy, and M. S. Carroll, Controller design via adaptive critic and model reference methods, Proc. Of International Conference on Neural Networks’03 (IJCNN’ 2003), Portland, OR, 2003.
-
(2003)
Proc. Of International Conference on Neural Networks’03 (IJCNN’ 2003)
-
-
Lendaris, G.G.1
Santiago, R.A.2
Mc Carthy, J.3
Carroll, M.S.4
-
20
-
-
0034480936
-
Controller design (From scratch) using approximate dynamic programming
-
Patras, Greece
-
G. G. Lendaris and L. J. Schultz, Controller design (from scratch) using approximate dynamic programming, Proc. Of IEEE International Symposium on Intelligent Control’2000, (IEEE-ISIC’2000), Patras, Greece, 2000.
-
(2000)
Proc. Of IEEE International Symposium on Intelligent Control’2000, (IEEE-ISIC’2000)
-
-
Lendaris, G.G.1
Schultz, L.J.2
-
23
-
-
0033325645
-
A comparison of training algorithms for dhp adaptive critic neuro-control
-
Washington, DC
-
G. G. Lendaris, T. T. Shannon, and A. Rustan, A comparison of training algorithms for dhp adaptive critic neuro-control, Proc. Of International Conference on Neural Networks’99 (IJCNN’99), Washington, DC, 1999.
-
(1999)
Proc. Of International Conference on Neural Networks’99 (IJCNN’99)
-
-
Lendaris, G.G.1
Shannon, T.T.2
Rustan, A.3
-
24
-
-
0035792262
-
Dual heuristic programming for fuzzy control
-
Vancouver, B.C
-
G. G. Lendaris, T. T. Shannon, L. J. Schultz, S. Hutsell, and A. Rogers, Dual heuristic programming for fuzzy control, Proceeedings of IFSA /NAFIPS Conference, Vancouver, B.C., 2001.
-
(2001)
Proceeedings of IFSA /NAFIPS Conference
-
-
Lendaris, G.G.1
Shannon, T.T.2
Schultz, L.J.3
Hutsell, S.4
Rogers, A.5
-
28
-
-
0003988402
-
System analysis via integral quadratic constraints: Part II, Technical Report ISRN LUTFD2/TFRT-7559-SE
-
A. Megretski and A. Rantzer, System analysis via integral quadratic constraints: Part II, Technical Report ISRN LUTFD2/TFRT-7559-SE, Lund Institute of Technology, 1997.
-
(1997)
Lund Institute of Technology
-
-
Megretski, A.1
Rantzer, A.2
-
29
-
-
0036588686
-
Adaptive dynamic programming
-
J. J. Murray, C. Cox, G.G. Lendaris, and R. Saeks, Adaptive dynamic programming, IEEE Trans, on Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 32, no. 2, pp. 140-153, 2002.
-
(2002)
IEEE Trans, on Systems, Man, and Cybernetics, Part C: Applications and Reviews
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.2
Lendaris, G.G.3
Saeks, R.4
-
30
-
-
0028137961
-
Adaptive control of nonlinear multivariable systems using neural networks
-
K. S. Narendra and S. Mukhopadhyay, Adaptive control of nonlinear multivariable systems using neural networks, Neural Networks, vol. 7, no. 5, pp. 737-752, 1994.
-
(1994)
Neural Networks
, vol.7
, Issue.5
, pp. 737-752
-
-
Narendra, K.S.1
Mukhopadhyay, S.2
-
33
-
-
0035078487
-
Intelligent control for autonomous aircraft missions
-
J. C. Neidhoefer and K. Krishnakumar, Intelligent control for autonomous aircraft missions, IEEE Trans, on Systems, Man, and Cybernetics, Part A, 2001.
-
(2001)
IEEE Trans, on Systems, Man, and Cybernetics, Part A
-
-
Neidhoefer, J.C.1
Krishnakumar, K.2
-
34
-
-
0042474292
-
The Truck Backer-Upper: An Example of Self Learning in Neural Networks
-
MIT Press, Cambridge, MA
-
D. Nguyen and B. Widrow, The Truck Backer-Upper: An Example of Self Learning in Neural Networks, Neural Networks for Control, MIT Press, Cambridge, MA, 1957.
-
(1957)
Neural Networks for Control
-
-
Nguyen, D.1
Widrow, B.2
-
36
-
-
0034852189
-
A systematic synthesis of optimal process control with neural networks
-
Washington, DC
-
R. Padhi and S. N. Balakrishnan, A systematic synthesis of optimal process control with neural networks, Proc. American Control Conference, Washington, DC, 2001.
-
(2001)
Proc. American Control Conference
-
-
Padhi, R.1
Balakrishnan, S.N.2
-
37
-
-
0035427378
-
Adaptive critic based optimal neuro control synthesis for distributed parameter systems
-
R. Padhi, S. N. Balakrishnan, and T. Randolph, Adaptive critic based optimal neuro control synthesis for distributed parameter systems, Automatica, vol. 37, pp.1223-1234,2001.
-
(2001)
Automatica
, vol.37
, pp. 1223-1234
-
-
Padhi, R.1
Balakrishnan, S.N.2
Randolph, T.3
-
42
-
-
0029592634
-
Adaptive critic designs: A case study for neurocontrol
-
D. Prokhorov, R. Santiago, and D. Wunsch, Adaptive critic designs: A case study for neurocontrol, Neural Networks, vol. 8, pp. 1367-1372,1995.
-
(1995)
Neural Networks
, vol.8
, pp. 1367-1372
-
-
Prokhorov, D.1
Santiago, R.2
Wunsch, D.3
-
43
-
-
0031236002
-
Adaptive critic designs
-
D. Prokhorov and D. Wunsch, Adaptive critic designs, IEEE Trans. Neural Networks, vol. 8, no. 5, pp. 997-1007,1997.
-
(1997)
IEEE Trans. Neural Networks
, vol.8
, Issue.5
, pp. 997-1007
-
-
Prokhorov, D.1
Wunsch, D.2
-
44
-
-
0035792648
-
A comparison of dhp based antecedent parameter tuning strategies for fuzzy control
-
Vancouver, B.C
-
A. Rogers, T. T. Shannon, and G. G. Lendaris, A comparison of dhp based antecedent parameter tuning strategies for fuzzy control, Proc. Of IFSA/NAFIPS Conference, Vancouver, B.C., 2001.
-
(2001)
Proc. Of IFSA/NAFIPS Conference
-
-
Rogers, A.1
Shannon, T.T.2
Lendaris, G.G.3
-
45
-
-
85036510264
-
Adaptive critic control of the power train in a hybrid electric vehicle
-
R. Saeks, C. Cox, J. Neidhoefer, and D. Escher, Adaptive critic control of the power train in a hybrid electric vehicle, Proc. SMCia Workshop, 1999.
-
(1999)
Proc. Smcia Workshop
-
-
Saeks, R.1
Cox, C.2
Neidhoefer, J.3
Escher, D.4
-
46
-
-
0347371777
-
Adaptive critic control of a hybrid electric vehicle
-
R. Saeks, C. Cox, J. Neidhoefer, P. Mays, and J. Murray, Adaptive critic control of a hybrid electric vehicle, IEEE Trans, on Intelligent Transportation Systems, vol. 3, no. 4,2002.
-
(2002)
IEEE Trans, on Intelligent Transportation Systems
, vol.3
, Issue.4
-
-
Saeks, R.1
Cox, C.2
Neidhoefer, J.3
Mays, P.4
Murray, J.5
-
47
-
-
0007783847
-
New progress towards truly brain-like intelligent control
-
Erlbaum, Hillsdale, NJ
-
R. Santiago and P. Werbos, New progress towards truly brain-like intelligent control, Proc. WCNN’94, pp. 12-133, Erlbaum, Hillsdale, NJ, 1994.
-
(1994)
Proc. WCNN’94
, pp. 12-133
-
-
Santiago, R.1
Werbos, P.2
-
48
-
-
0035791958
-
Using dhp adaptive critic methods to tune a fuzzy automobile steering controller
-
Vancouver, B.C
-
L. J. Schultz, T. T. Shannon, and G. G. Lendaris, Using dhp adaptive critic methods to tune a fuzzy automobile steering controller, Proc. Of IFSA/NAFIPS Conference, Vancouver, B.C., 2001.
-
(2001)
Proc. Of IFSA/NAFIPS Conference
-
-
Schultz, L.J.1
Shannon, T.T.2
Lendaris, G.G.3
-
51
-
-
0033720311
-
Adaptive critic based approximate dynamic programming for tuning fuzzy controllers
-
T. T. Shannon and G. G. Lendaris, Adaptive critic based approximate dynamic programming for tuning fuzzy controllers, Proc. Of IEEE-FUZZ 2000, 2000.
-
(2000)
Proc. Of IEEE-FUZZ 2000
-
-
Shannon, T.T.1
Lendaris, G.G.2
-
53
-
-
70449391472
-
Adaptive critic based design of a fuzzy motor speed controller
-
Mexico City
-
T. T. Shannon and G. G. Lendaris, Adaptive critic based design of a fuzzy motor speed controller, Proc. OfISIC2001, Mexico City, 2001.
-
(2001)
Proc. Ofisic2001
-
-
Shannon, T.T.1
Lendaris, G.G.2
-
54
-
-
0141704189
-
Accelerated critic learning in approximate dynamic programming via value templates and perceptual learning
-
Portland, OR
-
T. T. Shannon, R. A. Santiago, and G. G. Lendaris, Accelerated critic learning in approximate dynamic programming via value templates and perceptual learning, Proa of IJCNN’03, Portland, OR, 2003.
-
(2003)
Proa of IJCNN’03
-
-
Shannon, T.T.1
Santiago, R.A.2
Lendaris, G.G.3
-
55
-
-
0035792268
-
Adaptive critic based adaptation of a fuzzy policy manager for a logistic system
-
Vancouver, B.C
-
S. Shervais and T. T. Shannon, Adaptive critic based adaptation of a fuzzy policy manager for a logistic system, Proa of IFSA /NAFIPS Conference, Vancouver, B.C., 2001.
-
(2001)
Proa of IFSA /NAFIPS Conference
-
-
Shervais, S.1
Shannon, T.T.2
-
56
-
-
0026385066
-
Reinforcement learning is direct adaptive optimal control
-
Boston
-
R. S. Sutton, A. G. Barto, and R. J. Williams, Reinforcement learning is direct adaptive optimal control, Proc. Of the American Control Conference, Boston, pp. 2143-2146,1991.
-
(1991)
Proc. Of the American Control Conference
, pp. 2143-2146
-
-
Sutton, R.S.1
Barto, A.G.2
Williams, R.J.3
-
59
-
-
0002011091
-
A Menu of Designs for Reinforcement Learning Over Time
-
MIT Press, Cambridge, MA
-
P. J. Werbos, A Menu of Designs for Reinforcement Learning Over Time, Neural Networks for Control, pp. 67-95, MIT Press, Cambridge, MA, 1990.
-
(1990)
Neural Networks for Control
, pp. 67-95
-
-
Werbos, P.J.1
-
60
-
-
0002031779
-
Approximate Dynamic Programming for Real-Time Control and Neural Modeling
-
Van Nostrand Reinhold, New York
-
P. J. Werbos, Approximate Dynamic Programming for Real-Time Control and Neural Modeling, Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, pp. 493-525, Van Nostrand Reinhold, New York, 1994.
-
(1994)
Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
, pp. 493-525
-
-
Werbos, P.J.1
-
61
-
-
0015667648
-
Punish/reward: Learning with a critic in adaptive threshhold systems
-
B. Widrow, N. Gupta, and S. Maitra, Punish/reward: Learning with a critic in adaptive threshhold systems, IEEE Trans, on Systems, Man and Cybernetics, vol. 3, no. 5, pp. 455-465,1973.
-
(1973)
IEEE Trans, on Systems, Man and Cybernetics
, vol.3
, Issue.5
, pp. 455-465
-
-
Widrow, B.1
Gupta, N.2
Maitra, S.3
-
62
-
-
0003515853
-
-
Prentice-Hall, Englewood Cliffs, NJ
-
J. Yen and R. Langari, Fuzzy Logic: Intelligence, Control and Information, Prentice-Hall, Englewood Cliffs, NJ, 1999.
-
(1999)
Fuzzy Logic: Intelligence, Control and Information
-
-
Yen, J.1
Langari, R.2
|