메뉴 건너뛰기




Volumn , Issue , 2004, Pages 97-124

Guidance in the use of adaptive critics for control

Author keywords

Adaptation model; Dynamic programming; Equations; Function approximation; Learning; Training

Indexed keywords

PERSONNEL TRAINING;

EID: 80053055883     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1109/9780470544785.ch4     Document Type: Chapter
Times cited : (37)

References (62)
  • 4
    • 0346686091 scopus 로고    scopus 로고
    • A new method for stability analyis of nonlinear discrete-time systems
    • N. Barabanov and D. Prokhorov, A new method for stability analyis of nonlinear discrete-time systems, IEEE Trans. Automatic Control, vol. 48, no. 12, 2003.
    • (2003) IEEE Trans. Automatic Control , vol.48 , Issue.12
    • Barabanov, N.1    Prokhorov, D.2
  • 5
    • 0003544743 scopus 로고
    • chapter Reinforcement Learning and Adaptive Critic Methods, Van Nostrand-Reinhold, New York
    • A. G. Barto, Handbook of Intelligent Control, chapter Reinforcement Learning and Adaptive Critic Methods, pp. 469-491, Van Nostrand-Reinhold, New York, 1992.
    • (1992) Handbook of Intelligent Control , pp. 469-491
    • Barto, A.G.1
  • 7
    • 85012688561 scopus 로고
    • Princeton University Press, Princeton, NJ
    • R. E. Bellman, Dynamic Programming, Princeton University Press, Princeton, NJ, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 9
    • 0033750123 scopus 로고    scopus 로고
    • Neurocontroller for fuzzy ball-and-beam systems with nonlinear, nonuniform friction
    • P. Eaton, D. Prokhorov, and D. Wunsch, Neurocontroller for fuzzy ball-and-beam systems with nonlinear, nonuniform friction, IEEE Trans. Neural Networks, pp. 423-435,2000.
    • (2000) IEEE Trans. Neural Networks , pp. 423-435
    • Eaton, P.1    Prokhorov, D.2    Wunsch, D.3
  • 10
  • 11
    • 0033717755 scopus 로고    scopus 로고
    • Robust adaptive critic based neurocontrollers for systems with input uncertainties
    • Z. Huang and S.N. Balakrishnan, Robust adaptive critic based neurocontrollers for systems with input uncertainties, Proc. OfIJCNN’2000, pp. B-263, 2000.
    • (2000) Proc. OfIJCNN’2000
    • Huang, Z.1    Balakrishnan, S.N.2
  • 13
    • 0030677953 scopus 로고    scopus 로고
    • Immunized adaptive critics
    • Houston, A version of this was presented at ANNIE’96, November, St. Louis, MO
    • K. Krishna Kumar and J. Neidhoefer, Immunized adaptive critics, invited session on Adaptive Critics, ICNN’97, Houston, 1997. A version of this was presented at ANNIE’96, November, St. Louis, MO.
    • (1997) Adaptive Critics, ICNN’97
    • Krishna Kumar, K.1    Neidhoefer, J.2
  • 28
    • 0003988402 scopus 로고    scopus 로고
    • System analysis via integral quadratic constraints: Part II, Technical Report ISRN LUTFD2/TFRT-7559-SE
    • A. Megretski and A. Rantzer, System analysis via integral quadratic constraints: Part II, Technical Report ISRN LUTFD2/TFRT-7559-SE, Lund Institute of Technology, 1997.
    • (1997) Lund Institute of Technology
    • Megretski, A.1    Rantzer, A.2
  • 30
    • 0028137961 scopus 로고
    • Adaptive control of nonlinear multivariable systems using neural networks
    • K. S. Narendra and S. Mukhopadhyay, Adaptive control of nonlinear multivariable systems using neural networks, Neural Networks, vol. 7, no. 5, pp. 737-752, 1994.
    • (1994) Neural Networks , vol.7 , Issue.5 , pp. 737-752
    • Narendra, K.S.1    Mukhopadhyay, S.2
  • 34
    • 0042474292 scopus 로고
    • The Truck Backer-Upper: An Example of Self Learning in Neural Networks
    • MIT Press, Cambridge, MA
    • D. Nguyen and B. Widrow, The Truck Backer-Upper: An Example of Self Learning in Neural Networks, Neural Networks for Control, MIT Press, Cambridge, MA, 1957.
    • (1957) Neural Networks for Control
    • Nguyen, D.1    Widrow, B.2
  • 36
    • 0034852189 scopus 로고    scopus 로고
    • A systematic synthesis of optimal process control with neural networks
    • Washington, DC
    • R. Padhi and S. N. Balakrishnan, A systematic synthesis of optimal process control with neural networks, Proc. American Control Conference, Washington, DC, 2001.
    • (2001) Proc. American Control Conference
    • Padhi, R.1    Balakrishnan, S.N.2
  • 37
    • 0035427378 scopus 로고    scopus 로고
    • Adaptive critic based optimal neuro control synthesis for distributed parameter systems
    • R. Padhi, S. N. Balakrishnan, and T. Randolph, Adaptive critic based optimal neuro control synthesis for distributed parameter systems, Automatica, vol. 37, pp.1223-1234,2001.
    • (2001) Automatica , vol.37 , pp. 1223-1234
    • Padhi, R.1    Balakrishnan, S.N.2    Randolph, T.3
  • 42
    • 0029592634 scopus 로고
    • Adaptive critic designs: A case study for neurocontrol
    • D. Prokhorov, R. Santiago, and D. Wunsch, Adaptive critic designs: A case study for neurocontrol, Neural Networks, vol. 8, pp. 1367-1372,1995.
    • (1995) Neural Networks , vol.8 , pp. 1367-1372
    • Prokhorov, D.1    Santiago, R.2    Wunsch, D.3
  • 44
    • 0035792648 scopus 로고    scopus 로고
    • A comparison of dhp based antecedent parameter tuning strategies for fuzzy control
    • Vancouver, B.C
    • A. Rogers, T. T. Shannon, and G. G. Lendaris, A comparison of dhp based antecedent parameter tuning strategies for fuzzy control, Proc. Of IFSA/NAFIPS Conference, Vancouver, B.C., 2001.
    • (2001) Proc. Of IFSA/NAFIPS Conference
    • Rogers, A.1    Shannon, T.T.2    Lendaris, G.G.3
  • 45
    • 85036510264 scopus 로고    scopus 로고
    • Adaptive critic control of the power train in a hybrid electric vehicle
    • R. Saeks, C. Cox, J. Neidhoefer, and D. Escher, Adaptive critic control of the power train in a hybrid electric vehicle, Proc. SMCia Workshop, 1999.
    • (1999) Proc. Smcia Workshop
    • Saeks, R.1    Cox, C.2    Neidhoefer, J.3    Escher, D.4
  • 47
    • 0007783847 scopus 로고
    • New progress towards truly brain-like intelligent control
    • Erlbaum, Hillsdale, NJ
    • R. Santiago and P. Werbos, New progress towards truly brain-like intelligent control, Proc. WCNN’94, pp. 12-133, Erlbaum, Hillsdale, NJ, 1994.
    • (1994) Proc. WCNN’94 , pp. 12-133
    • Santiago, R.1    Werbos, P.2
  • 48
    • 0035791958 scopus 로고    scopus 로고
    • Using dhp adaptive critic methods to tune a fuzzy automobile steering controller
    • Vancouver, B.C
    • L. J. Schultz, T. T. Shannon, and G. G. Lendaris, Using dhp adaptive critic methods to tune a fuzzy automobile steering controller, Proc. Of IFSA/NAFIPS Conference, Vancouver, B.C., 2001.
    • (2001) Proc. Of IFSA/NAFIPS Conference
    • Schultz, L.J.1    Shannon, T.T.2    Lendaris, G.G.3
  • 51
    • 0033720311 scopus 로고    scopus 로고
    • Adaptive critic based approximate dynamic programming for tuning fuzzy controllers
    • T. T. Shannon and G. G. Lendaris, Adaptive critic based approximate dynamic programming for tuning fuzzy controllers, Proc. Of IEEE-FUZZ 2000, 2000.
    • (2000) Proc. Of IEEE-FUZZ 2000
    • Shannon, T.T.1    Lendaris, G.G.2
  • 53
    • 70449391472 scopus 로고    scopus 로고
    • Adaptive critic based design of a fuzzy motor speed controller
    • Mexico City
    • T. T. Shannon and G. G. Lendaris, Adaptive critic based design of a fuzzy motor speed controller, Proc. OfISIC2001, Mexico City, 2001.
    • (2001) Proc. Ofisic2001
    • Shannon, T.T.1    Lendaris, G.G.2
  • 54
    • 0141704189 scopus 로고    scopus 로고
    • Accelerated critic learning in approximate dynamic programming via value templates and perceptual learning
    • Portland, OR
    • T. T. Shannon, R. A. Santiago, and G. G. Lendaris, Accelerated critic learning in approximate dynamic programming via value templates and perceptual learning, Proa of IJCNN’03, Portland, OR, 2003.
    • (2003) Proa of IJCNN’03
    • Shannon, T.T.1    Santiago, R.A.2    Lendaris, G.G.3
  • 55
    • 0035792268 scopus 로고    scopus 로고
    • Adaptive critic based adaptation of a fuzzy policy manager for a logistic system
    • Vancouver, B.C
    • S. Shervais and T. T. Shannon, Adaptive critic based adaptation of a fuzzy policy manager for a logistic system, Proa of IFSA /NAFIPS Conference, Vancouver, B.C., 2001.
    • (2001) Proa of IFSA /NAFIPS Conference
    • Shervais, S.1    Shannon, T.T.2
  • 59
    • 0002011091 scopus 로고
    • A Menu of Designs for Reinforcement Learning Over Time
    • MIT Press, Cambridge, MA
    • P. J. Werbos, A Menu of Designs for Reinforcement Learning Over Time, Neural Networks for Control, pp. 67-95, MIT Press, Cambridge, MA, 1990.
    • (1990) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 60
    • 0002031779 scopus 로고
    • Approximate Dynamic Programming for Real-Time Control and Neural Modeling
    • Van Nostrand Reinhold, New York
    • P. J. Werbos, Approximate Dynamic Programming for Real-Time Control and Neural Modeling, Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, pp. 493-525, Van Nostrand Reinhold, New York, 1994.
    • (1994) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , pp. 493-525
    • Werbos, P.J.1
  • 61
    • 0015667648 scopus 로고
    • Punish/reward: Learning with a critic in adaptive threshhold systems
    • B. Widrow, N. Gupta, and S. Maitra, Punish/reward: Learning with a critic in adaptive threshhold systems, IEEE Trans, on Systems, Man and Cybernetics, vol. 3, no. 5, pp. 455-465,1973.
    • (1973) IEEE Trans, on Systems, Man and Cybernetics , vol.3 , Issue.5 , pp. 455-465
    • Widrow, B.1    Gupta, N.2    Maitra, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.