메뉴 건너뛰기




Volumn , Issue , 2004, Pages 65-95

Model-based adaptive critic designs

Author keywords

Adaptation model; Dynamic programming; Equations; Heuristic algorithms; Mathematical model; Optimal control; Trajectory

Indexed keywords

BACKPROPAGATION; HEURISTIC ALGORITHMS; HEURISTIC PROGRAMMING; MATHEMATICAL MODELS; TRAJECTORIES;

EID: 85032189594     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1109/9780470544785.ch3     Document Type: Chapter
Times cited : (47)

References (41)
  • 2
    • 0016115973 scopus 로고
    • Dual effect, certainty equivalence, and separation in stochastic control
    • Y. Bar-Shalom and E. Tse, Dual effect, certainty equivalence, and separation in stochastic control, IEEE Trans. Automatic Control, vol. 19, no. 5, pp. 494-500, 1974.
    • (1974) IEEE Trans. Automatic Control , vol.19 , Issue.5 , pp. 494-500
    • Bar-Shalom, Y.1    Tse, E.2
  • 3
    • 85012688561 scopus 로고
    • Princeton University Press, Princeton, NJ
    • R. Bellman, Dynamic Programming, Princeton University Press, Princeton, NJ 1957.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 7
    • 0015615562 scopus 로고
    • Wide-sense adaptive dual control for nonlinear stochastic systems
    • E. Tse, Y. Bar-Shalom, and L. Meier, III, Wide-sense adaptive dual control for nonlinear stochastic systems, IEEE Trans. Automatic Control, vol. 18, no. 2, pp. 98-108, 1973.
    • (1973) IEEE Trans. Automatic Control , vol.18 , Issue.2 , pp. 98-108
    • Tse, E.1    Bar-Shalom, Y.2    Meier, L.3
  • 8
    • 0019624645 scopus 로고
    • Stochastic dynamic programming: Caution and probing
    • Y. Bar-Shalom, Stochastic dynamic programming: caution and probing, IEEE Trans. Automatic Control, vol. 26, no. 5, pp. 1184-1195,1981.
    • (1981) IEEE Trans. Automatic Control , vol.26 , Issue.5 , pp. 1184-1195
    • Bar-Shalom, Y.1
  • 9
    • 0025462720 scopus 로고
    • Receding horizon control of non-linear systems
    • D. Q. Mayne and H. Michalska, Receding horizon control of non-linear systems, IEEE Trans. Automatic Control, vol. 35, no. 5, pp. 814-824, 1990.
    • (1990) IEEE Trans. Automatic Control , vol.35 , Issue.5 , pp. 814-824
    • Mayne, D.Q.1    Michalska, H.2
  • 11
    • 0036563963 scopus 로고    scopus 로고
    • Classical/neural synthesis of nonlinear control systems, Journal of Guidance
    • S. Ferrari and R. F. Stengel, Classical/neural synthesis of nonlinear control systems, Journal of Guidance, Control and Dynamics, vol. 25, no. 3, pp. 442-448, 2002.
    • (2002) Control and Dynamics , vol.25 , Issue.3 , pp. 442-448
    • Ferrari, S.1    Stengel, R.F.2
  • 16
    • 0023169119 scopus 로고
    • Building and understanding adaptive systems: A statisti-cal/numerical approach for factory automation and brain research
    • P. J. Werbos, Building and understanding adaptive systems: a statisti-cal/numerical approach for factory automation and brain research, IEEE Trans. Syst., Man, Cybern., vol. 17, no. 1, pp. 7-20, 1987.
    • (1987) IEEE Trans. Syst., Man, Cybern. , vol.17 , Issue.1 , pp. 7-20
    • Werbos, P.J.1
  • 17
  • 19
    • 0002437599 scopus 로고
    • Neurocontrol and Supervised Learning: An Overview and Evaluation
    • D. A. White and D. A. Sofge (eds.), Van Nostrand Reinhold, New York
    • P. J. Werbos, Neurocontrol and Supervised Learning: An Overview and Evaluation, Handbook of Intelligent Control, D. A. White and D. A. Sofge (eds.), pp. 65-86, Van Nostrand Reinhold, New York, 1992.
    • (1992) Handbook of Intelligent Control , pp. 65-86
    • Werbos, P.J.1
  • 21
    • 0002011091 scopus 로고
    • A Menu of Designs for Reinforcement Learning Over Time
    • W. T. Miller, R. S. Sutton, and P. J. Werbos (eds.), MIT Press, Cambridge, MA
    • P. J. Werbos, A Menu of Designs for Reinforcement Learning Over Time, Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos (eds.), pp. 6796, MIT Press, Cambridge, MA, 1990.
    • (1990) Neural Networks for Control , pp. 6796
    • Werbos, P.J.1
  • 22
    • 0002557583 scopus 로고    scopus 로고
    • Advanced Forecasting Methods for Global Crisis Warning and Models of Intelligence
    • P. J. Werbos, Advanced Forecasting Methods for Global Crisis Warning and Models of Intelligence, General Systems Yearbook, 1997.
    • (1997) General Systems Yearbook
    • Werbos, P.J.1
  • 24
    • 0036565019 scopus 로고    scopus 로고
    • Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
    • G. K. Venayagamoorthy, R. G. Harley, and D. C. Wunsch, Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator, IEEE Trans. Neural Networks, vol. 13, no. 3, pp. 764-773, 2002.
    • (2002) IEEE Trans. Neural Networks , vol.13 , Issue.3 , pp. 764-773
    • Venayagamoorthy, G.K.1    Harley, R.G.2    Wunsch, D.C.3
  • 25
    • 0020970738 scopus 로고
    • Neuronlike elements that can solve difficult learning control problems
    • A. Barto, R. Sutton, and C. Anderson, Neuronlike elements that can solve difficult learning control problems, IEEE Trans. Systems, Man, and Cybernetics, vol. 3, no. 5, pp. 834-846,1983.
    • (1983) IEEE Trans. Systems, Man, and Cybernetics , vol.3 , Issue.5 , pp. 834-846
    • Barto, A.1    Sutton, R.2    Anderson, C.3
  • 26
    • 0001773535 scopus 로고
    • Applications of advances in nonlinear sensitivity analysis
    • R. F. Drenick and F. Kozin (eds.), Springer-Verlag, New York
    • P. J. Werbos, Applications of advances in nonlinear sensitivity analysis, System Modeling and Optimization: Proc. Of the I Oth IFIP Conference, R. F. Drenick and F. Kozin (eds.), Springer-Verlag, New York, 1982.
    • (1982) System Modeling and Optimization: Proc. Of the I Oth IFIP Conference
    • Werbos, P.J.1
  • 27
    • 0004049893 scopus 로고
    • Ph.D. Thesis, Cambridge University, Cambridge, England
    • C. Watkins, Learning from Delayed Rewards, Ph.D. Thesis, Cambridge University, Cambridge, England, 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.1
  • 29
    • 85036554893 scopus 로고
    • Functional Approximation and Dynamic Programming
    • Athena Scientific, Belmont, MA
    • R. E. Bellman and S. E. Dreyfus, Functional Approximation and Dynamic Programming, Math. Tables and Other Aids Comp., Athena Scientific, Belmont, MA, 1995.
    • (1995) Math. Tables and Other Aids Comp.
    • Bellman, R.E.1    Dreyfus, S.E.2
  • 33
    • 2342498922 scopus 로고    scopus 로고
    • Version 5, September
    • The MathWorks, Inc., Getting Started with MATLAB, http://www.mathworks.com, Version 5, September 1998.
    • (1998) Getting Started with MATLAB
  • 34
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temp oral-difference learning with function approximation
    • J. N. Tsitsiklis and B. Van Roy, An analysis of temp oral-difference learning with function approximation, IEEE Trans. Automatic Control, vol. 42, no. 5, pp. 674-690,1997.
    • (1997) IEEE Trans. Automatic Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 35
    • 0025503558 scopus 로고
    • Backpropagation through time: What it does and how to do it
    • P. J. Werbos, Backpropagation through time: what it does and how to do it, Proc. Of the IEEE, vol. 78, no. 10, pp. 1550-1560, 1990.
    • (1990) Proc. Of the IEEE , vol.78 , Issue.10 , pp. 1550-1560
    • Werbos, P.J.1
  • 37
    • 0033283685 scopus 로고    scopus 로고
    • Adaptive critic based neural networks for control-constrained agile missile control
    • San Diego
    • D. Han and S. N. Balakrishnan, Adaptive critic based neural networks for control-constrained agile missile control, Proc. American Control Conference, San Diego, pp. 2600-2604,1999.
    • (1999) Proc. American Control Conference , pp. 2600-2604
    • Han, D.1    Balakrishnan, S.N.2
  • 38
    • 0030703479 scopus 로고    scopus 로고
    • Adaptive critic based neurocontroller for autolanding of aircraft
    • Albuquerque, NM
    • G. Saini and S. N. Balakrishnan, Adaptive critic based neurocontroller for autolanding of aircraft, Proc. American Control Conference, Albuquerque, NM, pp. 1081-1085,1997.
    • (1997) Proc. American Control Conference , pp. 1081-1085
    • Saini, G.1    Balakrishnan, S.N.2
  • 40
    • 0030196717 scopus 로고    scopus 로고
    • Adaptive-critic-based neural networks for aircraft optimal control
    • S. N. Balakrishnan and V. Biega, Adaptive-critic-based neural networks for aircraft optimal control, Journal of Guidance, Control, and Dynamics, vol. 19, no. 4, pp. 893-898,1996.
    • (1996) Journal of Guidance, Control, and Dynamics , vol.19 , Issue.4 , pp. 893-898
    • Balakrishnan, S.N.1    Biega, V.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.