메뉴 건너뛰기




Volumn , Issue , 2004, Pages 3-44

ADP: Goals, opportunities and principles

Author keywords

Books; Communities; Conferences; Dynamic programming; Humans; Optimization; Proposals

Indexed keywords

ECOSYSTEMS; OPTIMIZATION;

EID: 84893393162     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1109/9780470544785.ch1     Document Type: Chapter
Times cited : (44)

References (60)
  • 2
    • 47949095751 scopus 로고    scopus 로고
    • Optimization: A Foundation for understanding consciousness
    • D. Levine and W. Elsberry (eds.), Erlbaum
    • P. Werbos, “Optimization: A Foundation for understanding consciousness,” in D. Levine and W. Elsberry (eds.), Optimality in Biological and Artificial Networks, Erlbaum, 1997.
    • (1997) Optimality in Biological and Artificial Networks
    • Werbos, P.1
  • 6
    • 18644385222 scopus 로고    scopus 로고
    • Advanced Technology Paths to Global Climate Stability: Energy For a Greenhouse Planet
    • M. Hoffert et al, Advanced Technology Paths to Global Climate Stability: Energy For a Greenhouse Planet, Science, 2002.
    • (2002) Science
    • Hoffert, M.1
  • 7
    • 0003529238 scopus 로고
    • Ph.D. Thesis, Committee on Applied Mathematics, Harvard U., . Reprinted in its entirety in P. Werbos, The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting, Wiley, New York, 1994
    • P. Werbos, Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences, Ph.D. Thesis, Committee on Applied Mathematics, Harvard U., 1974. Reprinted in its entirety in P. Werbos, The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting, Wiley, New York, 1994.
    • (1974) Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences
    • Werbos, P.1
  • 8
    • 0025503558 scopus 로고
    • Backpropagation through time: What it does and how to do it
    • Updated version reprinted as chapter 8 of [6]
    • P. Werbos, Backpropagation through time: what it does and how to do it, Proc. IEEE, vol. 78, no. 10, 1990. Updated version reprinted as chapter 8 of [6].
    • (1990) Proc. IEEE , vol.78 , Issue.10
    • Werbos, P.1
  • 9
    • 0031170885 scopus 로고    scopus 로고
    • Nlq theory: A neural control framework with global asymptotic stability criteria
    • J. A. Suykens, B. DeMoor and J. Vandewalle, Nlq theory: a neural control framework with global asymptotic stability criteria, Neural Networks, vol. 10, no. 4, pp. 615-637,1997.
    • (1997) Neural Networks , vol.10 , Issue.4 , pp. 615-637
    • Suykens, J.A.1    Demoor, B.2    Vandewalle, J.3
  • 17
    • 0025389734 scopus 로고
    • Rational approaches to identifying policy objectives
    • P. Werbos, Rational approaches to identifying policy objectives, Energy: The International Journal, vol. 15, no. 3/4, pp. 171-185, 1990.
    • (1990) Energy: The International Journal , vol.15 , Issue.3-4 , pp. 171-185
    • Werbos, P.1
  • 20
    • 0003754075 scopus 로고    scopus 로고
    • Ph.D. thesis and Report No. 469, Department of Electrical Engineering, Linköping U., 58183, Linköping, Sweden
    • T. Landelius, Reinforcement Learning and Distributed Local Model Synthesis, Ph.D. thesis and Report No. 469, Department of Electrical Engineering, Linköping U., 58183, Linköping, Sweden.
    • Reinforcement Learning and Distributed Local Model Synthesis
    • Landelius, T.1
  • 22
    • 0003583154 scopus 로고
    • Prentice-Hall, Englewood Cliffs, NJ, 1989; Hemisphere, Washington, DC
    • K. Narendra and A. Annaswamy, Stable Adaptive Systems, Prentice-Hall, Englewood Cliffs, NJ, 1989; Hemisphere, Washington, DC, 1982.
    • (1982) Stable Adaptive Systems
    • Narendra, K.1    Annaswamy, A.2
  • 25
    • 34249833101 scopus 로고
    • Technical note: Q-leaming
    • Watkins W. and Dayan D., Technical note: Q-leaming, Machine Learning, vol. 8, no. 3/4, pp. 279-292, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, W.1    Dayan, D.2
  • 26
    • 0024888479 scopus 로고
    • Neural networks for control and system identification
    • IEEE
    • P. Werbos, Neural networks for control and system identification, IEEE Proc. CDC89, IEEE, 1989.
    • (1989) IEEE Proc. CDC89
    • Werbos, P.1
  • 28
    • 0020970738 scopus 로고
    • Neuronlike adaptive elements that can solve difficult learning control problems
    • A. Barto, R. Sutton and C. Anderson, Neuronlike adaptive elements that can solve difficult learning control problems, IEEE Trans. SMC, vol. 13, no. 5, pp. 834-846,1983.
    • (1983) IEEE Trans. SMC , vol.13 , Issue.5 , pp. 834-846
    • Barto, A.1    Sutton, R.2    Anderson, C.3
  • 29
    • 0040921652 scopus 로고
    • The elements of intelligence
    • P. Werbos, The elements of intelligence, Cybernetica (Namur), no. 3,1968.
    • (1968) Cybernetica (Namur) , Issue.3
    • Werbos, P.1
  • 31
    • 0029515159 scopus 로고
    • Information state for robust control of set-valued discrete time systems
    • IEEE
    • J. S. Baras and N. S. Patel, Information state for robust control of set-valued discrete time systems, Proc. 34th Conf. Decision and Control (CDC), IEEE, pp. 2302,1995.
    • (1995) Proc. 34Th Conf. Decision and Control (CDC) , pp. 2302
    • Baras, J.S.1    Patel, N.S.2
  • 33
    • 0003410791 scopus 로고    scopus 로고
    • New York: Spinger, Second Edition. Also see H. Ritter, T. Martinetz, and K. Schulten, Neural Computation and Self-Organizing Maps, Addison-Wesley, Reading, MA, 1992
    • T. Kohonen, Self-Organizing Maps, New York: Spinger, 1997, Second Edition. Also see H. Ritter, T. Martinetz, and K. Schulten, Neural Computation and Self-Organizing Maps, Addison-Wesley, Reading, MA, 1992.
    • (1997) Self-Organizing Maps
    • Kohonen, T.1
  • 35
    • 34548729060 scopus 로고
    • Changes in global policy analysis procedures suggested by new methods of optimization
    • P. Werbos, Changes in global policy analysis procedures suggested by new methods of optimization, Policy Analysis and Information Systems, vol. 3, no. 1, 1979.
    • (1979) Policy Analysis and Information Systems , vol.3 , Issue.1
    • Werbos, P.1
  • 36
    • 85036510816 scopus 로고    scopus 로고
    • Backpropagation: General Principles and Issues for Biology
    • D. Fogel and C. Robinson (eds.), IEEE
    • P. Werbos, Backpropagation: General Principles and Issues for Biology, in D. Fogel and C. Robinson (eds.), Computational Intelligence: The Experts Speak, IEEE, 2003.
    • (2003) Computational Intelligence: The Experts Speak
    • Werbos, P.1
  • 37
    • 0027599793 scopus 로고
    • Universal approximation bounds for superpositions of a sigmoidal function
    • A. R. Barron, Universal approximation bounds for superpositions of a sigmoidal function, IEEE Trans. Info. Theory, vol. 39, no. 3, pp. 930-945,1993.
    • (1993) IEEE Trans. Info. Theory , vol.39 , Issue.3 , pp. 930-945
    • Barron, A.R.1
  • 38
    • 84974765089 scopus 로고
    • Elastic fuzzy logic: A better fit to neurocontrol and true intelligence
    • P. Werbos, Elastic fuzzy logic: a better fit to neurocontrol and true intelligence, J. Intelligent & Fuzzy Systems, vol. 1, no. 4,1993.
    • (1993) J. Intelligent & Fuzzy Systems , vol.1 , Issue.4
    • Werbos, P.1
  • 39
    • 0005942467 scopus 로고    scopus 로고
    • Neural network design for J function approximation in dynamic programming
    • physically, asadap-org/9806001 atarXiv.org
    • X. Z. Pang and P. Werbos, Neural network design for J function approximation in dynamic programming, Math. Modelling and Scientific Computing, vol. 5, no. 2/3,1996 (physically 1998). Available also asadap-org/9806001 atarXiv.org.
    • (1996) Math. Modelling and Scientific Computing , vol.5 , Issue.2-3
    • Pang, X.Z.1    Werbos, P.2
  • 40
    • 0030421566 scopus 로고    scopus 로고
    • Generalized maze navigation: SRN critics solve what feedforward or Hebbian nets cannot
    • Beijing, IEEE
    • P. Werbos and X. Z. Pang, “Generalized maze navigation: SRN critics solve what feedforward or Hebbian nets cannot,” Proc. Conf. Systems, Man and Cybernetics (SMC), Beijing, IEEE, 1996.
    • (1996) Proc. Conf. Systems, Man and Cybernetics (SMC)
    • Werbos, P.1    Pang, X.Z.2
  • 41
    • 0040715861 scopus 로고    scopus 로고
    • Implementing Back-Propagation-Through-Time Learning Algorithm Using Cellular Neural Networks
    • T. Yang and L. O. Chua, Implementing Back-Propagation-Through-Time Learning Algorithm Using Cellular Neural Networks, Int 1 J. Bifurcation and Chaos, vol. 9, no. 9, pp. 1041-1074,1999.
    • (1999) Int 1 J. Bifurcation and Chaos , vol.9 , Issue.9 , pp. 1041-1074
    • Yang, T.1    Chua, L.O.2
  • 42
    • 85036532852 scopus 로고    scopus 로고
    • P. Werbos posted at http://www.iamcm.org, and [47].
    • Werbos, P.1
  • 43
    • 0025229247 scopus 로고
    • Consistency of HDP applied to a simple reinforcement learning problem
    • P. Werbos, Consistency of HDP applied to a simple reinforcement learning problem, Neural Networks, 1990.
    • (1990) Neural Networks
    • Werbos, P.1
  • 44
    • 0002557583 scopus 로고
    • Advanced forecasting for global crisis warning and models of intelligence
    • P. Werbos, Advanced forecasting for global crisis warning and models of intelligence, General Systems Yearbook, 1977.
    • (1977) General Systems Yearbook
    • Werbos, P.1
  • 45
    • 0015667648 scopus 로고
    • Punish/reward: Learning with a Critic in adaptive threshold systems
    • B. Widrow, N. Gupta and S. Maitra, Punish/reward: learning with a Critic in adaptive threshold systems, IEEE Trans. SMC, vol. 5, pp. 455-465,1973.
    • (1973) IEEE Trans. SMC , vol.5 , pp. 455-465
    • Widrow, B.1    Gupta, N.2    Maitra, S.3
  • 46
    • 0023169119 scopus 로고
    • Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research
    • P. Werbos, Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research, IEEE Trans. SMC, 1987.
    • (1987) IEEE Trans. SMC
    • Werbos, P.1
  • 47
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • J. Tsitsiklis and B. Van Roy, An analysis of temporal-difference learning with function approximation, IEEE Trans. Auto. Control, vol. 42, no. 5, 1997.
    • (1997) IEEE Trans. Auto. Control , vol.42 , Issue.5
    • Tsitsiklis, J.1    Van Roy, B.2
  • 48
    • 85036564722 scopus 로고    scopus 로고
    • Artificial neural networks in optimization and applications
    • P. M. Pardalos and M. G. C. Re-sende (eds.), Cambridge University Press
    • T. B. Trafalis and S. Kasap, Artificial neural networks in optimization and applications, Handbook of Applied Optimization, in P. M. Pardalos and M. G. C. Re-sende (eds.), Cambridge University Press, 2000.
    • (2000) Handbook of Applied Optimization
    • Trafalis, T.B.1    Kasap, S.2
  • 49
    • 84882243223 scopus 로고    scopus 로고
    • Intelligent control using neural networks
    • M. Gupta andN. Sinha (eds.), IEEE Press
    • K. Narendra and S. Mukhopadhyay, Intelligent control using neural networks, in M. Gupta andN. Sinha (eds.), Intelligent Control Systems, IEEE Press, 1996.
    • (1996) Intelligent Control Systems
    • Narendra, K.1    Mukhopadhyay, S.2
  • 50
    • 34548205411 scopus 로고    scopus 로고
    • A Brain-Like Design To Learn Optimal Decision Strategies in Complex Environments
    • M. Kamy, K. Warwick and V. Kurkova (eds.), Springer, London
    • P. Werbos, A Brain-Like Design To Learn Optimal Decision Strategies in Complex Environments, in M. Kamy, K. Warwick and V. Kurkova (eds.), Dealing with Complexity: A Neural Networks Approach, Springer, London, 1998.
    • (1998) Dealing with Complexity: A Neural Networks Approach
    • Werbos, P.1
  • 53
    • 33749957677 scopus 로고
    • Master’s Thesis, chapter 5, Dept, of Electronic Mechanical Engineering, Nagoya University, Japan
    • T. Shibata, Hierarchical Intelligent Control of Robotic Motion, Master’s Thesis, chapter 5, Dept, of Electronic Mechanical Engineering, Nagoya University, Japan, 1992.
    • (1992) Hierarchical Intelligent Control of Robotic Motion
    • Shibata, T.1
  • 54
    • 0001773535 scopus 로고
    • Applications of advances in nonlinear sensitivity analysis
    • R. Drenick and F. Kozin (eds.), Springer, reprinted as chapter 7 in [6]
    • P. Werbos, Applications of advances in nonlinear sensitivity analysis, in R. Drenick and F. Kozin (eds.), System Modeling and Optimization: Proc. IFIP Conf. (1981), Springer 1982; reprinted as chapter 7 in [6].
    • (1982) System Modeling and Optimization: Proc. IFIP Conf. (1981)
    • Werbos, P.1
  • 57
    • 85032941146 scopus 로고    scopus 로고
    • Erlbaum: Hillsdale, NJ, See also earlier books edited by Pribram in the same series from Erlbaum
    • K. H. Pribram, (ed.), Brain and Values, Erlbaum: Hillsdale, NJ, 1998. (See also earlier books edited by Pribram in the same series from Erlbaum.)
    • (1998) Brain and Values
    • Pribram, K.H.1
  • 58
    • 84869422138 scopus 로고    scopus 로고
    • Multiple Models for Approximate Dynamic Programming and True Intelligent Control: Why and How
    • K. Narendra (ed.), New Haven: K. Narendra, EE Dept., Yale University
    • P. Werbos, Multiple Models for Approximate Dynamic Programming and True Intelligent Control: Why and How, in K. Narendra (ed.), Proc. 10th Yale Conf. on Learning and Adaptive Systems, New Haven: K. Narendra, EE Dept., Yale University, 1998.
    • (1998) Proc. 10Th Yale Conf. On Learning and Adaptive Systems
    • Werbos, P.1
  • 59
    • 85036506254 scopus 로고
    • Modeling the World at a Mixture of Time Scales
    • University of Massachusetts at Amherst, December, later published in Proa 12th Int. Conf. Macjine Learning, pp. 531-539, Morgan Kauhnann, 1995
    • R. Sutton, TD Models: Modeling the World at a Mixture of Time Scales, CMP-SCI Technical Report, pp. 95-114, University of Massachusetts at Amherst, December 1995, later published in Proa 12th Int. Conf. Macjine Learning, pp. 531-539, Morgan Kauhnann, 1995.
    • (1995) CMP-SCI Technical Report , pp. 95-114
    • Sutton, R.1    Models, T.D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.