SCOPUS 정보 검색 플랫폼

Handbook of Learning and Approximate Dynamic Programming

Volumn , Issue , 2004, Pages 3-44

ADP: Goals, opportunities and principles

(1) Werbos, Paul a

a NATIONAL SCIENCE FOUNDATION (United States)

Author keywords

Books; Communities; Conferences; Dynamic programming; Humans; Optimization; Proposals

Indexed keywords

ECOSYSTEMS; OPTIMIZATION;

BOOKS; CONFERENCES; HUMANS; PROPOSALS;

DYNAMIC PROGRAMMING;

EID: 84893393162 PISSN: None EISSN: None Source Type: Book
DOI: 10.1109/9780470544785.ch1 Document Type: Chapter

Times cited : (44)

References (60)

1
- 79959415574
- What do neural nets and quantum theory tell us about mind and reality
- K. Yasue, M. Jibu and T Della Senta (eds.), John Benjamins Pub Co
- P. Werbos, What do neural nets and quantum theory tell us about mind and reality, in K. Yasue, M. Jibu and T Della Senta (eds.), No Matter, Never Mind: Proc. Of Toward a Science of Consciousness: Fundamental Approaches (Tokyo’99), John Benjamins Pub Co, 2002.
- (2002) No Matter, Never Mind: Proc. Of Toward a Science of Consciousness: Fundamental Approaches (Tokyo’99)
- Werbos, P.¹

2
- 47949095751
- Optimization: A Foundation for understanding consciousness
- D. Levine and W. Elsberry (eds.), Erlbaum
- P. Werbos, “Optimization: A Foundation for understanding consciousness,” in D. Levine and W. Elsberry (eds.), Optimality in Biological and Artificial Networks, Erlbaum, 1997.
- (1997) Optimality in Biological and Artificial Networks
- Werbos, P.¹

3
- 0004059199
- MIT Press, Cambridge, MA, now in paper
- W. T. Miller, R. Sutton and P. Werbos (eds.), Neural Networks for Control, MIT Press, Cambridge, MA, 1990, now in paper.
- (1990) Neural Networks for Control
- Miller, W.T.¹ Sutton, R.² Werbos, P.³

4
- 0003544743
- Van Nostrand, New York
- D. White and D. Sofge (eds.), Handbook of Intelligent Control, Van Nostrand, New York, 1992.
- (1992) Handbook of Intelligent Control
- White, D.¹ Sofge, D.²

5
- 0003756026
- Neurocontrollers
- J. Webster (eds.), Wiley, New York
- P. Werbos, Neurocontrollers, in J. Webster (eds.), Encyclopedia of Electrical and Electronics Engineering, Wiley, New York, 1999.
- (1999) Encyclopedia of Electrical and Electronics Engineering
- Werbos, P.¹

6
- 18644385222
- Advanced Technology Paths to Global Climate Stability: Energy For a Greenhouse Planet
- M. Hoffert et al, Advanced Technology Paths to Global Climate Stability: Energy For a Greenhouse Planet, Science, 2002.
- (2002) Science
- Hoffert, M.¹

7
- 0003529238
- Ph.D. Thesis, Committee on Applied Mathematics, Harvard U., . Reprinted in its entirety in P. Werbos, The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting, Wiley, New York, 1994
- P. Werbos, Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences, Ph.D. Thesis, Committee on Applied Mathematics, Harvard U., 1974. Reprinted in its entirety in P. Werbos, The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting, Wiley, New York, 1994.
- (1974) Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences
- Werbos, P.¹

8
- 0025503558
- Backpropagation through time: What it does and how to do it
- Updated version reprinted as chapter 8 of [6]
- P. Werbos, Backpropagation through time: what it does and how to do it, Proc. IEEE, vol. 78, no. 10, 1990. Updated version reprinted as chapter 8 of [6].
- (1990) Proc. IEEE , vol.78 , Issue.10
- Werbos, P.¹

9
- 0031170885
- Nlq theory: A neural control framework with global asymptotic stability criteria
- J. A. Suykens, B. DeMoor and J. Vandewalle, Nlq theory: a neural control framework with global asymptotic stability criteria, Neural Networks, vol. 10, no. 4, pp. 615-637,1997.
- (1997) Neural Networks , vol.10 , Issue.4 , pp. 615-637
- Suykens, J.A.¹ Demoor, B.² Vandewalle, J.³

10
- 0003950434
- ArXiv.org: adap-org/9810001
- P. Werbos, Stable Adaptive Control Using New Critic Designs. ArXiv.org: adap-org/9810001 (1998)
- (1998) Stable Adaptive Control Using New Critic Designs
- Werbos, P.¹

11
- 0004291983
- American Elsevier
- D. Jacobson and D. Mayne, Differential Dynamic Programming, American Elsevier, 1970.
- (1970) Differential Dynamic Programming
- Jacobson, D.¹ Mayne, D.²

12
- 85036535701
- TNN submitted, Condensed versions are in press in IJCNN 2003 Proc. (IEEE) and CCA 2003 Proc. (IEEE)
- P. He and J. Sarangapani, Neuro Emission Controller for Minimizing Cyclic Dispersion in Spark Ignition Engines, TNN submitted, 2003. Condensed versions are in press in IJCNN 2003 Proc. (IEEE) and CCA 2003 Proc. (IEEE).
- (2003) Neuro Emission Controller for Minimizing Cyclic Dispersion in Spark Ignition Engines
- He, P.¹ Sarangapani, J.²

13
- 0042032055
- SIAM, Philadelphia
- F. Lewis, J. Campos and R. Selmic, Neuro-Fuzzy Control of Industrial Systems with Actuator Nonlinearities, SIAM, Philadelphia, 2002.
- (2002) Neuro-Fuzzy Control of Industrial Systems with Actuator Nonlinearities
- Lewis, F.¹ Campos, J.² Selmic, R.³

14
- 0004242550
- McGraw-Hill
- E. A. Feigenbaum and J. Feldman, Computers and Thought, McGraw-Hill, 1963.
- (1963) Computers and Thought
- Feigenbaum, E.A.¹ Feldman, J.²

15
- 84884079276
- Princeton University Press, Princeton, NJ
- J. Von Neumann and O. Morgenstem, The Theory of Games and Economic Behavior, Princeton University Press, Princeton, NJ, 1953.
- (1953) The Theory of Games and Economic Behavior
- Von Neumann, J.¹ Morgenstem, O.²

16
- 0003507478
- Addison-Wesley, Reading, MA
- H. Raiffa, Decision Analysis, Addison-Wesley, Reading, MA 1968.
- (1968) Decision Analysis
- Raiffa, H.¹

17
- 0025389734
- Rational approaches to identifying policy objectives
- P. Werbos, Rational approaches to identifying policy objectives, Energy: The International Journal, vol. 15, no. 3/4, pp. 171-185, 1990.
- (1990) Energy: The International Journal , vol.15 , Issue.3-4 , pp. 171-185
- Werbos, P.¹

18
- 0004114841
- Springer, New York
- D. F. Walls and G. F. Milbum, Quantum Optics, Springer, New York, 1994.
- (1994) Quantum Optics
- Walls, D.F.¹ Milbum, G.F.²

19
- 0345407827
- Morgan-Kauffman, San Francisco
- D. B. Fogel, Blondie24: Playing at the Edge of AI, Morgan-Kauffman, San Francisco, 2001.
- (2001) Blondie24: Playing at the Edge of AI
- Fogel, D.B.¹

20
- 0003754075
- Ph.D. thesis and Report No. 469, Department of Electrical Engineering, Linköping U., 58183, Linköping, Sweden
- T. Landelius, Reinforcement Learning and Distributed Local Model Synthesis, Ph.D. thesis and Report No. 469, Department of Electrical Engineering, Linköping U., 58183, Linköping, Sweden.
- Reinforcement Learning and Distributed Local Model Synthesis
- Landelius, T.¹

21
- 0004294973
- Dover edition
- R. F. Stengel, Optimal Control and Estimation, Dover edition, 1994.
- (1994) Optimal Control and Estimation
- Stengel, R.F.¹

22
- 0003583154
- Prentice-Hall, Englewood Cliffs, NJ, 1989; Hemisphere, Washington, DC
- K. Narendra and A. Annaswamy, Stable Adaptive Systems, Prentice-Hall, Englewood Cliffs, NJ, 1989; Hemisphere, Washington, DC, 1982.
- (1982) Stable Adaptive Systems
- Narendra, K.¹ Annaswamy, A.²

23
- 0003644124
- MIT Press, Cambridge, MA
- R. Howard, Dynamic Programming and Markhov Processes, MIT Press, Cambridge, MA 1960.
- (1960) Dynamic Programming and Markhov Processes
- Howard, R.¹

24
- 0004049893
- Ph.D. thesis, University of Cambridge, England
- C. J. C. H. Watkins, Learning From Delayed Rewards, Ph.D. thesis, University of Cambridge, England, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

25
- 34249833101
- Technical note: Q-leaming
- Watkins W. and Dayan D., Technical note: Q-leaming, Machine Learning, vol. 8, no. 3/4, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, W.¹ Dayan, D.²

26
- 0024888479
- Neural networks for control and system identification
- IEEE
- P. Werbos, Neural networks for control and system identification, IEEE Proc. CDC89, IEEE, 1989.
- (1989) IEEE Proc. CDC89
- Werbos, P.¹

27
- 0003487482
- Athena Scientific, Belmont, MA
- D. P. Bertsekas and J. N. Tsisiklis, Neuro-Dynamic Programming, Athena Scientific, Belmont, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsisiklis, J.N.²

28
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- A. Barto, R. Sutton and C. Anderson, Neuronlike adaptive elements that can solve difficult learning control problems, IEEE Trans. SMC, vol. 13, no. 5, pp. 834-846,1983.
- (1983) IEEE Trans. SMC , vol.13 , Issue.5 , pp. 834-846
- Barto, A.¹ Sutton, R.² Anderson, C.³

29
- 0040921652
- The elements of intelligence
- P. Werbos, The elements of intelligence, Cybernetica (Namur), no. 3,1968.
- (1968) Cybernetica (Namur) , Issue.3
- Werbos, P.¹

30
- 0003507691
- Ginn
- A. Biyson and Y. C. Ho, Applied Optimal Control, Ginn, 1969.
- (1969) Applied Optimal Control
- Biyson, A.¹ Ho, Y.C.²

31
- 0029515159
- Information state for robust control of set-valued discrete time systems
- IEEE
- J. S. Baras and N. S. Patel, Information state for robust control of set-valued discrete time systems, Proc. 34th Conf. Decision and Control (CDC), IEEE, pp. 2302,1995.
- (1995) Proc. 34Th Conf. Decision and Control (CDC) , pp. 2302
- Baras, J.S.¹ Patel, N.S.²

32
- 0035680418
- Multi-agent Markhov decision processes with limited agent communication
- IEEE
- S. Mukhopadhyay and B. Jain, Multi-agent Markhov decision processes with limited agent communication, Proc. Of the Inti Joint Conf on Control Applications and Inti Symposium on Intelligent Control (IEEE CCA/ISIC01), IEEE: 2001.
- (2001) Proc. Of the Inti Joint Conf on Control Applications and Inti Symposium on Intelligent Control (IEEE CCA/ISIC01)
- Mukhopadhyay, S.¹ Jain, B.²

33
- 0003410791
- New York: Spinger, Second Edition. Also see H. Ritter, T. Martinetz, and K. Schulten, Neural Computation and Self-Organizing Maps, Addison-Wesley, Reading, MA, 1992
- T. Kohonen, Self-Organizing Maps, New York: Spinger, 1997, Second Edition. Also see H. Ritter, T. Martinetz, and K. Schulten, Neural Computation and Self-Organizing Maps, Addison-Wesley, Reading, MA, 1992.
- (1997) Self-Organizing Maps
- Kohonen, T.¹

34
- 34548218785
- Outline of Intelligence
- J. Albus, Outline of Intelligence, IEEE Trans. Systems, Man and Cybernetics, vol. 21, no. 2,1991.
- (1991) IEEE Trans. Systems, Man and Cybernetics , vol.21 , Issue.2
- Albus, J.¹

35
- 34548729060
- Changes in global policy analysis procedures suggested by new methods of optimization
- P. Werbos, Changes in global policy analysis procedures suggested by new methods of optimization, Policy Analysis and Information Systems, vol. 3, no. 1, 1979.
- (1979) Policy Analysis and Information Systems , vol.3 , Issue.1
- Werbos, P.¹

36
- 85036510816
- Backpropagation: General Principles and Issues for Biology
- D. Fogel and C. Robinson (eds.), IEEE
- P. Werbos, Backpropagation: General Principles and Issues for Biology, in D. Fogel and C. Robinson (eds.), Computational Intelligence: The Experts Speak, IEEE, 2003.
- (2003) Computational Intelligence: The Experts Speak
- Werbos, P.¹

37
- 0027599793
- Universal approximation bounds for superpositions of a sigmoidal function
- A. R. Barron, Universal approximation bounds for superpositions of a sigmoidal function, IEEE Trans. Info. Theory, vol. 39, no. 3, pp. 930-945,1993.
- (1993) IEEE Trans. Info. Theory , vol.39 , Issue.3 , pp. 930-945
- Barron, A.R.¹

38
- 84974765089
- Elastic fuzzy logic: A better fit to neurocontrol and true intelligence
- P. Werbos, Elastic fuzzy logic: a better fit to neurocontrol and true intelligence, J. Intelligent & Fuzzy Systems, vol. 1, no. 4,1993.
- (1993) J. Intelligent & Fuzzy Systems , vol.1 , Issue.4
- Werbos, P.¹

39
- 0005942467
- Neural network design for J function approximation in dynamic programming
- physically, asadap-org/9806001 atarXiv.org
- X. Z. Pang and P. Werbos, Neural network design for J function approximation in dynamic programming, Math. Modelling and Scientific Computing, vol. 5, no. 2/3,1996 (physically 1998). Available also asadap-org/9806001 atarXiv.org.
- (1996) Math. Modelling and Scientific Computing , vol.5 , Issue.2-3
- Pang, X.Z.¹ Werbos, P.²

40
- 0030421566
- Generalized maze navigation: SRN critics solve what feedforward or Hebbian nets cannot
- Beijing, IEEE
- P. Werbos and X. Z. Pang, “Generalized maze navigation: SRN critics solve what feedforward or Hebbian nets cannot,” Proc. Conf. Systems, Man and Cybernetics (SMC), Beijing, IEEE, 1996.
- (1996) Proc. Conf. Systems, Man and Cybernetics (SMC)
- Werbos, P.¹ Pang, X.Z.²

41
- 0040715861
- Implementing Back-Propagation-Through-Time Learning Algorithm Using Cellular Neural Networks
- T. Yang and L. O. Chua, Implementing Back-Propagation-Through-Time Learning Algorithm Using Cellular Neural Networks, Int 1 J. Bifurcation and Chaos, vol. 9, no. 9, pp. 1041-1074,1999.
- (1999) Int 1 J. Bifurcation and Chaos , vol.9 , Issue.9 , pp. 1041-1074
- Yang, T.¹ Chua, L.O.²

42
- 85036532852
- P. Werbos posted at http://www.iamcm.org, and [47].
- Werbos, P.¹

43
- 0025229247
- Consistency of HDP applied to a simple reinforcement learning problem
- P. Werbos, Consistency of HDP applied to a simple reinforcement learning problem, Neural Networks, 1990.
- (1990) Neural Networks
- Werbos, P.¹

44
- 0002557583
- Advanced forecasting for global crisis warning and models of intelligence
- P. Werbos, Advanced forecasting for global crisis warning and models of intelligence, General Systems Yearbook, 1977.
- (1977) General Systems Yearbook
- Werbos, P.¹

45
- 0015667648
- Punish/reward: Learning with a Critic in adaptive threshold systems
- B. Widrow, N. Gupta and S. Maitra, Punish/reward: learning with a Critic in adaptive threshold systems, IEEE Trans. SMC, vol. 5, pp. 455-465,1973.
- (1973) IEEE Trans. SMC , vol.5 , pp. 455-465
- Widrow, B.¹ Gupta, N.² Maitra, S.³

46
- 0023169119
- Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research
- P. Werbos, Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research, IEEE Trans. SMC, 1987.
- (1987) IEEE Trans. SMC
- Werbos, P.¹

47
- 0031143730
- An analysis of temporal-difference learning with function approximation
- J. Tsitsiklis and B. Van Roy, An analysis of temporal-difference learning with function approximation, IEEE Trans. Auto. Control, vol. 42, no. 5, 1997.
- (1997) IEEE Trans. Auto. Control , vol.42 , Issue.5
- Tsitsiklis, J.¹ Van Roy, B.²

48
- 85036564722
- Artificial neural networks in optimization and applications
- P. M. Pardalos and M. G. C. Re-sende (eds.), Cambridge University Press
- T. B. Trafalis and S. Kasap, Artificial neural networks in optimization and applications, Handbook of Applied Optimization, in P. M. Pardalos and M. G. C. Re-sende (eds.), Cambridge University Press, 2000.
- (2000) Handbook of Applied Optimization
- Trafalis, T.B.¹ Kasap, S.²

49
- 84882243223
- Intelligent control using neural networks
- M. Gupta andN. Sinha (eds.), IEEE Press
- K. Narendra and S. Mukhopadhyay, Intelligent control using neural networks, in M. Gupta andN. Sinha (eds.), Intelligent Control Systems, IEEE Press, 1996.
- (1996) Intelligent Control Systems
- Narendra, K.¹ Mukhopadhyay, S.²

50
- 34548205411
- A Brain-Like Design To Learn Optimal Decision Strategies in Complex Environments
- M. Kamy, K. Warwick and V. Kurkova (eds.), Springer, London
- P. Werbos, A Brain-Like Design To Learn Optimal Decision Strategies in Complex Environments, in M. Kamy, K. Warwick and V. Kurkova (eds.), Dealing with Complexity: A Neural Networks Approach, Springer, London, 1998.
- (1998) Dealing with Complexity: A Neural Networks Approach
- Werbos, P.¹

51
- 0003778946
- Springer
- S. Amari and N. Kasabov, Brain-Like Computing and Intelligent Information Systems, Springer, 1998.
- (1998) Brain-Like Computing and Intelligent Information Systems
- Amari, S.¹ Kasabov, N.²

52
- 33747609825
- A simple solution to the bioreactor benchmark problem by application of Q-leaming
- Erlbaum, New York
- F. Yuan, L. Feldkamp, G. Puskorius and L. Davis, A simple solution to the bioreactor benchmark problem by application of Q-leaming, Proc. World Congress on Neural Networks, Erlbaum, New York, 1995.
- (1995) Proc. World Congress on Neural Networks
- Yuan, F.¹ Feldkamp, L.² Puskorius, G.³ Davis, L.⁴

53
- 33749957677
- Master’s Thesis, chapter 5, Dept, of Electronic Mechanical Engineering, Nagoya University, Japan
- T. Shibata, Hierarchical Intelligent Control of Robotic Motion, Master’s Thesis, chapter 5, Dept, of Electronic Mechanical Engineering, Nagoya University, Japan, 1992.
- (1992) Hierarchical Intelligent Control of Robotic Motion
- Shibata, T.¹

54
- 0001773535
- Applications of advances in nonlinear sensitivity analysis
- R. Drenick and F. Kozin (eds.), Springer, reprinted as chapter 7 in [6]
- P. Werbos, Applications of advances in nonlinear sensitivity analysis, in R. Drenick and F. Kozin (eds.), System Modeling and Optimization: Proc. IFIP Conf. (1981), Springer 1982; reprinted as chapter 7 in [6].
- (1982) System Modeling and Optimization: Proc. IFIP Conf. (1981)
- Werbos, P.¹

55
- 0004230131
- Wiley, New York
- D. O. Hebb, The Organization of Behavior, Wiley, New York, 1949.
- (1949) The Organization of Behavior
- Hebb, D.O.¹

56
- 0003894363
- MIT Press, Cambridge, MA
- J. C. Houk, J. L. Davis and D. G. Beiser (eds.), Models of Information Processing in the Basal Ganglia, MIT Press, Cambridge, MA, 1995.
- (1995) Models of Information Processing in the Basal Ganglia
- Houk, J.C.¹ Davis, J.L.² Beiser, D.G.³

57
- 85032941146
- Erlbaum: Hillsdale, NJ, See also earlier books edited by Pribram in the same series from Erlbaum
- K. H. Pribram, (ed.), Brain and Values, Erlbaum: Hillsdale, NJ, 1998. (See also earlier books edited by Pribram in the same series from Erlbaum.)
- (1998) Brain and Values
- Pribram, K.H.¹

58
- 84869422138
- Multiple Models for Approximate Dynamic Programming and True Intelligent Control: Why and How
- K. Narendra (ed.), New Haven: K. Narendra, EE Dept., Yale University
- P. Werbos, Multiple Models for Approximate Dynamic Programming and True Intelligent Control: Why and How, in K. Narendra (ed.), Proc. 10th Yale Conf. on Learning and Adaptive Systems, New Haven: K. Narendra, EE Dept., Yale University, 1998.
- (1998) Proc. 10Th Yale Conf. On Learning and Adaptive Systems
- Werbos, P.¹

59
- 85036506254
- Modeling the World at a Mixture of Time Scales
- University of Massachusetts at Amherst, December, later published in Proa 12th Int. Conf. Macjine Learning, pp. 531-539, Morgan Kauhnann, 1995
- R. Sutton, TD Models: Modeling the World at a Mixture of Time Scales, CMP-SCI Technical Report, pp. 95-114, University of Massachusetts at Amherst, December 1995, later published in Proa 12th Int. Conf. Macjine Learning, pp. 531-539, Morgan Kauhnann, 1995.
- (1995) CMP-SCI Technical Report , pp. 95-114
- Sutton, R.¹ Models, T.D.²

60
- 85036583671
- Routing networks in visual cortex
- M. Arbib (ed.), First Edition, MIT Press, Cambridge, MA
- C. H. Anderson, B. Olshausen and D. Van Essen, Routing networks in visual cortex, in M. Arbib (ed.), The Handbook of Brain Theory and Neural Networks, First Edition, pp. 823-826, MIT Press, Cambridge, MA, 1995.
- (1995) The Handbook of Brain Theory and Neural Networks , pp. 823-826
- Anderson, C.H.¹ Olshausen, B.² Van Essen, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.