메뉴 건너뛰기




Volumn 161, Issue 1-2, 2004, Pages 37-55

A generic architecture for adaptive agents based on reinforcement learning

Author keywords

Adaptive systems; Dynamics of behavior; Modeling; Reinforcement learning

Indexed keywords

COMPUTER SIMULATION; DYNAMICS; ENGINEERING RESEARCH; LEARNING SYSTEMS; MATHEMATICAL MODELS;

EID: 1642333557     PISSN: 00200255     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ins.2003.03.005     Document Type: Article
Times cited : (20)

References (35)
  • 2
    • 0019891981 scopus 로고
    • Selection by consequences
    • Skinner B. Selection by consequences. Science. 213:1981;501-514.
    • (1981) Science , vol.213 , pp. 501-514
    • Skinner, B.1
  • 3
    • 0002621983 scopus 로고
    • Animal intelligence: An experimental study of the associative process in animals
    • Thorndike E. Animal intelligence: an experimental study of the associative process in animals. Psychology Monographs. 2:1911.
    • (1911) Psychology Monographs , vol.2
    • Thorndike, E.1
  • 4
    • 0343167665 scopus 로고
    • What is cognitive and what is not cognitive
    • D. Cliff, P. Husbands, J.-A. Meyer, & S.W. Wilson. From Animals to Animats 3. MIT Press
    • Toates F. What is cognitive and what is not cognitive. Cliff D., Husbands P., Meyer J.-A., Wilson S.W. From Animals to Animats 3, Proceedings of the Third International Conference on Simulation of Adaptive Behavior. 1994;102-107 MIT Press.
    • (1994) Proceedings of the Third International Conference on Simulation of Adaptive Behavior , pp. 102-107
    • Toates, F.1
  • 6
    • 0011195029 scopus 로고
    • Synthetic neural modelling: Comparisons of population and connectionist approaches
    • R. Pfeifer, Z. Schreter, F. Fogelman-Soulié, & L. Steels. Elsevier Science Publishers
    • Reeke G., Sporns O., Edelman G. Synthetic neural modelling: comparisons of population and connectionist approaches. Pfeifer R., Schreter Z., Fogelman-Soulié F., Steels L. Connectionism in Perspective. 1989;Elsevier Science Publishers.
    • (1989) Connectionism in Perspective
    • Reeke, G.1    Sporns, O.2    Edelman, G.3
  • 7
    • 0036646485 scopus 로고    scopus 로고
    • From a biological to a computational model for the autonomous behavior of ana animat
    • Frezza-Buet H., Alexandre F. From a biological to a computational model for the autonomous behavior of ana animat. Information Sciences. 144:2002;1-43.
    • (2002) Information Sciences , vol.144 , pp. 1-43
    • Frezza-Buet, H.1    Alexandre, F.2
  • 11
    • 0001201756 scopus 로고
    • Some studies in machine learning using the game of checkers
    • Samuel A. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development. 3:1959;211-229. (reprinted in E.A. Feigenbaum, J. Feldman (Eds), Computers and Thought, pp. 71-105, Mc Graw-Hill, New York, 1963).
    • (1959) IBM Journal of Research and Development , vol.3 , pp. 211-229
    • Samuel, A.1
  • 12
    • 0004242550 scopus 로고
    • reprinted, Mc Graw-Hill, New York
    • Samuel A. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development. 3:1959;211-229. (reprinted in E.A. Feigenbaum, J. Feldman (Eds), Computers and Thought, pp. 71-105, Mc Graw-Hill, New York, 1963).
    • (1963) Computers and Thought , pp. 71-105
    • Feigenbaum, E.A.1    Feldman, J.2
  • 14
    • 0000133751 scopus 로고    scopus 로고
    • Using reinforcement learning to spider the web efficiently
    • J. Rennie, A. McCallum, Using reinforcement learning to spider the web efficiently, in: Proceedings of ECML, 1999.
    • (1999) Proceedings of ECML
    • Rennie, J.1    McCallum, A.2
  • 15
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro G. Temporal difference learning and TD-Gammon. Communications of the ACM. 38:1995;58-68.
    • (1995) Communications of the ACM , vol.38 , pp. 58-68
    • Tesauro, G.1
  • 18
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science. 275:1997;1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 19
    • 0035315989 scopus 로고    scopus 로고
    • Temporal difference model reproduces anticipatory neural activity
    • Suri R., Schultz W. Temporal difference model reproduces anticipatory neural activity. Neural Computation. 13(4):2001;487-494.
    • (2001) Neural Computation , vol.13 , Issue.4 , pp. 487-494
    • Suri, R.1    Schultz, W.2
  • 26
    • 0032165064 scopus 로고    scopus 로고
    • Evolution and development of neural controllers for locomotion, gradient-following, and obstacle avoidance in artificial insects
    • Kodjobachian J., Meyer J. Evolution and development of neural controllers for locomotion, gradient-following, and obstacle avoidance in artificial insects. IEEE Transactions in Neural Networks. 9:1998;796-812.
    • (1998) IEEE Transactions in Neural Networks , vol.9 , pp. 796-812
    • Kodjobachian, J.1    Meyer, J.2
  • 27
    • 1642279019 scopus 로고    scopus 로고
    • Development: Is it the right way towards humanoid robotics?
    • G. Metta, R. Manzotti, F. Panerai, G. Sandini, Development: is it the right way towards humanoid robotics? in: IAS-6, 2000.
    • (2000) IAS-6
    • Metta, G.1    Manzotti, R.2    Panerai, F.3    Sandini, G.4
  • 33
    • 0038637209 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning: Independent vs. cooperative learning
    • M.N. Huhns, & M.P. Singh. San Francisco, CA, USA: Morgan Kaufmann
    • Tan M. Multi-agent reinforcement learning: Independent vs. cooperative learning. Huhns M.N., Singh M.P. Readings in Agents. 1997;487-494 Morgan Kaufmann, San Francisco, CA, USA.
    • (1997) Readings in Agents , pp. 487-494
    • Tan, M.1
  • 34
    • 58149409359 scopus 로고
    • Eye-hand coordination in newborns
    • von Hofsten C. Eye-hand coordination in newborns. Developmental Psychology. 18:1982;450-461.
    • (1982) Developmental Psychology , vol.18 , pp. 450-461
    • Von Hofsten, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.