SCOPUS 정보 검색 플랫폼

Information Sciences

Volumn 161, Issue 1-2, 2004, Pages 37-55

A generic architecture for adaptive agents based on reinforcement learning

(3) Preux, Philippe a,b Delepoulle, Samuel a Darcheville, Jean Claude b

a UNIVERSITÉ DU LITTORAL CÔTE D OPALE (France)

b UNIV LILLE (France)

Author keywords

Adaptive systems; Dynamics of behavior; Modeling; Reinforcement learning

Indexed keywords

COMPUTER SIMULATION; DYNAMICS; ENGINEERING RESEARCH; LEARNING SYSTEMS; MATHEMATICAL MODELS;

ADAPTIVE AGENTS; REINFORCEMENT LEARNING;

INTELLIGENT AGENTS;

EID: 1642333557 PISSN: 00200255 EISSN: None Source Type: Journal
DOI: 10.1016/j.ins.2003.03.005 Document Type: Article

Times cited : (20)

References (35)

1
- 0003744205
- Addison-Wesley
- Ferber J. Multi-Agent Systems. 1999;Addison-Wesley.
- (1999) Multi-agent Systems
- Ferber, J.¹

2
- 0019891981
- Selection by consequences
- Skinner B. Selection by consequences. Science. 213:1981;501-514.
- (1981) Science , vol.213 , pp. 501-514
- Skinner, B.¹

3
- 0002621983
- Animal intelligence: An experimental study of the associative process in animals
- Thorndike E. Animal intelligence: an experimental study of the associative process in animals. Psychology Monographs. 2:1911.
- (1911) Psychology Monographs , vol.2
- Thorndike, E.¹

4
- 0343167665
- What is cognitive and what is not cognitive
- D. Cliff, P. Husbands, J.-A. Meyer, & S.W. Wilson. From Animals to Animats 3. MIT Press
- Toates F. What is cognitive and what is not cognitive. Cliff D., Husbands P., Meyer J.-A., Wilson S.W. From Animals to Animats 3, Proceedings of the Third International Conference on Simulation of Adaptive Behavior. 1994;102-107 MIT Press.
- (1994) Proceedings of the Third International Conference on Simulation of Adaptive Behavior , pp. 102-107
- Toates, F.¹

5
- 0001854989
- Learning to do without cognition
- R. Pfeifer, B. Blumberg, J.-A. Meyer, & S. Wilson. MIT Press
- Spier E., McFarland D. Learning to do without cognition. Pfeifer R., Blumberg B., Meyer J.-A., Wilson S. Proceedings of Fifth International Conference on Simulation of Adaptive Behavior (SAB 5). 1998;38-47 MIT Press.
- (1998) Proceedings of Fifth International Conference on Simulation of Adaptive Behavior (SAB 5) , pp. 38-47
- Spier, E.¹ McFarland, D.²

6
- 0011195029
- Synthetic neural modelling: Comparisons of population and connectionist approaches
- R. Pfeifer, Z. Schreter, F. Fogelman-Soulié, & L. Steels. Elsevier Science Publishers
- Reeke G., Sporns O., Edelman G. Synthetic neural modelling: comparisons of population and connectionist approaches. Pfeifer R., Schreter Z., Fogelman-Soulié F., Steels L. Connectionism in Perspective. 1989;Elsevier Science Publishers.
- (1989) Connectionism in Perspective
- Reeke, G.¹ Sporns, O.² Edelman, G.³

7
- 0036646485
- From a biological to a computational model for the autonomous behavior of ana animat
- Frezza-Buet H., Alexandre F. From a biological to a computational model for the autonomous behavior of ana animat. Information Sciences. 144:2002;1-43.
- (2002) Information Sciences , vol.144 , pp. 1-43
- Frezza-Buet, H.¹ Alexandre, F.²

8
- 84862349945
- Ph.D. Thesis, Université de Lille 3, URECA, Villeneuve d'Ascq, thèse de doctorat de Psychologie, October
- S. Delepoulle, Coopération centre agents adaptatifs ; étude de la sélection des comportements sociaux, expérimentations et simulations, Ph.D. Thesis, Université de Lille 3, URECA, Villeneuve d'Ascq, thèse de doctorat de Psychologie, October 2000.
- (2000) Coopération Centre Agents Adaptatifs; Étude de la Sélection des Comportements Sociaux, Expérimentations et Simulations
- Delepoulle, S.¹

9
- 1642399289
- Dynamique de l'interaction
- B. Chaib-Dra, P. Enjalbert (Eds.), Toulouse
- S. Delepoulle, P. Preux, J.-C. Darcheville, Dynamique de l'interaction, in: B. Chaib-Dra, P. Enjalbert (Eds.), Proc. Modèles Formels de l'Interaction, Toulouse, 2001, pp. 141-150.
- (2001) Proc. Modèles Formels de l'Interaction , pp. 141-150
- Delepoulle, S.¹ Preux, P.² Darcheville, J.-C.³

10
- 0004102479
- MIT Press
- Sutton R., Barto A. Reinforcement Learning: An Introduction. 1998;MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

11
- 0001201756
- Some studies in machine learning using the game of checkers
- Samuel A. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development. 3:1959;211-229. (reprinted in E.A. Feigenbaum, J. Feldman (Eds), Computers and Thought, pp. 71-105, Mc Graw-Hill, New York, 1963).
- (1959) IBM Journal of Research and Development , vol.3 , pp. 211-229
- Samuel, A.¹

12
- 0004242550
- reprinted, Mc Graw-Hill, New York
- Samuel A. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development. 3:1959;211-229. (reprinted in E.A. Feigenbaum, J. Feldman (Eds), Computers and Thought, pp. 71-105, Mc Graw-Hill, New York, 1963).
- (1963) Computers and Thought , pp. 71-105
- Feigenbaum, E.A.¹ Feldman, J.²

13
- 0002769452
- W. Zhang, T. Dietterich, A reinforcement learning approach to job-shop scheduling.
- A Reinforcement Learning Approach to Job-shop Scheduling
- Zhang, W.¹ Dietterich, T.²

14
- 0000133751
- Using reinforcement learning to spider the web efficiently
- J. Rennie, A. McCallum, Using reinforcement learning to spider the web efficiently, in: Proceedings of ECML, 1999.
- (1999) Proceedings of ECML
- Rennie, J.¹ McCallum, A.²

15
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro G. Temporal difference learning and TD-Gammon. Communications of the ACM. 38:1995;58-68.
- (1995) Communications of the ACM , vol.38 , pp. 58-68
- Tesauro, G.¹

16
- 0003487482
- Athena Scientific
- Bertsekas D., Tsitsiklis J. Neuro-Dynamic Programming. 1996;Athena Scientific.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

17
- 0004049893
- Ph.D. Thesis, King's College, Cambridge, UK
- C. Watkins, Learning from delayed rewards, Ph.D. Thesis, King's College, Cambridge, UK, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

18
- 0030896968
- A neural substrate of prediction and reward
- Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science. 275:1997;1593-1599.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

19
- 0035315989
- Temporal difference model reproduces anticipatory neural activity
- Suri R., Schultz W. Temporal difference model reproduces anticipatory neural activity. Neural Computation. 13(4):2001;487-494.
- (2001) Neural Computation , vol.13 , Issue.4 , pp. 487-494
- Suri, R.¹ Schultz, W.²

20
- 0003785662
- MIT Press
- Thelen E., Smith L. A Dynamic Systems Approach to the Developement of Cognition and Action. 1994;MIT Press.
- (1994) A Dynamic Systems Approach to the Developement of Cognition and Action
- Thelen, E.¹ Smith, L.²

21
- 0003782780
- MIT Press
- Dorigo M., Colombetti M. Robot Shaping: An Experiment in Behavior Engineering. 1998;MIT Press.
- (1998) Robot Shaping: An Experiment in Behavior Engineering
- Dorigo, M.¹ Colombetti, M.²

22
- 26744452092
- Master's thesis, Université du Littoral Côte d'Opale, Computer Science Dpt, France
- C. Cassagnabère, Modeling and simulation of the adaptive behavior of a virtual arm during a reaching movement, Master's thesis, Université du Littoral Côte d'Opale, Computer Science Dpt, France, 2001.
- (2001) Modeling and Simulation of the Adaptive Behavior of a Virtual Arm during a Reaching Movement
- Cassagnabère, C.¹

23
- 0003785662
- MIT Press
- Thelen E., Smith L. A Dynamic Systems Approach to the Developement of Cognition and Action. 1994;MIT Press.
- (1994) A Dynamic Systems Approach to the Developement of Cognition and Action
- Thelen, E.¹ Smith, L.²

24
- 1642355478
- Interaction organisation-environnement dans l'émergence de la saisie chez le nourissons
- C. Boyer, J.-C. Darcheville, Interaction organisation-environnement dans l'émergence de la saisie chez le nourissons, in: Congrès National de la Société Française de Psychologie, 1999.
- (1999) Congrès National de la Société Française de Psychologie
- Boyer, C.¹ Darcheville, J.-C.²

25
- 0003620001
- MIT Press
- Brooks R. Cambrian Intelligence: The Early History of the New AI. 1999;MIT Press.
- (1999) Cambrian Intelligence: The Early History of the New AI
- Brooks, R.¹

26
- 0032165064
- Evolution and development of neural controllers for locomotion, gradient-following, and obstacle avoidance in artificial insects
- Kodjobachian J., Meyer J. Evolution and development of neural controllers for locomotion, gradient-following, and obstacle avoidance in artificial insects. IEEE Transactions in Neural Networks. 9:1998;796-812.
- (1998) IEEE Transactions in Neural Networks , vol.9 , pp. 796-812
- Kodjobachian, J.¹ Meyer, J.²

27
- 1642279019
- Development: Is it the right way towards humanoid robotics?
- G. Metta, R. Manzotti, F. Panerai, G. Sandini, Development: is it the right way towards humanoid robotics? in: IAS-6, 2000.
- (2000) IAS-6
- Metta, G.¹ Manzotti, R.² Panerai, F.³ Sandini, G.⁴

28
- 0003325546
- Imitation: Learning and communication
- J.-A. Meyer, A. Berthoz, D. Floreano, H. Roitblatt, & S. Wilson. MIT Press
- Andry P., Moga S., Gaussier P., Revel A., Nadel J. Imitation: learning and communication. Meyer J.-A., Berthoz A., Floreano D., Roitblatt H., Wilson S. Proceedings of from Animals to Animate Conference: Simulated Adaptive Behavior. 2000;353-362 MIT Press.
- (2000) Proceedings of from Animals to Animate Conference: Simulated Adaptive Behavior , pp. 353-362
- Andry, P.¹ Moga, S.² Gaussier, P.³ Revel, A.⁴ Nadel, J.⁵

29
- 77956755176
- Analysis of reaching for stationary and moving objects in the human infant
- Elsevier
- Berthier N. Analysis of reaching for stationary and moving objects in the human infant, Neural Networks Models of Complex Behavior - Behavioral Foundations. 1997;Elsevier. pp. 283-301.
- (1997) Neural Networks Models of Complex Behavior - Behavioral Foundations , pp. 283-301
- Berthier, N.¹

30
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- New Brunswick, NJ: Morgan Kaufmann
- Littman M. Markov games as a framework for multi-agent reinforcement learning. Proceedings of the 11th International Conference on Machine Learning (ICML-94). 1994;157-163 Morgan Kaufmann, New Brunswick, NJ.
- (1994) Proceedings of the 11th International Conference on Machine Learning (ICML-94) , pp. 157-163
- Littman, M.¹

31
- 0010623451
- On multiagent Q -learning in a semi-competitive domain
- Montreal, Canada
- T. Sandholm, R. Crites, On multiagent Q -learning in a semi-competitive domain, in: Workshop on Adaptation and Learning in Multiagent Systems at the 14th International Joint Conference on Artificial Intelligence (IJCAI-95), Montreal, Canada, 1995, pp. 71-77.
- (1995) Workshop on Adaptation and Learning in Multiagent Systems at the 14th International Joint Conference on Artificial Intelligence (IJCAI-95) , pp. 71-77
- Sandholm, T.¹ Crites, R.²

32
- 1642321450
- submitted for publication
- J. Hu, M. Wellman, Multiagent reinforcement learning in stochastic games, submitted for publication.
- Multiagent Reinforcement Learning in Stochastic Games
- Hu, J.¹ Wellman, M.²

33
- 0038637209
- Multi-agent reinforcement learning: Independent vs. cooperative learning
- M.N. Huhns, & M.P. Singh. San Francisco, CA, USA: Morgan Kaufmann
- Tan M. Multi-agent reinforcement learning: Independent vs. cooperative learning. Huhns M.N., Singh M.P. Readings in Agents. 1997;487-494 Morgan Kaufmann, San Francisco, CA, USA.
- (1997) Readings in Agents , pp. 487-494
- Tan, M.¹

34
- 58149409359
- Eye-hand coordination in newborns
- von Hofsten C. Eye-hand coordination in newborns. Developmental Psychology. 18:1982;450-461.
- (1982) Developmental Psychology , vol.18 , pp. 450-461
- Von Hofsten, C.¹

35
- 1642407311
- Visual control upon hand reaching movements in neonates
- Ennouri K., Dubon K., Notides C., Bloch H. Visual control upon hand reaching movements in neonates. Infant Behavior and Development. 17:1994.
- (1994) Infant Behavior and Development , vol.17
- Ennouri, K.¹ Dubon, K.² Notides, C.³ Bloch, H.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.