SCOPUS 정보 검색 플랫폼

International Journal of Intelligent Systems

Volumn 12, Issue 10, 1997, Pages 695-724

Training and delayed reinforcements in Q-learning agents

(2) Caironi, Pierguido V C a Dorigo, Marco b

a POLITECNICO DI MILANO (Italy)

b UNIVERSITÉ LIBRE DE BRUXELLES (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; LEARNING SYSTEMS; ROBOTS;

DELAYED REINFORCEMENTS; Q LEARNING ALGORITHMS; TRAINING REINFORCEMENTS;

LEARNING ALGORITHMS;

EID: 0031257934 PISSN: 08848173 EISSN: None Source Type: Journal
DOI: 10.1002/(SICI)1098-111X(199710)12:10<695::AID-INT1>3.0.CO;2-T Document Type: Article

Times cited : (16)

References (23)

1
- 0003812851
- Computer Science Dept., University of Rochester, NY
- S.D. Whitehead, A Study of Cooperative Mechanisms for Faster Reinforcement Learning, TR-365, Computer Science Dept., University of Rochester, NY, 1991.
- (1991) A Study of Cooperative Mechanisms for Faster Reinforcement Learning, TR-365
- Whitehead, S.D.¹

2
- 33847202724
- Learning to predict by the methods of temporal differences
- R.S. Sutton, "Learning to predict by the methods of temporal differences," Mach. Learn., 3, 9-44 (1988).
- (1988) Mach. Learn. , vol.3 , pp. 9-44
- Sutton, R.S.¹

3
- 0003617454
- Ph.D. Thesis, Department of Computer and Information Science, University of Massachusetts, Amherst, MA
- R.S. Sutton, "Temporal credit assignment in reinforcement learning," Ph.D. Thesis, Department of Computer and Information Science, University of Massachusetts, Amherst, MA, 1984.
- (1984) Temporal Credit Assignment in Reinforcement Learning
- Sutton, R.S.¹

4
- 0002201501
- Learning and sequential decision making
- M. Gabriel and J. W. Moore, Eds. MIT Press, Bradford Books, Cambridge, MA
- A.G. Barto, R.S. Sutton, and C.J.C.H. Watkins, "Learning and sequential decision making," in Learning and Computational Neuroscience: Foundations of Adaptive Network, M. Gabriel and J. W. Moore, Eds. MIT Press, Bradford Books, Cambridge, MA, 1990.
- (1990) Learning and Computational Neuroscience: Foundations of Adaptive Network
- Barto, A.G.¹ Sutton, R.S.² Watkins, C.J.C.H.³

5
- 0024735689
- Classifier systems and genetic algorithms
- L. Booker, D.E. Goldberg, and J.H. Holland, "Classifier systems and genetic algorithms," Artif. Intell., 40, 235-282 (1989).
- (1989) Artif. Intell. , vol.40 , pp. 235-282
- Booker, L.¹ Goldberg, D.E.² Holland, J.H.³

6
- 0004049895
- Ph.D. Dissertation, Psychology Department, University of Cambridge, England
- C.J.C.H. Watkins, "Learning with delayed rewards," Ph.D. Dissertation, Psychology Department, University of Cambridge, England, 1989.
- (1989) Learning with Delayed Rewards
- Watkins, C.J.C.H.¹

7
- 85158158334
- A complexity analysis of cooperative mechanisms in reinforcement learning
- S.D. Whitehead, "A complexity analysis of cooperative mechanisms in reinforcement learning," Proceeding of the Ninth National Conference on Artificial Intelligence (AAAI-91), 1991, pp. 607-613.
- (1991) Proceeding of the Ninth National Conference on Artificial Intelligence (AAAI-91) , pp. 607-613
- Whitehead, S.D.¹

8
- 0003411271
- Efficient Exploration in Reinforcement Learning
- Carnegie Mellon University, Pittsburgh, PA
- S.B. Thrun, Efficient Exploration in Reinforcement Learning, Technical Report CMU-CS-92-102, Carnegie Mellon University, Pittsburgh, PA, 1992.
- (1992) Technical Report CMU-CS-92-102
- Thrun, S.B.¹

9
- 0003782780
- MIT Press, Bradford Books, Cambridge, MA
- M. Dorigo and M. Colombetti, Robot Shaping: An Experiment in Behavior Engineering, MIT Press, Bradford Books, Cambridge, MA, 1997.
- (1997) Robot Shaping: An Experiment in Behavior Engineering
- Dorigo, M.¹ Colombetti, M.²

10
- 0029326107
- ALECSYS and the autonoMouse: Learning to control a real robot by distributed classifier systems
- M. Dorigo, "ALECSYS and the autonoMouse: Learning to control a real robot by distributed classifier systems," Mach. Learn., 19, 209-240 (1995).
- (1995) Mach. Learn. , vol.19 , pp. 209-240
- Dorigo, M.¹

11
- 0028739953
- Robot shaping: Developing autonomous agents through learning
- M. Dorigo and M. Colombetti, "Robot shaping: Developing autonomous agents through learning," Artif. Intell., 71, 321-370 (1994).
- (1994) Artif. Intell. , vol.71 , pp. 321-370
- Dorigo, M.¹ Colombetti, M.²

12
- 0001963114
- The role of the trainer in reinforcement learning
- New Brunswick, NJ
- M. Dorigo and M. Colombetti, "The role of the trainer in reinforcement learning," Proceedings of the MLC-COLT '94 Workshop on Robot Learning, New Brunswick, NJ, 1994, pp. 37-45.
- (1994) Proceedings of the MLC-COLT '94 Workshop on Robot Learning , pp. 37-45
- Dorigo, M.¹ Colombetti, M.²

13
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Morgan Kaufmann, San Mateo, CA
- R.S. Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," Proceedings of the Seventh International Conference on Machine Learning, Morgan Kaufmann, San Mateo, CA, 1990, pp. 216-224.
- (1990) Proceedings of the Seventh International Conference on Machine Learning , pp. 216-224
- Sutton, R.S.¹

14
- 0003673017
- Ph.D. Thesis, Carnegie Mellon University, Pittsburgh, PA
- L-J. Lin, "Reinforcement learning for robots using neural networks," Ph.D. Thesis, Carnegie Mellon University, Pittsburgh, PA, 1993.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.-J.¹

15
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L-J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Mach. Learn., 8, 293-322 (1992).
- (1992) Mach. Learn. , vol.8 , pp. 293-322
- Lin, L.-J.¹

16
- 34248963252
- McGraw-Hill
- S. Siegel and N.J. Castellan, Nonparametric Statistics for the Behavioral Sciences, McGraw-Hill, 1956.
- (1956) Nonparametric Statistics for the Behavioral Sciences
- Siegel, S.¹ Castellan, N.J.²

17
- 0030149709
- Purposive behavior acquisition for a real robot by vision-based reinforcement learning
- M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda, "Purposive behavior acquisition for a real robot by vision-based reinforcement learning," Mach. Learn., 23, 279-303 (1996).
- (1996) Mach. Learn. , vol.23 , pp. 279-303
- Asada, M.¹ Noda, S.² Tawaratsumida, S.³ Hosoda, K.⁴

18
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- S. Mahadevan and J. Connell, "Automatic programming of behavior-based robots using reinforcement learning," Artif. Intell., 55, 311-365 (1992).
- (1992) Artif. Intell. , vol.55 , pp. 311-365
- Mahadevan, S.¹ Connell, J.²

19
- 0007908166
- Experiments with reinforcement learning in problems with continuous state and action spaces
- Department of Computer Science, University of Massachusetts, Amherst, MA
- J.C. Santamaria, R.S. Sutton, and A. Ram, "Experiments with reinforcement learning in problems with continuous state and action spaces," Technical Report UM-CS-1966-088, Department of Computer Science, University of Massachusetts, Amherst, MA, 1996.
- (1996) Technical Report UM-CS-1966-088
- Santamaria, J.C.¹ Sutton, R.S.² Ram, A.³

20
- 0028730301
- Fuzzy Q-learning and dynamical fuzzy Q-learning
- IEEE Press, Piscataway, NJ
- P.-Y. Glorennec, "Fuzzy Q-learning and dynamical fuzzy Q-learning," Proceedings of the Third IEEE International Conference on Fuzzy Systems, IEEE Press, Piscataway, NJ, 1994, pp. 474-479.
- (1994) Proceedings of the Third IEEE International Conference on Fuzzy Systems , pp. 474-479
- Glorennec, P.-Y.¹

21
- 0030395785
- Refining linear fuzzy rules by reinforcement learning
- IEEE Press, Piscataway, NJ
- H.R. Berenji, P.S. Khedkar, and A. Malkani, "Refining linear fuzzy rules by reinforcement learning," Proceedings of the Fifth IEEE International Conference on Fuzzy Systems, IEEE Press, Piscataway, NJ, 1996, pp. 1750-1756.
- (1996) Proceedings of the Fifth IEEE International Conference on Fuzzy Systems , pp. 1750-1756
- Berenji, H.R.¹ Khedkar, P.S.² Malkani, A.³

22
- 0023540098
- Manual training techniques of autonomous systems based on artificial neural networks
- IEEE Press, Piscataway, NJ
- J.F. Shepanski and S.A. Macy, "Manual training techniques of autonomous systems based on artificial neural networks," Proceedings of the IEEE First Annual International Conference on Neural Networks, IEEE Press, Piscataway, NJ, 1987, pp. 697-704.
- (1987) Proceedings of the IEEE First Annual International Conference on Neural Networks , pp. 697-704
- Shepanski, J.F.¹ Macy, S.A.²

23
- 0345843391
- Achieving rapid adaptations in robots by means of external tuition
- MIT Press, Cambridge, MA
- U. Nehmzow and B. McGonigle, "Achieving rapid adaptations in robots by means of external tuition," Proceedings of From Animal to Animats, Third International Conference on Simulation of Adaptive Behaviour (SAB94), MIT Press, Cambridge, MA, 1994, pp. 301-308.
- (1994) Proceedings of from Animal to Animats, Third International Conference on Simulation of Adaptive Behaviour (SAB94) , pp. 301-308
- Nehmzow, U.¹ McGonigle, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.