SCOPUS 정보 검색 플랫폼

Volumn 3, Issue , 2003, Pages 1108-1113

Multitask reinforcement learning on the distribution of MDPs

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; AUTOMATION; ROBOTICS;

LEARNING EXPERIENCES; LEARNING PERFORMANCE; LEARNING TASKS;

REINFORCEMENT LEARNING;

EID: 84863336191 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CIRA.2003.1222152 Document Type: Conference Paper

Times cited : (63)

References (11)

1
- 0030171602
- Rapid, safe, and incremental learning of navigation strategies
- J. del R. Mill'an. Rapid, Safe, and Incremental Learning of Navigation Strategies. IEEE Transactions on Systems, Man and Cybernetics-Part B, 26:408-420, 1996.
- (1996) IEEE Transactions on Systems, Man and Cybernetics-Part B , vol.26 , pp. 408-420
- Mill'An, J.R.¹

2
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, 4:237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

3
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and Less Real Time
- A. W. Moore and C. G. Atkeson. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Real Time. Machine Learning, 13:103-130, 1993.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

4
- 84977063352
- Efficient learning and platuiing within the dyna framework
- J. Peng and R. J. Williams. Efficient Learning and Platuiing Within the Dyna Framework. Adaptive Behavior, l(4):437-454, 1993.
- (1993) Adaptive Behavior , vol.1 , Issue.4 , pp. 437-454
- Peng, J.¹ Williams, R.J.²

5
- 0001027894
- Transfer of learning by composing solutions of elemental sequential Tasks
- S. P. Singh. Transfer of Learning by Composing Solutions of Elemental Sequential Tasks. Machine Learning, 8:323-339, 1992.
- (1992) Machine Learning , vol.8 , pp. 323-339
- Singh, S.P.¹

6
- 85132026293
- Integrated architectures for learning, planning, and reacting based on Approximating Dynamic Programming
- R. S. Sutton. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming. In Proceedings of the 7th International Conference on Machine Learning, pages 216-224, 1990.
- (1990) Proceedings of the 7th International Conference on Machine Learning , pp. 216-224
- Sutton, R.S.¹

7
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

8
- 0003381055
- An approach to lifelong reinforcement learning through multiple Environments
- F. Tanaka and M. Yamamura. An Approach to Lifelong Reinforcement Learning through Multiple Environments. In Proceedings of the 6th European Workshop on Learning Robots, pages 93-99, 1997.
- (1997) Proceedings of the 6th European Workshop on Learning Robots , pp. 93-99
- Tanaka, F.¹ Yamamura, M.²

9
- 0001546350
- Active exploration in dynamic environments
- S. Thrun and K. Moller. Active Exploration in Dynamic Environments. In Advances in Neural Information Processing Systems 4, pages 531-538, 1992.
- (1992) Advances in Neural Information Processing Systems , vol.4 , pp. 531-538
- Thrun, S.¹ Moller, K.²

10
- 0003901612
- Kluwer Academic Publishers
- S. Thrun and L, Pratt. LEARNING TO LEARN. Kluwer Academic Publishers, 1998.
- (1998) Learning to Learn
- Thrun, S.¹ Pratt, L.²

11
- 34249833101
- C. J. C. H. Watkins and P. Dayan. Q-Learning. Machine Learning, 8:279-292, 1992.
- (1992) Q-Learning Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.