SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2011, Pages 1211-1217

Using cases as heuristics in reinforcement learning: A transfer learning application

(4) Celiberto Jr , Luiz A a Matsuura, Jackson P a De Mantaras, Ramon Lopez b Bianchi, Reinaldo A C c

a INSTITUTO TECNOLÓGICO DE AERONÁUTICA (Brazil)

b UNIVERSITAT AUTÒNOMA DE BARCELONA (Spain)

c CENTRO UNIVERSITÁRIO DA FEI (Brazil)

Author keywords

[No Author keywords available]

Indexed keywords

AI TECHNIQUES; EMPIRICAL EVALUATIONS; HUMANOID ROBOT; LEARNING PERFORMANCE; OPTIMAL POLICIES; THIRD PHASE; TRANSFER LEARNING; TWO DOMAINS;

ANTHROPOMORPHIC ROBOTS; COMPUTER AIDED INSTRUCTION; LEARNING ALGORITHMS; NEURAL NETWORKS; THREE DIMENSIONAL;

REINFORCEMENT LEARNING;

EID: 84871606206 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: 10.5591/978-1-57735-516-8/IJCAI11-206 Document Type: Conference Paper

Times cited : (26)

References (24)

1
- 70350350697
- Case-based reasoning in transfer learning
- Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning. Springer
- David W. Aha, Matthew Molineaux, and Gita Sukthankar. Case-based reasoning in transfer learning. In Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning, volume 5650 of Lecture Notes in Computer Science, pages 29-44. Springer, 2009.
- (2009) Lecture Notes in Computer Science , vol.5650 , pp. 29-44
- Aha, D.W.¹ Molineaux, M.² Sukthankar, G.³

2
- 52149099878
- Recognizing the enemy: Combining reinforcement learning with strategy selection using case-based reasoning
- Klaus-Dieter Althoff, Ralph Bergmann, Mirjam Minor, and Alexandre Hanft, editors, 9th European Conference on Case-Based Reasoning, Springer
- Bryan Auslander, Stephen Lee-Urban, Chad Hogg, and Héctor Muñoz-Avila. Recognizing the enemy: Combining reinforcement learning with strategy selection using case-based reasoning. In Klaus-Dieter Althoff, Ralph Bergmann, Mirjam Minor, and Alexandre Hanft, editors, 9th European Conference on Case-Based Reasoning, volume 5239 of Lecture Notes in Computer Science, pages 59-73. Springer, 2008.
- (2008) Lecture Notes in Computer Science , vol.5239 , pp. 59-73
- Auslander, B.¹ Lee-Urban, S.² Hogg, C.³ Muñoz-Avila, H.⁴

3
- 84880904080
- General game learning using knowledge transfer
- Manuela M. Veloso, editor, AAAI Press
- Bikramjit Banerjee and Peter Stone. General game learning using knowledge transfer. In Manuela M. Veloso, editor, Proceedings of the 20th International Joint Conference on Artificial Intelligence, pages 672-677. AAAI Press, 2007.
- (2007) Proceedings of the 20th International Joint Conference on Artificial Intelligence , pp. 672-677
- Banerjee, B.¹ Stone, P.²

4
- 33751369840
- Heuristically Accelerated Q-learning: A new approach to speed up reinforcement learning
- Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, and Anna H. R. Costa. Heuristically Accelerated Q-learning: a new approach to speed up reinforcement learning. Lecture Notes in Artificial Intelligence, 3171:245-254, 2004.
- (2004) Lecture Notes in Artificial Intelligence , vol.3171 , pp. 245-254
- Bianchi, R.A.C.¹ Ribeiro, C.H.C.² Costa, A.H.R.³

5
- 41249102188
- Accelerating autonomous learning by using heuristic selection of actions
- Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, and Anna H. R. Costa. Accelerating autonomous learning by using heuristic selection of actions. Journal of Heuristics, 14(2):135-168, 2008.
- (2008) Journal of Heuristics , vol.14 , Issue.2 , pp. 135-168
- Bianchi, R.A.C.¹ Ribeiro, C.H.C.² Costa, A.H.R.³

6
- 70350352555
- Improving reinforcement learning by using case based heuristics
- Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning
- Reinaldo A. C. Bianchi, Raquel Ros, and Ramon López de Mántaras. Improving reinforcement learning by using case based heuristics. In Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning, volume 5650 of Lecture Notes in Computer Science, pages 75-89. Springer, 2009.
- (2009) Lecture Notes in Computer Science , vol.5650
- Bianchi, R.A.C.¹ Ros, R.² López De Mántaras, R.³

7
- 84881081224
- Joschka Boedecker, Klaus Dorer, Markus Rollmann andYuan Xu, Feng Xue, Marian Buchta, and Hedayat Vatankhah. Spark 3D Simulation System. 2010.
- (2010) Spark 3D Simulation System
- Boedecker, J.¹ Dorer, K.² Rollmann, M.³ Xu, Y.⁴ Xue, F.⁵ Buchta, M.⁶ Vatankhah, H.⁷

8
- 33646537794
- Retrieval, reuse, revision and retention in case-based reasoning
- Ramon López de Mántaras, David McSherry, Derek Bridge, David Leake, Barry Smyth, Susan Craw, Boi Faltings, Mary Lou Maher, Michael T. Cox, Kenneth Forbus, Mark Keane, Agnar Aamodt, and Ian Watson. Retrieval, reuse, revision and retention in case-based reasoning. Knowl. Eng. Rev., 20(3):215-240, 2005.
- (2005) Knowl. Eng. Rev. , vol.20 , Issue.3 , pp. 215-240
- López De Mántaras, R.¹ McSherry, D.² Bridge, D.³ Leake, D.⁴ Smyth, B.⁵ Craw, S.⁶ Faltings, B.⁷ Maher, M.L.⁸ Cox, M.T.⁹ Forbus, K.¹⁰ Keane, M.¹¹ Aamodt, A.¹² Watson, I.¹³

9
- 0043247546
- Accelerating reinforcement learning by composing solutions of automatically identified subtasks
- Chris Drummond. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research, 16:59-104, 2002.
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
- Drummond, C.¹

10
- 34247199512
- Probabilistic policy reuse in a reinforcement learning agent
- Hideyuki Nakashima, Michael P. Wellman, Gerhard Weiss and Peter Stone, editors, ACM
- Fernando Fernández and Manuela Veloso. Probabilistic policy reuse in a reinforcement learning agent. In Hideyuki Nakashima, Michael P. Wellman, Gerhard Weiss and Peter Stone, editors, Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems, pages 720-727. ACM, 2006.
- (2006) Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 720-727
- Fernández, F.¹ Veloso, M.²

11
- 26944491842
- CBR for state value function approximation in reinforcement learning
- Héctor Muñoz-Avila and Francesco Ricci, editors, 6th International Conference on Case-Based Reasoning, Springer
- Thomas Gabel and Martin A. Riedmiller. CBR for state value function approximation in reinforcement learning. In Héctor Muñoz-Avila and Francesco Ricci, editors, 6th International Conference on Case-Based Reasoning, volume 3620 of Lecture Notes in Computer Science, pages 206-221. Springer, 2005.
- (2005) Lecture Notes in Computer Science , vol.3620 , pp. 206-221
- Gabel, T.¹ Riedmiller, M.A.²

12
- 0141829831
- Using reinforcement learning for similarity assessment in case-based systems
- Paul Juell and Patrick Paulson. Using reinforcement learning for similarity assessment in case-based systems. IEEE Intelligent Systems, 18(4):60-67, 2003.
- (2003) IEEE Intelligent Systems , vol.18 , Issue.4 , pp. 60-67
- Juell, P.¹ Paulson, P.²

13
- 56049125072
- Transfer of samples in batch reinforcement learning
- William W. Cohen, Andrew McCallum and Sam T. Roweis, editors, ACM
- Alessandro Lazaric, Marcello Restelli, and Andrea Bonarini. Transfer of samples in batch reinforcement learning. In William W. Cohen, Andrew McCallum and Sam T. Roweis, editors, 25th International Conference on Machine Learning, pages 544-551. ACM, 2008.
- (2008) 25th International Conference on Machine Learning , pp. 544-551
- Lazaric, A.¹ Restelli, M.² Bonarini, A.³

14
- 0036951208
- A case-based reinforcement learning for probe robot path planning
- Yang Li, Chen Zonghai, and Chen Feng. A case-based reinforcement learning for probe robot path planning. In 4th World Congress on Intelligent Control and Automation, pages 1161-1165. 2002.
- (2002) 4th World Congress on Intelligent Control and Automation , pp. 1161-1165
- Li, Y.¹ Zonghai, C.² Feng, C.³

15
- 64549151983
- A case-based approach for coordinated action selection in robot soccer
- Raquel Ros, Josep Lluis Arcos, Ramon López de Mántaras, and Manuela Veloso. A case-based approach for coordinated action selection in robot soccer. Artificial Intelligence, 173(9-10):1014-1039, 2009.
- (2009) Artificial Intelligence , vol.173 , Issue.9-10 , pp. 1014-1039
- Ros, R.¹ Arcos, J.L.² López De Mántaras, R.³ Veloso, M.⁴

16
- 80054035256
- Transfer learning in real-time strategy games using hybrid CBR/RL
- Manuela M. Veloso, editor, AAAI Press
- Manu Sharma, Michael Holmes, Juan Carlos Santamaría, Arya Irani, Charles Lee Isbell Jr., and Ashwin Ram. Transfer learning in real-time strategy games using hybrid CBR/RL. In Manuela M. Veloso, editor, Proceedings of the 20th International Joint Conference on Artificial Intelligence, pages 1041-1046. AAAI Press, 2007.
- (2007) Proceedings of the 20th International Joint Conference on Artificial Intelligence , pp. 1041-1046
- Sharma, M.¹ Holmes, M.² Santamaría, J.C.³ Irani, A.⁴ Isbell Jr., C.L.⁵ Ram, A.⁶

17
- 33750690679
- Using homomorphisms to transfer options across continuous reinforcement learning domains
- AAAI Press
- Vishal Soni and Satinder Singh. Using homomorphisms to transfer options across continuous reinforcement learning domains. In Proceedings of the 21st National Conference on Artificial Intelligence, volume 1, pages 494-499. AAAI Press, 2006.
- (2006) Proceedings of the 21st National Conference on Artificial Intelligence , vol.1 , pp. 494-499
- Soni, V.¹ Singh, S.²

18
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

19
- 68949157375
- Transfer learning for reinforcement learning domains: A survey
- Matthew E. Taylor and Peter Stone. Transfer learning for reinforcement learning domains: A survey. Journal of Machine Learning Research, 10(1):1633-1685, 2009.
- (2009) Journal of Machine Learning Research , vol.10 , Issue.1 , pp. 1633-1685
- Taylor, M.E.¹ Stone, P.²

20
- 56049086452
- Transferring instances for model-based reinforcement learning
- Walter Daelemans, Bart Goethals and Katharina Morik, editors, 19th European Conference on Machine Learning, Springer
- Matthew E. Taylor, Nicholas K. Jong, and Peter Stone. Transferring instances for model-based reinforcement learning. In Walter Daelemans, Bart Goethals and Katharina Morik, editors, 19th European Conference on Machine Learning, volume 5212 of Lecture Notes in Artificial Intelligence, pages 488-505. Springer, 2008.
- (2008) Lecture Notes in Artificial Intelligence , vol.5212 , pp. 488-505
- Taylor, M.E.¹ Jong, N.K.² Stone, P.³

21
- 33751551663
- The influence of improvement in one mental function upon the efficiency of other functions
- E. L. Thorndike and R. S. Woodworth. The influence of improvement in one mental function upon the efficiency of other functions. Psychological Review, 8:247-261, 1901.
- (1901) Psychological Review , vol.8 , pp. 247-261
- Thorndike, E.L.¹ Woodworth, R.S.²

22
- 33646413134
- Using advice to transfer knowledge acquired in one reinforcement learning task to another
- João Gama, Rui Camacho, Alípio Jorge, and Luís Torgo, editors, 16th European Conference on Machine Learning, Springer
- Lisa Torrey, Trevor Walker, Jude W. Shavlik, and Richard Maclin. Using advice to transfer knowledge acquired in one reinforcement learning task to another. In João Gama, Rui Camacho, Alípio Jorge, and Luís Torgo, editors, 16th European Conference on Machine Learning, volume 3720 of Lecture Notes in Computer Science, pages 412-424. Springer, 2005.
- (2005) Lecture Notes in Computer Science , vol.3720 , pp. 412-424
- Torrey, L.¹ Walker, T.² Shavlik, J.W.³ Maclin, R.⁴

23
- 33846010083
- Abstracting reusable cases from reinforcement learning
- Stefanie Brüninghaus, editor
- Andreas von Hessling and Ashok K. Goel. Abstracting reusable cases from reinforcement learning. In Stefanie Brüninghaus, editor, 6th International Conference on Case-Based Reasoning, Workshop Proceedings, pages 227-236, 2005.
- (2005) 6th International Conference on Case-Based Reasoning, Workshop Proceedings , pp. 227-236
- Von Hessling, A.¹ Goel, A.K.²

24
- 0004049893
- PhD thesis, University of Cambridge
- Christopher J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, University of Cambridge, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.