메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1211-1217

Using cases as heuristics in reinforcement learning: A transfer learning application

Author keywords

[No Author keywords available]

Indexed keywords

AI TECHNIQUES; EMPIRICAL EVALUATIONS; HUMANOID ROBOT; LEARNING PERFORMANCE; OPTIMAL POLICIES; THIRD PHASE; TRANSFER LEARNING; TWO DOMAINS;

EID: 84871606206     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.5591/978-1-57735-516-8/IJCAI11-206     Document Type: Conference Paper
Times cited : (26)

References (24)
  • 1
    • 70350350697 scopus 로고    scopus 로고
    • Case-based reasoning in transfer learning
    • Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning. Springer
    • David W. Aha, Matthew Molineaux, and Gita Sukthankar. Case-based reasoning in transfer learning. In Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning, volume 5650 of Lecture Notes in Computer Science, pages 29-44. Springer, 2009.
    • (2009) Lecture Notes in Computer Science , vol.5650 , pp. 29-44
    • Aha, D.W.1    Molineaux, M.2    Sukthankar, G.3
  • 2
    • 52149099878 scopus 로고    scopus 로고
    • Recognizing the enemy: Combining reinforcement learning with strategy selection using case-based reasoning
    • Klaus-Dieter Althoff, Ralph Bergmann, Mirjam Minor, and Alexandre Hanft, editors, 9th European Conference on Case-Based Reasoning, Springer
    • Bryan Auslander, Stephen Lee-Urban, Chad Hogg, and Héctor Muñoz-Avila. Recognizing the enemy: Combining reinforcement learning with strategy selection using case-based reasoning. In Klaus-Dieter Althoff, Ralph Bergmann, Mirjam Minor, and Alexandre Hanft, editors, 9th European Conference on Case-Based Reasoning, volume 5239 of Lecture Notes in Computer Science, pages 59-73. Springer, 2008.
    • (2008) Lecture Notes in Computer Science , vol.5239 , pp. 59-73
    • Auslander, B.1    Lee-Urban, S.2    Hogg, C.3    Muñoz-Avila, H.4
  • 4
    • 33751369840 scopus 로고    scopus 로고
    • Heuristically Accelerated Q-learning: A new approach to speed up reinforcement learning
    • Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, and Anna H. R. Costa. Heuristically Accelerated Q-learning: a new approach to speed up reinforcement learning. Lecture Notes in Artificial Intelligence, 3171:245-254, 2004.
    • (2004) Lecture Notes in Artificial Intelligence , vol.3171 , pp. 245-254
    • Bianchi, R.A.C.1    Ribeiro, C.H.C.2    Costa, A.H.R.3
  • 5
    • 41249102188 scopus 로고    scopus 로고
    • Accelerating autonomous learning by using heuristic selection of actions
    • Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, and Anna H. R. Costa. Accelerating autonomous learning by using heuristic selection of actions. Journal of Heuristics, 14(2):135-168, 2008.
    • (2008) Journal of Heuristics , vol.14 , Issue.2 , pp. 135-168
    • Bianchi, R.A.C.1    Ribeiro, C.H.C.2    Costa, A.H.R.3
  • 6
    • 70350352555 scopus 로고    scopus 로고
    • Improving reinforcement learning by using case based heuristics
    • Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning
    • Reinaldo A. C. Bianchi, Raquel Ros, and Ramon López de Mántaras. Improving reinforcement learning by using case based heuristics. In Lorraine McGinty and David C. Wilson, editors, 8th International Conference on Case-Based Reasoning, volume 5650 of Lecture Notes in Computer Science, pages 75-89. Springer, 2009.
    • (2009) Lecture Notes in Computer Science , vol.5650
    • Bianchi, R.A.C.1    Ros, R.2    López De Mántaras, R.3
  • 9
    • 0043247546 scopus 로고    scopus 로고
    • Accelerating reinforcement learning by composing solutions of automatically identified subtasks
    • Chris Drummond. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research, 16:59-104, 2002.
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
    • Drummond, C.1
  • 10
    • 34247199512 scopus 로고    scopus 로고
    • Probabilistic policy reuse in a reinforcement learning agent
    • Hideyuki Nakashima, Michael P. Wellman, Gerhard Weiss and Peter Stone, editors, ACM
    • Fernando Fernández and Manuela Veloso. Probabilistic policy reuse in a reinforcement learning agent. In Hideyuki Nakashima, Michael P. Wellman, Gerhard Weiss and Peter Stone, editors, Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems, pages 720-727. ACM, 2006.
    • (2006) Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 720-727
    • Fernández, F.1    Veloso, M.2
  • 11
    • 26944491842 scopus 로고    scopus 로고
    • CBR for state value function approximation in reinforcement learning
    • Héctor Muñoz-Avila and Francesco Ricci, editors, 6th International Conference on Case-Based Reasoning, Springer
    • Thomas Gabel and Martin A. Riedmiller. CBR for state value function approximation in reinforcement learning. In Héctor Muñoz-Avila and Francesco Ricci, editors, 6th International Conference on Case-Based Reasoning, volume 3620 of Lecture Notes in Computer Science, pages 206-221. Springer, 2005.
    • (2005) Lecture Notes in Computer Science , vol.3620 , pp. 206-221
    • Gabel, T.1    Riedmiller, M.A.2
  • 12
    • 0141829831 scopus 로고    scopus 로고
    • Using reinforcement learning for similarity assessment in case-based systems
    • Paul Juell and Patrick Paulson. Using reinforcement learning for similarity assessment in case-based systems. IEEE Intelligent Systems, 18(4):60-67, 2003.
    • (2003) IEEE Intelligent Systems , vol.18 , Issue.4 , pp. 60-67
    • Juell, P.1    Paulson, P.2
  • 13
    • 56049125072 scopus 로고    scopus 로고
    • Transfer of samples in batch reinforcement learning
    • William W. Cohen, Andrew McCallum and Sam T. Roweis, editors, ACM
    • Alessandro Lazaric, Marcello Restelli, and Andrea Bonarini. Transfer of samples in batch reinforcement learning. In William W. Cohen, Andrew McCallum and Sam T. Roweis, editors, 25th International Conference on Machine Learning, pages 544-551. ACM, 2008.
    • (2008) 25th International Conference on Machine Learning , pp. 544-551
    • Lazaric, A.1    Restelli, M.2    Bonarini, A.3
  • 15
    • 64549151983 scopus 로고    scopus 로고
    • A case-based approach for coordinated action selection in robot soccer
    • Raquel Ros, Josep Lluis Arcos, Ramon López de Mántaras, and Manuela Veloso. A case-based approach for coordinated action selection in robot soccer. Artificial Intelligence, 173(9-10):1014-1039, 2009.
    • (2009) Artificial Intelligence , vol.173 , Issue.9-10 , pp. 1014-1039
    • Ros, R.1    Arcos, J.L.2    López De Mántaras, R.3    Veloso, M.4
  • 17
    • 33750690679 scopus 로고    scopus 로고
    • Using homomorphisms to transfer options across continuous reinforcement learning domains
    • AAAI Press
    • Vishal Soni and Satinder Singh. Using homomorphisms to transfer options across continuous reinforcement learning domains. In Proceedings of the 21st National Conference on Artificial Intelligence, volume 1, pages 494-499. AAAI Press, 2006.
    • (2006) Proceedings of the 21st National Conference on Artificial Intelligence , vol.1 , pp. 494-499
    • Soni, V.1    Singh, S.2
  • 19
    • 68949157375 scopus 로고    scopus 로고
    • Transfer learning for reinforcement learning domains: A survey
    • Matthew E. Taylor and Peter Stone. Transfer learning for reinforcement learning domains: A survey. Journal of Machine Learning Research, 10(1):1633-1685, 2009.
    • (2009) Journal of Machine Learning Research , vol.10 , Issue.1 , pp. 1633-1685
    • Taylor, M.E.1    Stone, P.2
  • 20
    • 56049086452 scopus 로고    scopus 로고
    • Transferring instances for model-based reinforcement learning
    • Walter Daelemans, Bart Goethals and Katharina Morik, editors, 19th European Conference on Machine Learning, Springer
    • Matthew E. Taylor, Nicholas K. Jong, and Peter Stone. Transferring instances for model-based reinforcement learning. In Walter Daelemans, Bart Goethals and Katharina Morik, editors, 19th European Conference on Machine Learning, volume 5212 of Lecture Notes in Artificial Intelligence, pages 488-505. Springer, 2008.
    • (2008) Lecture Notes in Artificial Intelligence , vol.5212 , pp. 488-505
    • Taylor, M.E.1    Jong, N.K.2    Stone, P.3
  • 21
    • 33751551663 scopus 로고
    • The influence of improvement in one mental function upon the efficiency of other functions
    • E. L. Thorndike and R. S. Woodworth. The influence of improvement in one mental function upon the efficiency of other functions. Psychological Review, 8:247-261, 1901.
    • (1901) Psychological Review , vol.8 , pp. 247-261
    • Thorndike, E.L.1    Woodworth, R.S.2
  • 22
    • 33646413134 scopus 로고    scopus 로고
    • Using advice to transfer knowledge acquired in one reinforcement learning task to another
    • João Gama, Rui Camacho, Alípio Jorge, and Luís Torgo, editors, 16th European Conference on Machine Learning, Springer
    • Lisa Torrey, Trevor Walker, Jude W. Shavlik, and Richard Maclin. Using advice to transfer knowledge acquired in one reinforcement learning task to another. In João Gama, Rui Camacho, Alípio Jorge, and Luís Torgo, editors, 16th European Conference on Machine Learning, volume 3720 of Lecture Notes in Computer Science, pages 412-424. Springer, 2005.
    • (2005) Lecture Notes in Computer Science , vol.3720 , pp. 412-424
    • Torrey, L.1    Walker, T.2    Shavlik, J.W.3    Maclin, R.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.