메뉴 건너뛰기




Volumn 5650 LNAI, Issue , 2009, Pages 75-89

Improving reinforcement learning by using case based heuristics

Author keywords

[No Author keywords available]

Indexed keywords

CASE BASE; CASE BASED; CBR; EMPIRICAL EVALUATIONS; HEURISTIC FUNCTIONS; HEURISTIC INFORMATION; NEW APPROACHES; Q-LEARNING; REINFORCEMENT LEARNING TECHNIQUES; ROBOCUP; SPEED-UPS;

EID: 70350352555     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-02998-1_7     Document Type: Conference Paper
Times cited : (38)

References (28)
  • 1
    • 0028401306 scopus 로고
    • Case-based reasoning: Foundational issues, methodological variations, and system approaches
    • Aamodt, A., Plaza, E.: Case-based reasoning: foundational issues, methodological variations, and system approaches. AI Commun. 7(1), 39-59 (1994)
    • (1994) AI Commun , vol.7 , Issue.1 , pp. 39-59
    • Aamodt, A.1    Plaza, E.2
  • 6
    • 0003629453 scopus 로고    scopus 로고
    • Generalized markov decision processes: Dynamic-programming and reinforcement-learning algorithms
    • Technical report, Brown University, CS-96-11
    • Szepesvári, C., Littman, M.L.: Generalized markov decision processes: Dynamic-programming and reinforcement-learning algorithms. Technical report, Brown University, CS-96-11 (1996)
    • (1996)
    • Szepesvári, C.1    Littman, M.L.2
  • 8
    • 41249102188 scopus 로고    scopus 로고
    • Accelerating autonomous learning by using heuristic selection of actions
    • Bianchi, R.A.C., Ribeiro, C.H.C., Costa, A.H.R.: Accelerating autonomous learning by using heuristic selection of actions. Journal of Heuristics 14(2), 135-168 (2008)
    • (2008) Journal of Heuristics , vol.14 , Issue.2 , pp. 135-168
    • Bianchi, R.A.C.1    Ribeiro, C.H.C.2    Costa, A.H.R.3
  • 10
    • 70350415252 scopus 로고    scopus 로고
    • RoboCup Technical Committee: homepage 2009
    • RoboCup Technical Committee: Standard platform league homepage (2009), http://www.tzi.de/spl
    • Standard platform league
  • 12
    • 50249177133 scopus 로고    scopus 로고
    • Heuristic reinforcement learning applied to robocup simulation agents
    • Visser, U, Ribeiro, F, Ohashi, T, Del-laert, F, eds, RoboCup 2007: Robot Soccer World Cup XI, Springer, Heidelberg
    • Celiberto, L.A., Ribeiro, C.H.C., Costa, A.H.R., Bianchi, R.A.C.: Heuristic reinforcement learning applied to robocup simulation agents. In: Visser, U., Ribeiro, F., Ohashi, T., Del-laert, F. (eds.) RoboCup 2007: Robot Soccer World Cup XI. LNCS, vol. 5001, pp. 220-227. Springer, Heidelberg (2008)
    • (2008) LNCS , vol.5001 , pp. 220-227
    • Celiberto, L.A.1    Ribeiro, C.H.C.2    Costa, A.H.R.3    Bianchi, R.A.C.4
  • 13
    • 64549151983 scopus 로고    scopus 로고
    • A case-based approach for coordinated action selection in robot soccer
    • Ros, R., Arcos, J.L., de Mantaras, R.L., Veloso, M.: A case-based approach for coordinated action selection in robot soccer. Artificial Intelligence 173(9-10), 1014-1039 (2009)
    • (2009) Artificial Intelligence , vol.173 , Issue.9-10 , pp. 1014-1039
    • Ros, R.1    Arcos, J.L.2    de Mantaras, R.L.3    Veloso, M.4
  • 14
    • 38049029338 scopus 로고    scopus 로고
    • Team playing behavior in robot soccer: A case-based approach
    • Weber, R.O, Richter, M.M, eds, ICCBR 2007, Springer, Heidelberg
    • Ros, R., de Mántaras, R.L., Arcos, J.L., Veloso, M.: Team playing behavior in robot soccer: A case-based approach. In: Weber, R.O., Richter, M.M. (eds.) ICCBR 2007. LNCS (LNAI), vol. 4626, pp. 46-60. Springer, Heidelberg (2007)
    • (2007) LNCS (LNAI , vol.4626 , pp. 46-60
    • Ros, R.1    de Mántaras, R.L.2    Arcos, J.L.3    Veloso, M.4
  • 18
    • 1642357599 scopus 로고    scopus 로고
    • Ahmadi, M., Lamjiri, A.K., Nevisi, M.M., Habibi, J., Badie, K.: Using a two-layered case-based reasoning for prediction in soccer coach. In: Arabnia, H.R., Kozerenko, E.B. (eds.) MLMTA, pp. 181-185. CSREA Press (2003)
    • Ahmadi, M., Lamjiri, A.K., Nevisi, M.M., Habibi, J., Badie, K.: Using a two-layered case-based reasoning for prediction in soccer coach. In: Arabnia, H.R., Kozerenko, E.B. (eds.) MLMTA, pp. 181-185. CSREA Press (2003)
  • 19
    • 84943263781 scopus 로고    scopus 로고
    • Karol, A., Nebel, B., Stanton, C., Williams, M.A.: Case based game play in the robocup four-legged league part i the theoretical model. In: Polani, D., Browning, B., Bonarini, A., Yoshida, K. (eds.) RoboCup 2003. LNCS, 3020, pp. 739-747. Springer, Heidelberg (2004)
    • Karol, A., Nebel, B., Stanton, C., Williams, M.A.: Case based game play in the robocup four-legged league part i the theoretical model. In: Polani, D., Browning, B., Bonarini, A., Yoshida, K. (eds.) RoboCup 2003. LNCS, vol. 3020, pp. 739-747. Springer, Heidelberg (2004)
  • 20
    • 0043247546 scopus 로고    scopus 로고
    • Accelerating reinforcement learning by composing solutions of automatically identified subtasks
    • Drummond, C.: Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research 16, 59-104 (2002)
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
    • Drummond, C.1
  • 22
    • 26944491842 scopus 로고    scopus 로고
    • CBR for state value function approximation in reinforcement learning
    • Muñoz-Avila, H, Ricci, F, eds, ICCBR 2005, Springer, Heidelberg
    • Gabel, T., Riedmiller, M.A.: CBR for state value function approximation in reinforcement learning. In: Muñoz-Avila, H., Ricci, F. (eds.) ICCBR 2005. LNCS, vol. 3620, pp. 206-221. Springer, Heidelberg (2005)
    • (2005) LNCS , vol.3620 , pp. 206-221
    • Gabel, T.1    Riedmiller, M.A.2
  • 23
    • 0141829831 scopus 로고    scopus 로고
    • Using reinforcement learning for similarity assessment in case-based systems
    • Juell, P., Paulson, P.: Using reinforcement learning for similarity assessment in case-based systems. IEEE Intelligent Systems 18(4), 60-67 (2003)
    • (2003) IEEE Intelligent Systems , vol.18 , Issue.4 , pp. 60-67
    • Juell, P.1    Paulson, P.2
  • 24
    • 52149099878 scopus 로고    scopus 로고
    • Recognizing the enemy: Combining reinforcement learning with strategy selection using case-based reasoning
    • Al-thoff, K.D, Bergmann, R, Minor, M, Hanft, A, eds, ECCBR 2008, Springer, Heidelberg
    • Auslander, B., Lee-Urban, S., Hogg, C., Muñoz-Avila, H.: Recognizing the enemy: Combining reinforcement learning with strategy selection using case-based reasoning. In: Al-thoff, K.D., Bergmann, R., Minor, M., Hanft, A. (eds.) ECCBR 2008. LNCS, vol. 5239, pp. 59-73. Springer, Heidelberg (2008)
    • (2008) LNCS , vol.5239 , pp. 59-73
    • Auslander, B.1    Lee-Urban, S.2    Hogg, C.3    Muñoz-Avila, H.4
  • 26
    • 33846010083 scopus 로고    scopus 로고
    • Abstracting reusable cases from reinforcement learning
    • Brüninghaus, S, ed
    • von Hessling, A., Goel, A.K.: Abstracting reusable cases from reinforcement learning. In: Brüninghaus, S. (ed.) ICCBR Workshops, pp. 227-236 (2005)
    • (2005) ICCBR Workshops , pp. 227-236
    • von Hessling, A.1    Goel, A.K.2
  • 27
    • 70350415251 scopus 로고    scopus 로고
    • Veloso, M., Rybski, P.E., Chernova, S., McMillen, C., Fasola, J., von Hundelshausen, F., Vail, D., Trevor, A., Hauert, S., Ros, R.: Cmdash 2005: Team report. Technical report, School of Computer Science, Carnegie Mellon University (2005)
    • Veloso, M., Rybski, P.E., Chernova, S., McMillen, C., Fasola, J., von Hundelshausen, F., Vail, D., Trevor, A., Hauert, S., Ros, R.: Cmdash 2005: Team report. Technical report, School of Computer Science, Carnegie Mellon University (2005)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.