메뉴 건너뛰기




Volumn 148, Issue , 2006, Pages 489-496

Autonomous shaping: Knowledge transfer in reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

AUTONOMOUS AGENTS; PREDICTIVE CONTROL SYSTEMS; REINFORCEMENT LEARNING;

EID: 34250719248     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1143844.1143906     Document Type: Conference Paper
Times cited : (94)

References (24)
  • 1
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A., Bradtke, S., & Singh, S. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
    • (1995) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 2
    • 33749244036 scopus 로고    scopus 로고
    • Reusing old policies to accelerate learning on new MDPs
    • UM-CS-1999-026, Department of Computer Science, University of Massachusetts at Amherst
    • Bernstein, D. (1999). Reusing old policies to accelerate learning on new MDPs (Technical Report UM-CS-1999-026). Department of Computer Science, University of Massachusetts at Amherst.
    • (1999) Technical Report
    • Bernstein, D.1
  • 4
    • 0002479021 scopus 로고    scopus 로고
    • Exploring unknown environments with real-time search or reinforcement learning
    • Koenig, S. (1999). Exploring unknown environments with real-time search or reinforcement learning. Advances in Neural Information Processing Systems (NIPS) 12 (pp. 1003-1009).
    • (1999) Advances in Neural Information Processing Systems (NIPS) , vol.12 , pp. 1003-1009
    • Koenig, S.1
  • 5
    • 0029751419 scopus 로고    scopus 로고
    • The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms
    • Koenig, S., & Simmons, R. (1996). The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms. Machine Learning, 22, 227-250.
    • (1996) Machine Learning , vol.22 , pp. 227-250
    • Koenig, S.1    Simmons, R.2
  • 7
    • 0025400088 scopus 로고
    • Real-time heuristic search
    • Korf, R. (1990). Real-time heuristic search. Artificial Intelligence, 42, 189-211.
    • (1990) Artificial Intelligence , vol.42 , pp. 189-211
    • Korf, R.1
  • 9
    • 0030647149 scopus 로고    scopus 로고
    • Reinforcement learning in the multi-robot domain
    • Matarić, M. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4, 73-83.
    • (1997) Autonomous Robots , vol.4 , pp. 73-83
    • Matarić, M.1
  • 10
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less time
    • Moore, A., & Atkeson, C. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103-130.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.1    Atkeson, C.2
  • 20
    • 0003411271 scopus 로고
    • Efficient exploration in reinforcement learning
    • CS-92-102, Carnegie Mellon University
    • Thrun, S. (1992). Efficient exploration in reinforcement learning (Technical Report CS-92-102). Carnegie Mellon University.
    • (1992) Technical Report
    • Thrun, S.1
  • 24
    • 27344453198 scopus 로고    scopus 로고
    • Potential-based shaping and Q-value initialization are equivalent
    • Wiewiora, E. (2003). Potential-based shaping and Q-value initialization are equivalent. Journal of Artificial Intelligence Research, 19, 205-208.
    • (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 205-208
    • Wiewiora, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.