메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 528-535

Reinforcement learning from simultaneous human and MDP reward

Author keywords

Human teachers; Human agent interaction; Interactive learning; Reinforcement learning; Shaping

Indexed keywords

AUTONOMOUS AGENTS; LEARNING ALGORITHMS; MARKOV PROCESSES; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

EID: 84899434166     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (111)

References (20)
  • 1
    • 84899465277 scopus 로고    scopus 로고
    • Heuristically accelerated Q-learning: A new approach to speed up reinforcement learning
    • R. Bianchi, C. Ribeiro, and A. Costa. Heuristically Accelerated Q-Learning: a new approach to speed up Reinforcement Learning. Advances in AI - SBIA, 2004.
    • (2004) Advances in AI - SBIA
    • Bianchi, R.1    Ribeiro, C.2    Costa, A.3
  • 2
  • 3
    • 0028739953 scopus 로고
    • Robot shaping: Developing situated agents through learning
    • M. Dorigo and M. Colombetti. Robot shaping: Developing situated agents through learning. Artificial Intelligence, 1994.
    • (1994) Artificial Intelligence
    • Dorigo, M.1    Colombetti, M.2
  • 4
    • 77955023200 scopus 로고    scopus 로고
    • Probabilistic policy reuse in a reinforcement learning agent
    • F. Fernández and M. Veloso. Probabilistic policy reuse in a reinforcement learning agent. AAMAS, 2006.
    • (2006) AAMAS
    • Fernández, F.1    Veloso, M.2
  • 6
    • 80053038345 scopus 로고    scopus 로고
    • Reinforcement learning via practice and critique advice
    • K. Judah, S. Roy, A. Fern, and T. Dietterich. Reinforcement Learning Via Practice and Critique Advice. AAAI, 2010.
    • (2010) AAAI
    • Judah, K.1    Roy, S.2    Fern, A.3    Dietterich, T.4
  • 7
    • 70449629717 scopus 로고    scopus 로고
    • Interactively shaping agents via human reinforcement: The TAMER framework
    • W. Knox and P. Stone. Interactively shaping agents via human reinforcement: The TAMER framework. K-CAP, 2009.
    • (2009) K-CAP
    • Knox, W.1    Stone, P.2
  • 8
    • 84884357468 scopus 로고    scopus 로고
    • Combining manual feedback with subsequent MDP reward signals for reinforcement learning
    • W. Knox and P. Stone. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. AAMAS, 2010.
    • (2010) AAMAS
    • Knox, W.1    Stone, P.2
  • 10
    • 84957895797 scopus 로고
    • Reward functions for accelerated learning
    • M. Mataric. Reward functions for accelerated learning. ICML, 1994.
    • (1994) ICML
    • Mataric, M.1
  • 11
    • 27344432348 scopus 로고    scopus 로고
    • Accelerating reinforcement learning through implicit imitation
    • B. Price and C. Boutilier. Accelerating reinforcement learning through implicit imitation. JAIR, 19:569-629, 2003.
    • (2003) JAIR , vol.19 , pp. 569-629
    • Price, B.1    Boutilier, C.2
  • 12
    • 0001898381 scopus 로고    scopus 로고
    • Practical reinforcement learning in continuous spaces
    • W. Smart and L. Kaelbling. Practical reinforcement learning in continuous spaces. ICML, 2000.
    • (2000) ICML
    • Smart, W.1    Kaelbling, L.2
  • 16
    • 70449370276 scopus 로고    scopus 로고
    • RL-glue: Language-independent software for reinforcement-learning experiments
    • B. Tanner and A. White. RL-Glue: Language-independent software for reinforcement-learning experiments. JMLR, 10, 2009.
    • JMLR , vol.10 , pp. 2009
    • Tanner, B.1    White, A.2
  • 17
    • 84872375255 scopus 로고    scopus 로고
    • Integrating reinforcement learning with human demonstrations of varying ability
    • M. Taylor, H. Suay, and S. Chernova. Integrating reinforcement learning with human demonstrations of varying ability. AAMAS, 2011.
    • (2011) AAMAS
    • Taylor, M.1    Suay, H.2    Chernova, S.3
  • 19
    • 70350460438 scopus 로고    scopus 로고
    • Reinforcement learning with human teachers: Evidence of feedback and guidance with implications for learning performance
    • A. Thomaz and C. Breazeal. Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance. AAAI, 2006.
    • (2006) AAAI
    • Thomaz, A.1    Breazeal, C.2
  • 20
    • 1942484477 scopus 로고    scopus 로고
    • Principled methods for advising reinforcement learning agents
    • E. Wiewiora, G. Cottrell, and C. Elkan. Principled methods for advising reinforcement learning agents. ICML, 2003.
    • (2003) ICML
    • Wiewiora, E.1    Cottrell, G.2    Elkan, C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.