메뉴 건너뛰기




Volumn , Issue , 2018, Pages 3199-3206

Optiongan: Learning joint reward-policy options using generative adversarial inverse reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; INVERSE PROBLEMS;

EID: 85060430951     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (73)

References (32)
  • 3
    • 85030457046 scopus 로고    scopus 로고
    • The option-critic architecture
    • Bacon, P.-L.; Harb, J.; and Precup, D. 2017. The option-critic architecture. In AAAI, 1726-1734.
    • (2017) AAAI , pp. 1726-1734
    • Bacon, P.-L.1    Harb, J.2    Precup, D.3
  • 6
    • 84877772241 scopus 로고    scopus 로고
    • Nonparametric Bayesian inverse reinforcement learning for multiple reward functions
    • Pereira, F.; Burges, C. J. C.; Bottou, L.; and Wein-berger, K. Q., eds, Curran Associates, Inc
    • Choi, J., and eung Kim, K. 2012. Nonparametric bayesian inverse reinforcement learning for multiple reward functions. In Pereira, F.; Burges, C. J. C.; Bottou, L.; and Wein-berger, K. Q., eds., Advances in Neural Information Processing Systems 25. Curran Associates, Inc. 305-313.
    • (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 305-313
    • Choi, J.1    Eung Kim, K.2
  • 10
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T. G. 2000. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research 13:227-303.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 28
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S.; Precup, D.; and Singh, S. 1999. between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial intelligence 112(1-2):181-211.
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.