메뉴 건너뛰기




Volumn , Issue , 2007, Pages 2586-2591

Bayesian inverse reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

APPRENTICESHIP LEARNING; BAYESIAN; INVERSE REINFORCEMENT LEARNING; LEARNING POLICY; MARKOV DECISION PROCESSES; PREFERENCE ELICITATION; PRIOR KNOWLEDGE; REWARD FUNCTION;

EID: 77956052826     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (661)

References (14)
  • 1
    • 14344251217 scopus 로고    scopus 로고
    • Apprenticeship learning via inverse reinforcement learning
    • P. Abbeel and A. Y. Ng. Apprenticeship learning via inverse reinforcement learning. In ICML, 2004.
    • (2004) ICML
    • Abbeel, P.1    Ng, A.Y.2
  • 2
    • 84880884776 scopus 로고
    • Sampling and integration of near log-concave functions
    • D. Applegate and R. Kannan. Sampling and integration of near log-concave functions. In STOC, 1993.
    • (1993) STOC
    • Applegate, D.1    Kannan, R.2
  • 3
    • 0002130986 scopus 로고    scopus 로고
    • Robot learning from demonstration
    • C. Atkeson and S. Schaal. Robot learning from demonstration. In ICML, 1997.
    • (1997) ICML
    • Atkeson, C.1    Schaal, S.2
  • 5
    • 0031635794 scopus 로고    scopus 로고
    • Opponent modeling in Poker
    • Madison, WI, AAAI Press
    • D. Billings, D. Papp, J. Schaeffer, and D. Szafron. Opponent modeling in Poker. In AAAI, pages 493-498, Madison, WI, 1998. AAAI Press.
    • (1998) AAAI , pp. 493-498
    • Billings, D.1    Papp, D.2    Schaeffer, J.3    Szafron, D.4
  • 6
    • 0037856528 scopus 로고
    • Linear matrix inequalities in system and control theory
    • S. Boyd, L. E. Ghaoui, E. Feron, and V. Balakrishnan. Linear matrix inequalities in system and control theory. SIAM, 1994.
    • (1994) SIAM
    • Boyd, S.1    Ghaoui, L.E.2    Feron, E.3    Balakrishnan, V.4
  • 7
    • 0040628914 scopus 로고
    • An introduction to the Ising model
    • B. A. Cipra. An introduction to the Ising model. Am. Math. Monthly, 94(10):937-959, 1987.
    • (1987) Am. Math. Monthly , vol.94 , Issue.10 , pp. 937-959
    • Cipra, B.A.1
  • 8
    • 0042547347 scopus 로고    scopus 로고
    • Algorithms for inverse reinforcement learning
    • Morgan Kaufmann, San Francisco, CA
    • A. Y. Ng and S. Russell. Algorithms for inverse reinforcement learning. In ICML, pages 663-670. Morgan Kaufmann, San Francisco, CA, 2000.
    • (2000) ICML , pp. 663-670
    • Ng, A.Y.1    Russell, S.2
  • 9
    • 84880768440 scopus 로고    scopus 로고
    • A Bayesian approach to imitation in reinforcement learning
    • B. Price and C. Boutilier. A Bayesian approach to imitation in reinforcement learning. In IJCAI, 2003.
    • (2003) IJCAI
    • Price, B.1    Boutilier, C.2
  • 10
    • 84860604462 scopus 로고    scopus 로고
    • Learning agents for uncertain environments (extended abstract)
    • ACM Press
    • S. Russell. Learning agents for uncertain environments (extended abstract). In COLT. ACM Press, 1998.
    • (1998) COLT
    • Russell, S.1
  • 11
    • 84880892644 scopus 로고
    • Do people behave according to Bellman's principle of optimality?
    • T J Sargent. Do people behave according to Bellman's principle of optimality? JEP, 1994.
    • (1994) JEP
    • Sargent, T.J.1
  • 13
    • 33646717660 scopus 로고    scopus 로고
    • Inverse Problem Theory and Methods for Model Parameter Estimation
    • 2nd edition
    • A. Tarantola. Inverse Problem Theory and Methods for Model Parameter Estimation. SIAM, 2nd edition, 2005.
    • (2005) SIAM
    • Tarantola, A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.