메뉴 건너뛰기




Volumn 6913 LNAI, Issue PART 3, 2011, Pages 34-48

Preference elicitation and inverse reinforcement learning

Author keywords

Bayesian inference; decision theory; Inverse reinforcement learning; preference elicitation

Indexed keywords

BAYESIAN INFERENCE; INVERSE REINFORCEMENT LEARNING; POSTERIOR DISTRIBUTIONS; PREFERENCE ELICITATION; STATISTICAL FORMULATION;

EID: 80052420104     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-23808-6_3     Document Type: Conference Paper
Times cited : (62)

References (22)
  • 2
    • 85162048506 scopus 로고    scopus 로고
    • Gaussian process preference elicitation
    • Bonilla, E.V., Guo, S., Sanner, S.: Gaussian process preference elicitation. In: NIPS 2010 (2010)
    • (2010) NIPS 2010
    • Bonilla, E.V.1    Guo, S.2    Sanner, S.3
  • 3
    • 0036930295 scopus 로고    scopus 로고
    • A POMDP formulation of preference elicitation problems
    • Boutilier, C.: A POMDP formulation of preference elicitation problems. In: AAAI 2002, pp. 239-246 (2002)
    • (2002) AAAI 2002 , pp. 239-246
    • Boutilier, C.1
  • 4
    • 66149172589 scopus 로고    scopus 로고
    • Preference elicitation and generalized additive utility
    • Braziunas, D., Boutilier, C.: Preference elicitation and generalized additive utility. In: AAAI 2006 (2006)
    • (2006) AAAI 2006
    • Braziunas, D.1    Boutilier, C.2
  • 10
    • 0000030684 scopus 로고
    • The expected-utility hypothesis and the measurability of utility
    • Friedman, M., Savage, L.J.: The expected-utility hypothesis and the measurability of utility. The Journal of Political Economy 60(6), 463 (1952)
    • (1952) The Journal of Political Economy , vol.60 , Issue.6 , pp. 463
    • Friedman, M.1    Savage, L.J.2
  • 11
    • 84862273812 scopus 로고    scopus 로고
    • Variational methods for reinforcement learning
    • Furmston, T., Barber, D.: Variational methods for reinforcement learning. In: AISTATS, pp. 241-248 (2010)
    • (2010) AISTATS , pp. 241-248
    • Furmston, T.1    Barber, D.2
  • 12
    • 6344274901 scopus 로고    scopus 로고
    • Game theory, maximum entropy, minimum discrepancy, and robust bayesian decision theory
    • Grünwald, P.D., Philip Dawid, A.: Game theory, maximum entropy, minimum discrepancy, and robust bayesian decision theory. Annals of Statistics 32(4), 1367-1433 (2004)
    • (2004) Annals of Statistics , vol.32 , Issue.4 , pp. 1367-1433
    • Grünwald, P.D.1    Philip Dawid, A.2
  • 13
    • 80052402433 scopus 로고    scopus 로고
    • Real-time multiattribute bayesian preference elicitation with pairwise comparison queries
    • Guo, S., Sanner, S.: Real-time multiattribute bayesian preference elicitation with pairwise comparison queries. In: AISTATS 2010 (2010)
    • (2010) AISTATS 2010
    • Guo, S.1    Sanner, S.2
  • 14
    • 0042547347 scopus 로고    scopus 로고
    • Algorithms for inverse reinforcement learning
    • Morgan Kaufmann, San Francisco
    • Ng, A.Y., Russell, S.: Algorithms for inverse reinforcement learning. In: Proc. 17th International Conf. on Machine Learning, pp. 663-670. Morgan Kaufmann, San Francisco (2000)
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 663-670
    • Ng, A.Y.1    Russell, S.2
  • 15
    • 33749251297 scopus 로고    scopus 로고
    • An analytic solution to discrete Bayesian reinforcement learning
    • ACM Press, New York
    • Poupart, P., Vlassis, N., Hoey, J., Regan, K.: An analytic solution to discrete Bayesian reinforcement learning. In: ICML 2006, pp. 697-704. ACM Press, New York (2006)
    • (2006) ICML 2006 , pp. 697-704
    • Poupart, P.1    Vlassis, N.2    Hoey, J.3    Regan, K.4
  • 17
    • 80052401906 scopus 로고    scopus 로고
    • Personal communication
    • Ramachandran, D.: Personal communication (2010)
    • (2010)
    • Ramachandran, D.1
  • 19
    • 80052419808 scopus 로고    scopus 로고
    • PhD thesis, Department of Brain and Cognitive Sciences, Department of Computer Science, University of Rochester
    • Rothkopf, C.A.: Modular models of task based visually guided behavior. PhD thesis, Department of Brain and Cognitive Sciences, Department of Computer Science, University of Rochester (2008)
    • (2008) Modular Models of Task Based Visually Guided Behavior
    • Rothkopf, C.A.1
  • 21
    • 85162059109 scopus 로고    scopus 로고
    • A reduction from apprenticeship learning to classification
    • Syed, U., Schapire, R.E.: A reduction from apprenticeship learning to classification. In: NIPS 2010 (2010)
    • (2010) NIPS 2010
    • Syed, U.1    Schapire, R.E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.