SCOPUS 정보 검색 플랫폼

Volumn 6913 LNAI, Issue PART 3, 2011, Pages 34-48

Preference elicitation and inverse reinforcement learning

Author keywords

Bayesian inference; decision theory; Inverse reinforcement learning; preference elicitation

Indexed keywords

BAYESIAN INFERENCE; INVERSE REINFORCEMENT LEARNING; POSTERIOR DISTRIBUTIONS; PREFERENCE ELICITATION; STATISTICAL FORMULATION;

BAYESIAN NETWORKS; DECISION THEORY; INFERENCE ENGINES; REINFORCEMENT LEARNING;

INVERSE PROBLEMS;

EID: 80052420104 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-23808-6_3 Document Type: Conference Paper

Times cited : (62)

References (22)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P., Ng, A.Y.: Apprenticeship learning via inverse reinforcement learning. In: Proceedings of the 21st International Conference on Machine Learning, ICML 2004 (2004)
- (2004) Proceedings of the 21st International Conference on Machine Learning, ICML 2004
- Abbeel, P.¹ Ng, A.Y.²

2
- 85162048506
- Gaussian process preference elicitation
- Bonilla, E.V., Guo, S., Sanner, S.: Gaussian process preference elicitation. In: NIPS 2010 (2010)
- (2010) NIPS 2010
- Bonilla, E.V.¹ Guo, S.² Sanner, S.³

3
- 0036930295
- A POMDP formulation of preference elicitation problems
- Boutilier, C.: A POMDP formulation of preference elicitation problems. In: AAAI 2002, pp. 239-246 (2002)
- (2002) AAAI 2002 , pp. 239-246
- Boutilier, C.¹

4
- 66149172589
- Preference elicitation and generalized additive utility
- Braziunas, D., Boutilier, C.: Preference elicitation and generalized additive utility. In: AAAI 2006 (2006)
- (2006) AAAI 2006
- Braziunas, D.¹ Boutilier, C.²

5
- 0003919677
- Springer Texts in Statistics. Springer, Heidelberg
- Casella, G., Fienberg, S., Olkin, I. (eds.): Monte Carlo Statistical Methods. Springer Texts in Statistics. Springer, Heidelberg (1999)
- (1999) Monte Carlo Statistical Methods
- Casella, G.¹ Fienberg, S.² Olkin, I.³

7
- 0003759417
- John Wiley & Sons, Chichester
- DeGroot, M.H.: Optimal Statistical Decisions. John Wiley & Sons, Chichester (1970)
- (1970) Optimal Statistical Decisions
- DeGroot, M.H.¹

8
- 84877776997
- under review
- Dimitrakakis, C., Rothkopf, C.A.: Bayesian multitask inverse reinforcement learning (2011), under review
- (2011) Bayesian Multitask Inverse Reinforcement Learning
- Dimitrakakis, C.¹ Rothkopf, C.A.²

9
- 1942450858
- PhD thesis, University of Massachusetts at Amherst
- Duff, M.O.: Optimal Learning Computational Procedures for Bayes-adaptive Markov Decision Processes. PhD thesis, University of Massachusetts at Amherst (2002)
- (2002) Optimal Learning Computational Procedures for Bayes-adaptive Markov Decision Processes
- Duff, M.O.¹

10
- 0000030684
- The expected-utility hypothesis and the measurability of utility
- Friedman, M., Savage, L.J.: The expected-utility hypothesis and the measurability of utility. The Journal of Political Economy 60(6), 463 (1952)
- (1952) The Journal of Political Economy , vol.60 , Issue.6 , pp. 463
- Friedman, M.¹ Savage, L.J.²

11
- 84862273812
- Variational methods for reinforcement learning
- Furmston, T., Barber, D.: Variational methods for reinforcement learning. In: AISTATS, pp. 241-248 (2010)
- (2010) AISTATS , pp. 241-248
- Furmston, T.¹ Barber, D.²

12
- 6344274901
- Game theory, maximum entropy, minimum discrepancy, and robust bayesian decision theory
- Grünwald, P.D., Philip Dawid, A.: Game theory, maximum entropy, minimum discrepancy, and robust bayesian decision theory. Annals of Statistics 32(4), 1367-1433 (2004)
- (2004) Annals of Statistics , vol.32 , Issue.4 , pp. 1367-1433
- Grünwald, P.D.¹ Philip Dawid, A.²

13
- 80052402433
- Real-time multiattribute bayesian preference elicitation with pairwise comparison queries
- Guo, S., Sanner, S.: Real-time multiattribute bayesian preference elicitation with pairwise comparison queries. In: AISTATS 2010 (2010)
- (2010) AISTATS 2010
- Guo, S.¹ Sanner, S.²

16
- 85102627959
- John Wiley & Sons, New Jersey
- Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, New Jersey (2005)
- (2005) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

17
- 80052401906
- Personal communication
- Ramachandran, D.: Personal communication (2010)
- (2010)
- Ramachandran, D.¹

18
- 77956052826
- Bayesian inverse reinforcement learning
- Ramachandran, D., Amir, E.: Bayesian inverse reinforcement learning. In: 20th Int. Joint Conf. Artificial Intelligence, vol. 51, pp. 2856-2591 (2007)
- (2007) 20th Int. Joint Conf. Artificial Intelligence , vol.51 , pp. 2856-12591
- Ramachandran, D.¹ Amir, E.²

19
- 80052419808
- PhD thesis, Department of Brain and Cognitive Sciences, Department of Computer Science, University of Rochester
- Rothkopf, C.A.: Modular models of task based visually guided behavior. PhD thesis, Department of Brain and Cognitive Sciences, Department of Computer Science, University of Rochester (2008)
- (2008) Modular Models of Task Based Visually Guided Behavior
- Rothkopf, C.A.¹

20
- 85162012324
- A game-theoretic approach to apprenticeship learning
- Syed, U., Schapire, R.E.: A game-theoretic approach to apprenticeship learning. In: Advances in Neural Information Processing Systems, vol. 10 (2008)
- (2008) Advances in Neural Information Processing Systems , vol.10
- Syed, U.¹ Schapire, R.E.²

21
- 85162059109
- A reduction from apprenticeship learning to classification
- Syed, U., Schapire, R.E.: A reduction from apprenticeship learning to classification. In: NIPS 2010 (2010)
- (2010) NIPS 2010
- Syed, U.¹ Schapire, R.E.²

22
- 77956500986
- Modelling interaction via the principle of maximum causal entropy
- Ziebart, B.D., Andrew Bagnell, J., Dey, A.K.: Modelling interaction via the principle of maximum causal entropy. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), Haifa, Israel (2010)
- Proceedings of the 27th International Conference on Machine Learning (ICML 2010), Haifa, Israel (2010)
- Ziebart, B.D.¹ Andrew Bagnell, J.² Dey, A.K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.