SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2011, Pages 2159-2164

Eliciting additive reward functions for Markov decision processes

(2) Regan, Kevin a Boutilier, Craig a

a UNIVERSITY OF TORONTO (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ASSISTIVE TECHNOLOGY; AUTONOMIC COMPUTING; HUMAN ASSESSMENT; MARKOV DECISION PROCESSES; REWARD FUNCTION; ROBUST OPTIMIZATION; TWO DOMAINS;

ARTIFICIAL INTELLIGENCE; MARKOV PROCESSES; SPECIFICATIONS;

LEARNING ALGORITHMS;

EID: 84881054930 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: 10.5591/978-1-57735-516-8/IJCAI11-360 Document Type: Conference Paper

Times cited : (29)

References (12)

1
- 84880705310
- A decision-theoretic approach to task assistance for persons with dementia
- Edinburgh
- J. Boger, P. Poupart, J. Hoey, C. Boutilier, G. Fernie, and A. Mihailidis. A decision-theoretic approach to task assistance for persons with dementia. IJCAI-05, pp.1293-1299, Edinburgh, 2005.
- (2005) IJCAI-05 , pp. 1293-1299
- Boger, J.¹ Poupart, P.² Hoey, J.³ Boutilier, C.⁴ Fernie, G.⁵ Mihailidis, A.⁶

2
- 0346942368
- Decision-Theoretic Planning: Structural Assumptions and Computational Leverage
- C. Boutilier, T. Dean, and S. Hanks. Decision theoretic planning: Structural assumptions and computational leverage. JAIR, 11:1-94, 1999. (Pubitemid 129628760)
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

3
- 33646096015
- Constraint-based optimization and utility elicitation using the minimax decision criterion
- C. Boutilier, R. Patrascu, P. Poupart, and D. Schuurmans. Constraint-based optimization and utility elicitation using the minimax decision criterion. Art. Intel., 170:686-713, 2006.
- (2006) Art. Intel. , vol.170 , pp. 686-713
- Boutilier, C.¹ Patrascu, R.² Poupart, P.³ Schuurmans, D.⁴

4
- 58649112502
- Minimax regret-based elicitation of generalized additive utilities
- Vancouver
- D. Braziunas and C. Boutilier. Minimax regret-based elicitation of generalized additive utilities. UAI-07, pp.25-32, Vancouver, 2007.
- (2007) UAI-07 , pp. 25-32
- Braziunas, D.¹ Boutilier, C.²

5
- 33646107784
- Interdependence and additivity in multivariate, unidimensional expected utility theory
- P. C. Fishburn. Interdependence and additivity in multivariate, unidimensional expected utility theory. Int. Econ. Rev., 8:335-342, 1967.
- (1967) Int. Econ. Rev. , vol.8 , pp. 335-342
- Fishburn, P.C.¹

6
- 0004001439
- Wiley, NY
- R. L. Keeney and H. Raiffa. Decisions with Multiple Objectives: Preferences and Value Trade-offs. Wiley, NY, 1976.
- (1976) Decisions with Multiple Objectives: Preferences and Value Trade-offs
- Keeney, R.L.¹ Raiffa, H.²

7
- 0037253062
- The vision of autonomic computing
- J. O. Kephart and D. M. Chess. The vision of autonomic computing. Computer, 36:41-52, 2003.
- (2003) Computer , vol.36 , pp. 41-52
- Kephart, J.O.¹ Chess, D.M.²

8
- 85102627959
- Wiley, NY
- M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, NY, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

9
- 80052425037
- Regret-based reward elicitation for Markov decision processes
- Montreal
- K. Regan and C. Boutilier. Regret-based reward elicitation for Markov decision processes. UAI-09, pp.454-451, Montreal, 2009.
- (2009) UAI-09 , pp. 454-1451
- Regan, K.¹ Boutilier, C.²

10
- 77958520196
- Robust policy computation in reward-uncertain MDPs using nondominated policies
- Atlanta
- K. Regan and C. Boutilier. Robust policy computation in reward-uncertain MDPs using nondominated policies. AAAI-10, pp.1127-1133, Atlanta, 2010.
- (2010) AAAI-10 , pp. 1127-1133
- Regan, K.¹ Boutilier, C.²

11
- 34249831859
- A new reformulation-linearization technique for bilinear programming problems
- H. Sherali and A. Alameddine. A new reformulation-linearization technique for bilinear programming problems. J. Global Optim., 2:379-410, 1992.
- (1992) J. Global Optim. , vol.2 , pp. 379-410
- Sherali, H.¹ Alameddine, A.²

12
- 77950823530
- Parametric regret in uncertain Markov decision processes
- Shanghai
- H. Xu and S. Mannor. Parametric regret in uncertain Markov decision processes. CDC-09, pp.3606-3613, Shanghai, 2009.
- (2009) CDC-09 , pp. 3606-3613
- Xu, H.¹ Mannor, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.