메뉴 건너뛰기




Volumn , Issue , 2011, Pages 2159-2164

Eliciting additive reward functions for Markov decision processes

Author keywords

[No Author keywords available]

Indexed keywords

ASSISTIVE TECHNOLOGY; AUTONOMIC COMPUTING; HUMAN ASSESSMENT; MARKOV DECISION PROCESSES; REWARD FUNCTION; ROBUST OPTIMIZATION; TWO DOMAINS;

EID: 84881054930     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.5591/978-1-57735-516-8/IJCAI11-360     Document Type: Conference Paper
Times cited : (29)

References (12)
  • 1
    • 84880705310 scopus 로고    scopus 로고
    • A decision-theoretic approach to task assistance for persons with dementia
    • Edinburgh
    • J. Boger, P. Poupart, J. Hoey, C. Boutilier, G. Fernie, and A. Mihailidis. A decision-theoretic approach to task assistance for persons with dementia. IJCAI-05, pp.1293-1299, Edinburgh, 2005.
    • (2005) IJCAI-05 , pp. 1293-1299
    • Boger, J.1    Poupart, P.2    Hoey, J.3    Boutilier, C.4    Fernie, G.5    Mihailidis, A.6
  • 3
    • 33646096015 scopus 로고    scopus 로고
    • Constraint-based optimization and utility elicitation using the minimax decision criterion
    • C. Boutilier, R. Patrascu, P. Poupart, and D. Schuurmans. Constraint-based optimization and utility elicitation using the minimax decision criterion. Art. Intel., 170:686-713, 2006.
    • (2006) Art. Intel. , vol.170 , pp. 686-713
    • Boutilier, C.1    Patrascu, R.2    Poupart, P.3    Schuurmans, D.4
  • 4
    • 58649112502 scopus 로고    scopus 로고
    • Minimax regret-based elicitation of generalized additive utilities
    • Vancouver
    • D. Braziunas and C. Boutilier. Minimax regret-based elicitation of generalized additive utilities. UAI-07, pp.25-32, Vancouver, 2007.
    • (2007) UAI-07 , pp. 25-32
    • Braziunas, D.1    Boutilier, C.2
  • 5
    • 33646107784 scopus 로고
    • Interdependence and additivity in multivariate, unidimensional expected utility theory
    • P. C. Fishburn. Interdependence and additivity in multivariate, unidimensional expected utility theory. Int. Econ. Rev., 8:335-342, 1967.
    • (1967) Int. Econ. Rev. , vol.8 , pp. 335-342
    • Fishburn, P.C.1
  • 7
    • 0037253062 scopus 로고    scopus 로고
    • The vision of autonomic computing
    • J. O. Kephart and D. M. Chess. The vision of autonomic computing. Computer, 36:41-52, 2003.
    • (2003) Computer , vol.36 , pp. 41-52
    • Kephart, J.O.1    Chess, D.M.2
  • 9
    • 80052425037 scopus 로고    scopus 로고
    • Regret-based reward elicitation for Markov decision processes
    • Montreal
    • K. Regan and C. Boutilier. Regret-based reward elicitation for Markov decision processes. UAI-09, pp.454-451, Montreal, 2009.
    • (2009) UAI-09 , pp. 454-1451
    • Regan, K.1    Boutilier, C.2
  • 10
    • 77958520196 scopus 로고    scopus 로고
    • Robust policy computation in reward-uncertain MDPs using nondominated policies
    • Atlanta
    • K. Regan and C. Boutilier. Robust policy computation in reward-uncertain MDPs using nondominated policies. AAAI-10, pp.1127-1133, Atlanta, 2010.
    • (2010) AAAI-10 , pp. 1127-1133
    • Regan, K.1    Boutilier, C.2
  • 11
    • 34249831859 scopus 로고
    • A new reformulation-linearization technique for bilinear programming problems
    • H. Sherali and A. Alameddine. A new reformulation-linearization technique for bilinear programming problems. J. Global Optim., 2:379-410, 1992.
    • (1992) J. Global Optim. , vol.2 , pp. 379-410
    • Sherali, H.1    Alameddine, A.2
  • 12
    • 77950823530 scopus 로고    scopus 로고
    • Parametric regret in uncertain Markov decision processes
    • Shanghai
    • H. Xu and S. Mannor. Parametric regret in uncertain Markov decision processes. CDC-09, pp.3606-3613, Shanghai, 2009.
    • (2009) CDC-09 , pp. 3606-3613
    • Xu, H.1    Mannor, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.