SCOPUS 정보 검색 플랫폼

Proceedings of the 23rd AAAI Conference on Artificial Intelligence, AAAI 2008

Volumn , Issue , 2008, Pages 1433-1438

Maximum Entropy Inverse Reinforcement Learning

(4) Ziebart, Brian D a Maas, Andrew a Bagnell, J Andrew a Dey, Anind K a

a Carnegie Mellon University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ENTROPY; INVERSE PROBLEMS;

IMITATION LEARNING; INVERSE REINFORCEMENT LEARNING; MARKOV DECISION PROBLEM; MAXIMUM-ENTROPY; NEAR-OPTIMAL POLICIES; PERFORMANCE GUARANTEES; PRINCIPLE OF MAXIMUM ENTROPY; PROBABILISTICS APPROACH; RECENT RESEARCHES; UTILITY FUNCTIONS;

REINFORCEMENT LEARNING;

EID: 85148975703 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1062)

References (11)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P., and Ng, A. Y. 2004. Apprenticeship learning via inverse reinforcement learning. In Proc. ICML, 1–8.
- (2004) Proc. ICML , pp. 1-8
- Abbeel, P.¹ Ng, A. Y.²

2
- 33746050365
- Maximum entropy distribution estimation with generalized regularization
- Dudík, M., and Schapire, R. E. 2006. Maximum entropy distribution estimation with generalized regularization. In Proc. COLT, 123–138.
- (2006) Proc. COLT , pp. 123-138
- Dudík, M.¹ Schapire, R. E.²

3
- 11944266539
- Information theory and statistical mechanics
- Jaynes, E. T. 1957. Information theory and statistical mechanics. Physical Review 106:620–630.
- (1957) Physical Review , vol.106 , pp. 620-630
- Jaynes, E. T.¹

4
- 33750318706
- Predestination: Inferring destinations from partial trajectories
- Krumm, J., and Horvitz, E. 2006. Predestination: Inferring destinations from partial trajectories. In Proc. Ubicomp, 243–260.
- (2006) Proc. Ubicomp , pp. 243-260
- Krumm, J.¹ Horvitz, E.²

5
- 0142192295
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- Lafferty, J.; McCallum, A.; and Pereira, F. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. ICML, 282–289.
- (2001) Proc. ICML , pp. 282-289
- Lafferty, J.¹ McCallum, A.² Pereira, F.³

6
- 33750727337
- Trip router with individualized preferences (trip): Incorporating personalization into route planning
- Letchner, J.; Krumm, J.; and Horvitz, E. 2006. Trip router with individualized preferences (trip): Incorporating personalization into route planning. In Proc. IAAI, 1795–1800.
- (2006) Proc. IAAI , pp. 1795-1800
- Letchner, J.¹ Krumm, J.² Horvitz, E.³

7
- 34147182909
- Learning and inferring transportation routines
- Liao, L.; Patterson, D. J.; Fox, D.; and Kautz, H. 2007. Learning and inferring transportation routines. Artificial Intelligence 171(5-6):311–331.
- (2007) Artificial Intelligence , vol.171 , Issue.5-6 , pp. 311-331
- Liao, L.¹ Patterson, D. J.² Fox, D.³ Kautz, H.⁴

8
- 80053212134
- Apprenticeship learning using inverse reinforcement learning and gradient methods
- Neu, G., and Szepesvri, C. 2007. Apprenticeship learning using inverse reinforcement learning and gradient methods. In Proc. UAI, 295–302.
- (2007) Proc. UAI , pp. 295-302
- Neu, G.¹ Szepesvri, C.²

9
- 0042547347
- Algorithms for inverse reinforcement learning
- Ng, A. Y., and Russell, S. 2000. Algorithms for inverse reinforcement learning. In Proc. ICML, 663–670.
- (2000) Proc. ICML , pp. 663-670
- Ng, A. Y.¹ Russell, S.²

10
- 77956052826
- Bayesian inverse reinforcement learning
- Ramachandran, D., and Amir, E. 2007. Bayesian inverse reinforcement learning. In Proc. IJCAI, 2586–2591.
- (2007) Proc. IJCAI , pp. 2586-2591
- Ramachandran, D.¹ Amir, E.²

11
- 34250696009
- Maximum margin planning
- Ratliff, N.; Bagnell, J. A.; and Zinkevich, M. 2006. Maximum margin planning. In Proc. ICML, 729–736.
- (2006) Proc. ICML , pp. 729-736
- Ratliff, N.¹ Bagnell, J. A.² Zinkevich, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.