SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2011, Pages 1414-1420

Imitation learning in relational domains: A functional-gradient boosting approach

(5) Natarajan, Sriraam a Joshi, Saket b Tadepalli, Prasad b Kersting, Kristian c Shavlik, Jude d

a WAKE FOREST SCHOOL OF MEDICINE (United States)

b OREGON STATE UNIVERSITY (United States)

c FRAUNHOFER IAIS (Germany)

d UNIVERSITY OF WISCONSIN MADISON (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BOOSTING APPROACH; DIFFERENT DOMAINS; FUNCTIONAL GRADIENT; HUMAN TEACHERS; IMITATION LEARNING; RELATIONAL REGRESSION TREE; RELATIONAL REPRESENTATIONS;

FORESTRY; GRADIENT METHODS;

ARTIFICIAL INTELLIGENCE;

FORESTRY; REGRESSION ANALYSIS; TREES;

EID: 84872856015 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: 10.5591/978-1-57735-516-8/IJCAI11-239 Document Type: Conference Paper

Times cited : (54)

References (36)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- P. Abbeel and A. Ng. Apprenticeship learning via inverse reinforcement learning. In ICML, 2004.
- (2004) ICML
- Abbeel, P.¹ Ng, A.²

2
- 63149159130
- A survey of robot learning from demonstration
- B. Argall, S. Chernova, M. Veloso, and B. Browning. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57:469-483, 2009.
- (2009) Robotics and Autonomous Systems , vol.57 , pp. 469-483
- Argall, B.¹ Chernova, S.² Veloso, M.³ Browning, B.⁴

3
- 0032069371
- Top-down induction of first order logical decision trees
- H. Blockeel. Top-down induction of first order logical decision trees. AI Commun., 12(1-2), 1999.
- (1999) AI Commun. , vol.12 , Issue.1-2
- Blockeel, H.¹

4
- 84880891360
- Symbolic dynamic programming for first-order mdps
- C. Boutilier. Symbolic dynamic programming for first-order mdps. In IJCAI, 2001.
- (2001) IJCAI
- Boutilier, C.¹

5
- 70449601159
- EPFL Press
- S. Calinon. Robot Programming By Demonstration: A probabilistic approach. EPFL Press, 2009.
- (2009) Robot Programming by Demonstration: A Probabilistic Approach
- Calinon, S.¹

6
- 33749249600
- The relationship between precision-recall and ROC curves
- J. Davis and M. Goadrich. The relationship between precision-recall and ROC curves. In ICML, 2006.
- (2006) ICML
- Davis, J.¹ Goadrich, M.²

7
- 14344252373
- Training conditional random fields via gradient tree boosting
- T.G. Dietterich, A. Ashenfelter, and Y. Bulatov. Training conditional random fields via gradient tree boosting. In ICML, 2004.
- (2004) ICML
- Dietterich, T.G.¹ Ashenfelter, A.² Bulatov, Y.³

8
- 2842560201
- STRIPS: A new approach to the application of theorem proving to problem solving
- R. Fikes and N. Nilsson. STRIPS: A new approach to the application of theorem proving to problem solving. Artificial Intelligence, 2:189-208, 1971.
- (1971) Artificial Intelligence , vol.2 , pp. 189-208
- Fikes, R.¹ Nilsson, N.²

9
- 0002978642
- Experiments with a new boosting algorithm
- Y. Freund and R. Schapire. Experiments with a new boosting algorithm. In ICML, 1996.
- (1996) ICML
- Freund, Y.¹ Schapire, R.²

10
- 0035470889
- Greedy function approximation: A gradient boosting machine
- J.H. Friedman. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29, 2001.
- (2001) Annals of Statistics , pp. 29
- Friedman, J.H.¹

11
- 34547980383
- MIT Press
- L. Getoor and B. Taskar. Introduction to Statistical Relational Learning. MIT Press, 2007.
- (2007) Introduction to Statistical Relational Learning
- Getoor, L.¹ Taskar, B.²

12
- 44449170889
- Exploiting first-order regression in inductive policy selection
- C. Gretton and S. Thibaux. Exploiting first-order regression in inductive policy selection. In UAI, 2004.
- (2004) UAI
- Gretton, C.¹ Thibaux, S.²

13
- 84855674138
- Tilde-CRF: Conditional random fields for logical sequences
- B. Gutmann and K. Kersting. Tilde-CRF: Conditional random fields for logical sequences. In ECML, 2006.
- (2006) ECML
- Gutmann, B.¹ Kersting, K.²

14
- 58849123822
- Stochastic planning with first order decision diagrams
- S. Joshi and R. Khardon. Stochastic planning with first order decision diagrams. In ICAPS, 2008.
- (2008) ICAPS
- Joshi, S.¹ Khardon, R.²

15
- 67049169348
- Boosting relational sequence alignments
- A. Karwath, K. Kersting, and N. Landwehr. Boosting relational sequence alignments. In ICDM, 2008.
- (2008) ICDM
- Karwath, A.¹ Kersting, K.² Landwehr, N.³

16
- 56449088242
- Non-parametric policy gradients: A unified treatment of propositional and relational domains
- K. Kersting and K. Driessens. Non-parametric policy gradients: A unified treatment of propositional and relational domains. In ICML, 2008.
- (2008) ICML
- Kersting, K.¹ Driessens, K.²

17
- 14344249892
- Bellman goes relational
- K. Kersting, M. Van Otterlo, and L. De Raedt. Bellman goes relational. In In ICML, 2004.
- (2004) In ICML
- Kersting, K.¹ Van Otterlo, M.² De Raedt, L.³

18
- 0033189384
- Learning action strategies for planning domains
- DOI 10.1016/S0004-3702(99)00060-0
- R. Khardon. Learning action strategies for planning domains. Artificial Intelligence, 113:125-148, 1999. (Pubitemid 30542740)
- (1999) Artificial Intelligence , vol.113 , Issue.1 , pp. 125-148
- Khardon, R.¹

19
- 84881073978
- Batch reinforcement learning with state importance. Poster
- L. Li, V. Bulitko, and R. Greiner. Batch reinforcement learning with state importance. In ECML - Poster, 2004.
- (2004) ECML
- Li, L.¹ Bulitko, V.² Greiner, R.³

20
- 85011529894
- Programming by example (introduction)
- H. Lieberman. Programming by example (introduction). Communications of the ACM, 43:72-74, 2000.
- (2000) Communications of the ACM , vol.43 , pp. 72-74
- Lieberman, H.¹

21
- 0003543674
- Kluwer Publishers
- S. Minton. Learning Search Control Knowledge: An Explanation-based Approach. Kluwer Publishers, 1988.
- (1988) Learning Search Control Knowledge: An Explanation-based Approach
- Minton, S.¹

22
- 84889753643
- Gradient-based boosting for Statistical Relational Learning: The Relational Dependency Network Case
- S. Natarajan, T. Khot, K. Kersting, B. Guttmann, and J. Shavlik. Gradient-based boosting for Statistical Relational Learning: The Relational Dependency Network Case. MLJ, 2011.
- (2011) MLJ
- Natarajan, S.¹ Khot, T.² Kersting, K.³ Guttmann, B.⁴ Shavlik, J.⁵

23
- 80053212134
- Apprenticeship learning using inverse reinforcement learning and gradient methods
- G. Neu and C. Szepesvari. Apprenticeship learning using inverse reinforcement learning and gradient methods. In Proceedings of UAI, pages 295-302, 2007.
- (2007) Proceedings of UAI , pp. 295-302
- Neu, G.¹ Szepesvari, C.²

24
- 0042547347
- Algorithms for inverse reinforcement learning
- A. Ng and S. Russell. Algorithms for inverse reinforcement learning. In ICML, 2000.
- (2000) ICML
- Ng, A.¹ Russell, S.²

25
- 33749252753
- Maximum margin planning
- N. Ratliff, A. Bagnell, and M. Zinkevich. Maximum margin planning. In ICML, 2006.
- (2006) ICML
- Ratliff, N.¹ Bagnell, A.² Zinkevich, M.³

26
- 67650957592
- Learning to search: Functional gradient techniques for imitation learning
- N. Ratliff, D. Silver, and A. Bagnell. Learning to search: Functional gradient techniques for imitation learning. Autonomous Robots, pages 25-53, 2009.
- (2009) Autonomous Robots , pp. 25-53
- Ratliff, N.¹ Silver, D.² Bagnell, A.³

27
- 84886681621
- Learning to fly
- C. Sammut, S. Hurst, D. Kedzier, and D. Michie. Learning to fly. In ICML, 1992.
- (1992) ICML
- Sammut, C.¹ Hurst, S.² Kedzier, D.³ Michie, D.⁴

28
- 84894671097
- Explanation-based manipulator learning: Acquisition of planning ability through observation
- A. Segre and G. DeJong. Explanation-based manipulator learning: Acquisition of planning ability through observation. In Conf on Robotics and Automation, 1985.
- Conf on Robotics and Automation, 1985
- Segre, A.¹ DeJong, G.²

29
- 1542788921
- BAGGER: An EBL system that extends and generalizes explanations
- J. Shavlik and G. DeJong. BAGGER: An EBL system that extends and generalizes explanations. In AAAI, 1987.
- (1987) AAAI
- Shavlik, J.¹ DeJong, G.²

30
- 2542454374
- A. Srinivasan. The Aleph Manual, 2004.
- (2004) The Aleph Manual
- Srinivasan, A.¹

31
- 0013528313
- Scaling reinforcement learning toward RoboCup soccer
- P. Stone and R. Sutton. Scaling reinforcement learning toward RoboCup soccer. In ICML, 2001.
- (2001) ICML
- Stone, P.¹ Sutton, R.²

32
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R. Sutton, D. McAllester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In NIPS, 2000.
- (2000) NIPS
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

33
- 56449122183
- A gametheoretic approach to apprenticeship learning
- U. Syed and R. Schapire. A gametheoretic approach to apprenticeship learning. In NIPS, 2007.
- (2007) NIPS
- Syed, U.¹ Schapire, R.²

34
- 57749085102
- Relational macros for transfer in reinforcement learning
- L. Torrey, J. Shavlik, T. Walker, and R. Maclin. Relational macros for transfer in reinforcement learning. In ILP, 2007.
- (2007) ILP
- Torrey, L.¹ Shavlik, J.² Walker, T.³ Maclin, R.⁴

35
- 13444310066
- Inductive policy selection for first-order mdps
- S. Yoon, A. Fern, and R. Givan. Inductive policy selection for first-order mdps. In UAI, 2002.
- (2002) UAI
- Yoon, S.¹ Fern, A.² Givan, R.³

36
- 31144453572
- The first probabilistic track of the International Planning Competition
- H. Younes, M. Littman, D. Weissman, and J. Asmuth. The first probabilistic track of the international planning competition. JAIR, 24:851-887, 2005. (Pubitemid 43130951)
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 851-887
- Younes, H.L.S.¹ Littman, M.L.² Weissman, D.³ Asmuth, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.