SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 6322 LNAI, Issue PART 2, 2010, Pages 385-401

Learning from demonstration using MDP induced metrics

(2) Melo, Francisco S a Lopes, Manuel b

a INSTITUTO SUPERIOR TÉCNICO (Portugal)

b UNIVERSITY OF PLYMOUTH (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL COSTS; GENERALIZATION PERFORMANCE; INVERSE REINFORCEMENT LEARNING; KERNEL BASED APPROACH; LEARNING FROM DEMONSTRATION; OPTIMAL POLICIES; STATE-SPACE; SUPERVISED LEARNING METHODS;

LEARNING SYSTEMS;

DEMONSTRATIONS;

EID: 78049399307 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-15883-4_25 Document Type: Conference Paper

Times cited : (16)

References (29)

1
- 70349460167
- Ph. D. thesis, Dep. Computer Science, Stanford Univ
- Abbeel, P.: Apprenticeship learning and reinforcement learning with application to robotic control. Ph. D. thesis, Dep. Computer Science, Stanford Univ (2008)
- (2008) Apprenticeship Learning and Reinforcement Learning with Application to Robotic Control
- Abbeel, P.¹

2
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P., Ng, A.: Apprenticeship learning via inverse reinforcement learning. In: Proc. 21st Int. Conf. Machine Learning, pp. 1-8 (2004)
- (2004) Proc. 21st Int. Conf. Machine Learning , pp. 1-8
- Abbeel, P.¹ Ng, A.²

3
- 63149159130
- A survey of robot learning from demonstration
- Argall, B., Chernova, S., Veloso, M.: A survey of robot learning from demonstration. Robotics and Autonomous Systems 57(5), 469-483 (2009)
- (2009) Robotics and Autonomous Systems , vol.57 , Issue.5 , pp. 469-483
- Argall, B.¹ Chernova, S.² Veloso, M.³

4
- 65349173646
- Interactive policy learning through confidence-based autonomy
- Chernova, S., Veloso, M.: Interactive policy learning through confidence-based autonomy. J. Artificial Intelligence Research 34, 1-25 (2009)
- (2009) J. Artificial Intelligence Research , vol.34 , pp. 1-25
- Chernova, S.¹ Veloso, M.²

5
- 58349113822
- Approximate policy iteration with a policy language bias
- Fern, A., Yoon, S., Givan, R.: Approximate policy iteration with a policy language bias. In: Adv. Neural Information Proc. Systems 16 (2003)
- (2003) Adv. Neural. Information Proc. Systems , vol.16
- Fern, A.¹ Yoon, S.² Givan, R.³

6
- 33744466799
- Approximate policy iteration with a policy language bias: Solving relational Markov decision processes
- Fern, A., Yoon, S., Givan, R.: Approximate policy iteration with a policy language bias: Solving relational Markov decision processes. J. Artificial Intelligence Research 25, 75-118 (2006)
- (2006) J. Artificial Intelligence Research , vol.25 , pp. 75-118
- Fern, A.¹ Yoon, S.² Givan, R.³

7
- 38149108415
- Metrics for finite Markov decision processes
- Ferns, N., Panangaden, P., Precup, D.: Metrics for finite Markov decision processes. In: Proc. 20th Conf. Uncertainty in Artificial Intelligence, pp. 162-169 (2004)
- (2004) Proc. 20th Conf. Uncertainty in Artificial Intelligence , pp. 162-169
- Ferns, N.¹ Panangaden, P.² Precup, D.³

8
- 47249139892
- Metrics for Markov decision processes with infinite state-spaces
- Ferns, N., Panangaden, P., Precup, D.: Metrics for Markov decision processes with infinite state-spaces. In: Proc. 21st Conf. Uncertainty in Artificial Intelligence, pp. 201-208 (2005)
- (2005) Proc. 21st Conf. Uncertainty in Artificial Intelligence , pp. 201-208
- Ferns, N.¹ Panangaden, P.² Precup, D.³

9
- 0038517214
- Equivalence notions and model minimization in Markov Decision Processes
- Givan, R., Dean, T., Greig, M.: Equivalence notions and model minimization in Markov Decision Processes. Artificial Intelligence 147, 163-223 (2003)
- (2003) Artificial Intelligence , vol.147 , pp. 163-223
- Givan, R.¹ Dean, T.² Greig, M.³

10
- 1942420814
- Reinforcement learning as classification: Leveraging modern classifiers
- Lagoudakis, M., Parr, R.: Reinforcement learning as classification: Leveraging modern classifiers. In: Proc. 20th Int. Conf. Machine Learning, pp. D424-D431 (2003)
- (2003) Proc. 20th Int. Conf. Machine Learning
- Lagoudakis, M.¹ Parr, R.²

11
- 31844448029
- Relating reinforcement learning performance to classification performance
- Langford, J., Zadrozny, B.: Relating reinforcement learning performance to classification performance. In: Proc. 22nd Int. Conf. Machine Learning, pp. D473-D480 (2005)
- (2005) Proc. 22nd Int. Conf. Machine Learning
- Langford, J.¹ Zadrozny, B.²

12
- 77956523230
- Analysis of a classification-based policy iteration algorithm
- to appear
- Lazaric, A., Ghavamzadeh, M., Munos, R.: Analysis of a classification-based policy iteration algorithm. In: Proc. 27th Int. Conf. Machine Learning (to appear, 2010)
- (2010) Proc. 27th Int. Conf. Machine Learning
- Lazaric, A.¹ Ghavamzadeh, M.² Munos, R.³

13
- 74049086730
- Abstraction levels for robotic imitation: Overview and computational approaches
- Lopes, M., Melo, F., Montesano, L., Santos-Victor, J.: Abstraction levels for robotic imitation: Overview and computational approaches. In: From Motor Learning to Interaction Learning in Robots, pp. 313-355 (2010)
- (2010) From Motor Learning to Interaction Learning in Robots , pp. 313-355
- Lopes, M.¹ Melo, F.² Montesano, L.³ Santos-Victor, J.⁴

14
- 71049179753
- Learning grasping affordances from local visual descriptors
- Montesano, L., Lopes, M.: Learning grasping affordances from local visual descriptors. In: Proc. 8th Int. Conf. Development and Learning, pp. 1-6 (2009)
- (2009) Proc. 8th Int. Conf. Development and Learning , pp. 1-6
- Montesano, L.¹ Lopes, M.²

15
- 80053212134
- Apprenticeship learning using inverse reinforcement learning and gradient methods
- Neu, G., Szepesvári, C.: Apprenticeship learning using inverse reinforcement learning and gradient methods. In: Proc. 23rd Conf. Uncertainty in Artificial Intelligence, pp. 295-302 (2007)
- (2007) Proc. 23rd Conf. Uncertainty in Artificial Intelligence , pp. 295-302
- Neu, G.¹ Szepesvári, C.²

16
- 72449199041
- Training parsers by inverse reinforcement learning
- accepted
- Neu, G., Szepesvári, C.: Training parsers by inverse reinforcement learning. Machine Learning (2009) (accepted)
- (2009) Machine Learning
- Neu, G.¹ Szepesvári, C.²

17
- 0042547347
- Algorithms for inverse reinforcement learning
- Ng, A., Russel, S.: Algorithms for inverse reinforcement learning. In: Proc. 17th Int. Conf. Machine Learning, pp. 663-670 (2000)
- (2000) Proc. 17th Int. Conf. Machine Learning , pp. 663-670
- Ng, A.¹ Russel, S.²

18
- 0003212629
- Efficient training of artificial neural networks for autonomous navigation
- Pomerleau, D.: Efficient training of artificial neural networks for autonomous navigation. Neural Computation 3(1), 88-97 (1991)
- (1991) Neural. Computation , vol.3 , Issue.1 , pp. 88-97
- Pomerleau, D.¹

19
- 77956052826
- Bayesian inverse reinforcement learning
- Ramachandran, D., Amir, E.: Bayesian inverse reinforcement learning. In: Proc. 20th Int. Joint Conf. Artificial Intelligence, pp. 2586-2591 (2007)
- (2007) Proc. 20th Int. Joint Conf. Artificial Intelligence , pp. 2586-2591
- Ramachandran, D.¹ Amir, E.²

20
- 33749252753
- Maximum margin planning
- Ratliff, N., Bagnell, J., Zinkevich, M.: Maximum margin planning. In: Proc. 23rd Int. Conf. Machine Learning, pp. 729-736 (2006)
- (2006) Proc. 23rd Int. Conf. Machine Learning , pp. 729-736
- Ratliff, N.¹ Bagnell, J.² Zinkevich, M.³

21
- 84907329856
- Approximate homomorphisms: A framework for nonexact minimization in Markov decision processes
- Ravindran, B., Barto, A.: Approximate homomorphisms: A framework for nonexact minimization in Markov decision processes. In: Proc. 5th Int. Conf. Knowledge-Based Computer Systems (2004)
- (2004) Proc. 5th Int. Conf. Knowledge-based Computer Systems
- Ravindran, B.¹ Barto, A.²

22
- 33745869793
- Teaching robots by moulding behavior and scaffolding the environment
- Saunders, J., Nehaniv, C., Dautenhahn, K.: Teaching robots by moulding behavior and scaffolding the environment. In: Proc. 1st Annual Conf. Human-Robot Interaction (2006)
- (2006) Proc. 1st Annual Conf. Human-robot Interaction
- Saunders, J.¹ Nehaniv, C.² Dautenhahn, K.³

23
- 0003408420
- MIT Press, Cambridge
- Schölkopf, B., Smola, A.: Learning with kernels: Support vector machines, regularization, optimization and beyond. MIT Press, Cambridge (2002)
- (2002) Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond
- Schölkopf, B.¹ Smola, A.²

24
- 68949137209
- Active learning literature survey
- Univ. Wisconsin-Maddison
- Settles, B.: Active learning literature survey. Tech. Rep. CS Tech. Rep. 1648, Univ. Wisconsin-Maddison (2009)
- (2009) Tech. Rep. CS Tech. Rep. , vol.1648
- Settles, B.¹

25
- 85162012324
- A game-theoretic approach to apprenticeship learning
- Syed, U., Schapire, R.: A game-theoretic approach to apprenticeship learning. In: Adv. Neural Information Proc. Systems, vol. 20, pp. 1449-1456 (2008)
- (2008) Adv. Neural. Information Proc. Systems , vol.20 , pp. 1449-1456
- Syed, U.¹ Schapire, R.²

26
- 56449119102
- Apprenticeship learning using linear programming
- Syed, U., Schapire, R., Bowling, M.: Apprenticeship learning using linear programming. In: Proc. 25th Int. Conf. Machine Learning, pp. 1032-1039 (2008)
- (2008) Proc. 25th Int. Conf. Machine Learning , pp. 1032-1039
- Syed, U.¹ Schapire, R.² Bowling, M.³

27
- 78049390608
- Bounding performance loss in approximate MDP homomorphisms
- Taylor, J., Precup, D., Panangaden, P.: Bounding performance loss in approximate MDP homomorphisms. In: Adv. Neural Information Proc. Systems, pp. 1649-1656 (2008)
- (2008) Adv. Neural. Information Proc. Systems , pp. 1649-1656
- Taylor, J.¹ Precup, D.² Panangaden, P.³

28
- 84898974832
- Kernel logistic regression and the import vector machine
- Zhu, J., Hastie, T.: Kernel logistic regression and the import vector machine. In: Adv. Neural Information Proc. Systems. pp. 1081-1088 (2002)
- (2002) Adv. Neural. Information Proc. Systems , pp. 1081-1088
- Zhu, J.¹ Hastie, T.²

29
- 57749097473
- Maximum entropy inverse reinforcement learning
- Ziebart, B., Maas, A., Bagnell, J., Dey, A.: Maximum entropy inverse reinforcement learning. In: Proc. 23rd AAAI Conf. Artificial Intelligence, pp. 1433-1438 (2008)
- (2008) Proc. 23rd AAAI Conf. Artificial Intelligence , pp. 1433-1438
- Ziebart, B.¹ Maas, A.² Bagnell, J.³ Dey, A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.