SCOPUS 정보 검색 플랫폼

Volumn 27, Issue 1, 2009, Pages 25-53

Learning to search: Functional gradient techniques for imitation learning

(3) Ratliff, Nathan D a Silver, David a Bagnell, J Andrew b

a CARNEGIE MELLON UNIVERSITY (United States)

b CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

Autonomous navigation; Functional gradient techniques; Grasping; Imitation learning; Nonparametric optimization; Planning; Quadrupedal locomotion; Robotics; Structured prediction; Subgradient methods

Indexed keywords

AUTONOMOUS NAVIGATION; FUNCTIONAL GRADIENT TECHNIQUES; GRASPING; IMITATION LEARNING; NONPARAMETRIC OPTIMIZATION; QUADRUPEDAL LOCOMOTION; STRUCTURED PREDICTION; SUBGRADIENT METHODS;

BIPED LOCOMOTION; CLONING; COST FUNCTIONS; COSTS; DATA PROCESSING; DECISION TREES; DEMONSTRATIONS; NAVIGATION; NAVIGATION SYSTEMS; OPTIMIZATION; ROBOT LEARNING; ROBOT PROGRAMMING; ROBOTICS; ROBOTS; SUPERVISED LEARNING;

LEARNING ALGORITHMS;

EID: 67650957592 PISSN: 09295593 EISSN: None Source Type: Journal
DOI: 10.1007/s10514-009-9121-3 Document Type: Article

Times cited : (201)

References (49)

1
- 31844444663
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P., & Ng, A. Y. (2004). Apprenticeship learning via inverse reinforcement learning. In ICML '04: Proceedings of the twenty-first international conference on machine learning.
- (2004) ICML '04: Proceedings of the Twenty-first International Conference on Machine Learning
- Abbeel, P.¹ Ng, A.Y.²

2
- 0003650411
- Prentice Hall Englewood Cliffs
- Anderson, B. D. O., & Moore, J. B. (1990). Optimal control: linear quadratic methods. Englewood Cliffs: Prentice Hall.
- (1990) Optimal Control: Linear Quadratic Methods
- Anderson, B.D.O.¹ Moore, J.B.²

3
- 63149159130
- A survey of robot learning from demonstration
- Argall, B., Chernova, S., Veloso, M., & Browning, B. (2009). A survey of robot learning from demonstration. Robotics and Autonomous Systems.
- (2009) Robotics and Autonomous Systems
- Argall, B.¹ Chernova, S.² Veloso, M.³ Browning, B.⁴

4
- 67650979920
- Locally weighted learning
- Atkeson, C., Schaal, S., & Moore, A. (1995). Locally weighted learning. AI Review.
- (1995) AI Review
- Atkeson, C.¹ Schaal, S.² Moore, A.³

5
- 1942450674
- A framework for behavioral cloning
- Oxford University Press London
- Bain, M., & Sammut, C. (1995). A framework for behavioral cloning. In Machine intelligence agents. London: Oxford University Press.
- (1995) Machine Intelligence Agents
- Bain, M.¹ Sammut, C.²

6
- 0003004622
- Linear matrix inequalities in system and control theory
- Boyd, S., Ghaoui, L. E., Feron, E., & Balakrishnan, V. (1994). Linear matrix inequalities in system and control theory. Society for Industrial and Applied Mathematics (SIAM).
- (1994) Society for Industrial and Applied Mathematics (SIAM)
- Boyd, S.¹ Ghaoui, L.E.² Feron, E.³ Balakrishnan, V.⁴

7
- 34047173490
- On learning, representing, and generalizing a task in a humanoid robot
- DOI 10.1109/TSMCB.2006.886952, Special Issue on Robot Learning by Observation, Demonstration and Imitation
- Calinon, S., Guenter, F., & Billard, A. (2007). On learning, representing and generalizing a task in a humanoid robot. In IEEE Transactions on Systems, Man and Cybernetics, Part B. Special issue on robot learning by observation, demonstration and imitation, 37, 286-298. (Pubitemid 46523219)
- (2007) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.37 , Issue.2 , pp. 286-298
- Calinon, S.¹ Guenter, F.² Billard, A.³

8
- 84926078662
- Cambridge University Press New York
- Cesa-Bianchi, N., & Lugosi, G. (2006). Prediction, learning, and games. New York: Cambridge University Press.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

9
- 3042523593
- Planning biped navigation strategies in complex environments
- Karlsruhe, Germany
- Chestnutt, J., Kuffner, J., Nishiwaki, K., & Kagami, S. (2003). Planning biped navigation strategies in complex environments. In Proceedings of the IEEE-RAS, international conference on humanoid robots. Karlsruhe, Germany.
- (2003) Proceedings of the IEEE-RAS, International Conference on Humanoid Robots
- Chestnutt, J.¹ Kuffner, J.² Nishiwaki, K.³ Kagami, S.⁴

10
- 33745134003
- Footstep planning for the Honda ASIMO humanoid
- Chestnutt, J., Lau, M., Cheng, G., Kuffner, J., Hodgins, J., & Kanade, T. (2005). Footstep planning for the Honda ASIMO humanoid. In Proceedings of the IEEE, international conference on robotics and automation.
- (2005) Proceedings of the IEEE, International Conference on Robotics and Automation
- Chestnutt, J.¹ Lau, M.² Cheng, G.³ Kuffner, J.⁴ Hodgins, J.⁵ Kanade, T.⁶

11
- 0037418225
- Maximal sparsity representation via l1 minimization
- D. L. Donoho M. Elad 2003 Maximal sparsity representation via l1 minimization Proceedings of the National Academy Sciences 100 2197 2202
- (2003) Proceedings of the National Academy Sciences , vol.100 , pp. 2197-2202
- Donoho, D.L.¹ Elad, M.²

12
- 33745686540
- Using interpolation to improve path planning: The field D*algorithm
- D. Ferguson A. Stentz 2006 Using interpolation to improve path planning: The field D*algorithm Journal of Field Robotics 23 79 101
- (2006) Journal of Field Robotics , vol.23 , pp. 79-101
- Ferguson, D.¹ Stentz, A.²

13
- 0003591748
- Greedy function approximation: A gradient boosting machine
- Friedman, J. H. (1999a). Greedy function approximation: A gradient boosting machine. Annals of Statistics.
- (1999) Annals of Statistics
- Friedman, J.H.¹

14
- 0003989207
- Doctoral dissertation, Robotics Institute, Carnegie Mellon University
- Gordon, G. (1999). Approximate solutions to Markov decision processes. Doctoral dissertation, Robotics Institute, Carnegie Mellon University.
- (1999) Approximate Solutions to Markov Decision Processes
- Gordon, G.¹

15
- 58249138517
- Dynamical system modulation for robot learning via kinesthetic demonstrations
- M. Hersch F. Guenter S. Calinon A. Billard 2008 Dynamical system modulation for robot learning via kinesthetic demonstrations IEEE Transactions on Robotics 24 1463 1467
- (2008) IEEE Transactions on Robotics , vol.24 , pp. 1463-1467
- Hersch, M.¹ Guenter, F.² Calinon, S.³ Billard, A.⁴

16
- 0003998828
- Cambridge University Press Cambridge
- Jaynes, E. (2003). Probability: The logic of science. Cambridge: Cambridge University Press.
- (2003) Probability: The Logic of Science
- Jaynes, E.¹

17
- 84998095483
- When is a linear control system optimal?
- R. Kalman 1964 When is a linear control system optimal? Transaction ASME, Journal Basic Engineering 86 51 60
- (1964) Transaction ASME, Journal Basic Engineering , vol.86 , pp. 51-60
- Kalman, R.¹

18
- 44349170318
- Toward reliable autonomous vehicles operating in challenging environments
- Singapore
- Kelly, A., Amidi, O., Happold, M., Herman, H., Pilarski, T., Rander, P., Stentz, A., Vallidis, N., & Warner, R. (2004). Toward reliable autonomous vehicles operating in challenging environments. In Proceedings of the international symposium on experimental robotics (ISER). Singapore.
- (2004) Proceedings of the International Symposium on Experimental Robotics (ISER)
- Kelly, A.¹ Amidi, O.² Happold, M.³ Herman, H.⁴ Pilarski, T.⁵ Rander, P.⁶ Stentz, A.⁷ Vallidis, N.⁸ Warner, R.⁹

19
- 0008815681
- Exponentiated gradient versus gradient descent for linear predictors
- Kivinen, J., & Warmuth, M. K. (1997). Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132.
- (1997) Information and Computation , pp. 132
- Kivinen, J.¹ Warmuth, M.K.²

20
- 85162069513
- Hierarchical apprenticeship learning with application to quadruped locomotion
- Kolter, J. Z., Abbeel, P., & Ng, A. Y. (2008). Hierarchical apprenticeship learning with application to quadruped locomotion. Neural Information Processing Systems, 20.
- (2008) Neural Information Processing Systems , pp. 20
- Kolter, J.Z.¹ Abbeel, P.² Ng, A.Y.³

21
- 85162067389
- Structured learning with approximate inference
- MIT Cambridge
- Kulesza, A., & Pereira, F. (2008). Structured learning with approximate inference. In Advances in neural information processing systems. Cambridge: MIT.
- (2008) Advances in Neural Information Processing Systems
- Kulesza, A.¹ Pereira, F.²

22
- 33749267006
- Off-road obstacle avoidance through end-to-end learning
- MIT Cambridge
- LeCun, Y., Muller, U., Ben, J., Cosatto, E., & Flepp, B. (2006). Off-road obstacle avoidance through end-to-end learning. In Advances in neural information processing systems (Vol. 18). Cambridge: MIT.
- (2006) Advances in Neural Information Processing Systems
- Lecun, Y.¹ Muller, U.² Ben, J.³ Cosatto, E.⁴ Flepp, B.⁵

23
- 0002550596
- Functional gradient techniques for combining hypotheses
- MIT Cambridge
- Mason, L., Baxter, J., Bartlett, P., & Frean, M. (1999). Functional gradient techniques for combining hypotheses. In Advances in large margin classifiers. Cambridge: MIT.
- (1999) Advances in Large Margin Classifiers
- Mason, L.¹ Baxter, J.² Bartlett, P.³ Frean, M.⁴

24
- 0345308464
- Automatic grasp planning using shape primitives
- Miller, A. T., Knoop, S., Allen, P. K., & Christensen, H. I. (2003). Automatic grasp planning using shape primitives. In Proceedings of the IEEE, International conference on robotics and automation.
- (2003) Proceedings of the IEEE, International Conference on Robotics and Automation
- Miller, A.T.¹ Knoop, S.² Allen, P.K.³ Christensen, H.I.⁴

25
- 70450162267
- Contextual classification with functional max-margin Markov networks
- Munoz, D., Bagnell, J. A. D., Vandapel, N., & Hebert, M. (2009). Contextual classification with functional max-margin Markov networks. In IEEE computer society conference on computer vision and pattern recognition (CVPR).
- (2009) IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR)
- Munoz, D.¹ Bagnell, J.A.D.² Vandapel, N.³ Hebert, M.⁴

26
- 84923094225
- Directional associative Markov network for 3-d point cloud classification
- Munoz, D., Vandapel, N., & Hebert, M. (2008). Directional associative Markov network for 3-d point cloud classification. In Fourth international symposium on 3D data processing, visualization and transmission.
- (2008) Fourth International Symposium on 3D Data Processing, Visualization and Transmission
- Munoz, D.¹ Vandapel, N.² Hebert, M.³

27
- 80053212134
- Apprenticeship learning using inverse reinforcement learning and gradient methods
- Neu, G., & Szepesvari, C. (2007). Apprenticeship learning using inverse reinforcement learning and gradient methods. In Uncertainty in artificial intelligence (UAI).
- (2007) Uncertainty in Artificial Intelligence (UAI)
- Neu, G.¹ Szepesvari, C.²

28
- 0042547347
- Algorithms for inverse reinforcement learning
- Ng, A. Y., & Russell, S. (2000). Algorithms for inverse reinforcement learning. In Proc. 17th international conf. on machine learning.
- (2000) Proc. 17th International Conf. on Machine Learning
- Ng, A.Y.¹ Russell, S.²

29
- 0000796434
- ALVINN: An autonomous land vehicle in a neural network
- Pomerleau, D. (1989). ALVINN: An autonomous land vehicle in a neural network. In Advances in neural information processing systems (Vol. 1).
- (1989) Advances in Neural Information Processing Systems , vol.1
- Pomerleau, D.¹

30
- 85102627959
- Wiley New York
- Puterman, M. (1994). Markov decision processes: Discrete stochastic dynamic programming. New York: Wiley.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

31
- 67650964417
- Functional bundle methods
- Clearwater Beach, Florida
- Ratliff, N., & Bagnell, J. A. (2009). Functional bundle methods. In The Learning workshop. Clearwater Beach, Florida.
- (2009) The Learning Workshop
- Ratliff, N.¹ Bagnell, J.A.²

32
- 33749252753
- Maximum margin planning
- Ratliff, N., Bagnell, J. A., & Zinkevich, M. (2006a). Maximum margin planning. In Twenty second international conference on machine learning (ICML06).
- (2006) Twenty Second International Conference on Machine Learning (ICML06)
- Ratliff, N.¹ Bagnell, J.A.² Zinkevich, M.³

33
- 72449164388
- (Online) subgradient methods for structured prediction
- San Juan, Puerto Rico
- Ratliff, N., Bagnell, J. A., & Zinkevich, M. (2007a). (Online) subgradient methods for structured prediction. In Artificial intelligence and statistics. San Juan, Puerto Rico.
- (2007) Artificial Intelligence and Statistics
- Ratliff, N.¹ Bagnell, J.A.² Zinkevich, M.³

34
- 84863455430
- NIPS. Vancouver, B.C.
- Ratliff, N., Bradley, D., Bagnell, J. A., & Chestnutt, J. (2006b). Boosting structured prediction for imitation learning. In NIPS. Vancouver, B.C.
- (2006) Boosting Structured Prediction for Imitation Learning
- Ratliff, N.¹ Bradley, D.² Bagnell, J.A.³ Chestnutt, J.⁴

35
- 67649736232
- Imitation learning for locomotion and manipulation
- Ratliff, N., Srinivasa, S., & Bagnell, J. A. (2007b). Imitation learning for locomotion and manipulation. In IEEE-RAS international conference on humanoid robots.
- (2007) IEEE-RAS International Conference on Humanoid Robots
- Ratliff, N.¹ Srinivasa, S.² Bagnell, J.A.³

36
- 9444250658
- Regularized least squares classification
- IOS Press Amsterdam
- Rifkin, Y., Poggio (2003). Regularized least squares classification. In Advances in learning theory: methods, models and applications. Amsterdam: IOS Press.
- (2003) Advances in Learning Theory: Methods, Models and Applications
- Rifkin, Y.¹ Poggio²

37
- 12844274244
- Boosting as a regularized path to a maximum margin classifier
- S. Rosset J. Zhu T. Hastie 2004 Boosting as a regularized path to a maximum margin classifier Journal Machine Learning Research 5 941 973
- (2004) Journal Machine Learning Research , vol.5 , pp. 941-973
- Rosset, S.¹ Zhu, J.² Hastie, T.³

38
- 0028374275
- Robot juggling: An implementation of memory-based learning
- Schaal, S., & Atkeson, C. (1994). Robot juggling: An implementation of memory-based learning. IEEE Control Systems Magazine, 14.
- (1994) IEEE Control Systems Magazine , pp. 14
- Schaal, S.¹ Atkeson, C.²

39
- 0003995427
- Springer Berlin
- Shor, N. Z. (1985). Minimization methods for non-differentiable functions. Berlin: Springer.
- (1985) Minimization Methods for Non-differentiable Functions
- Shor, N.Z.¹

40
- 70450148833
- High performance outdoor navigation from overhead data using imitation learning
- Silver, D., Bagnell, J. A., & Stentz, A. (2008). High performance outdoor navigation from overhead data using imitation learning. In Proceedings of Robotics Science and Systems.
- (2008) Proceedings of Robotics Science and Systems
- Silver, D.¹ Bagnell, J.A.² Stentz, A.³

41
- 34250682969
- Experimental analysis of overhead data processing to support long range navigation
- Silver, D., Sofman, B., Vandapel, N., Bagnell, J. A., & Stentz, A. (2006). Experimental analysis of overhead data processing to support long range navigation. In Proceedings of the IEEE/JRS international conference on intelligent robots and systems.
- (2006) Proceedings of the IEEE/JRS International Conference on Intelligent Robots and Systems
- Silver, D.¹ Sofman, B.² Vandapel, N.³ Bagnell, J.A.⁴ Stentz, A.⁵

42
- 70449109089
- The crusher system for autonomous navigation
- Stentz, A., Bares, J., Pilarski, T., & Stager, D. (2007). The crusher system for autonomous navigation. In AUVSI's unmanned systems.
- (2007) AUVSI's Unmanned Systems
- Stentz, A.¹ Bares, J.² Pilarski, T.³ Stager, D.⁴

43
- 31844442382
- Learning structured prediction models: A large margin approach
- Taskar, B., Chatalbashev, V., Guestrin, C., & Koller, D. (2005). Learning structured prediction models: A large margin approach. In Twenty second international conference on machine learning (ICML05).
- (2005) Twenty Second International Conference on Machine Learning (ICML05)
- Taskar, B.¹ Chatalbashev, V.² Guestrin, C.³ Koller, D.⁴

44
- 14344253870
- Max margin Markov networks
- Taskar, B., Guestrin, C., & Koller, D. (2003). Max margin Markov networks. In Advances in neural information processing systems (NIPS-14).
- (2003) Advances in Neural Information Processing Systems (NIPS-14)
- Taskar, B.¹ Guestrin, C.² Koller, D.³

45
- 33745798352
- Structured prediction via the extragradient method
- MIT Cambridge
- Taskar, B., Lacoste-Julien, S., & Jordan, M. (2006). Structured prediction via the extragradient method. In Advances in neural information processing systems (Vol. 18). Cambridge: MIT.
- (2006) Advances in Neural Information Processing Systems
- Taskar, B.¹ Lacoste-Julien, S.² Jordan, M.³

46
- 5444237123
- Greed is good: Algorithmic results for sparse approximation
- J. A. Tropp 2004 Greed is good: Algorithmic results for sparse approximation IEEE Transactions on Information Theory 50 2231 2242
- (2004) IEEE Transactions on Information Theory , vol.50 , pp. 2231-2242
- Tropp, J.A.¹

47
- 0347410600
- Quality assessment of traversability maps from aerial lidar data for an unmanned ground vehicle
- Vandapel, N., Donamukkala, R. R., & Hebert, M. (2003). Quality assessment of traversability maps from aerial lidar data for an unmanned ground vehicle. In Proceedings of the IEEE/JRS international conference on intelligent robots and systems.
- (2003) Proceedings of the IEEE/JRS International Conference on Intelligent Robots and Systems
- Vandapel, N.¹ Donamukkala, R.R.² Hebert, M.³

48
- 85148975703
- Maximum entropy inverse reinforcement learning
- Ziebart, B., Bagnell, J. A., Mass, A., & Dey, A. (2008). Maximum entropy inverse reinforcement learning. In Twenty-third AAAI conference.
- (2008) Twenty-third AAAI Conference
- Ziebart, B.¹ Bagnell, J.A.² Mass, A.³ Dey, A.⁴

49
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- Zinkevich, M. (2003). Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the twentieth international conference on machine learning.
- (2003) Proceedings of the Twentieth International Conference on Machine Learning
- Zinkevich, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.