메뉴 건너뛰기




Volumn 27, Issue 1, 2009, Pages 25-53

Learning to search: Functional gradient techniques for imitation learning

Author keywords

Autonomous navigation; Functional gradient techniques; Grasping; Imitation learning; Nonparametric optimization; Planning; Quadrupedal locomotion; Robotics; Structured prediction; Subgradient methods

Indexed keywords

AUTONOMOUS NAVIGATION; FUNCTIONAL GRADIENT TECHNIQUES; GRASPING; IMITATION LEARNING; NONPARAMETRIC OPTIMIZATION; QUADRUPEDAL LOCOMOTION; STRUCTURED PREDICTION; SUBGRADIENT METHODS;

EID: 67650957592     PISSN: 09295593     EISSN: None     Source Type: Journal    
DOI: 10.1007/s10514-009-9121-3     Document Type: Article
Times cited : (201)

References (49)
  • 5
    • 1942450674 scopus 로고
    • A framework for behavioral cloning
    • Oxford University Press London
    • Bain, M., & Sammut, C. (1995). A framework for behavioral cloning. In Machine intelligence agents. London: Oxford University Press.
    • (1995) Machine Intelligence Agents
    • Bain, M.1    Sammut, C.2
  • 7
    • 34047173490 scopus 로고    scopus 로고
    • On learning, representing, and generalizing a task in a humanoid robot
    • DOI 10.1109/TSMCB.2006.886952, Special Issue on Robot Learning by Observation, Demonstration and Imitation
    • Calinon, S., Guenter, F., & Billard, A. (2007). On learning, representing and generalizing a task in a humanoid robot. In IEEE Transactions on Systems, Man and Cybernetics, Part B. Special issue on robot learning by observation, demonstration and imitation, 37, 286-298. (Pubitemid 46523219)
    • (2007) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.37 , Issue.2 , pp. 286-298
    • Calinon, S.1    Guenter, F.2    Billard, A.3
  • 12
    • 33745686540 scopus 로고    scopus 로고
    • Using interpolation to improve path planning: The field D*algorithm
    • D. Ferguson A. Stentz 2006 Using interpolation to improve path planning: The field D*algorithm Journal of Field Robotics 23 79 101
    • (2006) Journal of Field Robotics , vol.23 , pp. 79-101
    • Ferguson, D.1    Stentz, A.2
  • 13
    • 0003591748 scopus 로고    scopus 로고
    • Greedy function approximation: A gradient boosting machine
    • Friedman, J. H. (1999a). Greedy function approximation: A gradient boosting machine. Annals of Statistics.
    • (1999) Annals of Statistics
    • Friedman, J.H.1
  • 15
    • 58249138517 scopus 로고    scopus 로고
    • Dynamical system modulation for robot learning via kinesthetic demonstrations
    • M. Hersch F. Guenter S. Calinon A. Billard 2008 Dynamical system modulation for robot learning via kinesthetic demonstrations IEEE Transactions on Robotics 24 1463 1467
    • (2008) IEEE Transactions on Robotics , vol.24 , pp. 1463-1467
    • Hersch, M.1    Guenter, F.2    Calinon, S.3    Billard, A.4
  • 19
    • 0008815681 scopus 로고    scopus 로고
    • Exponentiated gradient versus gradient descent for linear predictors
    • Kivinen, J., & Warmuth, M. K. (1997). Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132.
    • (1997) Information and Computation , pp. 132
    • Kivinen, J.1    Warmuth, M.K.2
  • 20
    • 85162069513 scopus 로고    scopus 로고
    • Hierarchical apprenticeship learning with application to quadruped locomotion
    • Kolter, J. Z., Abbeel, P., & Ng, A. Y. (2008). Hierarchical apprenticeship learning with application to quadruped locomotion. Neural Information Processing Systems, 20.
    • (2008) Neural Information Processing Systems , pp. 20
    • Kolter, J.Z.1    Abbeel, P.2    Ng, A.Y.3
  • 27
    • 80053212134 scopus 로고    scopus 로고
    • Apprenticeship learning using inverse reinforcement learning and gradient methods
    • Neu, G., & Szepesvari, C. (2007). Apprenticeship learning using inverse reinforcement learning and gradient methods. In Uncertainty in artificial intelligence (UAI).
    • (2007) Uncertainty in Artificial Intelligence (UAI)
    • Neu, G.1    Szepesvari, C.2
  • 31
    • 67650964417 scopus 로고    scopus 로고
    • Functional bundle methods
    • Clearwater Beach, Florida
    • Ratliff, N., & Bagnell, J. A. (2009). Functional bundle methods. In The Learning workshop. Clearwater Beach, Florida.
    • (2009) The Learning Workshop
    • Ratliff, N.1    Bagnell, J.A.2
  • 37
    • 12844274244 scopus 로고    scopus 로고
    • Boosting as a regularized path to a maximum margin classifier
    • S. Rosset J. Zhu T. Hastie 2004 Boosting as a regularized path to a maximum margin classifier Journal Machine Learning Research 5 941 973
    • (2004) Journal Machine Learning Research , vol.5 , pp. 941-973
    • Rosset, S.1    Zhu, J.2    Hastie, T.3
  • 38
    • 0028374275 scopus 로고
    • Robot juggling: An implementation of memory-based learning
    • Schaal, S., & Atkeson, C. (1994). Robot juggling: An implementation of memory-based learning. IEEE Control Systems Magazine, 14.
    • (1994) IEEE Control Systems Magazine , pp. 14
    • Schaal, S.1    Atkeson, C.2
  • 46
    • 5444237123 scopus 로고    scopus 로고
    • Greed is good: Algorithmic results for sparse approximation
    • J. A. Tropp 2004 Greed is good: Algorithmic results for sparse approximation IEEE Transactions on Information Theory 50 2231 2242
    • (2004) IEEE Transactions on Information Theory , vol.50 , pp. 2231-2242
    • Tropp, J.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.