메뉴 건너뛰기




Volumn , Issue , 1997, Pages 1040-1046

Learning from demonstration

Author keywords

[No Author keywords available]

Indexed keywords

POLES; REINFORCEMENT LEARNING; SIGNAL PROCESSING;

EID: 84898995067     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (505)

References (19)
  • 1
    • 0039816976 scopus 로고
    • Using local trajectory optimizers to speed up global optimization in dynamic programming
    • Moody, Hanson, & Lippmann (Ed.) Morgan Kaufmann
    • Atkeson, C. G. (1994). " Using local trajectory optimizers to speed up global optimization in dynamic programming." In: Moody, Hanson, & Lippmann (Ed.), Adv. in Neural Inf. Proc. Sys. 6. Morgan Kaufmann.
    • (1994) Adv. in Neural Inf. Proc. Sys. , vol.6
    • Atkeson, C.G.1
  • 2
    • 84898973104 scopus 로고    scopus 로고
    • Robot see, robot do: An overview of robot imitation
    • Electrotechnical Laboratory, Tsukuba Science City, Japan
    • Bakker, P., & Kuniyoshi, Y. (1996). " Robot see, robot do: An overview of robot imitation." , Autonomous Systems Section, Electrotechnical Laboratory, Tsukuba Science City, Japan.
    • (1996) Autonomous Systems Section
    • Bakker, P.1    Kuniyoshi, Y.2
  • 4
    • 0000859970 scopus 로고
    • Reinforcement learning applied to linear quadratic regulation
    • Hanson, J. S., Cowan, J. D., & Giles, C. L. (Eds) Morgan Kaufmann
    • Bradtke, S. J. (1993). " Reinforcement learning applied to linear quadratic regulation." In: Hanson, J. S., Cowan, J. D., & Giles, C. L. (Eds.), Advances in Neural Inf. Processing Systems 5, pp.295-302. Morgan Kaufmann.
    • (1993) Advances in Neural Inf. Processing Systems , vol.5 , pp. 295-302
    • Bradtke, S.J.1
  • 6
    • 0004671869 scopus 로고    scopus 로고
    • Temporal difference learning in continuous time and space
    • Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.) MIT Press
    • Doya, K. (1996). " Temporal difference learning in continuous time and space." In: Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.), Advances in Neural Information Processing Systems 8. MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8
    • Doya, K.1
  • 7
    • 0021291468 scopus 로고
    • An approach to automatic robot programming based on inductive learning
    • Brady, M., & Paul, R. (Eds.) Cambridge, MA: MIT Press
    • Dufay, B., & Latombe, J.-C. (1984). " An approach to automatic robot programming based on inductive learning." In: Brady, M., & Paul, R. (Eds.), Robotics Research, pp.97-115. Cambridge, MA: MIT Press.
    • (1984) Robotics Research , pp. 97-115
    • Dufay, B.1    Latombe, J.-C.2
  • 9
    • 84899002203 scopus 로고
    • School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
    • Ikeuchi, K. (1993b). " Assembly plan from observation." , School of Computer Science, Carnegie Mellon University, Pittsburgh, PA.
    • (1993) Assembly Plan from Observation
    • Ikeuchi, K.1
  • 11
    • 0002001532 scopus 로고
    • Brady, M., Hollerbach, J. M., Johnson, T. L., Lozano-Prez, T., & Mason, M. T. (Eds) MIT Press
    • Lozano-Perez, T. (1982). " Task-Planning." In: Brady, M., Hollerbach, J. M., Johnson, T. L., Lozano-Prez, T., & Mason, M. T. (Eds.),, pp.473-498. MIT Press.
    • (1982) Task-planning , pp. 473-498
    • Lozano-Perez, T.1
  • 13
    • 0003971885 scopus 로고
    • Fast, robust adaptive control by learning only forward models
    • Moody, J. E., Hanson, S. J., & and Lippmann, R. P. (Eds.) Morgan Kaufmann
    • Moore, A. (1991a). " Fast, robust adaptive control by learning only forward models." In: Moody, J. E., Hanson, S. J., & and Lippmann, R. P. (Eds.), Advances in Neural Inf. Proc. Systems 4. Morgan Kaufmann.
    • (1991) Advances in Neural Inf. Proc. Systems , vol.4
    • Moore, A.1
  • 14
    • 0038501238 scopus 로고    scopus 로고
    • From isolation to cooperation: An alternative of a system of experts
    • Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.) Cambridge, MA: MIT Press
    • Schaal, S., & Atkeson, C. G. (1996). " From isolation to cooperation: An alternative of a system of experts." In: Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.), Advances in Neural Information Processing Systems 8. Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8
    • Schaal, S.1    Atkeson, C.G.2
  • 15
    • 84894671097 scopus 로고
    • Explanation-based manipulator learning: Acquisition of planning ability through observation
    • Segre, A. B., & DeJong, G. (1985). " Explanation-based manipulator learning: Acquisition of planning ability through observation." In: Conference on Robotics and Automation, pp.555-560.
    • (1985) Conference on Robotics and Automation , pp. 555-560
    • Segre, A.B.1    Dejong, G.2
  • 16
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with eligibility traces
    • Singh, S. P., & Sutton, R. S. (1996). " Reinforcement learning with eligibility traces." Machine Learning.
    • (1996) Machine Learning
    • Singh, S.P.1    Sutton, R.S.2
  • 17
    • 0002995053 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Sutton, R. S. (1990). " Integrated architectures for learning, planning, and reacting based on approximating dynamic programming." In: Proceedings of the International Machine Learning Conference.
    • (1990) Proceedings of the International Machine Learning Conference
    • Sutton R., .S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.