SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn , Issue , 1997, Pages 1040-1046

Learning from demonstration

(1) Schaal, Stefan a,b

a Georgia Institute of Technology (United States)

b ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

POLES; REINFORCEMENT LEARNING; SIGNAL PROCESSING;

ANTHROPOMORPHIC ROBOT ARM; LEARNING CONTROL; LEARNING FROM DEMONSTRATION; LEARNING PROBLEM; LINEAR QUADRATIC REGULATOR; MODEL-BASED REINFORCEMENT LEARNING; NONLINEAR LEARNING; PRIOR KNOWLEDGE;

DEMONSTRATIONS;

EID: 84898995067 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (505)

References (19)

1
- 0039816976
- Using local trajectory optimizers to speed up global optimization in dynamic programming
- Moody, Hanson, & Lippmann (Ed.) Morgan Kaufmann
- Atkeson, C. G. (1994). " Using local trajectory optimizers to speed up global optimization in dynamic programming." In: Moody, Hanson, & Lippmann (Ed.), Adv. in Neural Inf. Proc. Sys. 6. Morgan Kaufmann.
- (1994) Adv. in Neural Inf. Proc. Sys. , vol.6
- Atkeson, C.G.¹

2
- 84898973104
- Robot see, robot do: An overview of robot imitation
- Electrotechnical Laboratory, Tsukuba Science City, Japan
- Bakker, P., & Kuniyoshi, Y. (1996). " Robot see, robot do: An overview of robot imitation." , Autonomous Systems Section, Electrotechnical Laboratory, Tsukuba Science City, Japan.
- (1996) Autonomous Systems Section
- Bakker, P.¹ Kuniyoshi, Y.²

3
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- Barto, A. G., Sutton, R. S., & Anderson, C. W. (1983). " Neuronlike adaptive elements that can solve difficult learning control problems." IEEE Transactions on Systems, Man, and Cybernetics, SMC-13, 5.
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.SMC-13 , pp. 5
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

4
- 0000859970
- Reinforcement learning applied to linear quadratic regulation
- Hanson, J. S., Cowan, J. D., & Giles, C. L. (Eds) Morgan Kaufmann
- Bradtke, S. J. (1993). " Reinforcement learning applied to linear quadratic regulation." In: Hanson, J. S., Cowan, J. D., & Giles, C. L. (Eds.), Advances in Neural Inf. Processing Systems 5, pp.295-302. Morgan Kaufmann.
- (1993) Advances in Neural Inf. Processing Systems , vol.5 , pp. 295-302
- Bradtke, S.J.¹

5
- 0001335239
- Acquisition of elementary robot skills from human demonstration
- Pisa, Italy
- Dillmann, R., Kaiser, M., & Ude, A. (1995). " Acquisition of elementary robot skills from human demonstration." In: International Symposium on Intelligent Robotic Systems (SIRS95), Pisa, Italy
- (1995) International Symposium on Intelligent Robotic Systems (SIRS95)
- Dillmann, R.¹ Kaiser, M.² Ude, A.³

6
- 0004671869
- Temporal difference learning in continuous time and space
- Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.) MIT Press
- Doya, K. (1996). " Temporal difference learning in continuous time and space." In: Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.), Advances in Neural Information Processing Systems 8. MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8
- Doya, K.¹

7
- 0021291468
- An approach to automatic robot programming based on inductive learning
- Brady, M., & Paul, R. (Eds.) Cambridge, MA: MIT Press
- Dufay, B., & Latombe, J.-C. (1984). " An approach to automatic robot programming based on inductive learning." In: Brady, M., & Paul, R. (Eds.), Robotics Research, pp.97-115. Cambridge, MA: MIT Press.
- (1984) Robotics Research , pp. 97-115
- Dufay, B.¹ Latombe, J.-C.²

8
- 0004276055
- NY: Academic Press
- Dyer, P., & McReynolds, S. R. (1970). The computation and theory ofopitmal control. NY: Academic Press.
- (1970) The Computation and Theory Ofopitmal Control
- Dyer, P.¹ McReynolds, S.R.²

9
- 84899002203
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
- Ikeuchi, K. (1993b). " Assembly plan from observation." , School of Computer Science, Carnegie Mellon University, Pittsburgh, PA.
- (1993) Assembly Plan from Observation
- Ikeuchi, K.¹

10
- 0002757790
- Teaching by showing in kendama based on optimization principle
- Kawato, M., Gandolfo, F., Gomi, H., & Wada, Y. (1994b). " Teaching by showing in kendama based on optimization principle." In: Proceedings of the International Conference on Artificial Neural Networks (ICANN94), 1, pp.601-606.
- Proceedings of the International Conference on Artificial Neural Networks (ICANN94) , vol.1 , pp. 601-606
- Kawato, M.¹ Gandolfo, F.² Gomi, H.³ Wada, Y.⁴

11
- 0002001532
- Brady, M., Hollerbach, J. M., Johnson, T. L., Lozano-Prez, T., & Mason, M. T. (Eds) MIT Press
- Lozano-Perez, T. (1982). " Task-Planning." In: Brady, M., Hollerbach, J. M., Johnson, T. L., Lozano-Prez, T., & Mason, M. T. (Eds.),, pp.473-498. MIT Press.
- (1982) Task-planning , pp. 473-498
- Lozano-Perez, T.¹

12
- 84898944763
- A Kendama learning robot based on bi-directional theory
- in press
- Miyamoto, H., Schaal, S., Gandolfo, F., Koike, Y., Osu, R., Nakano, E., Wada, Y., & Kawato, M. (in press). " A Kendama learning robot based on bi-directional theory." Neural Networks.
- Neural Networks
- Miyamoto, H.¹ Schaal, S.² Gandolfo, F.³ Koike, Y.⁴ Osu, R.⁵ Nakano, E.⁶ Wada, Y.⁷ Kawato, M.⁸

13
- 0003971885
- Fast, robust adaptive control by learning only forward models
- Moody, J. E., Hanson, S. J., & and Lippmann, R. P. (Eds.) Morgan Kaufmann
- Moore, A. (1991a). " Fast, robust adaptive control by learning only forward models." In: Moody, J. E., Hanson, S. J., & and Lippmann, R. P. (Eds.), Advances in Neural Inf. Proc. Systems 4. Morgan Kaufmann.
- (1991) Advances in Neural Inf. Proc. Systems , vol.4
- Moore, A.¹

14
- 0038501238
- From isolation to cooperation: An alternative of a system of experts
- Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.) Cambridge, MA: MIT Press
- Schaal, S., & Atkeson, C. G. (1996). " From isolation to cooperation: An alternative of a system of experts." In: Touretzky, D. S., Mozer, M. C, & Hasselmo, M. E. (Eds.), Advances in Neural Information Processing Systems 8. Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8
- Schaal, S.¹ Atkeson, C.G.²

15
- 84894671097
- Explanation-based manipulator learning: Acquisition of planning ability through observation
- Segre, A. B., & DeJong, G. (1985). " Explanation-based manipulator learning: Acquisition of planning ability through observation." In: Conference on Robotics and Automation, pp.555-560.
- (1985) Conference on Robotics and Automation , pp. 555-560
- Segre, A.B.¹ Dejong, G.²

16
- 0029753630
- Reinforcement learning with eligibility traces
- Singh, S. P., & Sutton, R. S. (1996). " Reinforcement learning with eligibility traces." Machine Learning.
- (1996) Machine Learning
- Singh, S.P.¹ Sutton, R.S.²

17
- 0002995053
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Sutton, R. S. (1990). " Integrated architectures for learning, planning, and reacting based on approximating dynamic programming." In: Proceedings of the International Machine Learning Conference.
- (1990) Proceedings of the International Machine Learning Conference
- Sutton R., .S.¹

18
- 0004049895
- Ph.D. thesis, Cambridge University (UK)
- Watkins, C. J. C. H. (1989). " Learning with delayed rewards." Ph.D. thesis, Cambridge University (UK),.
- (1989) Learning with Delayed Rewards
- Watkins, C.J.C.H.¹

19
- 0001859165
- Pattern recognizing control systems
- Washington: Spartan
- Widrow, B., & Smith, F. W. (1964). " Pattern recognizing control systems." In: 1963 Comp. and Inf. Sciences (COINS) Symp. Proc, 288-317, Washington: Spartan.
- (1964) 1963 Comp. and Inf. Sciences (COINS) Symp. Proc , pp. 288-317
- Widrow, B.¹ Smith, F.W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.