SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 2056, Issue , 2001, Pages 111-120

Imitation and reinforcement learning in agents with heterogeneous actions

(2) Price, Bob a Boutilier, Craig b

a UNIVERSITY OF BRITISH COLUMBIA (Canada)

b UNIVERSITY OF TORONTO (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

COMBINATORIAL OPTIMIZATION; DYNAMIC PROGRAMMING; MACHINE LEARNING;

BELLMAN EQUATIONS; EXPERT AGENTS; IN-CONTROL; REINFORCEMENT LEARNING TECHNIQUES;

REINFORCEMENT LEARNING;

EID: 84949445092 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-45153-6_11 Document Type: Conference Paper

Times cited : (8)

References (17)

1
- 84918834208
- A reinforcement learning approach to job-shop scheduling
- Montreal
- Wei Zhang and Thomas G. Dietterich. A reinforcement learning approach to job-shop scheduling. In IJCAI-95, pages 1114-1120, Montreal, 1995.
- (1995) IJCAI-95 , pp. 1114-1120
- Zhang, W.¹ Dietterich, T.G.²

2
- 0031625420
- Learning evaluation functions for global optimization and boolean satisfiability
- Madison, Wisconsin
- Justin A. Boyan and Andrew W. Moore. Learning evaluation functions for global optimization and boolean satisfiability. In AAAI-98, pages 3-10, July 26-30, 1998, Madison, Wisconsin, 1998.
- (1998) AAAI-98 , pp. 3-10
- Boyan, J.A.¹ Moore, A.W.²

3
- 0010276944
- Implicit imitation in multiagent reinforcement learning
- Bled, SI
- Bob Price and Craig Boutilier. Implicit imitation in multiagent reinforcement learning. In ICML-99, pages 325-334, Bled, SI, 1999.
- (1999) ICML-99 , pp. 325-334
- Price, B.¹ Boutilier, C.²

4
- 0002734328
- Robot see, robot do : An overview of robot imitation
- Brighton,UK
- Paul Bakker and Yasuo Kuniyoshi. Robot see, robot do : An overview of robot imitation. In AISB96 Workshop on Learning in Robots and Animals, pages 3-11, Brighton,UK, 1996.
- (1996) AISB96 Workshop on Learning in Robots and Animals , pp. 3-11
- Bakker, P.¹ Kuniyoshi, Y.²

5
- 0002130986
- Robot learning from demonstration
- Nashville, TN
- C. G. Atkeson and S. Schaal. Robot learning from demonstration. In ICML-97, pages 12-20, Nashville, TN, 1997.
- (1997) ICML-97 , pp. 12-20
- Atkeson, C.G.¹ Schaal, S.²

6
- 84956662672
- Learning to communicate through imitation in autonomous robots
- Lausanne, Switzerland
- Aude Billard and Gillian Hayes. Learning to communicate through imitation in autonomous robots. In ICANN-97, pages 763-68, Lausanne, Switzerland, 1997.
- (1997) ICANN-97 , pp. 763-768
- Billard, A.¹ Hayes, G.²

7
- 14344253698
- Tech-nical Report DAI No. 676, University of Edinburgh. Dept. of Arti_cial Intelligence
- G. M. Hayes and J. Demiris. A robot controller using learning by imitation. Tech-nical Report DAI No. 676, University of Edinburgh. Dept. of Arti_cial Intelligence, 1994.
- (1994) A Robot Controller Using Learning by Imitation
- Hayes, G.M.¹ Demiris, J.²

8
- 0028740409
- Learning by watching: Extracting reusable task knowledge from visual observation of human performance
- Yasuo Kuniyoshi, Masayuki Inaba, and Hirochika Inoue. Learning by watching: Extracting reusable task knowledge from visual observation of human performance. IEEE Transactions on Robotics and Automation, 10(6):799-822, 1994.
- (1994) IEEE Transactions on Robotics and Automation , vol.10 , Issue.6 , pp. 799-822
- Kuniyoshi, Y.¹ Inaba, M.² Inoue, H.³

9
- 0012934374
- LEAP: A learning apprentice for VLSI design
- Los Altos, California, Morgan Kaufmann Publishers, Inc
- T. M. Mitchell, S. Mahadevan, and L. Steinberg. LEAP: A learning apprentice for VLSI design. In IJCAI-85, pages 573-580, Los Altos, California,1985. Morgan Kaufmann Publishers, Inc.
- (1985) IJCAI-85 , pp. 573-580
- Mitchell, T.M.¹ Mahadevan, S.² Steinberg, L.³

10
- 0008861422
- Two kinds of training information for eval-uation function learning
- Anaheim, CA, AAAI Press
- Paul E. Utgoff and Jeffrey A. Clouse. Two kinds of training information for eval-uation function learning. In AAAI-91, pages 596-600, Anaheim, CA,1991. AAAI Press.
- (1991) AAAI-91 , pp. 596-600
- Utgoff, P.E.¹ Clouse, J.A.²

11
- 0002803472
- Mapping between dissimilar bod-ies: A_ordances and the algebraic foundations of imitation
- Edinburgh
- Chrystopher Nehaniv and Kerstin Dautenhahn. Mapping between dissimilar bod-ies: A_ordances and the algebraic foundations of imitation. In EWLR-98, pages 64-72, Edinburgh, 1998.
- (1998) EWLR-98 , pp. 64-72
- Nehaniv, C.¹ Dautenhahn, K.²

12
- 33749975326
- Skill reconstruction as induction of LQ controllers with subgoals
- Nagoya
- Dorian _Suc and Ivan Bratko. Skill reconstruction as induction of LQ controllers with subgoals. In IJCAI-97, pages 914-919, Nagoya, 1997.
- (1997) IJCAI-97 , pp. 914-919
- _Suc, D.¹ Bratko, I.²

13
- 0003148586
- Behaviour-based primitives for articulated control
- Zurich
- Maja J. Mataric, Matthew Williamson, John Demiris, and Aswath Mohan. Behaviour-based primitives for articulated control. In SAB-98, pages 165-170, Zurich, 1998.
- (1998) SAB-98 , pp. 165-170
- Mataric, M.J.¹ Williamson, M.² Demiris, J.³ Mohan, A.⁴

14
- 0027684215
- Atkeson. Prioritized sweeping: Reinforce-ment learning with less data and less real time
- Andrew W. Moore and Christopher G. Atkeson. Prioritized sweeping: Reinforce-ment learning with less data and less real time. Machine Learning, 13(1):103-30, 1993.
- (1993) Machine Learning , vol.13 , Issue.1 , pp. 103-130
- Moore, A.W.¹ Christopher, G.²

15
- 84949480927
- Leslie Pack Kaelbling
- MIT Press, Cambridge,MA
- Leslie Pack Kaelbling. Learning in Embedded Systems. MIT Press, Cambridge,MA, 1993.
- (1993) Learning in Embedded Systems

16
- 0004255301
- Wiley, New York
- George A. F. Seber. Multivariate Observations. Wiley, New York, 1984.
- (1984) Multivariate Observations
- Seber, G.A.F.¹

17
- 38249003206
- A comparison of the Bonferroni and Scheffé bounds
- J. Mi and Allan R. Sampson. A comparison of the Bonferroni and Scheffé bounds. Journal of Statistical Planning and Inference, 36:101-105, 1993.
- (1993) Journal of Statistical Planning and Inference , vol.36 , pp. 101-105
- Mi, J.¹ Sampson, A.R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.