메뉴 건너뛰기




Volumn 2056, Issue , 2001, Pages 111-120

Imitation and reinforcement learning in agents with heterogeneous actions

Author keywords

[No Author keywords available]

Indexed keywords

COMBINATORIAL OPTIMIZATION; DYNAMIC PROGRAMMING; MACHINE LEARNING;

EID: 84949445092     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-45153-6_11     Document Type: Conference Paper
Times cited : (8)

References (17)
  • 1
    • 84918834208 scopus 로고
    • A reinforcement learning approach to job-shop scheduling
    • Montreal
    • Wei Zhang and Thomas G. Dietterich. A reinforcement learning approach to job-shop scheduling. In IJCAI-95, pages 1114-1120, Montreal, 1995.
    • (1995) IJCAI-95 , pp. 1114-1120
    • Zhang, W.1    Dietterich, T.G.2
  • 2
    • 0031625420 scopus 로고    scopus 로고
    • Learning evaluation functions for global optimization and boolean satisfiability
    • Madison, Wisconsin
    • Justin A. Boyan and Andrew W. Moore. Learning evaluation functions for global optimization and boolean satisfiability. In AAAI-98, pages 3-10, July 26-30, 1998, Madison, Wisconsin, 1998.
    • (1998) AAAI-98 , pp. 3-10
    • Boyan, J.A.1    Moore, A.W.2
  • 3
    • 0010276944 scopus 로고    scopus 로고
    • Implicit imitation in multiagent reinforcement learning
    • Bled, SI
    • Bob Price and Craig Boutilier. Implicit imitation in multiagent reinforcement learning. In ICML-99, pages 325-334, Bled, SI, 1999.
    • (1999) ICML-99 , pp. 325-334
    • Price, B.1    Boutilier, C.2
  • 5
    • 0002130986 scopus 로고    scopus 로고
    • Robot learning from demonstration
    • Nashville, TN
    • C. G. Atkeson and S. Schaal. Robot learning from demonstration. In ICML-97, pages 12-20, Nashville, TN, 1997.
    • (1997) ICML-97 , pp. 12-20
    • Atkeson, C.G.1    Schaal, S.2
  • 6
    • 84956662672 scopus 로고    scopus 로고
    • Learning to communicate through imitation in autonomous robots
    • Lausanne, Switzerland
    • Aude Billard and Gillian Hayes. Learning to communicate through imitation in autonomous robots. In ICANN-97, pages 763-68, Lausanne, Switzerland, 1997.
    • (1997) ICANN-97 , pp. 763-768
    • Billard, A.1    Hayes, G.2
  • 7
    • 14344253698 scopus 로고
    • Tech-nical Report DAI No. 676, University of Edinburgh. Dept. of Arti_cial Intelligence
    • G. M. Hayes and J. Demiris. A robot controller using learning by imitation. Tech-nical Report DAI No. 676, University of Edinburgh. Dept. of Arti_cial Intelligence, 1994.
    • (1994) A Robot Controller Using Learning by Imitation
    • Hayes, G.M.1    Demiris, J.2
  • 8
    • 0028740409 scopus 로고
    • Learning by watching: Extracting reusable task knowledge from visual observation of human performance
    • Yasuo Kuniyoshi, Masayuki Inaba, and Hirochika Inoue. Learning by watching: Extracting reusable task knowledge from visual observation of human performance. IEEE Transactions on Robotics and Automation, 10(6):799-822, 1994.
    • (1994) IEEE Transactions on Robotics and Automation , vol.10 , Issue.6 , pp. 799-822
    • Kuniyoshi, Y.1    Inaba, M.2    Inoue, H.3
  • 9
    • 0012934374 scopus 로고
    • LEAP: A learning apprentice for VLSI design
    • Los Altos, California, Morgan Kaufmann Publishers, Inc
    • T. M. Mitchell, S. Mahadevan, and L. Steinberg. LEAP: A learning apprentice for VLSI design. In IJCAI-85, pages 573-580, Los Altos, California,1985. Morgan Kaufmann Publishers, Inc.
    • (1985) IJCAI-85 , pp. 573-580
    • Mitchell, T.M.1    Mahadevan, S.2    Steinberg, L.3
  • 10
    • 0008861422 scopus 로고
    • Two kinds of training information for eval-uation function learning
    • Anaheim, CA, AAAI Press
    • Paul E. Utgoff and Jeffrey A. Clouse. Two kinds of training information for eval-uation function learning. In AAAI-91, pages 596-600, Anaheim, CA,1991. AAAI Press.
    • (1991) AAAI-91 , pp. 596-600
    • Utgoff, P.E.1    Clouse, J.A.2
  • 11
    • 0002803472 scopus 로고    scopus 로고
    • Mapping between dissimilar bod-ies: A_ordances and the algebraic foundations of imitation
    • Edinburgh
    • Chrystopher Nehaniv and Kerstin Dautenhahn. Mapping between dissimilar bod-ies: A_ordances and the algebraic foundations of imitation. In EWLR-98, pages 64-72, Edinburgh, 1998.
    • (1998) EWLR-98 , pp. 64-72
    • Nehaniv, C.1    Dautenhahn, K.2
  • 12
    • 33749975326 scopus 로고    scopus 로고
    • Skill reconstruction as induction of LQ controllers with subgoals
    • Nagoya
    • Dorian _Suc and Ivan Bratko. Skill reconstruction as induction of LQ controllers with subgoals. In IJCAI-97, pages 914-919, Nagoya, 1997.
    • (1997) IJCAI-97 , pp. 914-919
    • _Suc, D.1    Bratko, I.2
  • 13
    • 0003148586 scopus 로고    scopus 로고
    • Behaviour-based primitives for articulated control
    • Zurich
    • Maja J. Mataric, Matthew Williamson, John Demiris, and Aswath Mohan. Behaviour-based primitives for articulated control. In SAB-98, pages 165-170, Zurich, 1998.
    • (1998) SAB-98 , pp. 165-170
    • Mataric, M.J.1    Williamson, M.2    Demiris, J.3    Mohan, A.4
  • 14
    • 0027684215 scopus 로고
    • Atkeson. Prioritized sweeping: Reinforce-ment learning with less data and less real time
    • Andrew W. Moore and Christopher G. Atkeson. Prioritized sweeping: Reinforce-ment learning with less data and less real time. Machine Learning, 13(1):103-30, 1993.
    • (1993) Machine Learning , vol.13 , Issue.1 , pp. 103-130
    • Moore, A.W.1    Christopher, G.2
  • 15
    • 84949480927 scopus 로고
    • Leslie Pack Kaelbling
    • MIT Press, Cambridge,MA
    • Leslie Pack Kaelbling. Learning in Embedded Systems. MIT Press, Cambridge,MA, 1993.
    • (1993) Learning in Embedded Systems


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.