메뉴 건너뛰기




Volumn 2636, Issue , 2003, Pages 33-48

Cooperative learning using advice exchange

Author keywords

[No Author keywords available]

Indexed keywords

BACKPROPAGATION; INTELLIGENT AGENTS; LEARNING SYSTEMS; SUPERVISED LEARNING; TRAFFIC CONTROL;

EID: 4544316833     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-44826-8     Document Type: Conference Paper
Times cited : (20)

References (39)
  • 1
    • 85152198941 scopus 로고
    • Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents
    • Amherst, MA
    • M. Tan. Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents. Proc. of the Tenth International Conference on Machine Learning, Amherst, MA, 330-337, 1993
    • (1993) Proc. of the Tenth International Conference on Machine Learning , pp. 330-337
    • Tan, M.1
  • 2
    • 0000580224 scopus 로고
    • A Temporal-Difference Model of Classical Conditioning
    • R. S. Sutton and A. G. Barto. A Temporal-Difference Model of Classical Conditioning. Tech Report GTE Labs. TR 87-509.2, 1987
    • (1987) Tech Report GTE Labs , vol.2 , pp. 87-509
    • Sutton, R.S.1    Barto, A.G.2
  • 4
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • Kluwer Academic publishers
    • L.-J. Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning 8: 293-321, Kluwer Academic publishers, 1992
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.-J.1
  • 5
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • Kluwer Academic publishers
    • C. J. C. H. Watkins, P. D. Dayan. Technical note: Q-learning. Machine Learning 8, 3: 279-292, Kluwer Academic publishers, 1992
    • (1992) Machine Learning 8 , vol.3 , pp. 279-292
    • Watkins, C.J.C.H.1    Dayan, P.D.2
  • 6
    • 0003812851 scopus 로고
    • A study of cooperative mechanisms for faster reinforcement learning
    • Computer Science Department, University of Rochester
    • S. D. Whitehead, D. H. Ballard. A study of cooperative mechanisms for faster reinforcement learning. TR 365, Computer Science Department, University of Rochester, 1991
    • (1991) TR 365
    • Whitehead, S.D.1    Ballard, D.H.2
  • 8
    • 23144457147 scopus 로고
    • Teaching by shaping
    • Workshop on Learning by Induction vs. Learning by Demonstration, Tahoe City, CA, USA
    • C. Baroglio. Teaching by shaping. Proc. of ICML-95. Workshop on Learning by Induction vs. Learning by Demonstration, Tahoe City, CA, USA, 1995
    • (1995) Proc. of ICML-95
    • Baroglio, C.1
  • 9
    • 0038849321 scopus 로고    scopus 로고
    • Gerhard Weiß and Sandip Sen, editors, Adaptation and Learning in Multiagent Systems, Springer Verlag, Berlin
    • J. A. Clouse. Learning from an automated training agent. Gerhard Weiß and Sandip Sen, editors, Adaptation and Learning in Multiagent Systems, Springer Verlag, Berlin, 1996
    • (1996) Learning from an automated training agent
    • Clouse, J.A.1
  • 15
    • 0029732210 scopus 로고    scopus 로고
    • Creating advicetaking reinforcement learners
    • R. Maclin, J. Shavlik. Creating advicetaking reinforcement learners. Machine Learning 22: 251-281, 1997
    • (1997) Machine Learning , vol.22 , pp. 251-281
    • Maclin, R.1    Shavlik, J.2
  • 16
    • 0002797521 scopus 로고    scopus 로고
    • Learning in behaviour-based multi-robot systems: Policies, models and other agents
    • Elsvier
    • M. J. Mataric. Learning in behaviour-based multi-robot systems: Policies, models and other agents. Journal of Cognitive Systems Research 2: 81-93, Elsvier, 2001
    • (2001) Journal of Cognitive Systems Research , vol.2 , pp. 81-93
    • Mataric, M.J.1
  • 19
    • 0003200022 scopus 로고    scopus 로고
    • Sensory-motor primitives as a basis for imitation: Linking perception to action and biology to robotics
    • C. Nehaniv & K. Dautenhahn (Eds.), MIT Press
    • M. J. Mataric. Sensory-motor primitives as a basis for imitation: Linking perception to action and biology to robotics. C. Nehaniv & K. Dautenhahn (Eds.), Imitation in animals and artifacts, MIT Press, 2001
    • (2001) Imitation in animals and artifacts
    • Mataric, M.J.1
  • 25
    • 0033362601 scopus 로고    scopus 로고
    • Evolving artificial neural networks
    • X. Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9), 1423-1447, 1999
    • (1999) Proceedings of the IEEE , vol.87 , Issue.9 , pp. 1423-1447
    • Yao, X.1
  • 28
    • 23144445904 scopus 로고    scopus 로고
    • The Improvement and Comparison of different Algorithms for Optimizing Neural Networks on the MasPar {MP}-2
    • ICSC Academic Press, Ed.M. Heiss
    • W. Erhard, T. Fink, M. M. Gutzmann, C. Rahn, A. Doering, M. Galicki, The Improvement and Comparison of different Algorithms for Optimizing Neural Networks on the MasPar {MP}-2. Neural Computation {NC}'98, ICSC Academic Press, Ed.M. Heiss, 617-623, 1998
    • (1998) Neural Computation {NC}'98 , pp. 617-623
    • Erhard, W.1    Fink, T.2    Gutzmann, M.M.3    Rahn, C.4    Doering, A.5    Galicki, M.6
  • 32
    • 1842538384 scopus 로고    scopus 로고
    • Masters Thesis, Department of Computer Science, Colorado State University
    • T. Thorpe. Vehicle Traffic Light Control Using SARSA. Masters Thesis, Department of Computer Science, Colorado State University, 1997
    • (1997) Vehicle Traffic Light Control Using SARSA
    • Thorpe, T.1
  • 37
    • 85132026293 scopus 로고    scopus 로고
    • Integrated architectures for learning planning and reacting based on approximating dynamic programming
    • Morgan-Kaufman
    • R. S. Sutton. Integrated architectures for learning planning and reacting based on approximating dynamic programming. Proc. of the Seventh International Conference on Machine Learning, 216-224, Morgan-Kaufman.
    • Proc. of the Seventh International Conference on Machine Learning , pp. 216-224
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.