메뉴 건너뛰기




Volumn 5, Issue 1, 1996, Pages 5-28

Learning signaling behaviors and specialization in cooperative agents

Author keywords

Cooperation; Multiagent systems; Reinforcement learning; Specialization

Indexed keywords


EID: 0030153304     PISSN: 10597123     EISSN: None     Source Type: Journal    
DOI: 10.1177/105971239600500102     Document Type: Article
Times cited : (19)

References (23)
  • 2
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A. G., Bradtke, S. J., & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
    • (1995) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.G.1    Bradtke, S.J.2    Singh, S.P.3
  • 3
    • 0003602259 scopus 로고
    • Tech. Rep. COINS-89-95. Amherst: Department of Computer and Information Science, University of Massachusetts
    • Barto, A. G., Sutton, R. S., & Watkins, C. J. C. H. (1989). Learning and sequential decision making (Tech. Rep. COINS-89-95) Amherst: Department of Computer and Information Science, University of Massachusetts.
    • (1989) Learning and Sequential Decision Making
    • Barto, A.G.1    Sutton, R.S.2    Watkins, C.J.C.H.3
  • 6
    • 84977005393 scopus 로고
    • Collective robotics: From social insects to robots
    • Kube, C. R., & Zhang, H. (1993). Collective robotics: From social insects to robots. Adaptive Behavior, 2, 189-218.
    • (1993) Adaptive Behavior , vol.2 , pp. 189-218
    • Kube, C.R.1    Zhang, H.2
  • 7
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • Lin, L.-J. (1992). Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8, 293-321.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.-J.1
  • 8
    • 0026880130 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • Mahadevan, S., & Connell, J. (1992). Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55, 311-365.
    • (1992) Artificial Intelligence , vol.55 , pp. 311-365
    • Mahadevan, S.1    Connell, J.2
  • 12
    • 0010862056 scopus 로고
    • Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot
    • D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.). Cambridge, MA: MIT Press
    • Millán, J. del R. (1994). Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot. In D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.), From animals to animats III: Third International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press.
    • (1994) From Animals to Animats III: Third International Conference on Simulation of Adaptive Behavior
    • Millán, J.D.R.1
  • 13
    • 0000714373 scopus 로고
    • A reinforcement connectionist approach to robot path finding in non-maze-like environments
    • Millán, J. del R., & Torras, C. (1992). A reinforcement connectionist approach to robot path finding in non-maze-like environments. Machine Learning, 8, 363-395.
    • (1992) Machine Learning , vol.8 , pp. 363-395
    • Millán, J.D.R.1    Torras, C.2
  • 14
    • 0001187959 scopus 로고
    • Explanation-based neural networks learning for robot control
    • C. L. Giles, S. J. Hanson, & J. D. Cowan (Eds.). San Mateo, CA: Morgan Kaufmann
    • Mitchell, T. M., & Thrun, S.B. (1993). Explanation-based neural networks learning for robot control. In C. L. Giles, S. J. Hanson, & J. D. Cowan (Eds.), Advances in neural information processing systems 5. San Mateo, CA: Morgan Kaufmann.
    • (1993) Advances in Neural Information Processing Systems , vol.5
    • Mitchell, T.M.1    Thrun, S.B.2
  • 17
    • 0011267427 scopus 로고
    • Obstacle avoidance through reinforcement learning
    • J. E. Moody, S. J. Hanson, & R. P. Lippmann (Eds.). San Mateo, CA: Morgan Kaufmann
    • Prescott, T. J., & Mayhew, J. E. W. (1992). Obstacle avoidance through reinforcement learning. In J. E. Moody, S. J. Hanson, & R. P. Lippmann (Eds.), Advances in Neural Information Processing Systems 4. San Mateo, CA: Morgan Kaufmann.
    • (1992) Advances in Neural Information Processing Systems , vol.4
    • Prescott, T.J.1    Mayhew, J.E.W.2
  • 18
    • 0001024813 scopus 로고
    • A case study in the behavior-oriented design of autonomous agents
    • D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.). Cambridge MA: MIT Press
    • Steels, L. (1994). A case study in the behavior-oriented design of autonomous agents. In D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.), From animals to animats III: Third International Conference on Simulation of Adaptive Behavior. Cambridge MA: MIT Press.
    • (1994) From Animals to Animats III: Third International Conference on Simulation of Adaptive Behavior
    • Steels, L.1
  • 19
    • 0003617454 scopus 로고
    • Unpublished doctoral thesis, Department of Computer and Information Science, University of Massachusetts, Amherst
    • Sutton, R. S. (1984). Temporal credit assignment in reinforcement learning. Unpublished doctoral thesis, Department of Computer and Information Science, University of Massachusetts, Amherst.
    • (1984) Temporal Credit Assignment in Reinforcement Learning
    • Sutton, R.S.1
  • 20
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • San Mateo, CA: Morgan Kaufmann
    • Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proceedings of the Seventh International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann.
    • (1990) Proceedings of the Seventh International Conference on Machine Learning
    • Sutton, R.S.1
  • 21
    • 85152198941 scopus 로고
    • Multi-agent reinforcement learning: Independent vs. cooperative agents
    • San Mateo, CA: Morgan Kaufmann
    • Tan, M. (1993). Multi-agent reinforcement learning: Independent vs. cooperative agents. Proceedings of the Tenth International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann.
    • (1993) Proceedings of the Tenth International Conference on Machine Learning
    • Tan, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.