메뉴 건너뛰기




Volumn 2837, Issue , 2003, Pages 181-192

COllective INtelligence with sequences of actions coordinating actions in multi-agent systems

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; FUNCTIONS; HEURISTIC METHODS; LEARNING SYSTEMS; MULTI AGENT SYSTEMS; PARAMETER ESTIMATION; PROBLEM SOLVING; ARTIFICIAL INTELLIGENCE; INTELLIGENT AGENTS; REINFORCEMENT LEARNING;

EID: 9444227681     PISSN: 03029743     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (22)
  • 1
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • to appear
    • A. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete-Event Systems journal, 2003. to appear.
    • (2003) Discrete-event Systems Journal
    • Barto, A.1    Mahadevan, S.2
  • 3
    • 0014413249 scopus 로고
    • The tragedy of the commons
    • G. Hardin. The tragedy of the commons. Science, 162:1243-1248, 1968.
    • (1968) Science , vol.162 , pp. 1243-1248
    • Hardin, G.1
  • 4
    • 0012286079 scopus 로고    scopus 로고
    • An algorithm for distributed reinforcement learning in cooperative multi-agent systems
    • Morgan Kaufmann, San Francisco, CA
    • M. Lauer and M. Riedmiller. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In Proc. 17th International Conf. on Machine Learning, pages 535-542. Morgan Kaufmann, San Francisco, CA, 2000.
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 535-542
    • Lauer, M.1    Riedmiller, M.2
  • 6
    • 9444230770 scopus 로고    scopus 로고
    • Personal communication with A. Agogino
    • Personal communication with A. Agogino.
  • 9
    • 0003411271 scopus 로고
    • Efficient exploration in reinforcement learning
    • Carnegie Mellon University, Pittsburgh, Pennsylvania
    • S. B. Thrun. Efficient exploration in reinforcement learning. Technical Report CMU-CS-92-102, Carnegie Mellon University, Pittsburgh, Pennsylvania, 1992.
    • (1992) Technical Report , vol.CMU-CS-92-102
    • Thrun, S.B.1
  • 10
    • 0036355687 scopus 로고    scopus 로고
    • Learning sequences of actions in collectives of autonomous agents
    • ACM press
    • K. Turner, A. Agogino, and D. Wolpert. Learning sequences of actions in collectives of autonomous agents. In Autonomous Agents & Multiagent Systems, pages 378-385, part 1. ACM press, 2002.
    • (2002) Autonomous Agents & Multiagent Systems , Issue.PART 1 , pp. 378-385
    • Turner, K.1    Agogino, A.2    Wolpert, D.3
  • 12
    • 34249833101 scopus 로고
    • Q-learning
    • Watkins and Dayan. Q-learning. Machine Learning, 8:279-292, 1992.
    • (1992) Machine Learning , vol.8 , pp. 279-292
    • Watkins1    Dayan2
  • 13
    • 9444288363 scopus 로고    scopus 로고
    • A multiagent framework for planning, reacting, and learning
    • Institut für Informatik, Technische Universität München
    • G. Weiss. A multiagent framework for planning, reacting, and learning. Technical Report FKI-233-99, Institut für Informatik, Technische Universität München, 1999.
    • (1999) Technical Report , vol.FKI-233-99
    • Weiss, G.1
  • 14
    • 9444234731 scopus 로고    scopus 로고
    • The economic approach to artificial intelligence
    • M. P. Wellman. The economic approach to artificial intelligence. ACM Computing Surveys, 28(4es):14-15, 1996.
    • (1996) ACM Computing Surveys , vol.28 , Issue.4 ES , pp. 14-15
    • Wellman, M.P.1
  • 15
    • 0013250428 scopus 로고    scopus 로고
    • Market-oriented programming: Some early lessons
    • S. Clearwater, editor, World Scientific, River Edge, New Jersey
    • M. P. Wellman. Market-oriented programming: Some early lessons. In S. Clearwater, editor, Market-Based Control: A Paradigm for Distributed Resource Allocation. World Scientific, River Edge, New Jersey, 1996.
    • (1996) Market-based Control: A Paradigm for Distributed Resource Allocation
    • Wellman, M.P.1
  • 17
    • 0004320981 scopus 로고    scopus 로고
    • An introduction to COllective INtelligence
    • NASA Ames Research Center
    • D. Wolpert and K. Tumer. An introduction to COllective INtelligence. Technical Report NASA-ARC-IC-99-63, NASA Ames Research Center, 1999. A shorter version of this paper is to appear in: Jeffrey M. Bradshaw, editor, Handbook of Agent Technology, AAAI Press/MIT Press, 1999.
    • (1999) Technical Report , vol.NASA-ARC-IC-99-63
    • Wolpert, D.1    Tumer, K.2
  • 18
    • 0347885021 scopus 로고    scopus 로고
    • AAAI Press/MIT Press
    • D. Wolpert and K. Tumer. An introduction to COllective INtelligence. Technical Report NASA-ARC-IC-99-63, NASA Ames Research Center, 1999. A shorter version of this paper is to appear in: Jeffrey M. Bradshaw, editor, Handbook of Agent Technology, AAAI Press/MIT Press, 1999.
    • (1999) Handbook of Agent Technology
    • Bradshaw, J.M.1
  • 19
    • 0001309161 scopus 로고    scopus 로고
    • Optimal payoff functions for members of collectives
    • in press
    • D. Wolpert and K. Turner. Optimal payoff functions for members of collectives. Advances in Complex Systems, 2001. in press.
    • (2001) Advances in Complex Systems
    • Wolpert, D.1    Turner, K.2
  • 22
    • 0032691530 scopus 로고    scopus 로고
    • General principles of learning-based multi-agent systems
    • O. Etzioni, J. P. Müller, and J. M. Bradshaw, editors, New York, May 1-5. ACM Press
    • D. H. Wolpert, K. R. Wheeler, and K. Tumer. General principles of learning-based multi-agent systems. In O. Etzioni, J. P. Müller, and J. M. Bradshaw, editors, Proceedings of the Third Annual Conference on Autonomous Agents (AGENTS-99), pages 77-83, New York, May 1-5 1999. ACM Press.
    • (1999) Proceedings of the Third Annual Conference on Autonomous Agents (AGENTS-99) , pp. 77-83
    • Wolpert, D.H.1    Wheeler, K.R.2    Tumer, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.