메뉴 건너뛰기




Volumn 9, Issue 3, 2011, Pages 440-450

Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning

Author keywords

Decentralized Markov decision process; Multiagent systems; Reinforcement learning

Indexed keywords

ACTION SELECTION; ACTION SPACES; AGENT BASED; ANALYSIS RESULTS; BRAIN CIRCUITS; COMPLEX PROBLEMS; COMPUTATIONAL COSTS; HIERARCHICAL REPRESENTATION; LEARNING EFFICIENCY; LEARNING PROBLEM; MARKOV DECISION PROCESSES; MEMORY RESOURCES; MULTIROBOTS; NATURAL WORLD; Q-LEARNING; SOCIAL COGNITION; SOCIAL INTELLIGENCE; SOCIAL KNOWLEDGE; STATE ABSTRACTION; STATE REPRESENTATION; TASK COMPLETION TIME; TASK DOMAIN; TASK SPACE; THEORETICAL BOUNDS; UNCERTAIN ENVIRONMENTS;

EID: 79960460645     PISSN: 16726340     EISSN: 10008152     Source Type: Journal    
DOI: 10.1007/s11768-011-1047-6     Document Type: Article
Times cited : (10)

References (33)
  • 1
    • 0036874366 scopus 로고    scopus 로고
    • The complexity of decentralized control of Markov decision processes
    • D. Bernstein, R. Givan, N. Immerman, et al. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 2002, 27(4): 819-840.
    • (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
    • Bernstein, D.1    Givan, R.2    Immerman, N.3
  • 3
    • 33745892592 scopus 로고    scopus 로고
    • Engines of the brain: the computational instruction set of human cognition
    • R. Granger. Engines of the brain: the computational instruction set of human cognition. AI Magazine, 2006: 27(2): 15-32.
    • (2006) AI Magazine , vol.27 , Issue.2 , pp. 15-32
    • Granger, R.1
  • 4
    • 2942527304 scopus 로고    scopus 로고
    • Derivation and analysis of basic computational operations of thalamocortical circuits
    • A. Rodriguez, J. Whitson, R. Granger. Derivation and analysis of basic computational operations of thalamocortical circuits. Journal of Cognitive Neuroscience, 2004: 16(5): 856-877.
    • (2004) Journal of Cognitive Neuroscience , vol.16 , Issue.5 , pp. 856-877
    • Rodriguez, A.1    Whitson, J.2    Granger, R.3
  • 5
    • 34548679807 scopus 로고    scopus 로고
    • Social components of fitness in primate groups
    • J. B. Silk. Social components of fitness in primate groups. Science, 2007, 317(5843): 1347-1351.
    • (2007) Science , vol.317 , Issue.5843 , pp. 1347-1351
    • Silk, J.B.1
  • 8
    • 4544347163 scopus 로고    scopus 로고
    • Satisficing and optimality
    • M. Byron. Satisficing and optimality. Ethics, 1998: 109(1): 67-93.
    • (1998) Ethics , vol.109 , Issue.1 , pp. 67-93
    • Byron, M.1
  • 12
    • 0038485628 scopus 로고    scopus 로고
    • The spontaneous emergence of leaders and followers in a foraging pair
    • S. A. Rands, G. Cowlishaw, R. A. Pettifor, et al. The spontaneous emergence of leaders and followers in a foraging pair. Nature, 2003, 423(6938): 432-434.
    • (2003) Nature , vol.423 , Issue.6938 , pp. 432-434
    • Rands, S.A.1    Cowlishaw, G.2    Pettifor, R.A.3
  • 13
    • 34447339384 scopus 로고    scopus 로고
    • Dolphin social intelligence: complex alliance relationships in bottlenose dolphins and a consideration of selective environments for extreme brain size evolution in mammals
    • R. C. Connor. Dolphin social intelligence: complex alliance relationships in bottlenose dolphins and a consideration of selective environments for extreme brain size evolution in mammals. Philosophical Transactions of the Royal Society B: Biological Sciences, 2007: 362(1480): 587-602.
    • (2007) Philosophical Transactions of the Royal Society B: Biological Sciences , vol.362 , Issue.1480 , pp. 587-602
    • Connor, R.C.1
  • 14
    • 23844524419 scopus 로고    scopus 로고
    • Complex cooperation among tai chimpanzees
    • F. B. M. Waal and P. L. Tyack (Eds.), Cambridge: Harvard University Press
    • C. Boesch. Complex cooperation among tai chimpanzees. Animal Social Complexity. F. B. M. Waal, P. L. Tyack, eds. Cambridge: Harvard University Press, 2003: 93-110.
    • (2003) Animal Social Complexity , pp. 93-110
    • Boesch, C.1
  • 15
    • 27344449757 scopus 로고    scopus 로고
    • Decentralized control of cooperative systems: Categorization and complexity analysis
    • C. Goldman, S. Zilberstein. Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of Artificial Intelligence Research, 2004: 22(1): 143-174.
    • (2004) Journal of Artificial Intelligence Research , vol.22 , Issue.1 , pp. 143-174
    • Goldman, C.1    Zilberstein, S.2
  • 16
    • 4644369748 scopus 로고    scopus 로고
    • Nash Q-learning for general-sum stochastic games
    • J. Hu, M. Wellman. Nash Q-learning for general-sum stochastic games. Journal of Machine Learning Research, 2003, 4: 1039-1069.
    • (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
    • Hu, J.1    Wellman, M.2
  • 21
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • C. J. C. H. Watkins, P. Dayan. Technical note: Q-learning. Machine Learning, 1992: 8(3/4): 279-292.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.J.C.H.1    Dayan, P.2
  • 22
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • J. Tsitsiklis. Asynchronous stochastic approximation and Q-learning. Machine Learning, 1994: 16(3): 185-202.
    • (1994) Machine Learning , vol.16 , Issue.3 , pp. 185-202
    • Tsitsiklis, J.1
  • 23
    • 85149834820 scopus 로고
    • Markov games as a framework for multiagent reinforcement learning
    • San Francisco, CA: Morgan Kaufmann Publishers Inc
    • M. L. Littman. Markov games as a framework for multiagent reinforcement learning. Proceedings of the 11th International Conference on Machine Learning, San Francisco, CA: Morgan Kaufmann Publishers Inc., 1994: 157-163.
    • (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 26
    • 0031636218 scopus 로고    scopus 로고
    • Tree based discretization for continuous state space reinforcement learning
    • Menlo Park, CA: American Association for Artificial Intelligence (AAAI)
    • W. T. B. Uther, M. M. Veloso. Tree based discretization for continuous state space reinforcement learning. Proceedings of the 15th National Conference on Artificial Intelligence, Menlo Park, CA: American Association for Artificial Intelligence (AAAI), 1998: 769-774.
    • (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 769-774
    • Uther, W.T.B.1    Veloso, M.M.2
  • 31
    • 0012286079 scopus 로고    scopus 로고
    • An algorithm for distributed reinforcement learning in cooperative multi-agent systems
    • San Francisco, CA: Morgan Kaufmann Publishers Inc
    • M. Lauer, M. Riedmiller. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. Proceedings of the 17th International Conference on Machine Learning, San Francisco, CA: Morgan Kaufmann Publishers Inc., 2000: 535-542.
    • (2000) Proceedings of the 17th International Conference on Machine Learning , pp. 535-542
    • Lauer, M.1    Riedmiller, M.2
  • 32
    • 0001051322 scopus 로고
    • The significance of the gregarious habit
    • R. C. Miller. The significance of the gregarious habit. Ecology, 1922: 3(2): 122-126.
    • (1922) Ecology , vol.3 , Issue.2 , pp. 122-126
    • Miller, R.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.