SCOPUS 정보 검색 플랫폼

Journal of Control Theory and Applications

Volumn 9, Issue 3, 2011, Pages 440-450

Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning

(5) Sun, Xueqing a Mao, Tao a Ray, Laura a Shi, Dongqing b Kralik, Jerald b

a DARTMOUTH COLLEGE (United States)

b Lamont Doherty Earth Observatory (United States)

Author keywords

Decentralized Markov decision process; Multiagent systems; Reinforcement learning

Indexed keywords

ACTION SELECTION; ACTION SPACES; AGENT BASED; ANALYSIS RESULTS; BRAIN CIRCUITS; COMPLEX PROBLEMS; COMPUTATIONAL COSTS; HIERARCHICAL REPRESENTATION; LEARNING EFFICIENCY; LEARNING PROBLEM; MARKOV DECISION PROCESSES; MEMORY RESOURCES; MULTIROBOTS; NATURAL WORLD; Q-LEARNING; SOCIAL COGNITION; SOCIAL INTELLIGENCE; SOCIAL KNOWLEDGE; STATE ABSTRACTION; STATE REPRESENTATION; TASK COMPLETION TIME; TASK DOMAIN; TASK SPACE; THEORETICAL BOUNDS; UNCERTAIN ENVIRONMENTS;

ABSTRACTING; LEARNING ALGORITHMS; MAMMALS; MARKOV PROCESSES; MULTI AGENT SYSTEMS; PROBLEM SOLVING; REINFORCEMENT LEARNING;

COMPUTATIONAL COMPLEXITY;

EID: 79960460645 PISSN: 16726340 EISSN: 10008152 Source Type: Journal
DOI: 10.1007/s11768-011-1047-6 Document Type: Article

Times cited : (10)

References (33)

1
- 0036874366
- The complexity of decentralized control of Markov decision processes
- D. Bernstein, R. Givan, N. Immerman, et al. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 2002, 27(4): 819-840.
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.¹ Givan, R.² Immerman, N.³

2
- 1142292487
- The complexity of multiagent systems: The price of silence
- Melbourne, Australia
- Z. Rabinovich, C. V. Goldman, J. S. Rosenschein. The complexity of multiagent systems: The price of silence. Proceedings of the 2nd International Joint Conference on Autonomous Agents and Multiagent Systems, Melbourne, Australia, 2003: 1102-1103.
- (2003) Proceedings of the 2nd International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 1102-1103
- Rabinovich, Z.¹ Goldman, C.V.² Rosenschein, J.S.³

3
- 33745892592
- Engines of the brain: the computational instruction set of human cognition
- R. Granger. Engines of the brain: the computational instruction set of human cognition. AI Magazine, 2006: 27(2): 15-32.
- (2006) AI Magazine , vol.27 , Issue.2 , pp. 15-32
- Granger, R.¹

4
- 2942527304
- Derivation and analysis of basic computational operations of thalamocortical circuits
- A. Rodriguez, J. Whitson, R. Granger. Derivation and analysis of basic computational operations of thalamocortical circuits. Journal of Cognitive Neuroscience, 2004: 16(5): 856-877.
- (2004) Journal of Cognitive Neuroscience , vol.16 , Issue.5 , pp. 856-877
- Rodriguez, A.¹ Whitson, J.² Granger, R.³

5
- 34548679807
- Social components of fitness in primate groups
- J. B. Silk. Social components of fitness in primate groups. Science, 2007, 317(5843): 1347-1351.
- (2007) Science , vol.317 , Issue.5843 , pp. 1347-1351
- Silk, J.B.¹

6
- 0004249605
- (Eds.), Chicago: University of Chicago Press
- B. B. Smuts, D. L. Cheney, R. M. Seyfarth, et al., eds. Primate Societies. Chicago: University of Chicago Press, 1987.
- (1987) Primate Societies
- Smuts, B.B.¹ Cheney, D.L.² Seyfarth, R.M.³

7
- 0003605728
- Oxford: Oxford University Press
- C. Boesch, H. Boesch-Achermann. The Chimpanzees of the Tai Forest: Behavioral Ecology and Evolution. Oxford: Oxford University Press, 2000.
- (2000) The Chimpanzees of the Tai Forest: Behavioral Ecology and Evolution
- Boesch, C.¹ Boesch-Achermann, H.²

8
- 4544347163
- Satisficing and optimality
- M. Byron. Satisficing and optimality. Ethics, 1998: 109(1): 67-93.
- (1998) Ethics , vol.109 , Issue.1 , pp. 67-93
- Byron, M.¹

9
- 0004298344
- Oxford: Clarendon Press
- W. Byrne, A. Whiten. Machiavellian Intelligence: Social Expertise and the Evolution of Intellect in Monkeys, Apes and Humans. Oxford: Clarendon Press, 1988.
- (1988) Machiavellian Intelligence: Social Expertise and the Evolution of Intellect in Monkeys, Apes and Humans
- Byrne, W.¹ Whiten, A.²

10
- 0003840657
- New York: Oxford University Press
- C. W. Clark, M. Mangel. Dynamic State Variable Models in Ecology: Methods and Applications. New York: Oxford University Press, 2000.
- (2000) Dynamic State Variable Models in Ecology: Methods and Applications
- Clark, C.W.¹ Mangel, M.²

11
- 0004250710
- Princeton: Princeton University Press
- L. A. Giraldeau, T. Caraco. Social Foraging Theory. Princeton: Princeton University Press, 2000.
- (2000) Social Foraging Theory
- Giraldeau, L.A.¹ Caraco, T.²

12
- 0038485628
- The spontaneous emergence of leaders and followers in a foraging pair
- S. A. Rands, G. Cowlishaw, R. A. Pettifor, et al. The spontaneous emergence of leaders and followers in a foraging pair. Nature, 2003, 423(6938): 432-434.
- (2003) Nature , vol.423 , Issue.6938 , pp. 432-434
- Rands, S.A.¹ Cowlishaw, G.² Pettifor, R.A.³

13
- 34447339384
- Dolphin social intelligence: complex alliance relationships in bottlenose dolphins and a consideration of selective environments for extreme brain size evolution in mammals
- R. C. Connor. Dolphin social intelligence: complex alliance relationships in bottlenose dolphins and a consideration of selective environments for extreme brain size evolution in mammals. Philosophical Transactions of the Royal Society B: Biological Sciences, 2007: 362(1480): 587-602.
- (2007) Philosophical Transactions of the Royal Society B: Biological Sciences , vol.362 , Issue.1480 , pp. 587-602
- Connor, R.C.¹

14
- 23844524419
- Complex cooperation among tai chimpanzees
- F. B. M. Waal and P. L. Tyack (Eds.), Cambridge: Harvard University Press
- C. Boesch. Complex cooperation among tai chimpanzees. Animal Social Complexity. F. B. M. Waal, P. L. Tyack, eds. Cambridge: Harvard University Press, 2003: 93-110.
- (2003) Animal Social Complexity , pp. 93-110
- Boesch, C.¹

15
- 27344449757
- Decentralized control of cooperative systems: Categorization and complexity analysis
- C. Goldman, S. Zilberstein. Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of Artificial Intelligence Research, 2004: 22(1): 143-174.
- (2004) Journal of Artificial Intelligence Research , vol.22 , Issue.1 , pp. 143-174
- Goldman, C.¹ Zilberstein, S.²

16
- 4644369748
- Nash Q-learning for general-sum stochastic games
- J. Hu, M. Wellman. Nash Q-learning for general-sum stochastic games. Journal of Machine Learning Research, 2003, 4: 1039-1069.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.²

17
- 0004260007
- Cambridge: MIT Press
- D. Fudenberg, J. Tirole. Game Theory. Cambridge: MIT Press, 1991.
- (1991) Game Theory
- Fudenberg, D.¹ Tirole, J.²

18
- 0033750210
- Experiments in automatic flock control
- R. Vaughan, N. Sumpter, A. Frost, et al. Experiments in automatic flock control. Robotics and Autonomous Systems, 2000: 31(1/2): 109-116.
- (2000) Robotics and Autonomous Systems , vol.31 , Issue.1-2 , pp. 109-116
- Vaughan, R.¹ Sumpter, N.² Frost, A.³

19
- 0036367289
- Reinforcement learning for landmark-based robot navigation
- New York: ACM
- D. Busquets, R. L. de Mantaras, C. Sierra, et al. Reinforcement learning for landmark-based robot navigation. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems. New York: ACM, 2002: 841-843.
- (2002) Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 841-843
- Busquets, D.¹ de Mantaras, R.L.² Sierra, C.³

20
- 45849101880
- Cambridge: Cambridge University Press
- A. Sanjeev, B. Boaz. Complexity Theory: A Modern Approach. Cambridge: Cambridge University Press, 2009.
- (2009) Complexity Theory: A Modern Approach
- Sanjeev, A.¹ Boaz, B.²

21
- 34249833101
- Technical note: Q-learning
- C. J. C. H. Watkins, P. Dayan. Technical note: Q-learning. Machine Learning, 1992: 8(3/4): 279-292.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

22
- 0028497630
- Asynchronous stochastic approximation and Q-learning
- J. Tsitsiklis. Asynchronous stochastic approximation and Q-learning. Machine Learning, 1994: 16(3): 185-202.
- (1994) Machine Learning , vol.16 , Issue.3 , pp. 185-202
- Tsitsiklis, J.¹

23
- 85149834820
- Markov games as a framework for multiagent reinforcement learning
- San Francisco, CA: Morgan Kaufmann Publishers Inc
- M. L. Littman. Markov games as a framework for multiagent reinforcement learning. Proceedings of the 11th International Conference on Machine Learning, San Francisco, CA: Morgan Kaufmann Publishers Inc., 1994: 157-163.
- (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

24
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Menlo Park, CA: AAAI Press
- C. Claus, C. Boutilier. The dynamics of reinforcement learning in cooperative multiagent systems. Proceedings of the 15th National Conference on Artificial Intelligence, Menlo Park, CA: AAAI Press, 1998: 746-752.
- (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

25
- 33748309557
- Report TR CS-98-112, Fort Collins, CO: Colorado State University
- L. D. Pyeatt, A. E. Howe. Decision Tree Function Approximation in Reinforcement Learning. Report TR CS-98-112. Fort Collins, CO: Colorado State University, 1998.
- (1998) Decision Tree Function Approximation in Reinforcement Learning
- Pyeatt, L.D.¹ Howe, A.E.²

26
- 0031636218
- Tree based discretization for continuous state space reinforcement learning
- Menlo Park, CA: American Association for Artificial Intelligence (AAAI)
- W. T. B. Uther, M. M. Veloso. Tree based discretization for continuous state space reinforcement learning. Proceedings of the 15th National Conference on Artificial Intelligence, Menlo Park, CA: American Association for Artificial Intelligence (AAAI), 1998: 769-774.
- (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 769-774
- Uther, W.T.B.¹ Veloso, M.M.²

27
- 76249118582
- Cooperative multirobot reinforcement learning: a framework in hybrid state space
- New York: IEEE
- X. Sun, T. Mao, J. D. Kralik, et al. Cooperative multirobot reinforcement learning: a framework in hybrid state space. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), New York: IEEE, 2009: 1190-1196.
- (2009) Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , pp. 1190-1196
- Sun, X.¹ Mao, T.² Kralik, J.D.³

28
- 77954208497
- Role selection in multi-robot systems using abstract state-based reinforcement learning
- Boston, MA
- T. Mao, X. Sun, L. E. Ray. Role selection in multi-robot systems using abstract state-based reinforcement learning. Proceedings of the 14th IASTED International Conference on Robotics and Applications, Boston, MA, 2009.
- (2009) Proceedings of the 14th IASTED International Conference on Robotics and Applications
- Mao, T.¹ Sun, X.² Ray, L.E.³

29
- 77951487805
- Multi-agent reinforcement learning and chimpanzee hunting
- New York: IEEE
- M. Z. Sauter, D. Shi, J. D. Kralik. Multi-agent reinforcement learning and chimpanzee hunting. Proceedings of IEEE International Conferenceon Robotics and Biomimetics (ROBIO), New York: IEEE, 2009: 622-626.
- (2009) Proceedings of IEEE International Conferenceon Robotics and Biomimetics (ROBIO) , pp. 622-626
- Sauter, M.Z.¹ Shi, D.² Kralik, J.D.³

30
- 77951457571
- Distributed, heterogeneous, multiagent social coordination via reinforcement learning
- New York: IEEE
- D. Shi, M. Z. Sauter, J. D. Kralik. Distributed, heterogeneous, multiagent social coordination via reinforcement learning. Proceedings of IEEE International Conference on Robotics and Biomimetics (ROBIO), New York: IEEE, 2009: 653-658.
- (2009) Proceedings of IEEE International Conference on Robotics and Biomimetics (ROBIO) , pp. 653-658
- Shi, D.¹ Sauter, M.Z.² Kralik, J.D.³

31
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- San Francisco, CA: Morgan Kaufmann Publishers Inc
- M. Lauer, M. Riedmiller. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. Proceedings of the 17th International Conference on Machine Learning, San Francisco, CA: Morgan Kaufmann Publishers Inc., 2000: 535-542.
- (2000) Proceedings of the 17th International Conference on Machine Learning , pp. 535-542
- Lauer, M.¹ Riedmiller, M.²

32
- 0001051322
- The significance of the gregarious habit
- R. C. Miller. The significance of the gregarious habit. Ecology, 1922: 3(2): 122-126.
- (1922) Ecology , vol.3 , Issue.2 , pp. 122-126
- Miller, R.C.¹

33
- 79960441660
- Webots Reference Manual
- Cyberbotics Ltd
- Webots Reference Manual. Professional Mobile Robot Simulation Software. Cyberbotics Ltd. http://www. cyberbotics. com.
- Professional Mobile Robot Simulation Software

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.