SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Volumn 37, Issue 6, 2007, Pages 1567-1580

Self-organizing neural architectures and cooperative learning in a multiagent environment

a NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

Author keywords

Learning (artificial intelligence); Multi agent systems; Multiagent cooperative learning; Neural net architecture; Reinforcement learning (RL); Self organizing neural architectures

Indexed keywords

BACKPROPAGATION ALGORITHMS; FEEDFORWARD NEURAL NETWORKS; REINFORCEMENT LEARNING; SELF ORGANIZING MAPS;

MULTIAGENT COOPERATIVE LEARNING; RESILIENT-PROPAGATION ALGORITHM; SELF-ORGANIZING NEURAL ARCHITECTURE;

MULTI AGENT SYSTEMS;

ALGORITHM; ARTICLE; ARTIFICIAL NEURAL NETWORK; AUTOMATED PATTERN RECOGNITION; COMPUTER SIMULATION; DECISION SUPPORT SYSTEM; METHODOLOGY; THEORETICAL MODEL;

ALGORITHMS; COMPUTER SIMULATION; DECISION SUPPORT TECHNIQUES; MODELS, THEORETICAL; NEURAL NETWORKS (COMPUTER); PATTERN RECOGNITION, AUTOMATED;

EID: 36749092785 PISSN: 10834419 EISSN: None Source Type: Journal
DOI: 10.1109/TSMCB.2007.907040 Document Type: Article

Times cited : (17)

References (50)

1
- 0004102479
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 0000494894
- Computationally feasible bounds for partially observed Markov decision processes
- Jan./Feb
- W. S. Lovejoy, "Computationally feasible bounds for partially observed Markov decision processes," Oper. Res., vol. 39, no. 1, pp. 162-175, Jan./Feb. 1991.
- (1991) Oper. Res , vol.39 , Issue.1 , pp. 162-175
- Lovejoy, W.S.¹

3
- 10944258804
- FALCON: A fusion architecture for learning, cognition, and navigation
- Budapest, Hungary
- A. H. Tan, "FALCON: A fusion architecture for learning, cognition, and navigation," in Proc. IJCNN, Budapest, Hungary, 2004, pp. 3297-3302.
- (2004) Proc. IJCNN , pp. 3297-3302
- Tan, A.H.¹

4
- 33846312864
- Self-organizing cognitive agents and reinforcement learning in a multi-agent environment
- A.-H. Tan and D. Xiao, "Self-organizing cognitive agents and reinforcement learning in a multi-agent environment," in Proc. IEEE/ACM/WIC Int. Conf. Intell. Agent Technol., 2005, pp. 351-357.
- (2005) Proc. IEEE/ACM/WIC Int. Conf. Intell. Agent Technol , pp. 351-357
- Tan, A.-H.¹ Xiao, D.²

5
- 40549121994
- Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback
- to be published
- A. H. Tan, N. Lu, and D. Xiao, "Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback," IEEE Trans. Neural Netw. to be published.
- IEEE Trans. Neural Netw
- Tan, A.H.¹ Lu, N.² Xiao, D.³

6
- 0344074958
- A cognitive model of learning to navigate
- D. Gordan and D. Subramanian, "A cognitive model of learning to navigate," in Proc. 19th Annu. Conf. Cognitive Sci. Soc., 1997, pp. 271-276.
- (1997) Proc. 19th Annu. Conf. Cognitive Sci. Soc , pp. 271-276
- Gordan, D.¹ Subramanian, D.²

7
- 0035740527
- From implicit skills to explicit knowledge: A bottom-up model of skill learning
- Mar
- R. Sun, E. Merrill, and T. Peterson, "From implicit skills to explicit knowledge: A bottom-up model of skill learning," Cogn. Sci., vol. 25, no. 2, pp. 203-244, Mar. 2001.
- (2001) Cogn. Sci , vol.25 , Issue.2 , pp. 203-244
- Sun, R.¹ Merrill, E.² Peterson, T.³

8
- 36749051994
- M. Riedmiller and H. Braun, RPROP - A fast adaptive learning algorithm, Univ. Karlsruhe, Karlsruhe, Germany, 1992. Tech. Rep. (Also Proc. of ISCIS VII).
- M. Riedmiller and H. Braun, "RPROP - A fast adaptive learning algorithm," Univ. Karlsruhe, Karlsruhe, Germany, 1992. Tech. Rep. (Also Proc. of ISCIS VII).

9
- 84943274699
- A direct adaptive method for faster back-propagation learning: The RPROP algorithm
- San Francisco, CA
- M. Riedmiller and H. Braun, "A direct adaptive method for faster back-propagation learning: The RPROP algorithm," in Proc. IEEE Int. Conf. Neural Netw., San Francisco, CA, 1993, pp. 586-591.
- (1993) Proc. IEEE Int. Conf. Neural Netw , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

10
- 36749072878
- M. Brenda, V. Jagannathan, and R. Dodhiawala, On optimal cooperation of knowledge sources - An empirical investigation, Boeing Adv. Technol. Center, Boeing Comput. Services, Seattle, WA, Tech. Rep. BCS-G2010-28, 1986.
- M. Brenda, V. Jagannathan, and R. Dodhiawala, "On optimal cooperation of knowledge sources - An empirical investigation," Boeing Adv. Technol. Center, Boeing Comput. Services, Seattle, WA, Tech. Rep. BCS-G2010-28, 1986.

11
- 27144475166
- George Mason Univ, Fairfax, VA, Tech. Rep. GMU-CS-TR-2003-1
- L. Panait and S. Luke, "Cooperative multi-agent learning: The state of the art," George Mason Univ., Fairfax, VA, Tech. Rep. GMU-CS-TR-2003-1, 2003.
- (2003) Cooperative multi-agent learning: The state of the art
- Panait, L.¹ Luke, S.²

12
- 31744441013
- Analysis of a master-slave architecture for distributed evolutionary computations
- Feb
- M. Dubreuil, C. Gagne, and M. Parizeau, "Analysis of a master-slave architecture for distributed evolutionary computations," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 36, no. 1, pp. 229-235, Feb. 2006.
- (2006) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.36 , Issue.1 , pp. 229-235
- Dubreuil, M.¹ Gagne, C.² Parizeau, M.³

13
- 27744568015
- Bristol, PA: Inst. Phys. Publishing
- H. P. Schwefel, Advantages and Disadvantages of Evolutionary Computation Over Other Approaches, Evolutionary Computation 1: Basic Algorithms and Operators. Bristol, PA: Inst. Phys. Publishing, 2000.
- (2000) Advantages and Disadvantages of Evolutionary Computation Over Other Approaches, Evolutionary Computation 1: Basic Algorithms and Operators
- Schwefel, H.P.¹

14
- 1842535228
- Modular fuzzy-reinforcement learning approach with internal model capabilities for multiagent systems
- Apr
- M. Kaya and R. Alhajj, "Modular fuzzy-reinforcement learning approach with internal model capabilities for multiagent systems" IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34, no. 2, pp. 1210-1223, Apr. 2004.
- (2004) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.34 , Issue.2 , pp. 1210-1223
- Kaya, M.¹ Alhajj, R.²

15
- 17444385973
- Fuzzy OLAP association rules mining-based modular reinforcement learning approach for multiagent systems
- Apr
- M. Kaya and R. Alhajj, "Fuzzy OLAP association rules mining-based modular reinforcement learning approach for multiagent systems," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 2, pp. 326-338, Apr. 2005.
- (2005) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.35 , Issue.2 , pp. 326-338
- Kaya, M.¹ Alhajj, R.²

16
- 34249833101
- Q-learning
- C. J. C. H. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3/4, pp. 279-292, 1992.
- (1992) Mach. Learn , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

17
- 33947192713
- Q-learning based market-driven multi-agent collaboration in robot soccer
- Izmir, Turkey
- H. Kose, U. Tatlidede, C. Mericli, K. Kaplan, and H. L. Akin, "Q-learning based market-driven multi-agent collaboration in robot soccer," in Proc. Turkish Symp. Artif. Intell. Neural Netw., Izmir, Turkey, 2004, pp. 219-228.
- (2004) Proc. Turkish Symp. Artif. Intell. Neural Netw , pp. 219-228
- Kose, H.¹ Tatlidede, U.² Mericli, C.³ Kaplan, K.⁴ Akin, H.L.⁵

18
- 17044414523
- Gradient descent for symmetric and asymmetric multiagent reinforcement learning
- V. Kononen, "Gradient descent for symmetric and asymmetric multiagent reinforcement learning," Web Intell. Agent Syst.: Int. J. (WIAS), vol. 3, no. 1, pp. 17-30, 2005.
- (2005) Web Intell. Agent Syst.: Int. J. (WIAS) , vol.3 , Issue.1 , pp. 17-30
- Kononen, V.¹

19
- 36749089731
- E. F. Yang and D. B. Gu, Multiagent reinforcement learning for multi-robot systems: A survey, Dep. Comput. Sci., Univ. Essex, Colchester, U.K., Tech. Rep. CSM-404, 2004.
- E. F. Yang and D. B. Gu, "Multiagent reinforcement learning for multi-robot systems: A survey," Dep. Comput. Sci., Univ. Essex, Colchester, U.K., Tech. Rep. CSM-404, 2004.

20
- 17444424596
- Cooperative multiagent congestion control for high-speed networks
- Apr
- K. S. Hwang, S. W. Tan, M. C. Hsiao, and C. S. Wu, "Cooperative multiagent congestion control for high-speed networks," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 2, pp. 255-268, Apr. 2005.
- (2005) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.35 , Issue.2 , pp. 255-268
- Hwang, K.S.¹ Tan, S.W.² Hsiao, M.C.³ Wu, C.S.⁴

21
- 0005807053
- Evolving cooperative communicating classifier systems
- L. Bull and T. C. Fogarty, "Evolving cooperative communicating classifier systems," in Proc. 4th Annu. Conf. Evol. Program., 1994, pp. 308-315.
- (1994) Proc. 4th Annu. Conf. Evol. Program , pp. 308-315
- Bull, L.¹ Fogarty, T.C.²

22
- 26444506193
- Reward and diversity in multirobot foraging
- Stockholm, Sweden
- T. Balch, "Reward and diversity in multirobot foraging," in Proc. Agents Learning About, From With Other Agents Workshop, Stockholm, Sweden, 1999.
- (1999) Proc. Agents Learning About, From With Other Agents Workshop
- Balch, T.¹

23
- 84962044659
- The moving target function problem in multi-agent learning
- Paris, France, Jul
- J. M. Vidal and E. H. Durfee, "The moving target function problem in multi-agent learning," in Proc. 3rd Int. Conf. Multi-Agent Syst., Paris, France, Jul. 1998, pp. 317-324.
- (1998) Proc. 3rd Int. Conf. Multi-Agent Syst , pp. 317-324
- Vidal, J.M.¹ Durfee, E.H.²

24
- 1142280924
- Coordination in multiagent reinforcement learning: A Bayesian approach
- G. Chalkiadakis and C. Boutilier, "Coordination in multiagent reinforcement learning: A Bayesian approach," in Proc. 2nd Int. Joint Conf. Anton. Agents Multiagent Syst., 2003, pp. 709-716.
- (2003) Proc. 2nd Int. Joint Conf. Anton. Agents Multiagent Syst , pp. 709-716
- Chalkiadakis, G.¹ Boutilier, C.²

25
- 0001309161
- Optimal payoff functions for members of collectives
- D. H. Wolpert and K. Turner, "Optimal payoff functions for members of collectives," Adv. Complex Systems, vol. 4, no. 2/3, pp. 265-279, 2001.
- (2001) Adv. Complex Systems , vol.4 , Issue.2-3 , pp. 265-279
- Wolpert, D.H.¹ Turner, K.²

26
- 36749006381
- T. Balch, Learning roles: Behavioural diversity in robot teams, Georgia Inst. Technol., Atlanta, GA, Tech. Rep. GIT-CC-97-12, 1997.
- T. Balch, "Learning roles: Behavioural diversity in robot teams," Georgia Inst. Technol., Atlanta, GA, Tech. Rep. GIT-CC-97-12, 1997.

27
- 0001201710
- Learning to behave socially
- M. Mataric, "Learning to behave socially," in Proc. 3rd Int. Conf. Simul. Adaptive Behaviour, 1994, pp. 453-462.
- (1994) Proc. 3rd Int. Conf. Simul. Adaptive Behaviour , pp. 453-462
- Mataric, M.¹

28
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- C. Claus and C. Boutilier, "The dynamics of reinforcement learning in cooperative multiagent systems," in Proc. Nat. Conf. Artif. Intell. 1998, pp. 746-752.
- (1998) Proc. Nat. Conf. Artif. Intell , pp. 746-752
- Claus, C.¹ Boutilier, C.²

29
- 0003863106
- Comput. Sci. Dept, Carnegie Mellon Univ, Pittsburgh, PA, Tech. Rep. CMU-CS-00-165
- M. Bowling and M. Veloso, "An analysis of stochastic game theory for multiagent reinforcement learning," Comput. Sci. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Tech. Rep. CMU-CS-00-165, 2000.
- (2000) An analysis of stochastic game theory for multiagent reinforcement learning
- Bowling, M.¹ Veloso, M.²

30
- 1142268794
- Towards a Pareto-optimal solution in general-sum games
- Melbourne, Australia
- R. Mukherjee and S. Sen, "Towards a Pareto-optimal solution in general-sum games," in Proc. 2nd Int. Joint Conf. AAMAS, Melbourne, Australia, 2003, pp. 153-160.
- (2003) Proc. 2nd Int. Joint Conf. AAMAS , pp. 153-160
- Mukherjee, R.¹ Sen, S.²

31
- 84962092523
- Evaluating concurrent reinforcement learners
- Boston, MA
- M. Mundhe and S. Sen, "Evaluating concurrent reinforcement learners," in Proc. 4th ICMAS, Boston, MA, 2000, pp. 421-422.
- (2000) Proc. 4th ICMAS , pp. 421-422
- Mundhe, M.¹ Sen, S.²

32
- 0032359707
- Individual learning of coordination knowledge
- Jul
- S. Sen and M. Sekaran, "Individual learning of coordination knowledge," J. Exp. Theor. Artif. Intell., vol. 10, no. 3, pp. 333-356, Jul. 1998.
- (1998) J. Exp. Theor. Artif. Intell , vol.10 , Issue.3 , pp. 333-356
- Sen, S.¹ Sekaran, M.²

33
- 0028555752
- Learning to coordinate without sharing information
- Seattle, WA
- S. Sen M. Sekaran, and J. Hale, "Learning to coordinate without sharing information," in Proc. 12th Nat. Conf. Artif. Intell., Seattle, WA, 1994, pp. 426-431.
- (1994) Proc. 12th Nat. Conf. Artif. Intell , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

34
- 85149834820
- Markov games as a framework for multiagent reinforcement learning
- San Francisco, CA
- M. L. Littman, "Markov games as a framework for multiagent reinforcement learning," in Proc. 11th Int. Conf. Machine Learning San Francisco, CA, 1994, pp. 157-163.
- (1994) Proc. 11th Int. Conf. Machine Learning , pp. 157-163
- Littman, M.L.¹

35
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in Proc. 10th Int. Conf. Machine Learning, 1993, pp. 330-337.
- (1993) Proc. 10th Int. Conf. Machine Learning , pp. 330-337
- Tan, M.¹

36
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- J. Hu and M. Wellman, "Multiagent reinforcement learning: Theoretical framework and an algorithm," in Proc. 15th Int. Conf. Machine Learning, 1998, pp. 242-250.
- (1998) Proc. 15th Int. Conf. Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.²

37
- 33646007901
- Reinforcement learning in large multi-agent systems
- Utrecht, The Netherlands
- A. Agogino and K. Turner, "Reinforcement learning in large multi-agent systems," in Proc. AAMAS Workshop Coordination Large Scale Multi-Agent Syst., Utrecht, The Netherlands, 2005.
- (2005) Proc. AAMAS Workshop Coordination Large Scale Multi-Agent Syst
- Agogino, A.¹ Turner, K.²

38
- 0021776661
- A massively parallel architecture for a self-organizing neural pattern recognition machine
- Jan
- G. A. Carpenter and S. Grossberg, "A massively parallel architecture for a self-organizing neural pattern recognition machine,:Comput. Vis. Graph. Image Process., vol. 37, no. 1, pp. 54-115, Jan. 1987.
- (1987) Comput. Vis. Graph. Image Process , vol.37 , Issue.1 , pp. 54-115
- Carpenter, G.A.¹ Grossberg, S.²

39
- 84973857317
- ART 2: Self-organization of stable category recognition codes for analog input patterns
- Dec
- G. A. Carpenter and S. Grossberg, "ART 2: Self-organization of stable category recognition codes for analog input patterns," Appl. Opt. vol. 26, no. 23, pp. 4919-4930, Dec. 1987.
- (1987) Appl. Opt , vol.26 , Issue.23 , pp. 4919-4930
- Carpenter, G.A.¹ Grossberg, S.²

40
- 0000473841
- ART 1 and pattern clustering
- B. Moore, "ART 1 and pattern clustering," in Proc. Connectionist Models Summer School, 1988, pp. 174-185.
- (1988) Proc. Connectionist Models Summer School , pp. 174-185
- Moore, B.¹

41
- 0026408256
- Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system
- G. A. Carpenter, S. Grossberg, and D. B. Rosen, "Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system," Neural Netw., vol. 4, pp. 759-771, 1991.
- (1991) Neural Netw , vol.4 , pp. 759-771
- Carpenter, G.A.¹ Grossberg, S.² Rosen, D.B.³

42
- 18144423305
- Structure-adaptable digital neural networks,
- Ph.D. dissertation, Swiss Federal Inst. Technol.-Lausanne, Lausanne, Switzerland
- A. Pérez-Uribe, "Structure-adaptable digital neural networks," Ph.D. dissertation, Swiss Federal Inst. Technol.-Lausanne, Lausanne, Switzerland, 2002.
- (2002)
- Pérez-Uribe, A.¹

43
- 0001842850
- Bottom-up skill learning in reactive sequential decision tasks
- R. Sun, T. Peterson, and E. Merrill, "Bottom-up skill learning in reactive sequential decision tasks," in Proc. 18th Cognitive Sci. Soc. Conf., 1996, pp. 684-690.
- (1996) Proc. 18th Cognitive Sci. Soc. Conf , pp. 684-690
- Sun, R.¹ Peterson, T.² Merrill, E.³

44
- 85007487673
- A new learning rates adaptation strategy for the resilient propagation algorithm
- A. D. Anastasiadis, G. D. Magoulas, and M. N. Vrahatis, "A new learning rates adaptation strategy for the resilient propagation algorithm," in Proc. ESANN, 2004, pp. 1-6.
- (2004) Proc. ESANN , pp. 1-6
- Anastasiadis, A.D.¹ Magoulas, G.D.² Vrahatis, M.N.³

45
- 0031363808
- Layered learning in multiagent systems
- P. Stone, "Layered learning in multiagent systems," in Proc. AAAI/IAAI, 1997, p. 819.
- (1997) Proc. AAAI/IAAI , pp. 819
- Stone, P.¹

46
- 0010276169
- Experiments in learning prototypical situations for variants of the pursuit game
- Kyoto, Japan
- J. Denzinger and M. Fuchs, "Experiments in learning prototypical situations for variants of the pursuit game," in Proc. 2nd ICMAS, Kyoto, Japan, 1996, pp. 48-55.
- (1996) Proc. 2nd ICMAS , pp. 48-55
- Denzinger, J.¹ Fuchs, M.²

47
- 7444263056
- On customizing evolutionary learning of agent behavior
- J. Denzinger and A. Schur, "On customizing evolutionary learning of agent behavior," in Proc. Can. Conf. AI, 2004, pp. 146-160.
- (2004) Proc. Can. Conf. AI , pp. 146-160
- Denzinger, J.¹ Schur, A.²

48
- 0030050933
- Multiagent reinforcement leaning in the iterated prisoner's dilemma
- T. W. Sandholm and R. H. Crites, "Multiagent reinforcement leaning in the iterated prisoner's dilemma," Biosystems, vol. 37, no. 1, pp. 147-166, 1995.
- (1995) Biosystems , vol.37 , Issue.1 , pp. 147-166
- Sandholm, T.W.¹ Crites, R.H.²

49
- 0002335248
- Multi-agent reinforcement learning: A modular approach
- N. Ono and K. Fukomoto, "Multi-agent reinforcement learning: A modular approach," in Proc. 2nd Int. Conf. Multi-Agent Syst., 1996, pp. 252-258.
- (1996) Proc. 2nd Int. Conf. Multi-Agent Syst , pp. 252-258
- Ono, N.¹ Fukomoto, K.²

50
- 0013354070
- Effective learning approach for planning and scheduling in multi-agent domain
- S. Arai and K. Sycara, "Effective learning approach for planning and scheduling in multi-agent domain," in Proc. 6th Int. Conf. Simulation Adaptive Behaviour (From Animals to Animats 6), 2000, pp. 507-516.
- (2000) Proc. 6th Int. Conf. Simulation Adaptive Behaviour (From Animals to Animats 6) , pp. 507-516
- Arai, S.¹ Sycara, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.