메뉴 건너뛰기




Volumn 14, Issue 1, 2012, Pages 137-152

Learning automata based multi-agent system algorithms for finding optimal policies in Markov games

Author keywords

learning automata; Markov games; multi agent systems; optimal policy

Indexed keywords

AUTOMATA THEORY; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; ROBOTS; SOFTWARE AGENTS;

EID: 84856285917     PISSN: 15618625     EISSN: 19346093     Source Type: Journal    
DOI: 10.1002/asjc.315     Document Type: Article
Times cited : (12)

References (29)
  • 3
    • 48049111350 scopus 로고    scopus 로고
    • A multi-agent control scheme for a supply chain model
    • No
    • Boccadoro, M., F. Martinelli, and, P. Valigi, " A multi-agent control scheme for a supply chain model," Asian J. Control, Vol. 10, No. 2, pp. 260-266 (2008).
    • (2008) Asian J. Control , vol.10 , Issue.2 , pp. 260-266
    • Boccadoro, M.1    Martinelli, F.2    Valigi, P.3
  • 4
    • 48049101593 scopus 로고    scopus 로고
    • A hybrid framework for resource allocation among multiple agents moving on discrete environments
    • No
    • Piovesan, J. L., C. T. Abdallah, and, H. G. Tanner, " A hybrid framework for resource allocation among multiple agents moving on discrete environments," Asian J. Control, Vol. 10, No. 2, pp. 171-186 (2008).
    • (2008) Asian J. Control , vol.10 , Issue.2 , pp. 171-186
    • Piovesan, J.L.1    Abdallah, C.T.2    Tanner, H.G.3
  • 8
    • 4644369748 scopus 로고    scopus 로고
    • Nash Q-learning for general-sum stochastic games
    • Hu, J., and, M. P. Wellman, " Nash Q-learning for general-sum stochastic games," J. Mach. Learn. Res., Vol. 4, pp. 1039-1069 (2003).
    • (2003) J. Mach. Learn. Res. , vol.4 , pp. 1039-1069
    • Hu, J.1    Wellman, M.P.2
  • 9
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • Madison, WI
    • Claus, C., and, C. Boutilier, " The dynamics of reinforcement learning in cooperative multiagent systems," 15th Natl. Conf. Artif. Intell., Madison, WI, pp. 746-752 (1998).
    • (1998) 15th Natl. Conf. Artif. Intell. , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 11
    • 38149021278 scopus 로고    scopus 로고
    • Evaluating learning automata as a model for cooperation in complex multi-agent domains
    • Bremen, Germany
    • Khojasteh, M. R., and, M. R. Meybodi, " Evaluating learning automata as a model for cooperation in complex multi-agent domains," RoboCup 2006: Robot Soccer World Cup, Bremen, Germany, Vol. 4434, pp. 410-417, (2007).
    • (2007) RoboCup 2006: Robot Soccer World Cup , vol.4434 , pp. 410-417
    • Khojasteh, M.R.1    Meybodi, M.R.2
  • 13
    • 70349456957 scopus 로고    scopus 로고
    • Multi-automata learning
    • In, Weber, C. M. Elshaw, N. M. Mayer (eds). I-Tech Education and Publishing, Vienna
    • Verbeeck, K., A. Nowe, P. Vrancx, and, M. Peeters, " Multi-automata learning," In Reinforcement Learning, Weber, C., M. Elshaw, N. M. Mayer, (eds). I-Tech Education and Publishing, Vienna, pp. 167-185 (2008).
    • (2008) Reinforcement Learning , pp. 167-185
    • Verbeeck, K.1    Nowe, A.2    Vrancx, P.3    Peeters, M.4
  • 14
    • 58049194007 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning in stochastic single and multi-stage games
    • Verbeeck, K., A. Nowé, M. Peeters, and, K. Tuyls, " Multi-agent reinforcement learning in stochastic single and multi-stage games," Adapt. Agents Multi-Agent Syst. Part III., Vol. 3394, pp. 275-294 (2005).
    • (2005) Adapt. Agents Multi-Agent Syst. Part III. , vol.3394 , pp. 275-294
    • Verbeeck, K.1    Nowé, A.2    Peeters, M.3    Tuyls, K.4
  • 15
    • 0028423534 scopus 로고
    • Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information
    • Sastry, P., V. Phansalkar, and, M. A. L. Thathachar, " Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information," IEEE Trans. Syst. Man Cybern., Vol. 24, pp. 769-777 (1994).
    • (1994) IEEE Trans. Syst. Man Cybern. , vol.24 , pp. 769-777
    • Sastry, P.1    Phansalkar, V.2    Thathachar, M.A.L.3
  • 16
    • 0032687842 scopus 로고    scopus 로고
    • Learning in multilevel games with incomplete information-part i
    • Billard, E. A., and, S. Lakshmivarahan, " Learning in multilevel games with incomplete information-part I," IEEE Trans. Syst. Man Cybern. Part B, Vol. 29, pp. 329-339 (1999).
    • (1999) IEEE Trans. Syst. Man Cybern. Part B , vol.29 , pp. 329-339
    • Billard, E.A.1    Lakshmivarahan, S.2
  • 18
    • 58049196572 scopus 로고    scopus 로고
    • Solving multi-agent Markov decision processes using learning automata
    • Subotica, Serbia
    • Abtahi, F., and, M. R. Meybodi, " Solving multi-agent Markov decision processes using learning automata," 6th Int. Symp. Intell. Syst., Subotica, Serbia, pp. 1-6 (2008).
    • (2008) 6th Int. Symp. Intell. Syst. , pp. 1-6
    • Abtahi, F.1    Meybodi, M.R.2
  • 20
    • 84856290835 scopus 로고    scopus 로고
    • Utilization of networks of learning automata for solving decision problems in decentralized multiagent systems
    • Tehran, Iran
    • Masoumi, B and, M. R. Meybodi, " Utilization of networks of learning automata for solving decision problems in decentralized multiagent systems," 17th Iranian Conf. Electr. Eng., Tehran, Iran (2009).
    • (2009) 17th Iranian Conf. Electr. Eng.
    • Masoumi, B.1    Meybodi, M.R.2
  • 21
    • 58049179535 scopus 로고    scopus 로고
    • Strategy entropy as a measure of strategy convergence in reinforcement learning
    • Wuhan, China
    • Zhuang, X and, Z. Chen, " Strategy entropy as a measure of strategy convergence in reinforcement learning," 1st Int. Conf. Intell. Netw. Intell. Syst., Wuhan, China, pp. 81-84 (2008).
    • (2008) 1st Int. Conf. Intell. Netw. Intell. Syst. , pp. 81-84
    • Zhuang, X.1    Chen, Z.2
  • 23
    • 0000268071 scopus 로고
    • Learning algorithms for two-person zero-sum stochastic games with incomplete information
    • Lakshmivarahan, S., and, K. Narendra, " Learning algorithms for two-person zero-sum stochastic games with incomplete information," Math. Oper. Res., Vol. 6, pp. 379-386 (1981).
    • (1981) Math. Oper. Res. , vol.6 , pp. 379-386
    • Lakshmivarahan, S.1    Narendra, K.2
  • 24
    • 0022738693 scopus 로고
    • Decentralized learning in finite Markov chains
    • Wheeler, R. M., and, K. S. Narendra, " Decentralized learning in finite Markov chains," IEEE Trans. Autom. Control, Vol. 31, pp. 519-526 (1986).
    • (1986) IEEE Trans. Autom. Control , vol.31 , pp. 519-526
    • Wheeler, R.M.1    Narendra, K.S.2
  • 25
    • 84972535636 scopus 로고
    • Equilibrium in a stochastic N-person game
    • Fink, A. M., " Equilibrium in a stochastic N-person game," J. Sci. Hiroshima Univ. Ser. A-I, Vol. 28, pp. 89-93 (1964).
    • (1964) J. Sci. Hiroshima Univ. Ser. A-I , vol.28 , pp. 89-93
    • Fink, A.M.1
  • 27
    • 34247547839 scopus 로고    scopus 로고
    • Information and entropy econometrics - volume overview and synthesis
    • DOI 10.1016/j.jeconom.2006.05.001, PII S0304407606000741
    • Golan, A., " Information and entropy econometrics-volume overview and synthesis," J. Econom., Vol. 138, pp. 379-387 (2007). (Pubitemid 46646965)
    • (2007) Journal of Econometrics , vol.138 , Issue.2 , pp. 379-387
    • Golan, A.1
  • 28
    • 72449169495 scopus 로고    scopus 로고
    • Topological entropy and data rate for practical stability: A scalar case
    • No
    • Xie, L., " Topological entropy and data rate for practical stability: a scalar case," Asian J. Control, Vol. 11, No. 4, pp. 376-385 (2009).
    • (2009) Asian J. Control , vol.11 , Issue.4 , pp. 376-385
    • Xie, L.1
  • 29
    • 1842722362 scopus 로고    scopus 로고
    • Multi-scale entropy analysis of complex physiologic time series
    • Costa, M., A. Goldberger, and, C. Peng, " Multi-scale entropy analysis of complex physiologic time series," Phys. Rev. Lett., Vol. 89, pp. 1-4 (2002).
    • (2002) Phys. Rev. Lett. , vol.89 , pp. 1-4
    • Costa, M.1    Goldberger, A.2    Peng, C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.