메뉴 건너뛰기




Volumn 58, Issue , 2017, Pages 111-122

The dynamics of reinforcement social learning in networked cooperative multiagent systems

Author keywords

Cooperative games; Multiagent coordination; Multiagent social learning

Indexed keywords

LEARNING ALGORITHMS; TOPOLOGY;

EID: 85006041139     PISSN: 09521976     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.engappai.2016.11.008     Document Type: Article
Times cited : (43)

References (32)
  • 1
  • 2
    • 0036013593 scopus 로고    scopus 로고
    • Statistical mechanics of complex networks
    • Albert, R., Barabási, A.-L., Statistical mechanics of complex networks. Rev. Mod. Phys. 74:1 (2002), 47–97.
    • (2002) Rev. Mod. Phys. , vol.74 , Issue.1 , pp. 47-97
    • Albert, R.1    Barabási, A.-L.2
  • 3
    • 33947692782 scopus 로고    scopus 로고
    • Topology of sensor networks in distributed detection
    • In: Acoustics, Speech and Signal Processing, 2006. In: Proceedings of the 2006 IEEE International Conference on ICASSP 2006. vol. 5, IEEE, pages V–V.
    • Aldosari, S.A., Moura, J.MF., 2006. Topology of sensor networks in distributed detection. In: Acoustics, Speech and Signal Processing, 2006. In: Proceedings of the 2006 IEEE International Conference on ICASSP 2006. vol. 5, IEEE, pages V–V.
    • (2006)
    • Aldosari, S.A.1    Moura, J.M.F.2
  • 4
    • 0342521525 scopus 로고    scopus 로고
    • Scale-free characteristics of random networks: the topology of the world-wide web
    • Barabási, A.-L., Albert, R., Jeong, H., Scale-free characteristics of random networks: the topology of the world-wide web. Physica A: Stat. Mech. Appl. 281:1 (2000), 69–77.
    • (2000) Physica A: Stat. Mech. Appl. , vol.281 , Issue.1 , pp. 69-77
    • Barabási, A.-L.1    Albert, R.2    Jeong, H.3
  • 5
    • 4544271516 scopus 로고    scopus 로고
    • Efficient learning equilibrium
    • Brafman, R.I., Tennenholtz, M., Efficient learning equilibrium. Artif. Intell. 159 (2004), 27–47.
    • (2004) Artif. Intell. , vol.159 , pp. 27-47
    • Brafman, R.I.1    Tennenholtz, M.2
  • 6
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • In: 1998
    • Claus, C., Boutilier, C., 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In: Proceedings of AAAI'98, 1998, pp. 746–752.
    • (1998) Proceedings of AAAI'98 , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 7
    • 84880861539 scopus 로고    scopus 로고
    • Predicting and preventing coordination problems in cooperative learning systems
    • In: 2007
    • Fulda, N., Ventura, D., 2007. Predicting and preventing coordination problems in cooperative learning systems. In: Proceedings of IJCAI'07, 2007, pp. 780–785.
    • (2007) Proceedings of IJCAI'07 , pp. 780-785
    • Fulda, N.1    Ventura, D.2
  • 8
    • 84896062256 scopus 로고    scopus 로고
    • The dynamics of reinforcement social learning in cooperative multiagent systems
    • In: Press
    • Hao, J.Y., Leung, H.F., 2013. The dynamics of reinforcement social learning in cooperative multiagent systems. In: IJCAI'13, AAAI Press, pp. 184–190.
    • (2013) IJCAI'13, AAA , pp. 184-190
    • Hao, J.Y.1    Leung, H.F.2
  • 9
    • 84938057635 scopus 로고    scopus 로고
    • Multiagent reinforcement social learning toward coordination in cooperative multiagent systems
    • Hao, J.Y., Leung, H.-F., Ming, Z., Multiagent reinforcement social learning toward coordination in cooperative multiagent systems. ACM Trans. Auton. Adapt. Syst. (TAAS), 9(4), 2014, 20.
    • (2014) ACM Trans. Auton. Adapt. Syst. (TAAS) , vol.9 , Issue.4 , pp. 20
    • Hao, J.Y.1    Leung, H.-F.2    Ming, Z.3
  • 10
    • 84974799121 scopus 로고    scopus 로고
    • Reinforcement sociallearning of coordination in networked cooperative multiagent systems
    • In: AAAI workshop on multiagent interaction without prior coordination (MIPC 2014).
    • Hao, J.Y., Huang, D.P., Cai, Y., Leung, H.-F., 2014. Reinforcement sociallearning of coordination in networked cooperative multiagent systems. In: AAAI workshop on multiagent interaction without prior coordination (MIPC 2014).
    • (2014)
    • Hao, J.Y.1    Huang, D.P.2    Cai, Y.3    Leung, H.-F.4
  • 11
    • 84944699681 scopus 로고    scopus 로고
    • Heuristic collective learning for efficient and robust emergence of social norms
    • In:
    • Hao, J.Y., Sun, J., Huang, D.P., Cai, Y., Yu, C., 2015. Heuristic collective learning for efficient and robust emergence of social norms. In: Proceedings of AAMAS'15, pp. 1647–1648.
    • (2015) Proceedings of AAMAS'15 , pp. 1647-1648
    • Hao, J.Y.1    Sun, J.2    Huang, D.P.3    Cai, Y.4    Yu, C.5
  • 12
    • 85006044300 scopus 로고    scopus 로고
    • Accelerating norm emergence through hierarchical heuristic learning
    • In: Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI).
    • Hao, J.Y., Sen, S., Yu, C., Yang, T.P., Meng, Z.P., 2016. Accelerating norm emergence through hierarchical heuristic learning. In: Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI).
    • (2016)
    • Hao, J.Y.1    Sen, S.2    Yu, C.3    Yang, T.P.4    Meng, Z.P.5
  • 14
    • 84979256395 scopus 로고    scopus 로고
    • Mobile ad-hoc networking with aodv: a review
    • Jhaveri, R.H., Patel, N.M., Mobile ad-hoc networking with aodv: a review. Int. J. -Gener. Comput. 6:3 (2015), 165–191.
    • (2015) Int. J. -Gener. Comput. , vol.6 , Issue.3 , pp. 165-191
    • Jhaveri, R.H.1    Patel, N.M.2
  • 15
    • 0036932299 scopus 로고    scopus 로고
    • Reinforcement learning of coordination in cooperative multiagent systems
    • In: AAAI'02
    • Kapetanakis, S., Kudenko, D., 2002. Reinforcement learning of coordination in cooperative multiagent systems. In: Proceedings of AAAI'02, pp. 326–331.
    • (2002) Proceedings o , pp. 326-331
    • Kapetanakis, S.1    Kudenko, D.2
  • 16
    • 0012286079 scopus 로고    scopus 로고
    • An algorithm for distributed reinforcement learning in cooperative multi-agent systems
    • In: ICML'00
    • Lauer, M., Riedmiller, M., 2000. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In: Proceedings of ICML'00, pp. 535–542.
    • (2000) Proceedings o , pp. 535-542
    • Lauer, M.1    Riedmiller, M.2
  • 18
    • 51349117828 scopus 로고    scopus 로고
    • Le Fort-Piat, N., 2007. Hysteretic q-learning: an algorithm for dynamic reinforcement learning in cooperative multiagent teams. In: Proceeding of IROS'07, pp. 64–69.
    • Matignon, L., Laurent, G.J., Le Fort-Piat, N., 2007. Hysteretic q-learning: an algorithm for dynamic reinforcement learning in cooperative multiagent teams. In: Proceeding of IROS'07, pp. 64–69.
    • Matignon, L.1    Laurent, G.J.2
  • 19
    • 85005986707 scopus 로고    scopus 로고
    • Le For-Piat, N., 2008. A study of fmq heuristic in cooperative multi-agent games. In: AAMAS'08 workshop: MSDM, pp. 77–91.
    • Matignon, L., Laurent, G.J., Le For-Piat, N., 2008. A study of fmq heuristic in cooperative multi-agent games. In: AAMAS'08 workshop: MSDM, pp. 77–91.
    • Matignon, L.1    Laurent, G.J.2
  • 20
    • 84857861863 scopus 로고    scopus 로고
    • Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems
    • Matignon, L., Laurent, G.J., Le For-Piat, N., Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems. Knowl. Eng. Rev. 27 (2012), 1–31.
    • (2012) Knowl. Eng. Rev. , vol.27 , pp. 1-31
    • Matignon, L.1    Laurent, G.J.2    Le For-Piat, N.3
  • 21
    • 85005974276 scopus 로고    scopus 로고
    • Reward-based Learning in Cooperative Games
    • Modeling, B., Thomas, K., 2015. Reward-based Learning in Cooperative Games.
    • (2015)
    • Modeling, B.1    Thomas, K.2
  • 22
    • 34247189655 scopus 로고    scopus 로고
    • Lenient learners in cooperative multiagent systems
    • In: AAMAS'06
    • Panait, L., Sullivan, K., Luke, S., 2006. Lenient learners in cooperative multiagent systems. In: Proceedings of AAMAS'06, pp. 801–803.
    • (2006) Proceedings o , pp. 801-803
    • Panait, L.1    Sullivan, K.2    Luke, S.3
  • 23
    • 84984700595 scopus 로고    scopus 로고
    • Intego2: a web tool for measuring and visualizing gene semantic similarities using gene ontology
    • Peng, Jiajie, Li, Hongxiang, Liu, Yongzhuang, Juan, Liran, Jiang, Qinghua, Wang, Yadong, Chen, Jin, Intego2: a web tool for measuring and visualizing gene semantic similarities using gene ontology. BMC Genom., 17, 2016, 530.
    • (2016) BMC Genom. , vol.17 , pp. 530
    • Peng, J.1    Li, H.2    Liu, Y.3    Juan, L.4    Jiang, Q.5    Wang, Y.6    Chen, J.7
  • 24
    • 84954080803 scopus 로고    scopus 로고
    • Robust convention emergence in social networks through self-reinforcing structures dissolution
    • Villatoro, D., Sabater-Mir, J., Sen, S., Robust convention emergence in social networks through self-reinforcing structures dissolution. ACM Trans. Auton. Adapt. Syst., 8(1), 2013, 2.
    • (2013) ACM Trans. Auton. Adapt. Syst. , vol.8 , Issue.1 , pp. 2
    • Villatoro, D.1    Sabater-Mir, J.2    Sen, S.3
  • 25
    • 0942276880 scopus 로고    scopus 로고
    • Complex networks: small-world, scale-free and beyond
    • Wang, X.F., Chen, G.R., Complex networks: small-world, scale-free and beyond. Circuits Syst. Mag. 3:1 (2003), 6–20.
    • (2003) Circuits Syst. Mag. , vol.3 , Issue.1 , pp. 6-20
    • Wang, X.F.1    Chen, G.R.2
  • 26
    • 67649405225 scopus 로고    scopus 로고
    • Reinforcement learning to play an optimal nash equilibrium in team markov games
    • In: NIPS'02
    • Wang, X., Sandholm, T., 2002. Reinforcement learning to play an optimal nash equilibrium in team markov games. In: Proceedings of NIPS'02, pp. 1571–1578.
    • (2002) Proceedings o , pp. 1571-1578
    • Wang, X.1    Sandholm, T.2
  • 27
    • 34249833101 scopus 로고
    • Q-learning
    • Machine Learning, pp. 279–292.
    • Watkins, C.J.C.H., Dayan, P.D., 1992. Q-learning. Machine Learning, pp. 279–292.
    • (1992)
    • Watkins, C.J.C.H.1    Dayan, P.D.2
  • 28
    • 84862278646 scopus 로고    scopus 로고
    • Scholarly network similarities: how bibliographic coupling networks, citation networks, cocitation networks, topical networks, coauthorship networks, and coword networks relate to each other
    • Yan, E., Ding, Y., Scholarly network similarities: how bibliographic coupling networks, citation networks, cocitation networks, topical networks, coauthorship networks, and coword networks relate to each other. J. Assoc. Inf. Sci. Technol. 63:7 (2012), 1313–1326.
    • (2012) J. Assoc. Inf. Sci. Technol. , vol.63 , Issue.7 , pp. 1313-1326
    • Yan, E.1    Ding, Y.2
  • 29
    • 85006022354 scopus 로고    scopus 로고
    • Accelerating norm emergence through hierarchical heuristic learning
    • In: Proceedings of ECAI'16.
    • Yang, T.P., Meng, Z.P., Hao, J.Y., Sen, S., Yu, C., 2016. Accelerating norm emergence through hierarchical heuristic learning. In: Proceedings of ECAI'16.
    • (2016)
    • Yang, T.P.1    Meng, Z.P.2    Hao, J.Y.3    Sen, S.4    Yu, C.5
  • 30
    • 84911927447 scopus 로고    scopus 로고
    • Collective learning for the emergence of social norms in networked multiagent systems
    • Yu, C., Zhang, M.J., Ren, F.H., Collective learning for the emergence of social norms in networked multiagent systems. IEEE Trans. Cybern. 44:12 (2014), 2342–2355.
    • (2014) IEEE Trans. Cybern. , vol.44 , Issue.12 , pp. 2342-2355
    • Yu, C.1    Zhang, M.J.2    Ren, F.H.3
  • 32
    • 85014302063 scopus 로고    scopus 로고
    • An adaptive learning framework for efficient emergence of social norms
    • In: AAMAS'16
    • Yu, C., Lv, H.T., Sen, S., Hao, J.Y., Ren, F.H., Liu, R., 2016. An adaptive learning framework for efficient emergence of social norms. In: Proceedings of AAMAS'16, pp. 1307–1308.
    • (2016) Proceedings o , pp. 1307-1308
    • Yu, C.1    Lv, H.T.2    Sen, S.3    Hao, J.Y.4    Ren, F.H.5    Liu, R.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.