SCOPUS 정보 검색 플랫폼

Expert Systems with Applications

Volumn 38, Issue 3, 2011, Pages 1565-1574

Multi-goal Q-learning of cooperative teams

(3) Li, Jing a,b Sheng, Zhaohan a Ng, Kwanchew a

a NANJING UNIVERSITY (China)

b NANJING AGRICULTURAL UNIVERSITY (China)

Author keywords

Cooperative team; Multi agent learning; Multi goal learning; Q learning

Indexed keywords

COGNITIVE MAPS; COOPERATIVE TEAMS; LEARNING GOALS; LEARNING PERFORMANCE; MULTI-AGENT LEARNING; MULTI-GOAL LEARNING; OPTIMAL ACTIONS; PARAMETER VALUES; Q-LEARNING; Q-LEARNING ALGORITHMS; VIRTUAL TEAM;

LEARNING ALGORITHMS;

EID: 78049530668 PISSN: 09574174 EISSN: None Source Type: Journal
DOI: 10.1016/j.eswa.2010.07.071 Document Type: Article

Times cited : (10)

References (21)

1
- 33847068620
- Simulating knowledge dynamics in innovation networks
- P. Ahrweiler, A. Pyka, and N. Gilbert Simulating knowledge dynamics in innovation networks R. Leombruni, M. Richiardi, Industry and labor dynamics: The agent-based computational economics approach 2004 World Scientific Press Singapore 284 296
- (2004) Industry and Labor Dynamics: The Agent-based Computational Economics Approach , pp. 284-296
- Ahrweiler, P.¹ Pyka, A.² Gilbert, N.³

2
- 56549086866
- R. Bergmann et al. (Eds.), MATES 2008. LNAI
- Akchurina, N. (2008). Optimistic-pessimistic Q-learning algorithm for multi-agent systems. In R. Bergmann et al. (Eds.), MATES 2008. LNAI (Vol. 5244, pp. 13-24).
- (2008) Optimistic-pessimistic Q-learning Algorithm for Multi-agent Systems , vol.5244 , pp. 13-24
- Akchurina, N.¹

3
- 34548099216
- Shaping multi-agent systems with gradient reinforcement learning
- O. Buffet, A. Dutech, and F. Charpillet Shaping multi-agent systems with gradient reinforcement learning Autonomous Agents and Multi-agent Systems 15 2007 197 220
- (2007) Autonomous Agents and Multi-agent Systems , vol.15 , pp. 197-220
- Buffet, O.¹ Dutech, A.² Charpillet, F.³

4
- 53349100494
- A reinforcement learning model for supply chain ordering management: An application to the beer game
- S.K. Chaharsooghi, J. Heydari, and S.H. Zegordi A reinforcement learning model for supply chain ordering management: An application to the beer game Decision Support Systems 45 2008 949 959
- (2008) Decision Support Systems , vol.45 , pp. 949-959
- Chaharsooghi, S.K.¹ Heydari, J.² Zegordi, S.H.³

5
- 53849147885
- Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach
- Y. Cheng Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach Expert Systems with Applications 36 2009 472 480
- (2009) Expert Systems with Applications , vol.36 , pp. 472-480
- Cheng, Y.¹

6
- 78049528693
- Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces
- Pittsburgh, Pennsylvania
- Cuayáhuitl, H.; Renals, S.; Lemon, O.; & Shimodaira, H. (2006). Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. In INTERSPEECH 2006-ICSLP, Pittsburgh, Pennsylvania (Vol. 9, pp. 17-21).
- (2006) INTERSPEECH 2006-ICSLP , vol.9 , pp. 17-21
- Cuayáhuitl, H.¹ Renals, S.² Lemon, O.³ Shimodaira, H.⁴

7
- 0034246487
- Target reaching by using visual information and Q-learning controllers
- C. Distante, A. Anglani, and F. Taurisano Target reaching by using visual information and Q-learning controllers Autonomous Robots 9 2000 41 50
- (2000) Autonomous Robots , vol.9 , pp. 41-50
- Distante, C.¹ Anglani, A.² Taurisano, F.³

8
- 14344266002
- Learning rates for Q-learning
- E. Even-Dar, and Y. Mansour Learning rates for Q-learning Journal of Machine Learning Research 5 2003 1 25
- (2003) Journal of Machine Learning Research , vol.5 , pp. 1-25
- Even-Dar, E.¹ Mansour, Y.²

9
- 2342578813
- Learning behavior-selection by emotions and cognition in a multi-goal robot task
- S.C. Gadanho Learning behavior-selection by emotions and cognition in a multi-goal robot task Journal of Machine Learning Research 4 2003 385 412
- (2003) Journal of Machine Learning Research , vol.4 , pp. 385-412
- Gadanho, S.C.¹

10
- 33847031724
- Learning in innovation networks: Some simulation experiments
- N. Gilbert, P. Ahrweiler, and A. Pyka Learning in innovation networks: Some simulation experiments Physica A 378 2007 100 109
- (2007) Physica A , vol.378 , pp. 100-109
- Gilbert, N.¹ Ahrweiler, P.² Pyka, A.³

11
- 2942538007
- Innovation networks-A simulation approach
- N. Gilbert, A. Pyka, and P. Ahrweiler Innovation networks-A simulation approach Journal of Artificial Societies and Social Simulation 4 2001
- (2001) Journal of Artificial Societies and Social Simulation , vol.4
- Gilbert, N.¹ Pyka, A.² Ahrweiler, P.³

12
- 0035978635
- Modular Q-learning based multi-agent cooperation for robot soccer
- K. Park, Y. Kim, and J. Kim Modular Q-learning based multi-agent cooperation for robot soccer Robotics and Autonomous Systems 35 2001 109 122
- (2001) Robotics and Autonomous Systems , vol.35 , pp. 109-122
- Park, K.¹ Kim, Y.² Kim, J.³

13
- 0004102479
- MIT Press Cambridge, MA
- R.S. Sutton, and A.G. Barto Reinforcement learning: An introduction 1998 MIT Press Cambridge, MA
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

14
- 2042544751
- Multi-agent learning for routing control within an Internet environment
- P.R.J. Tillotson, Q.H. Wu, and P.M. Hughes Multi-agent learning for routing control within an Internet environment Engineering Applications of Artificial Intelligence 17 2004 179 185
- (2004) Engineering Applications of Artificial Intelligence , vol.17 , pp. 179-185
- Tillotson, P.R.J.¹ Wu, Q.H.² Hughes, P.M.³

15
- 31344450384
- An evolutionary dynamical analysis of multi-agent learning in iterated games
- K. Tuyls, P.J. Hoen, and B. Vanschoenwinkel An evolutionary dynamical analysis of multi-agent learning in iterated games Autonomous Agents and Multi-agent Systems 12 2006 115 153
- (2006) Autonomous Agents and Multi-agent Systems , vol.12 , pp. 115-153
- Tuyls, K.¹ Hoen, P.J.² Vanschoenwinkel, B.³

16
- 0001838252
- An illustration of the essential difference between individual and social learning, and its consequences for computational analyses
- N.J. Vriend An illustration of the essential difference between individual and social learning, and its consequences for computational analyses Journal of Economic Dynamics and Control 24 2000 1 19
- (2000) Journal of Economic Dynamics and Control , vol.24 , pp. 1-19
- Vriend, N.J.¹

17
- 50049118902
- Q-learning agents in a Cournot oligopoly model
- L. Waltman, and U. Kaymak Q-learning agents in a Cournot oligopoly model Journal of Economic Dynamics and Control 32 2008 3275 3293
- (2008) Journal of Economic Dynamics and Control , vol.32 , pp. 3275-3293
- Waltman, L.¹ Kaymak, U.²

18
- 34547899534
- A two-layered multi-agent reinforcement learning model and algorithm
- DOI 10.1016/j.jnca.2006.09.004, PII S1084804506000713
- B. Wang, Y. Gao, Z. Chen, J. Xie, and S. Chen A two-layered multi-agent reinforcement learning model and algorithm Journal of Network and Computer Applications 30 2007 1366 1376 (Pubitemid 47259418)
- (2007) Journal of Network and Computer Applications , vol.30 , Issue.4 , pp. 1366-1376
- Wang, B.-N.¹ Gao, Y.² Chen, Z.-Q.³ Xie, J.-Y.⁴ Chen, S.-F.⁵

19
- 0004049893
- PhD thesis, University of Cambridge, England
- Watkins, C. J. C. H. (1989). Learning from delayed rewards. PhD thesis, University of Cambridge, England.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

20
- 34249833101
- Technical note Q-learning
- C.J.C.H. Watkins Technical note Q-learning Machine Learning 8 1992 279 292
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹

21
- 35048843384
- Biologically inspired reinforcement learning: Reward-based decomposition for multi-goal environments
- A. J. Ijspeert et al. (Eds.) LNCS
- Zhou, W.; & Coggins, R. (2004). Biologically inspired reinforcement learning: Reward-based decomposition for multi-goal environments. In A. J. Ijspeert et al. (Eds.), BioADIT 2004. LNCS (Vol. 3141, pp. 80-94).
- (2004) BioADIT 2004 , vol.3141 , pp. 80-94
- Zhou, W.¹ Coggins, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.