SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 1886 LNAI, Issue , 2000, Pages 125-135

Experience-based reinforcement learning to acquire effective behavior in a multi-agent domain

(3) Arai, Sachiyo a Sycara, Katia a Payne, Terry R a

a CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT AGENTS; MACHINE LEARNING; MONTE CARLO METHODS; PROFITABILITY; REINFORCEMENT LEARNING; WAGES; ARTIFICIAL INTELLIGENCE; COMPENSATION (PERSONNEL); MULTI AGENT SYSTEMS;

EFFECTIVE BEHAVIORS; MULTI AGENT; NON-COMBATANT EVACUATION; PROFIT SHARING; Q-LEARNING APPROACH; REINFORCEMENT LEARNING APPROACH; REINFORCEMENT LEARNING METHOD; SUBGOALS;

MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

EID: 84867798807 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-44533-1_16 Document Type: Conference Paper

Times cited : (23)

References (17)

1
- 0004766989
- Generating cooperative behavior by multi-agent reinforcement learning
- Arai, S., Miyazaki, K., Kobayashi, S.: Generating Cooperative Behavior by Multi-Agent Reinforcement Learning, Proceedings of 6th European Workshop on Learning Robots p111-120 (1997).
- (1997) Proceedings of 6th European Workshop on Learning Robots , pp. 111-120
- Arai, S.¹ Miyazaki, K.² Kobayashi, S.³

2
- 0032668674
- Top-down search for coordinating the hierarchical plans of multiple agents
- Clement, J. Bradley and Durfee, H. Edmund: Top-Down Search for Coordinating the Hierarchical Plans of Multiple Agents. Proceedings of the 3rd International Conference on Autonomous Agents, pp252-259 (1999).
- (1999) Proceedings of the 3rd International Conference on Autonomous Agents , pp. 252-259
- Clement, J.B.¹ Durfee, H.E.²

3
- 0026998041
- Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
- Chrisman, L.: Reinforcement learning with perceptual aliasing: The Perceptual Distinctions Approach, Proceedings of the 10th National Conference on Artificial Intelligence, pp.183-188 (1992).
- (1992) Proceedings of the 10th National Conference on Artificial Intelligence , pp. 183-188
- Chrisman, L.¹

4
- 0002887360
- An investigation into reactive planning in complex domains
- Firby, R.J., An Investigation into Reactive Planning in complex Domains, Proceedings of 10th National Conference on Artificial Intelligence '87, 202-206 (1987).
- (1987) Proceedings of 10th National Conference on Artificial Intelligence '87 , pp. 202-206
- Firby, R.J.¹

5
- 0020947621
- Communication and interaction in multi-agent planning
- Georgeff, M.P.: Communication and interaction in Multi-agent Planning, Proceedings of the 3rd National Conference on Artificial Intelligence, pp.125-129 (1983).
- (1983) Proceedings of the 3rd National Conference on Artificial Intelligence , pp. 125-129
- Georgeff, M.P.¹

6
- 0000146518
- Credit assignment in rule discovery systems based on genetic algorithms
- Grefenstette, J. J.: Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms, Machine Learning Vol.3, pp.225-245(1988).
- (1988) Machine Learning , vol.3 , pp. 225-245
- Grefenstette, J.J.¹

7
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Hu, Junling and Wellman, Michael P.: Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm, Proceedings of the 15th International Conference on Machine Learning, pp.242-250(1998).
- (1998) Proceedings of the 15th International Conference on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

8
- 85153938292
- Reinforcement learning algorithm for partially observable Markov decision problems
- Jaakkola, T., Singh, S.P. and Jordan, M.I.: Reinforcement Learning Algorithm for Partially Observable Markov decision Problems, Advances in Neural Information Processing Systems 7(NIPS-94), pp.345-352 (1994).
- (1994) Advances in Neural Information Processing Systems , vol.7 , Issue.NIPS-94 , pp. 345-352
- Jaakkola, T.¹ Singh, S.P.² Jordan, M.I.³

9
- 2342482919
- Instance-based utile distinctions for reinforcement learning with hidden state
- MacCallum, R. A.: Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State, Proceedings of 12th International Conference on Machine Learning, pp387-395(1993).
- (1993) Proceedings of 12th International Conference on Machine Learning , pp. 387-395
- MacCallum, R.A.¹

10
- 0030647149
- Reinforcement learning in the multi-robot domain
- Mataric, J.M.: Reinforcement Learning in the Multi-Robot Domain, Autonomous Robots 4(1), pp.77-83(1997).
- (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 77-83
- Mataric, J.M.¹

11
- 0000089014
- On the rationality of profit sharing in reinforcement learning
- Miyazaki, K., Yamamura, M. and Kobayashi, S.: On the Rationality of Profit Sharing in Reinforcement Learning, Proceedings of the 3rd International Conference on Fuzzy Logic, Neural Nets and Soft Computing, pp.285-288 (1994).
- (1994) Proceedings of the 3rd International Conference on Fuzzy Logic, Neural Nets and Soft Computing , pp. 285-288
- Miyazaki, K.¹ Yamamura, M.² Kobayashi, S.³

12
- 0004762325
- Multiagent coordination with learning classifier systems
- Weiss, G. and Sen, S.(eds.) Berlin, Heidelberg. Springer Verlag
- Sen, S. and Sekaran, M.: Multiagent Coordination with Learning Classifier Systems, in Weiss, G. and Sen, S.(eds.), Adaption and Learning in Multi-agent systems, Berlin, Heidelberg. Springer Verlag, pp.218-233(1995).
- (1995) Adaption and Learning in Multi-agent Systems , pp. 218-233
- Sen, S.¹ Sekaran, M.²

13
- 0029753630
- Reinforcement learning with replacing eligibility traces
- Singh, S.P. and Sutton, R.S.: Reinforcement Learning with Replacing Eligibility Traces, Machine Learning Vol.22, pp.1-37(1996).
- (1996) Machine Learning , vol.22 , pp. 1-37
- Singh, S.P.¹ Sutton, R.S.²

14
- 0032645144
- Team partitioned, opaque transition reinforcement learning
- Stone, P. and Veloso, M.: Team Partitioned, Opaque Transition Reinforcement Learning, Proceedings of the 3rd International Conference on Autonomous Agents, pp.206-212(1999).
- (1999) Proceedings of the 3rd International Conference on Autonomous Agents , pp. 206-212
- Stone, P.¹ Veloso, M.²

15
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R.S.: Learning to Predict by the Methods of Temporal Differences, Machine Learning, Vol. 3, pp.9-44(1988).
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

16
- 34249833101
- Technical note: Q-learning
- Watkins, C. J. H., and Dayan, P.: Technical note: Q-learning, Machine Learning Vol.8, pp.55-68(1992).
- (1992) Machine Learning , vol.8 , pp. 55-68
- Watkins, C.J.H.¹ Dayan, P.²

17
- 0004767188
- Active perception and reinforcement learning
- Whitehead, S. D. and Balland, D. H.: Active perception and Reinforcement Learning, Proceedings of the 1th International Conference on Machine Learning, pp.162-169(1990).
- (1990) Proceedings of the 1th International Conference on Machine Learning , pp. 162-169
- Whitehead, S.D.¹ Balland, D.H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.