SCOPUS 정보 검색 플랫폼

Volumn E88-D, Issue 5, 2005, Pages 1004-1011

CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes

Author keywords

Multi agent system; Partially observable MDP; Q learning; Reinforcement learning

Indexed keywords

DECISION SUPPORT SYSTEMS; LEARNING SYSTEMS; MARKOV PROCESSES; PROBABILITY;

PARTIALLY OBSERVABLE MDP; Q-LEARNING; REINFORCEMENT LEARNING; STATE TRANSITIONS;

MULTI AGENT SYSTEMS;

EID: 24144454723 PISSN: 09168532 EISSN: None Source Type: Journal
DOI: 10.1093/ietisy/e88-d.5.1004 Document Type: Conference Paper

Times cited : (5)

References (12)

1
- 27844489585
- Prentice-Hall, Englewood Cliffs, NJ
- S.A. Barnett, Instinct and Intelligence: Behavior of Animals and Man, Prentice-Hall, Englewood Cliffs, NJ, 1967.
- (1967) Instinct and Intelligence: Behavior of Animals and Man
- Barnett, S.A.¹

3
- 0032093057
- Agent-mediated electronic commerce: A survey
- R.H. Guttman, A.C. Moukas, and P. Maes, "Agent-mediated electronic commerce: A survey,"Knowledge Engineering Review, vol.13, no.2, pp.147-159, 1998.
- (1998) Knowledge Engineering Review , vol.13 , Issue.2 , pp. 147-159
- Guttman, R.H.¹ Moukas, A.C.² Maes, P.³

4
- 0003673017
- PhD thesis, Carnegie Mellon University, Pittsburgh
- L. Lin, Reinforcement Learning for Robots Using Neural Networks, PhD thesis, Carnegie Mellon University, Pittsburgh, 1993.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.¹

5
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- A. Moore and C.G. Atkeson, "Prioritized sweeping: Reinforcement learning with less data and less time,"Mach. Learn., vol. 13, pp.103-130, 1993.
- (1993) Mach. Learn. , vol.13 , pp. 103-130
- Moore, A.¹ Atkeson, C.G.²

6
- 33847202724
- Learning to predict by the method of temporal differences
- R.S. Sutton, "Learning to predict by the method of temporal differences,"Mach. Learn., vol.3, pp.9-44, 1998.
- (1998) Mach. Learn. , vol.3 , pp. 9-44
- Sutton, R.S.¹

8
- 16244381898
- Little Brown & Company
- N. Tinbergen, Animal Behavior, Little Brown & Company, 1974.
- (1974) Animal Behavior
- Tinbergen, N.¹

9
- 34249833101
- Technical note: Q-learning
- C.J.C.H. Watkins and P. Dayan, "Technical note: Q-Learning, "Mach. Learn., vol.8, pp.279-292, 1992.
- (1992) Mach. Learn. , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

10
- 0031215211
- HQ-learning
- M. Wiering and J. Schmidhuber, "HQ-learning,"Adaptive Behavior, vol.6, no.2, 1997.
- (1997) Adaptive Behavior , vol.6 , Issue.2
- Wiering, M.¹ Schmidhuber, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.