SCOPUS 정보 검색 플랫폼

MODSIM05 - International Congress on Modelling and Simulation: Advances and Applications for Management and Decision Making, Proceedings

Volumn , Issue , 2005, Pages 1594-1600

Solving multiagent markov decision processes: A forest management example

(2) Chadès, Iadine a Bouteiller, Bertrand a

a Unite de Biometrie et Intelligence Artificielle (France)

Author keywords

Multiagent reinforcement learning; Multiagent systems; Stochastic dynamic programming

Indexed keywords

COMPLEXITY LEVELS; DESIGNING AGENTS; FINITE NUMBER; FIXED COST; GLOBAL PROBLEMS; LEARNING METHODS; MARKOV DECISION PROBLEM; MARKOV DECISION PROCESSES; MEMORY SPACE; MULTI-AGENT REINFORCEMENT LEARNING; MULTI-STAND; MULTIAGENT REINFORCEMENT LEARNING ALGORITHM; NEAR-OPTIMAL POLICIES; NEAR-OPTIMAL SOLUTIONS; OPTIMAL DECISION MAKING; OPTIMAL POLICIES; OPTIMAL SOLUTIONS; OPTIMAL STRATEGIES; PLANNING ALGORITHMS; PLANNING METHOD; REINFORCEMENT LEARNING TECHNIQUES; REWARD FUNCTION; SCHNEIDER; SEQUENTIAL DECISION MAKING; SIMULATION TECHNIQUE; SMALL SIZE; STOCHASTIC DYNAMIC PROGRAMMING; SUB-PROBLEMS; MULTI-AGENT MARKOV DECISION PROCESS;

CONVERGENCE OF NUMERICAL METHODS; COST ACCOUNTING; DECISION MAKING; DYNAMIC PROGRAMMING; FORESTRY; LEARNING ALGORITHMS; MARKOV PROCESSES; MULTI AGENT SYSTEMS; OPTIMAL SYSTEMS; OPTIMIZATION; PLANT EXTRACTS; REINFORCEMENT; REINFORCEMENT LEARNING; BEHAVIORAL RESEARCH; LEARNING SYSTEMS;

PROBLEM SOLVING; LEARNING ALGORITHMS;

EID: 80053116521 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (5)

References (13)

1
- 0141965747
- The complexity of decentralized control of markov decision processes
- Bernstein, D. S., Zilberstein, S. & Immerman, N. (2000), The complexity of decentralized control of markov decision processes, in 'Proc. of UAI'.
- (2000) Proc. of UAI
- Bernstein, D.S.¹ Zilberstein, S.² Immerman, N.³

2
- 34548187333
- PhD thesis, Universite Nancy 1, LORIA
- Chadès, I. (2003), Planification distribuée dans les SMA l'aide de processus décisionnels deMarkov, PhD thesis, Universite Nancy 1, LORIA.
- (2003) Planification Distribuée Dans Les SMA l'Aide de Processus Décisionnels DeMarkov
- Chadès, I.¹

3
- 80053096157
- Multiple equilibria solution for the multi-agent mdp coordination problem
- Chadès, I. (2004), Multiple equilibria solution for the multi-agent mdp coordination problem, in 'Workshop of Multi-Agent Markov Decision Processes: Theories and Models, ECAI04'.
- (2004) Workshop of Multi-Agent Markov Decision Processes: Theories and Models, ECAI04
- Chadès, I.¹

4
- 0036040313
- A heuristic approach for solving decentralized-pomdp: Assessment on the pursuit problem
- Chadès, I., Scherrer, B. & Charpillet, F. (2002), A heuristic approach for solving decentralized-pomdp: Assessment on the pursuit problem, in 'the 2002 ACM Symposium on Applied Computing'.
- (2002) The 2002 ACM Symposium on Applied Computing
- Chadès, I.¹ Scherrer, B.² Charpillet, F.³

5
- 84859236073
- Multiagent systems by incremental gradient reinforcement learning
- Dutech, A., Buffet, O. & Charpillet, F. (2001), Multiagent systems by incremental gradient reinforcement learning, in 'Proceedings of IJCAI'01'.
- (2001) Proceedings of IJCAI'01
- Dutech, A.¹ Buffet, O.² Charpillet, F.³

6
- 80053107067
- Solving large weakly coupled markov decision processes: Application to forest management
- Garcia, F. & Sabbadin, R. (2001), Solving large weakly coupled markov decision processes: Application to forest management, in 'MODSIM 2001'.
- (2001) MODSIM 2001
- Garcia, F.¹ Sabbadin, R.²

7
- 84880823326
- Taming decentralized pomdps: Towards efficient policy computation for multiagent settings
- Nair, R., Tambe, M., Yokoo, M., Pynadath, D. & Marsella, S. (2003), Taming decentralized pomdps: Towards efficient policy computation for multiagent settings, in 'IJCAI'03'.
- (2003) IJCAI'03
- Nair, R.¹ Tambe, M.² Yokoo, M.³ Pynadath, D.⁴ Marsella, S.⁵

8
- 0003998452
- John Wiley and Sons, Inc., New York, USA
- Puterman, M. L. (1994), Markov Decision Processes-Discrete Stochastic Dynamic Programming, John Wiley and Sons, Inc., New York, USA.
- (1994) Markov Decision Processes-Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

9
- 0001395498
- Distributed value functions
- Morgan Kaufmann, San Francisco, CA
- Schneider, J., Wong, W.-K., Moore, A. & Riedmiller, M. (1999), Distributed value functions, in 'Proc. ICML 99', Morgan Kaufmann, San Francisco, CA, pp. 371-378.
- (1999) Proc. ICML 99 , pp. 371-378
- Schneider, J.¹ Wong, W.-K.² Moore, A.³ Riedmiller, M.⁴

10
- 1242265508
- Minimizing communication cost in a distributed bayesian network using a decentralized mdp
- Shen, J., Lesser, V. & Carver, N. (2003), Minimizing communication cost in a distributed bayesian network using a decentralized mdp, in 'the second international joint conference on Autonomous agents and multiagent systems', pp. 678-685.
- (2003) The Second International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 678-685
- Shen, J.¹ Lesser, V.² Carver, N.³

11
- 4544279348
- Technical report, Stanford University
- Shoham, Y., Powers, R. & Grenager, T. (2003), Multiagent reinforcement learning: a critical survey, Technical report, Stanford University.
- (2003) Multiagent Reinforcement Learning: A Critical Survey
- Shoham, Y.¹ Powers, R.² Grenager, T.³

12
- 0004102479
- Bradford Book, MIT Press, Cambridge, MA
- Sutton, R. & Barto, G. (1998), Reinforcement Learning: an introduction, Bradford Book, MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, G.²

13
- 34249833101
- Q-learning
- Watkins, C. & Dayan, P. (1992), 'Q-learning', Machine Learning 8, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.