메뉴 건너뛰기




Volumn , Issue , 2005, Pages 1594-1600

Solving multiagent markov decision processes: A forest management example

Author keywords

Multiagent reinforcement learning; Multiagent systems; Stochastic dynamic programming

Indexed keywords

COMPLEXITY LEVELS; DESIGNING AGENTS; FINITE NUMBER; FIXED COST; GLOBAL PROBLEMS; LEARNING METHODS; MARKOV DECISION PROBLEM; MARKOV DECISION PROCESSES; MEMORY SPACE; MULTI-AGENT REINFORCEMENT LEARNING; MULTI-STAND; MULTIAGENT REINFORCEMENT LEARNING ALGORITHM; NEAR-OPTIMAL POLICIES; NEAR-OPTIMAL SOLUTIONS; OPTIMAL DECISION MAKING; OPTIMAL POLICIES; OPTIMAL SOLUTIONS; OPTIMAL STRATEGIES; PLANNING ALGORITHMS; PLANNING METHOD; REINFORCEMENT LEARNING TECHNIQUES; REWARD FUNCTION; SCHNEIDER; SEQUENTIAL DECISION MAKING; SIMULATION TECHNIQUE; SMALL SIZE; STOCHASTIC DYNAMIC PROGRAMMING; SUB-PROBLEMS; MULTI-AGENT MARKOV DECISION PROCESS;

EID: 80053116521     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (13)
  • 1
    • 0141965747 scopus 로고    scopus 로고
    • The complexity of decentralized control of markov decision processes
    • Bernstein, D. S., Zilberstein, S. & Immerman, N. (2000), The complexity of decentralized control of markov decision processes, in 'Proc. of UAI'.
    • (2000) Proc. of UAI
    • Bernstein, D.S.1    Zilberstein, S.2    Immerman, N.3
  • 5
    • 84859236073 scopus 로고    scopus 로고
    • Multiagent systems by incremental gradient reinforcement learning
    • Dutech, A., Buffet, O. & Charpillet, F. (2001), Multiagent systems by incremental gradient reinforcement learning, in 'Proceedings of IJCAI'01'.
    • (2001) Proceedings of IJCAI'01
    • Dutech, A.1    Buffet, O.2    Charpillet, F.3
  • 6
    • 80053107067 scopus 로고    scopus 로고
    • Solving large weakly coupled markov decision processes: Application to forest management
    • Garcia, F. & Sabbadin, R. (2001), Solving large weakly coupled markov decision processes: Application to forest management, in 'MODSIM 2001'.
    • (2001) MODSIM 2001
    • Garcia, F.1    Sabbadin, R.2
  • 7
    • 84880823326 scopus 로고    scopus 로고
    • Taming decentralized pomdps: Towards efficient policy computation for multiagent settings
    • Nair, R., Tambe, M., Yokoo, M., Pynadath, D. & Marsella, S. (2003), Taming decentralized pomdps: Towards efficient policy computation for multiagent settings, in 'IJCAI'03'.
    • (2003) IJCAI'03
    • Nair, R.1    Tambe, M.2    Yokoo, M.3    Pynadath, D.4    Marsella, S.5
  • 9
    • 0001395498 scopus 로고    scopus 로고
    • Distributed value functions
    • Morgan Kaufmann, San Francisco, CA
    • Schneider, J., Wong, W.-K., Moore, A. & Riedmiller, M. (1999), Distributed value functions, in 'Proc. ICML 99', Morgan Kaufmann, San Francisco, CA, pp. 371-378.
    • (1999) Proc. ICML 99 , pp. 371-378
    • Schneider, J.1    Wong, W.-K.2    Moore, A.3    Riedmiller, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.