메뉴 건너뛰기




Volumn 34, Issue , 2009, Pages 89-132

Policy iteration for decentralized control of markov decision processes

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIORAL RESEARCH; CONTROLLERS; DYNAMIC PROGRAMMING; ITERATIVE METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; MULTIPURPOSE ROBOTS; PROBABILITY DISTRIBUTIONS; SOFTWARE AGENTS; STOCHASTIC SYSTEMS;

EID: 65349083220     PISSN: None     EISSN: 10769757     Source Type: Journal    
DOI: 10.1613/jair.2667     Document Type: Article
Times cited : (87)

References (41)
  • 2
    • 50549213583 scopus 로고
    • Optimal control of Markov decision processes with incomplete state estimation
    • Astrom, K. J. (1965). Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications, 10, 174-205.
    • (1965) Journal of Mathematical Analysis and Applications , vol.10 , pp. 174-205
    • Astrom, K.J.1
  • 3
    • 0002430114 scopus 로고
    • Subjectivity and correlation in randomized strategies
    • Aumann, R. J. (1974). Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1, 67-96.
    • (1974) Journal of Mathematical Economics , vol.1 , pp. 67-96
    • Aumann, R.J.1
  • 16
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2), 99-134.
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 17
    • 0141503453 scopus 로고    scopus 로고
    • Multi-agent influence diagrams for representing and solving games
    • Koller, D., & Milch, B. (2003). Multi-agent influence diagrams for representing and solving games. Games and Economic Behavior, 45(l), 181-221.
    • (2003) Games and Economic Behavior , vol.45 , Issue.L , pp. 181-221
    • Koller, D.1    Milch, B.2
  • 26
    • 0042658750 scopus 로고
    • A feasible computational approach to infinite-horizon partially- observed Markov decision processes
    • Tech. rep, Georgia Institute of Technology. Reprinted in Working Notes of the 1998 AAA I Fall Symposium on Planning Using Partially Observable Markov Decision Processes
    • Platzman, L. K. (1980). A feasible computational approach to infinite-horizon partially- observed Markov decision processes. Tech. rep., Georgia Institute of Technology. Reprinted in Working Notes of the 1998 AAA I Fall Symposium on Planning Using Partially Observable Markov Decision Processes.
    • (1980)
    • Platzman, L.K.1
  • 32
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • Small wood, R. D., & Sondik, E. J. (1973). The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21(h), 1071-1088.
    • (1973) Operations Research, 21(h) , pp. 1071-1088
    • Small wood, R.D.1    Sondik, E.J.2
  • 35
    • 0017943242 scopus 로고
    • The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
    • Sondik, E. J. (1978). The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research, 26, 282-304.
    • (1978) Operations Research , vol.26 , pp. 282-304
    • Sondik, E.J.1
  • 39
    • 0015158656 scopus 로고
    • Separation of estimation and control for discrete time systems
    • Witsenhausen, H. S. (1971). Separation of estimation and control for discrete time systems. Proceedings of the IEEE, 55(11), 1557-1566.
    • (1971) Proceedings of the IEEE , vol.55 , Issue.11 , pp. 1557-1566
    • Witsenhausen, H.S.1
  • 41
    • 0036374229 scopus 로고    scopus 로고
    • Speeding up the convergence of value iteration in partially observable Markov decision processes
    • Zhang, N. L., & Zhang, W. (2001). Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14, 29-51.
    • (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
    • Zhang, N.L.1    Zhang, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.