메뉴 건너뛰기




Volumn 7, Issue , 1997, Pages 199-230

A model approximation scheme for planning in partially observable stochastic domains

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; MARKOV PROCESSES; STOCHASTIC SYSTEMS;

EID: 85016628903     PISSN: 10769757     EISSN: None     Source Type: Journal    
DOI: 10.1613/jair.419     Document Type: Article
Times cited : (22)

References (30)
  • 3
    • 0024680419 scopus 로고
    • Adaptive Aggregation for Infinite Horizon Dynamic Programming
    • Bertsekas, D. P., & Castanon, D. C. (1989). Adaptive Aggregation for Infinite Horizon Dynamic Programming. IEEE trans. on auto. control, vol 34, No 6.
    • (1989) IEEE Trans. on Auto. Control , vol.34 , Issue.6
    • Bertsekas, D.P.1    Castanon, D.C.2
  • 5
    • 0030349220 scopus 로고    scopus 로고
    • Computing optimal policies for partially observable decision processes using compact representations
    • Boutilier, C, & Poole, D. (1996). Computing optimal policies for partially observable decision processes using compact representations. In Proceedings of AAAI-96, 1168-1175.
    • (1996) Proceedings of AAAI-96 , pp. 1168-1175
    • Boutilier, C.1    Poole, D.2
  • 6
    • 0031385391 scopus 로고    scopus 로고
    • A heuristic variable grid solution method for POMDPs
    • Brafman, R. I. (1997). A heuristic variable grid solution method for POMDPs. In Proceedings of AAAI-97, 727-733.
    • (1997) Proceedings of AAAI-97 , pp. 727-733
    • Brafman, R.I.1
  • 14
  • 16
    • 0021486586 scopus 로고
    • The optimal search for a moving target when the search path is constrained
    • Eagle, J. N. (1984). The optimal search for a moving target when the search path is constrained. Operations Research, 32(5), 1107-1115.
    • (1984) Operations Research , vol.32 , Issue.5 , pp. 1107-1115
    • Eagle, J.N.1
  • 17
    • 0031385618 scopus 로고    scopus 로고
    • Incremental methods for computing bounds in partially observable Markov decision processes
    • Hauskrecht, M. (1997). Incremental methods for computing bounds in partially observable Markov decision processes. In Proceedings of AAAI-97, 734-739.
    • (1997) Proceedings of AAAI-97 , pp. 734-739
    • Hauskrecht, M.1
  • 20
    • 0002679852 scopus 로고
    • A survey of algorithmic methods for solving partially observable Markov decision processes
    • Lovejoy, W. S. (1991a). A survey of algorithmic methods for solving partially observable Markov decision processes. Annals of Operations Research, 28 (1), 47-65.
    • (1991) Annals of Operations Research , vol.28 , Issue.1 , pp. 47-65
    • Lovejoy, W.S.1
  • 21
    • 0000494894 scopus 로고
    • Computationally feasible bounds for partially observed Markov decision processes
    • Lovejoy, W. S. (1991b). Computationally feasible bounds for partially observed Markov decision processes. Operations Research, 39 (1), 162-175.
    • (1991) Operations Research , vol.39 , Issue.1 , pp. 162-175
    • Lovejoy, W.S.1
  • 22
    • 0019909899 scopus 로고
    • A survey of partially observable Markov decision processes: Theory, models, and algorithms
    • Monahan, G. E. (1982). A survey of partially observable Markov decision processes: theory, models, and algorithms. Management Science, 28 (1), 1-16.
    • (1982) Management Science , vol.28 , Issue.1 , pp. 1-16
    • Monahan, G.E.1
  • 23
    • 85168129602 scopus 로고
    • Approximating optimal polices for partially observable stochastic domains
    • Parr, R., & Russell, S. (1995). Approximating optimal polices for partially observable stochastic domains. In Proceedings of IJCAI-95, 1088-1094.
    • (1995) Proceedings of IJCAI-95 , pp. 1088-1094
    • Parr, R.1    Russell, S.2
  • 24
    • 0003438819 scopus 로고
    • Ph.D. Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology
    • Platzman, L. K. (1977). Finite-memory estimation and control of finite probabilistic systems. Ph.D. Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology.
    • (1977) Finite-memory Estimation and Control of Finite Probabilistic Systems
    • Platzman, L.K.1
  • 25
    • 77957101448 scopus 로고
    • Markov decision processes
    • D. P. Heyman and M. J. Sobel (eds.) Elsevier Science Publishers
    • Puterman, M. L. (1990). Markov decision processes. In D. P. Heyman and M. J. Sobel (eds.), Handbooks in OR & MS., Elsevier Science Publishers, Vol. 2, 331-434.
    • (1990) Handbooks in or & MS , vol.2 , pp. 331-434
    • Puterman, M.L.1
  • 27
    • 84936534549 scopus 로고
    • Southwest Fisheries Center Administrative Report H-79-13, National Marine Fisheries Service, Honolulu, Hawaii
    • Sondik, E. J., & Mendelssohn, R. (1979). Information seeking in Markov decision processes, Southwest Fisheries Center Administrative Report H-79-13, National Marine Fisheries Service, Honolulu, Hawaii.
    • (1979) Information Seeking in Markov Decision Processes
    • Sondik, E.J.1    Mendelssohn, R.2
  • 28
    • 0000893414 scopus 로고
    • Partially observed Markov decision processes: A survey
    • White III, C. C. (1991). Partially observed Markov decision processes: A survey. Annals of Operations Research, 32.
    • (1991) Annals of Operations Research , vol.32
    • White, I.I.I.C.C.1
  • 30
    • 0005951145 scopus 로고
    • Finite-memory suboptimal design for partially observed Markov decision processes
    • White III, C. C, & Scherer, W. T., (1994). Finite-memory suboptimal design for partially observed Markov decision processes. Operations Research, 42(3), 440-455.
    • (1994) Operations Research , vol.42 , Issue.3 , pp. 440-455
    • White, I.I.I.C.C.1    Scherer, W.T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.