메뉴 건너뛰기




Volumn 58, Issue 1, 2010, Pages 214-228

Partially observable Markov decision processes: A geometric technique and analysis

Author keywords

Analysis of algorithms; Artificial intelligence; Combinatorics; Computational complexity; Computers computer science; Dynamic programming; Markov; Mathematics

Indexed keywords

ANALYSIS OF ALGORITHMS; COMBINATORICS; CONVEX HULL; CONVEX POLYTOPES; DEGREE OF OBSERVABILITY; DISCOUNTED REWARD; FINITE STATE; GEOMETRIC TECHNIQUES; INFINITE HORIZONS; MINKOWSKI SUM; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS;

EID: 77249107864     PISSN: 0030364X     EISSN: 15265463     Source Type: Journal    
DOI: 10.1287/opre.1090.0697     Document Type: Article
Times cited : (41)

References (67)
  • 1
    • 33645568007 scopus 로고    scopus 로고
    • An optimal lot-sizing and offline inspection policy in the case of nonrigid demand
    • Anily, S., A. Grosfeld-Nir. 2006. An optimal lot-sizing and offline inspection policy in the case of nonrigid demand. Oper. Res. 54(2) 311-323.
    • (2006) Oper. Res. , vol.54 , Issue.2 , pp. 311-323
    • Anily, S.1    Grosfeld-Nir, A.2
  • 2
    • 0037919794 scopus 로고
    • Optimal control of partially observable Markovian systems
    • Aoki, M. 1965. Optimal control of partially observable Markovian systems. J. Franklin Inst. 280 367-386.
    • (1965) J. Franklin Inst. , vol.280 , pp. 367-386
    • Aoki, M.1
  • 3
    • 50549213583 scopus 로고
    • Optimal control of Markov decision processes with incomplete state estimation
    • Astrom, K. J. 1965. Optimal control of Markov decision processes with incomplete state estimation. J. Math. Anal. Appl. 10 174-205.
    • (1965) J. Math. Anal. Appl. , vol.10 , pp. 174-205
    • Astrom, K.J.1
  • 6
    • 0000129333 scopus 로고
    • Equivalent comparisons of experiments
    • Blackwell, D. 1953. Equivalent comparisons of experiments. Ann. Math. Statist. 24 265-272.
    • (1953) Ann. Math. Statist. , vol.24 , pp. 265-272
    • Blackwell, D.1
  • 15
    • 0011718924 scopus 로고
    • Optimum maintenance with incomplete information
    • Eckles, J. E. 1968. Optimum maintenance with incomplete information. Oper. Res. 16 1058-1067.
    • (1968) Oper. Res. , vol.16 , pp. 1058-1067
    • Eckles, J.E.1
  • 16
    • 0011715912 scopus 로고
    • On a sequential Markovian decision procedure with incomplete information
    • Ehrenfeld, S. 1976. On a sequential Markovian decision procedure with incomplete information. Comput. Oper. Res. 3 39-48.
    • (1976) Comput. Oper. Res. , vol.3 , pp. 39-48
    • Ehrenfeld, S.1
  • 18
    • 0004808420 scopus 로고
    • On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes
    • Fernández-Gaucherand, E., A. Arapostathis, S. I. Marcus. 1991. On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes. Ann. Oper. Res. 29 439-470.
    • (1991) Ann. Oper. Res. , vol.29 , pp. 439-470
    • Fernández-Gaucherand, E.1    Arapostathis, A.2    Marcus, S.I.3
  • 19
    • 4344711578 scopus 로고    scopus 로고
    • From the zonetope construction to the Minkowski addition of convex polytopes
    • Fukuda, K. 2004. From the zonetope construction to the Minkowski addition of convex polytopes. J. Symbolic Comput. 38 1261-1272.
    • (2004) J. Symbolic Comput. , vol.38 , pp. 1261-1272
    • Fukuda, K.1
  • 20
    • 0001734192 scopus 로고
    • Minkowski addition of polytopes: Computational complexity and applications to Grobner bases
    • Gritzmann, P., B. Sturmfels. 1993. Minkowski addition of polytopes: Computational complexity and applications to Grobner bases. SIAM J. Discrete Math. 6(2) 246-269.
    • (1993) SIAM J. Discrete Math. , vol.6 , Issue.2 , pp. 246-269
    • Gritzmann, P.1    Sturmfels, B.2
  • 21
    • 0030134884 scopus 로고    scopus 로고
    • A two-state partially observable Markov decision process with uniformly distributed observations
    • Grosfeld-Nir, A. 1996. A two-state partially observable Markov decision process with uniformly distributed observations. Oper. Res. 44(3) 458-463.
    • (1996) Oper. Res. , vol.44 , Issue.3 , pp. 458-463
    • Grosfeld-Nir, A.1
  • 22
    • 34147157217 scopus 로고    scopus 로고
    • Control limits for two-state partially observable Markov decision processes
    • Grosfeld-Nir, A. 2007. Control limits for two-state partially observable Markov decision processes. Eur. J. Oper. Res. 182 300-304.
    • (2007) Eur. J. Oper. Res. , vol.182 , pp. 300-304
    • Grosfeld-Nir, A.1
  • 23
    • 84898987770 scopus 로고    scopus 로고
    • An improved policy iteration algorithm for partially observable MDPs
    • MIT Press, Cambridge, MA
    • Hansen, E. A. 1998a. An improved policy iteration algorithm for partially observable MDPs. Advances in Neural Inform. Processing Systems 10 (NIPS-97). MIT Press, Cambridge, MA, 1015-1021.
    • (1998) Advances in Neural Inform. Processing Systems 10 (NIPS-97) , pp. 1015-1021
    • Hansen, E.A.1
  • 26
    • 28544443262 scopus 로고    scopus 로고
    • On the existence of stationary optimal policies for partially observed MDPs under the long-run average cost criterion
    • Hsu, S.-P., D.-M. Chuang, A. Arapostathis. 2006. On the existence of stationary optimal policies for partially observed MDPs under the long-run average cost criterion. Systems Control Lett. 55 165-173.
    • (2006) Systems Control Lett. , vol.55 , pp. 165-173
    • Hsu, S.-P.1    Chuang, D.-M.2    Arapostathis, A.3
  • 27
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P., M. Littman, A. R. Cassandra. 1998. Planning and acting in partially observable stochastic domains. Artificial Intelligence 101 99-134.
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.2    Cassandra, A.R.3
  • 28
    • 77249178097 scopus 로고
    • Optimum policies for partially observable Markov systems
    • Massachusetts Institute of Technology, Cambridge, MA
    • Kakalik, J. S. 1965. Optimum policies for partially observable Markov systems. Technical Report 18, Operations Research Center, Massachusetts Institute of Technology, Cambridge, MA.
    • (1965) Technical Report 18, Operations Research Center
    • Kakalik, J.S.1
  • 29
    • 0024628848 scopus 로고
    • A partially observable model of decision making by fishermen
    • Lane, D. 1989. A partially observable model of decision making by fishermen. Oper. Res. 37(2) 240-254.
    • (1989) Oper. Res. , vol.37 , Issue.2 , pp. 240-254
    • Lane, D.1
  • 31
    • 0023415789 scopus 로고
    • Some monotonicity results for partially observed Markov decision processes
    • Lovejoy, W. S. 1987. Some monotonicity results for partially observed Markov decision processes. Oper. Res. 35 (5) 736-743.
    • (1987) Oper. Res. , vol.35 , Issue.5 , pp. 736-743
    • Lovejoy, W.S.1
  • 32
    • 0000494894 scopus 로고
    • Computationally feasible bounds for partially observed Markov decision processes
    • Lovejoy, W. S. 1991a. Computationally feasible bounds for partially observed Markov decision processes. Oper. Res. 39 162-175.
    • (1991) Oper. Res. , vol.39 , pp. 162-175
    • Lovejoy, W.S.1
  • 33
    • 0002679852 scopus 로고
    • A survey of algorithmic methods for partially observed Markov decision processes
    • Lovejoy, W. S. 1991b. A survey of algorithmic methods for partially observed Markov decision processes. Ann. Oper. Res. 28 47-66.
    • (1991) Ann. Oper. Res. , vol.28 , pp. 47-66
    • Lovejoy, W.S.1
  • 34
    • 0036374190 scopus 로고    scopus 로고
    • Nonapproximability results for partially observable Markov decision processes
    • Lusena, C., J. Goldsmith, M. Mundhenk. 2001. Nonapproximability results for partially observable Markov decision processes. J. Artificial Intelligence Res. 14 83-103.
    • (2001) J. Artificial Intelligence Res. , vol.14 , pp. 83-103
    • Lusena, C.1    Goldsmith, J.2    Mundhenk, M.3
  • 35
    • 0033876565 scopus 로고    scopus 로고
    • Call admission control and routing in integrated service networks using neuro-dynamic programming
    • Marbach, P., O. Mihatsch, J. N. Tsitsiklis. 2000. Call admission control and routing in integrated service networks using neuro-dynamic programming. IEEE J. Selected Areas Commun. 18(2) 197-208.
    • (2000) IEEE J. Selected Areas Commun. , vol.18 , Issue.2 , pp. 197-208
    • Marbach, P.1    Mihatsch, O.2    Tsitsiklis, J.N.3
  • 38
    • 0019909899 scopus 로고
    • A survey of partially observable Markov decision processes: Theory, models and algorithms
    • Monahan, G. E. 1982. A survey of partially observable Markov decision processes: Theory, models and algorithms. Management Sci. 28 1-16.
    • (1982) Management Sci. , vol.28 , pp. 1-16
    • Monahan, G.E.1
  • 41
    • 0000977910 scopus 로고
    • The complexity of Markov decision processes
    • Papadimitriou, C. H., J. N. Tsitsiklis. 1987. The complexity of Markov decision processes. Math. Oper. Res. 12(3) 441-450.
    • (1987) Math. Oper. Res. , vol.12 , Issue.3 , pp. 441-450
    • Papadimitriou, C.H.1    Tsitsiklis, J.N.2
  • 45
    • 0019037868 scopus 로고
    • Optimal infinite-horizon undiscounted control of finite probabilistic systems
    • Platzman, L. K. 1980. Optimal infinite-horizon undiscounted control of finite probabilistic systems. SIAM J. Control Optim. 18(4) 362-380.
    • (1980) SIAM J. Control Optim. , vol.18 , Issue.4 , pp. 362-380
    • Platzman, L.K.1
  • 49
    • 0004267646 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Rockafellar, R. T. 1970. Convex Analysis. Princeton University Press, Princeton, NJ.
    • (1970) Convex Analysis
    • Rockafellar, R.T.1
  • 50
    • 0000113607 scopus 로고
    • Quality control under Markovian deterioration
    • Ross, S. 1971. Quality control under Markovian deterioration. Management Sci. 17 587-596.
    • (1971) Management Sci. , vol.17 , pp. 587-596
    • Ross, S.1
  • 51
    • 0015665630 scopus 로고
    • Markovian decision processes with probabilistic observation of states
    • Satia, J. K., R. E. Lave. 1973. Markovian decision processes with probabilistic observation of states. Management Sci. 20 1-13.
    • (1973) Management Sci. , vol.20 , pp. 1-13
    • Satia, J.K.1    Lave, R.E.2
  • 52
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • Smallwood, R. D., E. J. Sondik. 1973. The optimal control of partially observable Markov processes over a finite horizon. Oper. Res. 21 1071-1088.
    • (1973) Oper. Res. , vol.21 , pp. 1071-1088
    • Smallwood, R.D.1    Sondik, E.J.2
  • 54
    • 0017943242 scopus 로고
    • The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
    • Sondik, E. J. 1978. The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Oper. Res. 26 282-304.
    • (1978) Oper. Res. , vol.26 , pp. 282-304
    • Sondik, E.J.1
  • 55
    • 48049112994 scopus 로고
    • On the structure of Blackwell's equivalence classes of information systems
    • Sulganik, E. 1995. On the structure of Blackwell's equivalence classes of information systems. Math. Soc. Sci. 29 213-223.
    • (1995) Math. Soc. Sci. , vol.29 , pp. 213-223
    • Sulganik, E.1
  • 57
    • 0036564813 scopus 로고    scopus 로고
    • Adaptive inventory control for nonstationary demand and partial information
    • Treharne, J. T., C. R. Sox. 2002. Adaptive inventory control for nonstationary demand and partial information. Management Sci. 48(5) 607-624.
    • (2002) Management Sci. , vol.48 , Issue.5 , pp. 607-624
    • Treharne, J.T.1    Sox, C.R.2
  • 58
    • 14244251274 scopus 로고
    • Optimal replacement policy under unobservable states
    • Wang, R. 1977. Optimal replacement policy under unobservable states. J. Appl. Probab. 14 340-348.
    • (1977) J. Appl. Probab. , vol.14 , pp. 340-348
    • Wang, R.1
  • 59
    • 0038295445 scopus 로고
    • A Markov quality control process subject to partial observation
    • White, C. C. 1977. A Markov quality control process subject to partial observation. Management Sci. 23 843-852.
    • (1977) Management Sci. , vol.23 , pp. 843-852
    • White, C.C.1
  • 60
    • 0018443406 scopus 로고
    • Optimal control-limit strategies for a partially observed replacement problem
    • White, C. C. 1979. Optimal control-limit strategies for a partially observed replacement problem. Internat. J. Systems Sci. 10 321-331.
    • (1979) Internat. J. Systems Sci. , vol.10 , pp. 321-331
    • White, C.C.1
  • 61
    • 0019045547 scopus 로고
    • Monotone control laws for noisy, countable-state Markov chains
    • White, C. C. 1980. Monotone control laws for noisy, countable-state Markov chains. Eur. J. Oper. Res. 5 124-132.
    • (1980) Eur. J. Oper. Res. , vol.5 , pp. 124-132
    • White, C.C.1
  • 62
    • 34249925148 scopus 로고
    • A survey of solution techniques for the partially observed decision process
    • White, C. C. 1991. A survey of solution techniques for the partially observed decision process. Ann. Oper. Res. 32 215-230.
    • (1991) Ann. Oper. Res. , vol.32 , pp. 215-230
    • White, C.C.1
  • 63
    • 0024739631 scopus 로고
    • Solution procedures for partially observed Markov decision processes
    • White, C. C., W. T. Scherer. 1989. Solution procedures for partially observed Markov decision processes. Oper. Res. 37 791-797.
    • (1989) Oper. Res. , vol.37 , pp. 791-797
    • White, C.C.1    Scherer, W.T.2
  • 64
    • 0005951145 scopus 로고
    • Finite-memory suboptimal design for partially observed Markov decision processes
    • White, C. C., W. T. Scherer. 1994. Finite-memory suboptimal design for partially observed Markov decision processes. Oper. Res. 42 439-455.
    • (1994) Oper. Res. , vol.42 , pp. 439-455
    • White, C.C.1    Scherer, W.T.2
  • 65
    • 61349089285 scopus 로고    scopus 로고
    • On near optimality of the set of finite-state controllers for average cost POMDP
    • Yu, H., D. Bertsekas. 2008. On near optimality of the set of finite-state controllers for average cost POMDP. Math. Oper. Res. 33(1) 1-11.
    • (2008) Math. Oper. Res. , vol.33 , Issue.1 , pp. 1-11
    • Yu, H.1    Bertsekas, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.