메뉴 건너뛰기




Volumn 48, Issue 11, 2003, Pages 1951-1961

Two-Person Zero-Sum Markov Games: Receding Horizon Approach

Author keywords

Hindsight optimization; Infinite horizon cost; Markov game; Receding horizon control; Rollout

Indexed keywords

APPROXIMATION THEORY; ERROR ANALYSIS; GAME THEORY; LINEAR CONTROL SYSTEMS; MARKOV PROCESSES; OPTIMIZATION;

EID: 0344395590     PISSN: 00189286     EISSN: None     Source Type: Journal    
DOI: 10.1109/TAC.2003.819077     Document Type: Article
Times cited : (25)

References (34)
  • 1
    • 0001073374 scopus 로고
    • Rolling horizon procedures in non-homogeneous Markov decision processes
    • J. M. Alden and R. L. Smith, "Rolling horizon procedures in non-homogeneous Markov decision processes," Oper. Res., vol. 40, pp. S183-S194, 1992.
    • (1992) Oper. Res. , vol.40
    • Alden, J.M.1    Smith, R.L.2
  • 2
    • 0000611954 scopus 로고
    • Zero-sum Markov games and worst-cast optimal control of queuejng systems
    • E. Altman, "Zero-sum Markov games and worst-cast optimal control of queuejng systems," QUESTA, vol. 21, pp. 415-447, 1995.
    • (1995) QUESTA , vol.21 , pp. 415-447
    • Altman, E.1
  • 3
    • 0039564884 scopus 로고    scopus 로고
    • Non zero-sum stochastic games in admission, service and routing control in queueing systems
    • _, "Non zero-sum stochastic games in admission, service and routing control in queueing systems," QUESTA, vol. 23, pp. 259-279, 1996.
    • (1996) QUESTA , vol.23 , pp. 259-279
  • 4
    • 0013297856 scopus 로고
    • Monotonicity of optimal policies in a zero sum game: A flow control model
    • _, "Monotonicity of optimal policies in a zero sum game: A flow control model," Adv. Dyna. Games Applicat., pp. 269-286, 1994.
    • (1994) Adv. Dyna. Games Applicat. , pp. 269-286
  • 5
    • 0033324964 scopus 로고    scopus 로고
    • Neural approximators and team theory for dynamic routing: A receding horizon approach
    • M. Baglietto, T. Parisini, and R. Zoppoli, "Neural approximators and team theory for dynamic routing: A receding horizon approach," Proc. IEEE Conf. Decision Control, pp. 3283-3288, 1999.
    • (1999) Proc. IEEE Conf. Decision Control , pp. 3283-3288
    • Baglietto, M.1    Parisini, T.2    Zoppoli, R.3
  • 9
    • 0346942368 scopus 로고    scopus 로고
    • Decision-theoretic planning: Structural assumptions and computational leverage
    • C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," J. Artif. Intell. Res., vol. 11, pp. 1-94, 1999.
    • (1999) J. Artif. Intell. Res. , vol.11 , pp. 1-94
    • Boutilier, C.1    Dean, T.2    Hanks, S.3
  • 10
    • 0013205236 scopus 로고    scopus 로고
    • Moving horizon control in dynamic games
    • W. A. van den Broek, "Moving horizon control in dynamic games, " Comput. Econ. Finance, vol. 122, 1999.
    • (1999) Comput. Econ. Finance , vol.122
    • Van den Broek, W.A.1
  • 13
    • 0242677010 scopus 로고    scopus 로고
    • Approximate receding horizon approach for Markov decision processes: Average reward case
    • submitted for publication
    • H. S. Chang and S. Marcus, "Approximate receding horizon approach for Markov decision processes: Average reward case," J. Math. Anal. Applicat., vol. 286, no. 2, pp. 636-651, 2001, submitted for publication.
    • (2001) J. Math. Anal. Applicat. , vol.286 , Issue.2 , pp. 636-651
    • Chang, H.S.1    Marcus, S.2
  • 14
    • 0034439756 scopus 로고    scopus 로고
    • A framework for simulation-based network control via hindsight optimization
    • E. K. P. Chong, R. Givan, and H. S. Chang, "A framework for simulation-based network control via hindsight optimization," in Proc. 39th IEEE Conf. Decision Control, vol. 2000, pp. 1433-1438.
    • Proc. 39th IEEE Conf. Decision Control , vol.2000 , pp. 1433-1438
    • Chong, E.K.P.1    Givan, R.2    Chang, H.S.3
  • 15
    • 0036662695 scopus 로고    scopus 로고
    • Moving horizon nash strategies for a military air operation
    • June
    • J. Cruz, M. Simaan, A. Gacic, and Y. Liu, "Moving horizon nash strategies for a military air operation," IEEE Trans. Aero. Electron. Syst., vol. 38, pp. 989-999, June 2002.
    • (2002) IEEE Trans. Aero. Electron. Syst. , vol.38 , pp. 989-999
    • Cruz, J.1    Simaan, M.2    Gacic, A.3    Liu, Y.4
  • 17
    • 0344030849 scopus 로고
    • An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix
    • B. L. Fox and D. M. Landi, "An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix," Commun. ACM, vol. 2, pp. 619-621, 1968.
    • (1968) Commun. ACM , vol.2 , pp. 619-621
    • Fox, B.L.1    Landi, D.M.2
  • 18
    • 84880657398 scopus 로고    scopus 로고
    • GIB: Steps toward an expert-level bridge-playing program
    • M. L. Ginsberg, "GIB: steps toward an expert-level bridge-playing program," in Proc. IJCAI, 1999, pp. 584-589.
    • (1999) Proc. IJCAI , pp. 584-589
    • Ginsberg, M.L.1
  • 19
    • 0000703817 scopus 로고    scopus 로고
    • Stochastic games with zero stop probabilities
    • M. Dresher, A. Tucker, and P. Wolfe, Eds. Princeton, NJ: Princeton Univ. Press
    • D. Gillette, "Stochastic games with zero stop probabilities," in Contributions to the Theory of Games, M. Dresher, A. Tucker, and P. Wolfe, Eds. Princeton, NJ: Princeton Univ. Press, vol. III, pp. 179-187.
    • Contributions to the Theory of Games , vol.3 , pp. 179-187
    • Gillette, D.1
  • 20
    • 0025502594 scopus 로고
    • Error bounds for rolling horizon policies in discrete-time markov control processes
    • Oct.
    • O. Hernández-Lerma and J. B. Lasserre, "Error bounds for rolling horizon policies in discrete-time markov control processes," IEEE Trans. Automat. Contr., vol. 35, pp. 1118-1124, Oct. 1990.
    • (1990) IEEE Trans. Automat. Contr. , vol.35 , pp. 1118-1124
    • Hernández-Lerma, O.1    Lasserre, J.B.2
  • 21
    • 0015300160 scopus 로고
    • Team decision theory and information structures in optimal control problems-part I
    • Feb.
    • Y. -C. Ho and K. -C. Chu, "Team decision theory and information structures in optimal control problems-part I," IEEE Trans. Automat. Contr., vol. AC-17, pp. 15-22, Feb. 1972.
    • (1972) IEEE Trans. Automat. Contr. , vol.AC-17 , pp. 15-22
    • Ho, Y.-C.1    Chu, K.-C.2
  • 24
    • 0024011239 scopus 로고
    • Optimal, infinite horizon feedback laws for a general class of constrained discrete time systems: Stability and moving-horizon approximations
    • S. S. Keerthi and E. G. Gilbert, "Optimal, infinite horizon feedback laws for a general class of constrained discrete time systems: Stability and moving-horizon approximations," J. Optim. Theory Applicat., vol. 57, pp. 265-293, 1988.
    • (1988) J. Optim. Theory Applicat. , vol.57 , pp. 0265-293
    • Keerthi, S.S.1    Gilbert, E.G.2
  • 25
    • 0025462720 scopus 로고
    • Receding horizon control of nonlinear system
    • July
    • D. Q. Mayne and H. Michalska, "Receding horizon control of nonlinear system," IEEE Trans. Automat. Contr., vol. 38, pp. 814-824, July 1990.
    • (1990) IEEE Trans. Automat. Contr. , vol.38 , pp. 814-824
    • Mayne, D.Q.1    Michalska, H.2
  • 26
    • 0033135677 scopus 로고    scopus 로고
    • Model predictive control: Past, present, and future
    • M. Morari and J. H. Lee, "Model predictive control: Past, present, and future," Comput. Chem. Eng., vol. 23, pp. 667-682, 1999.
    • (1999) Comput. Chem. Eng. , vol.23 , pp. 667-682
    • Morari, M.1    Lee, J.H.2
  • 27
    • 0002282886 scopus 로고
    • Markov Games - A survey
    • E. Roxin, P. Liu, and R. Sternberg, Eds. New York: Marcel Dekker
    • T. Parthasarathy and M. Stern, "Markov Games - A survey," in Differential Games and Control Theory II, E. Roxin, P. Liu, and R. Sternberg, Eds. New York: Marcel Dekker, 1977, pp. 1-46.
    • (1977) Differential Games and Control Theory II , pp. 1-46
    • Parthasarathy, T.1    Stern, M.2
  • 29
    • 0001172487 scopus 로고
    • Multichain Markov decision processes with a sample-path constraint: A decomposition approach
    • K. W. Ross and R. Varadarajan, "Multichain Markov decision processes with a sample-path constraint: A decomposition approach," Math. Oper. Res., vol. 16, no. 1, pp. 195-207, 1991.
    • (1991) Math. Oper. Res. , vol.16 , Issue.1 , pp. 195-207
    • Ross, K.W.1    Varadarajan, R.2
  • 30
    • 0000392613 scopus 로고
    • Stochastic games
    • L. Shapley, "Stochastic games," in Proc. Nat. Acad. Sci., vol. 39, 1953, pp. 1095-1100.
    • (1953) Proc. Nat. Acad. Sci. , vol.39 , pp. 1095-1100
    • Shapley, L.1
  • 31
    • 0029771680 scopus 로고    scopus 로고
    • Approximations in dynamic zero-sum games
    • M. Tidball and E. Airman, "Approximations in dynamic zero-sum games," SIAM J. Control Optim., vol. 34, no. 1, pp. 311-328, 1996.
    • (1996) SIAM J. Control Optim. , vol.34 , Issue.1 , pp. 311-328
    • Tidball, M.1    Airman, E.2
  • 32
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • J. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Mach. Learn., vol. 16, pp. 185-202, 1994.
    • (1994) Mach. Learn. , vol.16 , pp. 185-202
    • Tsitsiklis, J.1
  • 33
    • 0013201693 scopus 로고
    • Discounted markov games: Generalized policy iteration method
    • J. Van Der Wal, "Discounted markov games: Generalized policy iteration method," J. of Optim. Theory Applicat., vol. 25, no. 1, pp. 125-138, 1978.
    • (1978) J. of Optim. Theory Applicat. , vol.25 , Issue.1 , pp. 125-138
    • Van Der Wal, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.