SCOPUS 정보 검색 플랫폼

IEEE Transactions on Automatic Control

Volumn 48, Issue 11, 2003, Pages 1951-1961

Two-Person Zero-Sum Markov Games: Receding Horizon Approach

(2) Chang, Hyeong Soo a Marcus, Steven I b

a Sogang University (South Korea)

b UNIVERSITY OF MARYLAND (United States)

Author keywords

Hindsight optimization; Infinite horizon cost; Markov game; Receding horizon control; Rollout

Indexed keywords

APPROXIMATION THEORY; ERROR ANALYSIS; GAME THEORY; LINEAR CONTROL SYSTEMS; MARKOV PROCESSES; OPTIMIZATION;

HINDSIGHT OPTIMIZATION; HORIZON CONTROL;

OPTIMAL CONTROL SYSTEMS;

EID: 0344395590 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2003.819077 Document Type: Article

Times cited : (25)

References (34)

1
- 0001073374
- Rolling horizon procedures in non-homogeneous Markov decision processes
- J. M. Alden and R. L. Smith, "Rolling horizon procedures in non-homogeneous Markov decision processes," Oper. Res., vol. 40, pp. S183-S194, 1992.
- (1992) Oper. Res. , vol.40
- Alden, J.M.¹ Smith, R.L.²

2
- 0000611954
- Zero-sum Markov games and worst-cast optimal control of queuejng systems
- E. Altman, "Zero-sum Markov games and worst-cast optimal control of queuejng systems," QUESTA, vol. 21, pp. 415-447, 1995.
- (1995) QUESTA , vol.21 , pp. 415-447
- Altman, E.¹

3
- 0039564884
- Non zero-sum stochastic games in admission, service and routing control in queueing systems
- _, "Non zero-sum stochastic games in admission, service and routing control in queueing systems," QUESTA, vol. 23, pp. 259-279, 1996.
- (1996) QUESTA , vol.23 , pp. 259-279

4
- 0013297856
- Monotonicity of optimal policies in a zero sum game: A flow control model
- _, "Monotonicity of optimal policies in a zero sum game: A flow control model," Adv. Dyna. Games Applicat., pp. 269-286, 1994.
- (1994) Adv. Dyna. Games Applicat. , pp. 269-286

5
- 0033324964
- Neural approximators and team theory for dynamic routing: A receding horizon approach
- M. Baglietto, T. Parisini, and R. Zoppoli, "Neural approximators and team theory for dynamic routing: A receding horizon approach," Proc. IEEE Conf. Decision Control, pp. 3283-3288, 1999.
- (1999) Proc. IEEE Conf. Decision Control , pp. 3283-3288
- Baglietto, M.¹ Parisini, T.² Zoppoli, R.³

6
- 0004071782
- New York: Academic
- T. Basar and G. J. Olsder, Dynamic Noncooperative Game Theory. New York: Academic, 1995.
- (1995) Dynamic Noncooperative Game Theory
- Basar, T.¹ Olsder, G.J.²

7
- 0003565783
- Belmont: Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control. Belmont: Athena Scientific, 1995, vol. 1/2.
- (1995) Dynamic Programming and Optimal Control , vol.1-2
- Bertsekas, D.P.¹

8
- 0003487482
- Belmont, CA: Athena Scientific
- D. P. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming. Belmont, CA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.²

9
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," J. Artif. Intell. Res., vol. 11, pp. 1-94, 1999.
- (1999) J. Artif. Intell. Res. , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

10
- 0013205236
- Moving horizon control in dynamic games
- W. A. van den Broek, "Moving horizon control in dynamic games, " Comput. Econ. Finance, vol. 122, 1999.
- (1999) Comput. Econ. Finance , vol.122
- Van den Broek, W.A.¹

11
- 84955501513
- On-line scheduling via sampling
- H. S. Chang, R. Givan, and E. K. P. Chong, "On-line scheduling via sampling," in Proc. 5th Int. Conf. Artificial Intelligence Planning Scheduling, 2000, pp. 62-71.
- (2000) Proc. 5th Int. Conf. Artificial Intelligence Planning Scheduling , pp. 62-71
- Chang, H.S.¹ Givan, R.² Chong, E.K.P.³

12
- 0345300725
- submitted for publication
- _, "Parallel rollout for online solution of partially observable Markov decision processes,", 2003, submitted for publication.
- (2003) Parallel Rollout for Online Solution of Partially Observable Markov Decision Processes

13
- 0242677010
- Approximate receding horizon approach for Markov decision processes: Average reward case
- submitted for publication
- H. S. Chang and S. Marcus, "Approximate receding horizon approach for Markov decision processes: Average reward case," J. Math. Anal. Applicat., vol. 286, no. 2, pp. 636-651, 2001, submitted for publication.
- (2001) J. Math. Anal. Applicat. , vol.286 , Issue.2 , pp. 636-651
- Chang, H.S.¹ Marcus, S.²

14
- 0034439756
- A framework for simulation-based network control via hindsight optimization
- E. K. P. Chong, R. Givan, and H. S. Chang, "A framework for simulation-based network control via hindsight optimization," in Proc. 39th IEEE Conf. Decision Control, vol. 2000, pp. 1433-1438.
- Proc. 39th IEEE Conf. Decision Control , vol.2000 , pp. 1433-1438
- Chong, E.K.P.¹ Givan, R.² Chang, H.S.³

15
- 0036662695
- Moving horizon nash strategies for a military air operation
- June
- J. Cruz, M. Simaan, A. Gacic, and Y. Liu, "Moving horizon nash strategies for a military air operation," IEEE Trans. Aero. Electron. Syst., vol. 38, pp. 989-999, June 2002.
- (2002) IEEE Trans. Aero. Electron. Syst. , vol.38 , pp. 989-999
- Cruz, J.¹ Simaan, M.² Gacic, A.³ Liu, Y.⁴

16
- 0003989209
- New York: Springer-Verlag
- J. Filar and K. Vrieze, Competitive Markov Decision Processes. New York: Springer-Verlag, 1996.
- (1996) Competitive Markov Decision Processes
- Filar, J.¹ Vrieze, K.²

17
- 0344030849
- An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix
- B. L. Fox and D. M. Landi, "An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix," Commun. ACM, vol. 2, pp. 619-621, 1968.
- (1968) Commun. ACM , vol.2 , pp. 619-621
- Fox, B.L.¹ Landi, D.M.²

18
- 84880657398
- GIB: Steps toward an expert-level bridge-playing program
- M. L. Ginsberg, "GIB: steps toward an expert-level bridge-playing program," in Proc. IJCAI, 1999, pp. 584-589.
- (1999) Proc. IJCAI , pp. 584-589
- Ginsberg, M.L.¹

19
- 0000703817
- Stochastic games with zero stop probabilities
- M. Dresher, A. Tucker, and P. Wolfe, Eds. Princeton, NJ: Princeton Univ. Press
- D. Gillette, "Stochastic games with zero stop probabilities," in Contributions to the Theory of Games, M. Dresher, A. Tucker, and P. Wolfe, Eds. Princeton, NJ: Princeton Univ. Press, vol. III, pp. 179-187.
- Contributions to the Theory of Games , vol.3 , pp. 179-187
- Gillette, D.¹

20
- 0025502594
- Error bounds for rolling horizon policies in discrete-time markov control processes
- Oct.
- O. Hernández-Lerma and J. B. Lasserre, "Error bounds for rolling horizon policies in discrete-time markov control processes," IEEE Trans. Automat. Contr., vol. 35, pp. 1118-1124, Oct. 1990.
- (1990) IEEE Trans. Automat. Contr. , vol.35 , pp. 1118-1124
- Hernández-Lerma, O.¹ Lasserre, J.B.²

21
- 0015300160
- Team decision theory and information structures in optimal control problems-part I
- Feb.
- Y. -C. Ho and K. -C. Chu, "Team decision theory and information structures in optimal control problems-part I," IEEE Trans. Automat. Contr., vol. AC-17, pp. 15-22, Feb. 1972.
- (1972) IEEE Trans. Automat. Contr. , vol.AC-17 , pp. 15-22
- Ho, Y.-C.¹ Chu, K.-C.²

22
- 0004076283
- Amsterdam, The Netherlands: North-Holland
- L. Johansen, Lectures on Macroeconomic Planning. Amsterdam, The Netherlands: North-Holland, 1977.
- (1977) Lectures on Macroeconomic Planning
- Johansen, L.¹

23
- 9444295723
- Fast planning in stochastic games
- M. Kearns, Y. Mansour, and S. Singh, "Fast planning in stochastic games," in Proc. UAI, 2000.
- (2000) Proc. UAI
- Kearns, M.¹ Mansour, Y.² Singh, S.³

24
- 0024011239
- Optimal, infinite horizon feedback laws for a general class of constrained discrete time systems: Stability and moving-horizon approximations
- S. S. Keerthi and E. G. Gilbert, "Optimal, infinite horizon feedback laws for a general class of constrained discrete time systems: Stability and moving-horizon approximations," J. Optim. Theory Applicat., vol. 57, pp. 265-293, 1988.
- (1988) J. Optim. Theory Applicat. , vol.57 , pp. 0265-293
- Keerthi, S.S.¹ Gilbert, E.G.²

25
- 0025462720
- Receding horizon control of nonlinear system
- July
- D. Q. Mayne and H. Michalska, "Receding horizon control of nonlinear system," IEEE Trans. Automat. Contr., vol. 38, pp. 814-824, July 1990.
- (1990) IEEE Trans. Automat. Contr. , vol.38 , pp. 814-824
- Mayne, D.Q.¹ Michalska, H.²

26
- 0033135677
- Model predictive control: Past, present, and future
- M. Morari and J. H. Lee, "Model predictive control: Past, present, and future," Comput. Chem. Eng., vol. 23, pp. 667-682, 1999.
- (1999) Comput. Chem. Eng. , vol.23 , pp. 667-682
- Morari, M.¹ Lee, J.H.²

27
- 0002282886
- Markov Games - A survey
- E. Roxin, P. Liu, and R. Sternberg, Eds. New York: Marcel Dekker
- T. Parthasarathy and M. Stern, "Markov Games - A survey," in Differential Games and Control Theory II, E. Roxin, P. Liu, and R. Sternberg, Eds. New York: Marcel Dekker, 1977, pp. 1-46.
- (1977) Differential Games and Control Theory II , pp. 1-46
- Parthasarathy, T.¹ Stern, M.²

28
- 85102627959
- New York: Wiley
- M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming, New York: Wiley, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

29
- 0001172487
- Multichain Markov decision processes with a sample-path constraint: A decomposition approach
- K. W. Ross and R. Varadarajan, "Multichain Markov decision processes with a sample-path constraint: A decomposition approach," Math. Oper. Res., vol. 16, no. 1, pp. 195-207, 1991.
- (1991) Math. Oper. Res. , vol.16 , Issue.1 , pp. 195-207
- Ross, K.W.¹ Varadarajan, R.²

30
- 0000392613
- Stochastic games
- L. Shapley, "Stochastic games," in Proc. Nat. Acad. Sci., vol. 39, 1953, pp. 1095-1100.
- (1953) Proc. Nat. Acad. Sci. , vol.39 , pp. 1095-1100
- Shapley, L.¹

31
- 0029771680
- Approximations in dynamic zero-sum games
- M. Tidball and E. Airman, "Approximations in dynamic zero-sum games," SIAM J. Control Optim., vol. 34, no. 1, pp. 311-328, 1996.
- (1996) SIAM J. Control Optim. , vol.34 , Issue.1 , pp. 311-328
- Tidball, M.¹ Airman, E.²

32
- 0028497630
- Asynchronous stochastic approximation and Q-learning
- J. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Mach. Learn., vol. 16, pp. 185-202, 1994.
- (1994) Mach. Learn. , vol.16 , pp. 185-202
- Tsitsiklis, J.¹

33
- 0013201693
- Discounted markov games: Generalized policy iteration method
- J. Van Der Wal, "Discounted markov games: Generalized policy iteration method," J. of Optim. Theory Applicat., vol. 25, no. 1, pp. 125-138, 1978.
- (1978) J. of Optim. Theory Applicat. , vol.25 , Issue.1 , pp. 125-138
- Van Der Wal, J.¹

34
- 0141824325
- Ph.D. dissertation, Eindhoven, The Netherlands
- _, "Stochastic dynamic programming: Successive approximations and nearly optimal strategies for Markov decision processes and Markov games," Ph.D. dissertation, Eindhoven, The Netherlands, 1980.
- (1980) Stochastic Dynamic Programming: Successive Approximations and Nearly Optimal Strategies for Markov Decision Processes and Markov Games

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.