-
1
-
-
0001073374
-
Rolling horizon procedures in non-homogeneous Markov decision processes
-
J. M. Alden and R. L. Smith, "Rolling horizon procedures in non-homogeneous Markov decision processes," Oper. Res., vol. 40, pp. S183-S194, 1992.
-
(1992)
Oper. Res.
, vol.40
-
-
Alden, J.M.1
Smith, R.L.2
-
2
-
-
0000611954
-
Zero-sum Markov games and worst-cast optimal control of queuejng systems
-
E. Altman, "Zero-sum Markov games and worst-cast optimal control of queuejng systems," QUESTA, vol. 21, pp. 415-447, 1995.
-
(1995)
QUESTA
, vol.21
, pp. 415-447
-
-
Altman, E.1
-
3
-
-
0039564884
-
Non zero-sum stochastic games in admission, service and routing control in queueing systems
-
_, "Non zero-sum stochastic games in admission, service and routing control in queueing systems," QUESTA, vol. 23, pp. 259-279, 1996.
-
(1996)
QUESTA
, vol.23
, pp. 259-279
-
-
-
4
-
-
0013297856
-
Monotonicity of optimal policies in a zero sum game: A flow control model
-
_, "Monotonicity of optimal policies in a zero sum game: A flow control model," Adv. Dyna. Games Applicat., pp. 269-286, 1994.
-
(1994)
Adv. Dyna. Games Applicat.
, pp. 269-286
-
-
-
5
-
-
0033324964
-
Neural approximators and team theory for dynamic routing: A receding horizon approach
-
M. Baglietto, T. Parisini, and R. Zoppoli, "Neural approximators and team theory for dynamic routing: A receding horizon approach," Proc. IEEE Conf. Decision Control, pp. 3283-3288, 1999.
-
(1999)
Proc. IEEE Conf. Decision Control
, pp. 3283-3288
-
-
Baglietto, M.1
Parisini, T.2
Zoppoli, R.3
-
9
-
-
0346942368
-
Decision-theoretic planning: Structural assumptions and computational leverage
-
C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," J. Artif. Intell. Res., vol. 11, pp. 1-94, 1999.
-
(1999)
J. Artif. Intell. Res.
, vol.11
, pp. 1-94
-
-
Boutilier, C.1
Dean, T.2
Hanks, S.3
-
10
-
-
0013205236
-
Moving horizon control in dynamic games
-
W. A. van den Broek, "Moving horizon control in dynamic games, " Comput. Econ. Finance, vol. 122, 1999.
-
(1999)
Comput. Econ. Finance
, vol.122
-
-
Van den Broek, W.A.1
-
11
-
-
84955501513
-
On-line scheduling via sampling
-
H. S. Chang, R. Givan, and E. K. P. Chong, "On-line scheduling via sampling," in Proc. 5th Int. Conf. Artificial Intelligence Planning Scheduling, 2000, pp. 62-71.
-
(2000)
Proc. 5th Int. Conf. Artificial Intelligence Planning Scheduling
, pp. 62-71
-
-
Chang, H.S.1
Givan, R.2
Chong, E.K.P.3
-
13
-
-
0242677010
-
Approximate receding horizon approach for Markov decision processes: Average reward case
-
submitted for publication
-
H. S. Chang and S. Marcus, "Approximate receding horizon approach for Markov decision processes: Average reward case," J. Math. Anal. Applicat., vol. 286, no. 2, pp. 636-651, 2001, submitted for publication.
-
(2001)
J. Math. Anal. Applicat.
, vol.286
, Issue.2
, pp. 636-651
-
-
Chang, H.S.1
Marcus, S.2
-
14
-
-
0034439756
-
A framework for simulation-based network control via hindsight optimization
-
E. K. P. Chong, R. Givan, and H. S. Chang, "A framework for simulation-based network control via hindsight optimization," in Proc. 39th IEEE Conf. Decision Control, vol. 2000, pp. 1433-1438.
-
Proc. 39th IEEE Conf. Decision Control
, vol.2000
, pp. 1433-1438
-
-
Chong, E.K.P.1
Givan, R.2
Chang, H.S.3
-
15
-
-
0036662695
-
Moving horizon nash strategies for a military air operation
-
June
-
J. Cruz, M. Simaan, A. Gacic, and Y. Liu, "Moving horizon nash strategies for a military air operation," IEEE Trans. Aero. Electron. Syst., vol. 38, pp. 989-999, June 2002.
-
(2002)
IEEE Trans. Aero. Electron. Syst.
, vol.38
, pp. 989-999
-
-
Cruz, J.1
Simaan, M.2
Gacic, A.3
Liu, Y.4
-
17
-
-
0344030849
-
An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix
-
B. L. Fox and D. M. Landi, "An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix," Commun. ACM, vol. 2, pp. 619-621, 1968.
-
(1968)
Commun. ACM
, vol.2
, pp. 619-621
-
-
Fox, B.L.1
Landi, D.M.2
-
18
-
-
84880657398
-
GIB: Steps toward an expert-level bridge-playing program
-
M. L. Ginsberg, "GIB: steps toward an expert-level bridge-playing program," in Proc. IJCAI, 1999, pp. 584-589.
-
(1999)
Proc. IJCAI
, pp. 584-589
-
-
Ginsberg, M.L.1
-
19
-
-
0000703817
-
Stochastic games with zero stop probabilities
-
M. Dresher, A. Tucker, and P. Wolfe, Eds. Princeton, NJ: Princeton Univ. Press
-
D. Gillette, "Stochastic games with zero stop probabilities," in Contributions to the Theory of Games, M. Dresher, A. Tucker, and P. Wolfe, Eds. Princeton, NJ: Princeton Univ. Press, vol. III, pp. 179-187.
-
Contributions to the Theory of Games
, vol.3
, pp. 179-187
-
-
Gillette, D.1
-
20
-
-
0025502594
-
Error bounds for rolling horizon policies in discrete-time markov control processes
-
Oct.
-
O. Hernández-Lerma and J. B. Lasserre, "Error bounds for rolling horizon policies in discrete-time markov control processes," IEEE Trans. Automat. Contr., vol. 35, pp. 1118-1124, Oct. 1990.
-
(1990)
IEEE Trans. Automat. Contr.
, vol.35
, pp. 1118-1124
-
-
Hernández-Lerma, O.1
Lasserre, J.B.2
-
21
-
-
0015300160
-
Team decision theory and information structures in optimal control problems-part I
-
Feb.
-
Y. -C. Ho and K. -C. Chu, "Team decision theory and information structures in optimal control problems-part I," IEEE Trans. Automat. Contr., vol. AC-17, pp. 15-22, Feb. 1972.
-
(1972)
IEEE Trans. Automat. Contr.
, vol.AC-17
, pp. 15-22
-
-
Ho, Y.-C.1
Chu, K.-C.2
-
24
-
-
0024011239
-
Optimal, infinite horizon feedback laws for a general class of constrained discrete time systems: Stability and moving-horizon approximations
-
S. S. Keerthi and E. G. Gilbert, "Optimal, infinite horizon feedback laws for a general class of constrained discrete time systems: Stability and moving-horizon approximations," J. Optim. Theory Applicat., vol. 57, pp. 265-293, 1988.
-
(1988)
J. Optim. Theory Applicat.
, vol.57
, pp. 0265-293
-
-
Keerthi, S.S.1
Gilbert, E.G.2
-
25
-
-
0025462720
-
Receding horizon control of nonlinear system
-
July
-
D. Q. Mayne and H. Michalska, "Receding horizon control of nonlinear system," IEEE Trans. Automat. Contr., vol. 38, pp. 814-824, July 1990.
-
(1990)
IEEE Trans. Automat. Contr.
, vol.38
, pp. 814-824
-
-
Mayne, D.Q.1
Michalska, H.2
-
26
-
-
0033135677
-
Model predictive control: Past, present, and future
-
M. Morari and J. H. Lee, "Model predictive control: Past, present, and future," Comput. Chem. Eng., vol. 23, pp. 667-682, 1999.
-
(1999)
Comput. Chem. Eng.
, vol.23
, pp. 667-682
-
-
Morari, M.1
Lee, J.H.2
-
27
-
-
0002282886
-
Markov Games - A survey
-
E. Roxin, P. Liu, and R. Sternberg, Eds. New York: Marcel Dekker
-
T. Parthasarathy and M. Stern, "Markov Games - A survey," in Differential Games and Control Theory II, E. Roxin, P. Liu, and R. Sternberg, Eds. New York: Marcel Dekker, 1977, pp. 1-46.
-
(1977)
Differential Games and Control Theory II
, pp. 1-46
-
-
Parthasarathy, T.1
Stern, M.2
-
29
-
-
0001172487
-
Multichain Markov decision processes with a sample-path constraint: A decomposition approach
-
K. W. Ross and R. Varadarajan, "Multichain Markov decision processes with a sample-path constraint: A decomposition approach," Math. Oper. Res., vol. 16, no. 1, pp. 195-207, 1991.
-
(1991)
Math. Oper. Res.
, vol.16
, Issue.1
, pp. 195-207
-
-
Ross, K.W.1
Varadarajan, R.2
-
30
-
-
0000392613
-
Stochastic games
-
L. Shapley, "Stochastic games," in Proc. Nat. Acad. Sci., vol. 39, 1953, pp. 1095-1100.
-
(1953)
Proc. Nat. Acad. Sci.
, vol.39
, pp. 1095-1100
-
-
Shapley, L.1
-
31
-
-
0029771680
-
Approximations in dynamic zero-sum games
-
M. Tidball and E. Airman, "Approximations in dynamic zero-sum games," SIAM J. Control Optim., vol. 34, no. 1, pp. 311-328, 1996.
-
(1996)
SIAM J. Control Optim.
, vol.34
, Issue.1
, pp. 311-328
-
-
Tidball, M.1
Airman, E.2
-
32
-
-
0028497630
-
Asynchronous stochastic approximation and Q-learning
-
J. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Mach. Learn., vol. 16, pp. 185-202, 1994.
-
(1994)
Mach. Learn.
, vol.16
, pp. 185-202
-
-
Tsitsiklis, J.1
-
33
-
-
0013201693
-
Discounted markov games: Generalized policy iteration method
-
J. Van Der Wal, "Discounted markov games: Generalized policy iteration method," J. of Optim. Theory Applicat., vol. 25, no. 1, pp. 125-138, 1978.
-
(1978)
J. of Optim. Theory Applicat.
, vol.25
, Issue.1
, pp. 125-138
-
-
Van Der Wal, J.1
|