-
1
-
-
0000611954
-
Zero-sum Markov games and worst-cast optimal control of queueing systems
-
E. Altman, "Zero-sum Markov games and worst-cast optimal control of queueing systems," Queueing Syst., Theory Appl., vol.21, pp. 415-447, 1995.
-
(1995)
Queueing Syst., Theory Appl.
, vol.21
, pp. 415-447
-
-
Altman, E.1
-
2
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire, "The nonstochastic multiarmed bandit problem," SIAM J. Comput., vol.32, no.1, pp. 48-77, 2002.
-
(2002)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
4
-
-
0344395590
-
Two-person zero-sum Markov games: Receding horizon approach
-
Nov.
-
H. S. Chang and S. I. Marcus, "Two-person zero-sum Markov games: Receding horizon approach," IEEE Trans. Autom. Control, vol.48, no.11, pp. 1951-1961, Nov. 2003.
-
(2003)
IEEE Trans. Autom. Control
, vol.48
, Issue.11
, pp. 1951-1961
-
-
Chang, H.S.1
Marcus, S.I.2
-
6
-
-
9444295723
-
Fast planning in stochastic games
-
M. Kearns, Y. Mansour, and S. Singh, "Fast planning in stochastic games," in Proc. 16th Conf. Uncertainty Artif. Intell., 2000, pp. 309-316.
-
(2000)
Proc. 16th Conf. Uncertainty Artif. Intell.
, pp. 309-316
-
-
Kearns, M.1
Mansour, Y.2
Singh, S.3
-
7
-
-
0036013019
-
The sample average approximation method for stochastic discrete optimization
-
A. J. Kleywegt, A. Shapiro, and T. Homem-De-Mello, "The sample average approximation method for stochastic discrete optimization," SIAM J. Optim., vol.12, no.2, pp. 479-502, 2001.
-
(2001)
SIAM J. Optim.
, vol.12
, Issue.2
, pp. 479-502
-
-
Kleywegt, A.J.1
Shapiro, A.2
Homem-De-Mello, T.3
-
8
-
-
0000268071
-
Learning algorithms for twoperson zero-sum stochastic games with incomplete information
-
S. Lakshmivarahan and K. S. Narendra, "Learning algorithms for twoperson zero-sum stochastic games with incomplete information," Math. Oper. Res., vol.6, pp. 379-386, 1981.
-
(1981)
Math. Oper. Res.
, vol.6
, pp. 379-386
-
-
Lakshmivarahan, S.1
Narendra, K.S.2
-
9
-
-
0020159814
-
Learning algorithms for twoperson zero-sum stochastic games with incomplete information: A unified approach
-
S. Lakshmivarahan and K. S. Narendra, "Learning algorithms for twoperson zero-sum stochastic games with incomplete information: A unified approach," SIAM J. Control Optim., vol.20, pp. 541-552, 1982.
-
(1982)
SIAM J. Control Optim.
, vol.20
, pp. 541-552
-
-
Lakshmivarahan, S.1
Narendra, K.S.2
-
10
-
-
0030212543
-
Finite time analysis of the pursuit algorithm for learning automata
-
Aug.
-
K. Rajaraman and P. S. Sastry, "Finite time analysis of the pursuit algorithm for learning automata," IEEE Trans. Syst., Man, Cybern. B, vol.26, no.4, pp. 590-598, Aug. 1996.
-
(1996)
IEEE Trans. Syst., Man, Cybern. B
, vol.26
, Issue.4
, pp. 590-598
-
-
Rajaraman, K.1
Sastry, P.S.2
-
11
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
H. Robbins, "Some aspects of the sequential design of experiments," Bull. Amer. Math. Soc., vol.55, pp. 527-535, 1952.
-
(1952)
Bull. Amer. Math. Soc.
, vol.55
, pp. 527-535
-
-
Robbins, H.1
-
12
-
-
0028423534
-
Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete information
-
May
-
P. S. Sastry, V. V. Phansalkar, and M. A. L. Thathachar, "Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete information," IEEE Trans. Syst., Man, Cybern., vol.24, no.5, pp. 769-777, May 1994.
-
(1994)
IEEE Trans. Syst., Man, Cybern.
, vol.24
, Issue.5
, pp. 769-777
-
-
Sastry, P.S.1
Phansalkar, V.V.2
Thathachar, M.A.L.3
-
13
-
-
0141824325
-
-
Ph.D. dissertation, Department of Mathematics, Technische Hogeschool Eindhoven, Eindhoven, The Netherlands
-
J. Van Der Wal, "Stochastic Dynamic Programming: Successive Approximations and Nearly Optimal Strategies for Markov Decision Processes and Markov Games," Ph.D. dissertation, Department of Mathematics, Technische Hogeschool Eindhoven, Eindhoven, The Netherlands, 1980.
-
(1980)
Stochastic Dynamic Programming: Successive Approximations and Nearly Optimal Strategies for Markov Decision Processes and Markov Games
-
-
Wal Der J.Van1
|