-
4
-
-
0002807389
-
Estimation and control in Markov chains
-
P. Mandl, “Estimation and control in Markov chains,” Adv. App!. Prob., vol. 6, 40–60, 1974.
-
(1974)
Adv. App. Prob.
, vol.6
, pp. 40-60
-
-
Mandl, P.1
-
5
-
-
0018678571
-
Adaptive control of Markov chains, I: Finite parameter set
-
V. Borkar and P. Varaiya, “Adaptive control of Markov chains, I: Finite parameter set,” IEEE Trans. Automat. Contr., vol. AC-24, pp. 953–958, 1979.
-
(1979)
IEEE Trans. Automat. Contr.
, vol.AC-24
, pp. 953-958
-
-
Borkar, V.1
Varaiya, P.2
-
6
-
-
0020166640
-
Optimal adaptive controllers for unknown Markov chains
-
P. R. Kumar and W. Lin, “Optimal adaptive controllers for unknown Markov chains,” IEEE Trans. Automat. Contr., vol. AC-27, pp. 765–774, 1982.
-
(1982)
IEEE Trans. Automat. Contr.
, vol.AC-27
, pp. 765-774
-
-
Kumar, P.R.1
Lin, W.2
-
7
-
-
50549213583
-
Optimal control of Markov processes with incomplete state information
-
K. Astrom, “Optimal control of Markov processes with incomplete state information,- J. Math. Anal. App!., vol. 10, pp. 174–205, 1965.
-
(1965)
J. Math. Anal. App.
, vol.10
, pp. 174-205
-
-
Astrom, K.1
-
8
-
-
0020113091
-
Decentralized control of finite state Markov processes
-
K. Hsu and S. I. Marcus, “Decentralized control of finite state Markov processes,” IEEE Trans. Automat. Contr., vol. AC-27, pp. 426–431, 431, 1982.
-
(1982)
IEEE Trans. Automat. Contr.
, vol.AC-27
, pp. 426-431
-
-
Hsu, K.1
Marcus, S.I.2
-
9
-
-
0001700171
-
A Markovian decision process
-
R. E. Bellman, “A Markovian decision process,” J. Math. Mech., vol. 6, pp. 679–684, 1957.
-
(1957)
J. Math. Mech.
, vol.6
, pp. 679-684
-
-
Bellman, R.E.1
-
10
-
-
0003122592
-
Dynamic programming, Markov chains, and the method of successive approximations
-
D. J. White, “Dynamic programming, Markov chains, and the method of successive approximations,” J. Math. Anal. Appl., vol. 6, pp. 373–376, 1963.
-
(1963)
J. Math. Anal. Appl.
, vol.6
, pp. 373-376
-
-
White, D.J.1
-
11
-
-
0017980998
-
Optimal and suboptimal stationary controls for Markov chains
-
P. Varaiya, “Optimal and suboptimal stationary controls for Markov chains,” IEEE Trans. Automat. Contr., vol. AC-23, pp. 388–394, 1978.
-
(1978)
IEEE Trans. Automat. Contr.
, vol.AC-23
, pp. 388-394
-
-
Varaiya, P.1
-
12
-
-
0017961288
-
Multi layer control of large Markov chains
-
J.-P. Forestier and P. Varaiya, “Multi layer control of large Markov chains,” IEEE Trans. Automat. Contr., vol. AC-23, pp. 298–305, 1978.
-
(1978)
IEEE Trans. Automat. Contr.
, vol.AC-23
, pp. 298-305
-
-
Forestier, J.-P.1
Varaiya, P.2
-
13
-
-
0019527937
-
Recursive algorithms for adaptive control of finite Markov chains
-
Y. M. El-Fattah, “Recursive algorithms for adaptive control of finite Markov chains,” IEEE Trans. Syst., Man, Cybern., vol. SMC-1l, pp. 135–144, 1981.
-
(1981)
IEEE Trans. Syst., Man, Cybern., vol. SMC-1l
, pp. 135-144
-
-
El-Fattah, Y.M.1
-
14
-
-
0020114278
-
earning control of finite Markov chains with unknown transition probabilities
-
M. Sato, K. Abe, and H. Takeda, Learning control of finite Markov chains with unknown transition probabilities,” IEEE Trans. Automat. Contr., vol. AC-27, 502–505, 1982.
-
(1982)
IEEE Trans. Automat. Contr.
, vol.AC-27
, pp. 502-505
-
-
Sato, M.1
Abe, K.2
Takeda, H.3
-
21
-
-
0016082525
-
Learning automata–A survey
-
K. S. Narendra and M. A. L. Thathachar, “Learning automata—A survey,” IEEE Trans. Syst., Man, Cybern., vol. SMC-4, pp. 323–334, 334, 1974.
-
(1974)
IEEE Trans. Syst., Man, Cybern.
, vol.SMC-4
, pp. 323-334
-
-
Narendra, K.S.1
Thathachar, M.A.L.2
-
23
-
-
0016926517
-
Absolute expediency of Q- and 5- model learning algorithms
-
S. Lakshmivarahan and M. A. L. Thathachar, “Absolute expediency of Q- and 5- model learning algorithms,” Trans. Syst., Man, Cybern., vol. SMC-6, pp. 222–226, 1976.
-
(1976)
Trans. Syst., Man, Cybern.
, vol.SMC-6
, pp. 222-226
-
-
Lakshmivarahan, S.1
Thathachar, M.A.L.2
-
24
-
-
0022099763
-
Learning models for decentralized decision making
-
R. M. Wheeler, Jr. and K. S. Narendra, “Learning models for decentralized decision making,” Automatica, vol. 21, pp. 479–484, 1985.
-
(1985)
Automatica
, vol.21
, pp. 479-484
-
-
Wheeler, R.M.1
Narendra, K.S.2
-
25
-
-
0000268071
-
Learning algorithms for two-person person zero-sum stochastic games with incomplete information
-
S. Lakshmivarahan and K. S. Narendra, “Learning algorithms for two-person person zero-sum stochastic games with incomplete information,” Math. Oper. Res., vol. 6, pp. 379–386, 1981.
-
(1981)
Math. Oper. Res.
, vol.6
, pp. 379-386
-
-
Lakshmivarahan, S.1
Narendra, K.S.2
-
26
-
-
0020159814
-
Learning algorithms for two-person person zero-sum stochastic games with incomplete information: A unified approach
-
S. Lakshmivarahan and K. S. Narendra, “Learning algorithms for two-person person zero-sum stochastic games with incomplete information: A unified approach,” SIAM J. Contr. and Opt., vol. 20, pp. 541–552, 1982.
-
(1982)
SIAM J. Contr. and Opt.
, vol.20
, pp. 541-552
-
-
Lakshmivarahan, S.1
Narendra, K.S.2
-
27
-
-
0002021736
-
Equilibrium points in n-person games
-
J. F. Nash, “Equilibrium points in n-person games,” Proc. National Acad. Sci. USA, vol. 36, pp. 48–49, 1950.
-
(1950)
Proc. National Acad. Sci. USA
, vol.36
, pp. 48-49
-
-
Nash, J.F.1
-
28
-
-
0013550306
-
Example of a problem in the joint behavior of two automata
-
V. L. Stefanyuk, “Example of a problem in the joint behavior of two automata,” Automat. Telemekh., vol. 24, pp. 781–784, 1963.
-
(1963)
Automat. Telemekh.
, vol.24
, pp. 781-784
-
-
Stefanyuk, V.L.1
-
29
-
-
84939320230
-
One example of a game for many identical automata
-
S. L. Ginsburg, V. Y. Krylov, and M. L. Tsetlin, “One example of a game for many identical automata,” Automat. Telernekh., vol. 25, pp. 668–671, 1964.
-
(1964)
Automat. Telernekh.
, vol.25
, pp. 668-671
-
-
Ginsburg, S.L.1
Krylov, V.Y.2
Tsetlin, M.L.3
-
30
-
-
84939369952
-
Collective behavior and control problems
-
V. I. Varshavsldi, “Collective behavior and control problems,” in Machine Intelligence, 3, D. Michie, Ed. Edinburgh: Edinburgh Univ., 1968.
-
(1968)
Machine Intelligence
, vol.3
-
-
Varshavsldi, V.I.1
-
31
-
-
0015534574
-
Competitive and cooperative games of variable-structure stochastic automata
-
R. Viswanathan and K. S. Narendra, “Competitive and cooperative games of variable-structure stochastic automata,” J. Cybern., vol. 3, pp. 1–23, 1973.
-
(1973)
J. Cybern.
, vol.3
, pp. 1-23
-
-
Viswanathan, R.1
Narendra, K.S.2
-
32
-
-
0016483919
-
On the learning behavior of stochastic automata under a nonstationary random environment
-
N. Baba and Y. Sawaragi, “On the learning behavior of stochastic automata under a nonstationary random environment,” IEEE Trans. Syst., Man, Cybern., vol. SMC-5, pp. 273–275, 1975.
-
(1975)
IEEE Trans. Syst., Man, Cybern.
, vol.SMC-5
, pp. 273-275
-
-
Baba, N.1
Sawaragi, Y.2
-
33
-
-
84939351145
-
A new approach to the design of reinforcement schemes for learning automata
-
M. A. L. Thathachar and P. S. Sastry, “A new approach to the design of reinforcement schemes for learning automata,” Dept. Elec. Eng., Ind. Inst. Sci., Bangalore, India, Tech. Rep. EE/60, 1983.
-
(1983)
-
-
Thathachar, M.A.L.1
Sastry, P.S.2
-
34
-
-
0000446786
-
On expediency and convergence in variable-structure automata
-
B. Chandrasekaran and D. W. C. Shen, “On expediency and convergence in variable-structure automata,” IEEE Trans. Syst. Sci. Cybern., vol. SSC-4, pp. 52–60, 1968.
-
(1968)
IEEE Trans. Syst. Sci. Cybern.
, vol.SSC-4
, pp. 52-60
-
-
Chandrasekaran, B.1
Shen, D.W.C.2
|