SCOPUS 정보 검색 플랫폼

IEEE Transactions on Automatic Control

Volumn 31, Issue 6, 1986, Pages 519-526

Decentralized Learning in Finite Markov Chains

(2) Wheeler, Richard M a Narendra, Kumpati S b

a SANDIA NATIONAL LABORATORIES (United States)

b YALE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONTROL SYSTEMS, OPTIMAL; CONTROL SYSTEMS, STOCHASTIC; DECISION THEORY AND ANALYSIS; PROBABILITY - RANDOM PROCESSES;

DECENTRALIZED LEARNING; DISTRIBUTED DECISION-MAKING; FINITE MARKOV CHAINS; SEQUENTIAL STOCHASTIC GAMES;

SYSTEMS SCIENCE AND CYBERNETICS;

EID: 0022738693 PISSN: 00189286 EISSN: 15582523 Source Type: Journal
DOI: 10.1109/TAC.1986.1104342 Document Type: Article

Times cited : (86)

References (34)

1
- 0003644124
- R. A. Howard, Dynamic Programming and Markov Processes. Cambridge, MA: M.I.T. Press, 1960.
- (1960) Dynamic Programming and Markov Processes.
- Howard, R.A.¹

2
- 0003644137
- S. M. Ross, Applied Probability Models with Optimization Applications. San Francisco, CA: Holden-Day, 1970.
- (1970) Applied Probability Models with Optimization Applications.
- Ross, S.M.¹

3
- 0003421685
- C. Denman, Finite State Markovian Decision Processes. New York: Academic, 1970.
- (1970) Finite State Markovian Decision Processes.
- Denman, C.¹

4
- 0002807389
- Estimation and control in Markov chains
- P. Mandl, “Estimation and control in Markov chains,” Adv. App!. Prob., vol. 6, 40–60, 1974.
- (1974) Adv. App. Prob. , vol.6 , pp. 40-60
- Mandl, P.¹

5
- 0018678571
- Adaptive control of Markov chains, I: Finite parameter set
- V. Borkar and P. Varaiya, “Adaptive control of Markov chains, I: Finite parameter set,” IEEE Trans. Automat. Contr., vol. AC-24, pp. 953–958, 1979.
- (1979) IEEE Trans. Automat. Contr. , vol.AC-24 , pp. 953-958
- Borkar, V.¹ Varaiya, P.²

6
- 0020166640
- Optimal adaptive controllers for unknown Markov chains
- P. R. Kumar and W. Lin, “Optimal adaptive controllers for unknown Markov chains,” IEEE Trans. Automat. Contr., vol. AC-27, pp. 765–774, 1982.
- (1982) IEEE Trans. Automat. Contr. , vol.AC-27 , pp. 765-774
- Kumar, P.R.¹ Lin, W.²

7
- 50549213583
- Optimal control of Markov processes with incomplete state information
- K. Astrom, “Optimal control of Markov processes with incomplete state information,- J. Math. Anal. App!., vol. 10, pp. 174–205, 1965.
- (1965) J. Math. Anal. App. , vol.10 , pp. 174-205
- Astrom, K.¹

8
- 0020113091
- Decentralized control of finite state Markov processes
- K. Hsu and S. I. Marcus, “Decentralized control of finite state Markov processes,” IEEE Trans. Automat. Contr., vol. AC-27, pp. 426–431, 431, 1982.
- (1982) IEEE Trans. Automat. Contr. , vol.AC-27 , pp. 426-431
- Hsu, K.¹ Marcus, S.I.²

9
- 0001700171
- A Markovian decision process
- R. E. Bellman, “A Markovian decision process,” J. Math. Mech., vol. 6, pp. 679–684, 1957.
- (1957) J. Math. Mech. , vol.6 , pp. 679-684
- Bellman, R.E.¹

10
- 0003122592
- Dynamic programming, Markov chains, and the method of successive approximations
- D. J. White, “Dynamic programming, Markov chains, and the method of successive approximations,” J. Math. Anal. Appl., vol. 6, pp. 373–376, 1963.
- (1963) J. Math. Anal. Appl. , vol.6 , pp. 373-376
- White, D.J.¹

11
- 0017980998
- Optimal and suboptimal stationary controls for Markov chains
- P. Varaiya, “Optimal and suboptimal stationary controls for Markov chains,” IEEE Trans. Automat. Contr., vol. AC-23, pp. 388–394, 1978.
- (1978) IEEE Trans. Automat. Contr. , vol.AC-23 , pp. 388-394
- Varaiya, P.¹

12
- 0017961288
- Multi layer control of large Markov chains
- J.-P. Forestier and P. Varaiya, “Multi layer control of large Markov chains,” IEEE Trans. Automat. Contr., vol. AC-23, pp. 298–305, 1978.
- (1978) IEEE Trans. Automat. Contr. , vol.AC-23 , pp. 298-305
- Forestier, J.-P.¹ Varaiya, P.²

13
- 0019527937
- Recursive algorithms for adaptive control of finite Markov chains
- Y. M. El-Fattah, “Recursive algorithms for adaptive control of finite Markov chains,” IEEE Trans. Syst., Man, Cybern., vol. SMC-1l, pp. 135–144, 1981.
- (1981) IEEE Trans. Syst., Man, Cybern., vol. SMC-1l , pp. 135-144
- El-Fattah, Y.M.¹

14
- 0020114278
- earning control of finite Markov chains with unknown transition probabilities
- M. Sato, K. Abe, and H. Takeda, Learning control of finite Markov chains with unknown transition probabilities,” IEEE Trans. Automat. Contr., vol. AC-27, 502–505, 1982.
- (1982) IEEE Trans. Automat. Contr. , vol.AC-27 , pp. 502-505
- Sato, M.¹ Abe, K.² Takeda, H.³

15
- 0003650765
- S. Lakshmivarahan, Learning Algorithms: Theory and Applications. New York: Springer-Verlag, 1981.
- (1981) Learning Algorithms: Theory and Applications.
- Lakshmivarahan, S.¹

16
- 0003781528
- R. R. Bush and F. Mosteller, Stochastic Models for Learning. New York: Wiley, 1958.
- (1958) Stochastic Models for Learning.
- Bush, R.R.¹ Mosteller, F.²

17
- 0004014191
- R. C. Atkinson, G. H. Bower, and E. J. Crothers, An Introduction to Mathematical Learning Theory. New York: Wiley, 1965.
- (1965) An Introduction to Mathematical Learning Theory.
- Atkinson, R.C.¹ Bower, G.H.² Crothers, E.J.³

18
- 0003722746
- M. F. Norman, Markov Processes and Learning Models. New York: Academic, 1972.
- (1972) Markov Processes and Learning Models.
- Norman, M.F.¹

19
- 0003799456
- J. M. Mendel and K. S. Fu, Eds., Adaptive, Learning and Pattern Recognition Systems. New York: Academic, 1970.
- (1970) Adaptive, Learning and Pattern Recognition Systems.
- Mendel, J.M.¹ Fu, K.S.²

20
- 0004162272
- M. L. Tsetlin, Automaton Theory and Modeling of Biological Systems. New York: Academic, 1973.
- (1973) Automaton Theory and Modeling of Biological Systems.
- Tsetlin, M.L.¹

21
- 0016082525
- Learning automata–A survey
- K. S. Narendra and M. A. L. Thathachar, “Learning automata—A survey,” IEEE Trans. Syst., Man, Cybern., vol. SMC-4, pp. 323–334, 334, 1974.
- (1974) IEEE Trans. Syst., Man, Cybern. , vol.SMC-4 , pp. 323-334
- Narendra, K.S.¹ Thathachar, M.A.L.²

22
- 0017463231
- Learning automata–A critique
- K. S. Narendra and S. Lakshmivarahan, “Learning automata—A critique,” J. Cybern. Inf. Sci., vol. 4, pp. 53–66, 1977.
- (1977) J. Cybern. Inf. Sci. , vol.4 , pp. 53-66
- Narendra, K.S.¹ Lakshmivarahan, S.²

23
- 0016926517
- Absolute expediency of Q- and 5- model learning algorithms
- S. Lakshmivarahan and M. A. L. Thathachar, “Absolute expediency of Q- and 5- model learning algorithms,” Trans. Syst., Man, Cybern., vol. SMC-6, pp. 222–226, 1976.
- (1976) Trans. Syst., Man, Cybern. , vol.SMC-6 , pp. 222-226
- Lakshmivarahan, S.¹ Thathachar, M.A.L.²

24
- 0022099763
- Learning models for decentralized decision making
- R. M. Wheeler, Jr. and K. S. Narendra, “Learning models for decentralized decision making,” Automatica, vol. 21, pp. 479–484, 1985.
- (1985) Automatica , vol.21 , pp. 479-484
- Wheeler, R.M.¹ Narendra, K.S.²

25
- 0000268071
- Learning algorithms for two-person person zero-sum stochastic games with incomplete information
- S. Lakshmivarahan and K. S. Narendra, “Learning algorithms for two-person person zero-sum stochastic games with incomplete information,” Math. Oper. Res., vol. 6, pp. 379–386, 1981.
- (1981) Math. Oper. Res. , vol.6 , pp. 379-386
- Lakshmivarahan, S.¹ Narendra, K.S.²

26
- 0020159814
- Learning algorithms for two-person person zero-sum stochastic games with incomplete information: A unified approach
- S. Lakshmivarahan and K. S. Narendra, “Learning algorithms for two-person person zero-sum stochastic games with incomplete information: A unified approach,” SIAM J. Contr. and Opt., vol. 20, pp. 541–552, 1982.
- (1982) SIAM J. Contr. and Opt. , vol.20 , pp. 541-552
- Lakshmivarahan, S.¹ Narendra, K.S.²

27
- 0002021736
- Equilibrium points in n-person games
- J. F. Nash, “Equilibrium points in n-person games,” Proc. National Acad. Sci. USA, vol. 36, pp. 48–49, 1950.
- (1950) Proc. National Acad. Sci. USA , vol.36 , pp. 48-49
- Nash, J.F.¹

28
- 0013550306
- Example of a problem in the joint behavior of two automata
- V. L. Stefanyuk, “Example of a problem in the joint behavior of two automata,” Automat. Telemekh., vol. 24, pp. 781–784, 1963.
- (1963) Automat. Telemekh. , vol.24 , pp. 781-784
- Stefanyuk, V.L.¹

29
- 84939320230
- One example of a game for many identical automata
- S. L. Ginsburg, V. Y. Krylov, and M. L. Tsetlin, “One example of a game for many identical automata,” Automat. Telernekh., vol. 25, pp. 668–671, 1964.
- (1964) Automat. Telernekh. , vol.25 , pp. 668-671
- Ginsburg, S.L.¹ Krylov, V.Y.² Tsetlin, M.L.³

30
- 84939369952
- Collective behavior and control problems
- V. I. Varshavsldi, “Collective behavior and control problems,” in Machine Intelligence, 3, D. Michie, Ed. Edinburgh: Edinburgh Univ., 1968.
- (1968) Machine Intelligence , vol.3
- Varshavsldi, V.I.¹

31
- 0015534574
- Competitive and cooperative games of variable-structure stochastic automata
- R. Viswanathan and K. S. Narendra, “Competitive and cooperative games of variable-structure stochastic automata,” J. Cybern., vol. 3, pp. 1–23, 1973.
- (1973) J. Cybern. , vol.3 , pp. 1-23
- Viswanathan, R.¹ Narendra, K.S.²

32
- 0016483919
- On the learning behavior of stochastic automata under a nonstationary random environment
- N. Baba and Y. Sawaragi, “On the learning behavior of stochastic automata under a nonstationary random environment,” IEEE Trans. Syst., Man, Cybern., vol. SMC-5, pp. 273–275, 1975.
- (1975) IEEE Trans. Syst., Man, Cybern. , vol.SMC-5 , pp. 273-275
- Baba, N.¹ Sawaragi, Y.²

33
- 84939351145
- A new approach to the design of reinforcement schemes for learning automata
- M. A. L. Thathachar and P. S. Sastry, “A new approach to the design of reinforcement schemes for learning automata,” Dept. Elec. Eng., Ind. Inst. Sci., Bangalore, India, Tech. Rep. EE/60, 1983.
- (1983)
- Thathachar, M.A.L.¹ Sastry, P.S.²

34
- 0000446786
- On expediency and convergence in variable-structure automata
- B. Chandrasekaran and D. W. C. Shen, “On expediency and convergence in variable-structure automata,” IEEE Trans. Syst. Sci. Cybern., vol. SSC-4, pp. 52–60, 1968.
- (1968) IEEE Trans. Syst. Sci. Cybern. , vol.SSC-4 , pp. 52-60
- Chandrasekaran, B.¹ Shen, D.W.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.