메뉴 건너뛰기




Volumn 31, Issue 6, 1986, Pages 519-526

Decentralized Learning in Finite Markov Chains

Author keywords

[No Author keywords available]

Indexed keywords

CONTROL SYSTEMS, OPTIMAL; CONTROL SYSTEMS, STOCHASTIC; DECISION THEORY AND ANALYSIS; PROBABILITY - RANDOM PROCESSES;

EID: 0022738693     PISSN: 00189286     EISSN: 15582523     Source Type: Journal    
DOI: 10.1109/TAC.1986.1104342     Document Type: Article
Times cited : (86)

References (34)
  • 4
    • 0002807389 scopus 로고
    • Estimation and control in Markov chains
    • P. Mandl, “Estimation and control in Markov chains,” Adv. App!. Prob., vol. 6, 40–60, 1974.
    • (1974) Adv. App. Prob. , vol.6 , pp. 40-60
    • Mandl, P.1
  • 5
    • 0018678571 scopus 로고
    • Adaptive control of Markov chains, I: Finite parameter set
    • V. Borkar and P. Varaiya, “Adaptive control of Markov chains, I: Finite parameter set,” IEEE Trans. Automat. Contr., vol. AC-24, pp. 953–958, 1979.
    • (1979) IEEE Trans. Automat. Contr. , vol.AC-24 , pp. 953-958
    • Borkar, V.1    Varaiya, P.2
  • 6
    • 0020166640 scopus 로고
    • Optimal adaptive controllers for unknown Markov chains
    • P. R. Kumar and W. Lin, “Optimal adaptive controllers for unknown Markov chains,” IEEE Trans. Automat. Contr., vol. AC-27, pp. 765–774, 1982.
    • (1982) IEEE Trans. Automat. Contr. , vol.AC-27 , pp. 765-774
    • Kumar, P.R.1    Lin, W.2
  • 7
    • 50549213583 scopus 로고
    • Optimal control of Markov processes with incomplete state information
    • K. Astrom, “Optimal control of Markov processes with incomplete state information,- J. Math. Anal. App!., vol. 10, pp. 174–205, 1965.
    • (1965) J. Math. Anal. App. , vol.10 , pp. 174-205
    • Astrom, K.1
  • 8
    • 0020113091 scopus 로고
    • Decentralized control of finite state Markov processes
    • K. Hsu and S. I. Marcus, “Decentralized control of finite state Markov processes,” IEEE Trans. Automat. Contr., vol. AC-27, pp. 426–431, 431, 1982.
    • (1982) IEEE Trans. Automat. Contr. , vol.AC-27 , pp. 426-431
    • Hsu, K.1    Marcus, S.I.2
  • 9
    • 0001700171 scopus 로고
    • A Markovian decision process
    • R. E. Bellman, “A Markovian decision process,” J. Math. Mech., vol. 6, pp. 679–684, 1957.
    • (1957) J. Math. Mech. , vol.6 , pp. 679-684
    • Bellman, R.E.1
  • 10
    • 0003122592 scopus 로고
    • Dynamic programming, Markov chains, and the method of successive approximations
    • D. J. White, “Dynamic programming, Markov chains, and the method of successive approximations,” J. Math. Anal. Appl., vol. 6, pp. 373–376, 1963.
    • (1963) J. Math. Anal. Appl. , vol.6 , pp. 373-376
    • White, D.J.1
  • 11
    • 0017980998 scopus 로고
    • Optimal and suboptimal stationary controls for Markov chains
    • P. Varaiya, “Optimal and suboptimal stationary controls for Markov chains,” IEEE Trans. Automat. Contr., vol. AC-23, pp. 388–394, 1978.
    • (1978) IEEE Trans. Automat. Contr. , vol.AC-23 , pp. 388-394
    • Varaiya, P.1
  • 12
    • 0017961288 scopus 로고
    • Multi layer control of large Markov chains
    • J.-P. Forestier and P. Varaiya, “Multi layer control of large Markov chains,” IEEE Trans. Automat. Contr., vol. AC-23, pp. 298–305, 1978.
    • (1978) IEEE Trans. Automat. Contr. , vol.AC-23 , pp. 298-305
    • Forestier, J.-P.1    Varaiya, P.2
  • 13
    • 0019527937 scopus 로고
    • Recursive algorithms for adaptive control of finite Markov chains
    • Y. M. El-Fattah, “Recursive algorithms for adaptive control of finite Markov chains,” IEEE Trans. Syst., Man, Cybern., vol. SMC-1l, pp. 135–144, 1981.
    • (1981) IEEE Trans. Syst., Man, Cybern., vol. SMC-1l , pp. 135-144
    • El-Fattah, Y.M.1
  • 14
    • 0020114278 scopus 로고
    • earning control of finite Markov chains with unknown transition probabilities
    • M. Sato, K. Abe, and H. Takeda, Learning control of finite Markov chains with unknown transition probabilities,” IEEE Trans. Automat. Contr., vol. AC-27, 502–505, 1982.
    • (1982) IEEE Trans. Automat. Contr. , vol.AC-27 , pp. 502-505
    • Sato, M.1    Abe, K.2    Takeda, H.3
  • 23
    • 0016926517 scopus 로고
    • Absolute expediency of Q- and 5- model learning algorithms
    • S. Lakshmivarahan and M. A. L. Thathachar, “Absolute expediency of Q- and 5- model learning algorithms,” Trans. Syst., Man, Cybern., vol. SMC-6, pp. 222–226, 1976.
    • (1976) Trans. Syst., Man, Cybern. , vol.SMC-6 , pp. 222-226
    • Lakshmivarahan, S.1    Thathachar, M.A.L.2
  • 24
    • 0022099763 scopus 로고
    • Learning models for decentralized decision making
    • R. M. Wheeler, Jr. and K. S. Narendra, “Learning models for decentralized decision making,” Automatica, vol. 21, pp. 479–484, 1985.
    • (1985) Automatica , vol.21 , pp. 479-484
    • Wheeler, R.M.1    Narendra, K.S.2
  • 25
    • 0000268071 scopus 로고
    • Learning algorithms for two-person person zero-sum stochastic games with incomplete information
    • S. Lakshmivarahan and K. S. Narendra, “Learning algorithms for two-person person zero-sum stochastic games with incomplete information,” Math. Oper. Res., vol. 6, pp. 379–386, 1981.
    • (1981) Math. Oper. Res. , vol.6 , pp. 379-386
    • Lakshmivarahan, S.1    Narendra, K.S.2
  • 26
    • 0020159814 scopus 로고
    • Learning algorithms for two-person person zero-sum stochastic games with incomplete information: A unified approach
    • S. Lakshmivarahan and K. S. Narendra, “Learning algorithms for two-person person zero-sum stochastic games with incomplete information: A unified approach,” SIAM J. Contr. and Opt., vol. 20, pp. 541–552, 1982.
    • (1982) SIAM J. Contr. and Opt. , vol.20 , pp. 541-552
    • Lakshmivarahan, S.1    Narendra, K.S.2
  • 27
    • 0002021736 scopus 로고
    • Equilibrium points in n-person games
    • J. F. Nash, “Equilibrium points in n-person games,” Proc. National Acad. Sci. USA, vol. 36, pp. 48–49, 1950.
    • (1950) Proc. National Acad. Sci. USA , vol.36 , pp. 48-49
    • Nash, J.F.1
  • 28
    • 0013550306 scopus 로고
    • Example of a problem in the joint behavior of two automata
    • V. L. Stefanyuk, “Example of a problem in the joint behavior of two automata,” Automat. Telemekh., vol. 24, pp. 781–784, 1963.
    • (1963) Automat. Telemekh. , vol.24 , pp. 781-784
    • Stefanyuk, V.L.1
  • 29
    • 84939320230 scopus 로고
    • One example of a game for many identical automata
    • S. L. Ginsburg, V. Y. Krylov, and M. L. Tsetlin, “One example of a game for many identical automata,” Automat. Telernekh., vol. 25, pp. 668–671, 1964.
    • (1964) Automat. Telernekh. , vol.25 , pp. 668-671
    • Ginsburg, S.L.1    Krylov, V.Y.2    Tsetlin, M.L.3
  • 30
    • 84939369952 scopus 로고
    • Collective behavior and control problems
    • V. I. Varshavsldi, “Collective behavior and control problems,” in Machine Intelligence, 3, D. Michie, Ed. Edinburgh: Edinburgh Univ., 1968.
    • (1968) Machine Intelligence , vol.3
    • Varshavsldi, V.I.1
  • 31
    • 0015534574 scopus 로고
    • Competitive and cooperative games of variable-structure stochastic automata
    • R. Viswanathan and K. S. Narendra, “Competitive and cooperative games of variable-structure stochastic automata,” J. Cybern., vol. 3, pp. 1–23, 1973.
    • (1973) J. Cybern. , vol.3 , pp. 1-23
    • Viswanathan, R.1    Narendra, K.S.2
  • 32
    • 0016483919 scopus 로고
    • On the learning behavior of stochastic automata under a nonstationary random environment
    • N. Baba and Y. Sawaragi, “On the learning behavior of stochastic automata under a nonstationary random environment,” IEEE Trans. Syst., Man, Cybern., vol. SMC-5, pp. 273–275, 1975.
    • (1975) IEEE Trans. Syst., Man, Cybern. , vol.SMC-5 , pp. 273-275
    • Baba, N.1    Sawaragi, Y.2
  • 33
    • 84939351145 scopus 로고
    • A new approach to the design of reinforcement schemes for learning automata
    • M. A. L. Thathachar and P. S. Sastry, “A new approach to the design of reinforcement schemes for learning automata,” Dept. Elec. Eng., Ind. Inst. Sci., Bangalore, India, Tech. Rep. EE/60, 1983.
    • (1983)
    • Thathachar, M.A.L.1    Sastry, P.S.2
  • 34
    • 0000446786 scopus 로고
    • On expediency and convergence in variable-structure automata
    • B. Chandrasekaran and D. W. C. Shen, “On expediency and convergence in variable-structure automata,” IEEE Trans. Syst. Sci. Cybern., vol. SSC-4, pp. 52–60, 1968.
    • (1968) IEEE Trans. Syst. Sci. Cybern. , vol.SSC-4 , pp. 52-60
    • Chandrasekaran, B.1    Shen, D.W.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.