SCOPUS 정보 검색 플랫폼

Volumn 52, Issue 7, 2007, Pages 1349-1355

Recursive learning automata approach to Markov decision processes

Author keywords

Learning automata; Markov decision process (MDP); Sampling

Indexed keywords

FINITE ELEMENT METHOD; MARKOV PROCESSES; OPTIMIZATION; RANDOM PROCESSES; RECURSIVE FUNCTIONS;

FINITE-TIME ANALYSIS; MARKOV DECISION PROCESSES; RECURSIVE AUTOMATA SAMPLING ALGORITHM (RASA);

LEARNING ALGORITHMS;

EID: 34547108579 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2007.900859 Document Type: Article

Times cited : (11)

References (18)

1
- 0036682894
- A reinforcement learning approach to automatic generation control
- T. P. I. Ahamed, P. S. N. Rao, and P. S. Sastry, "A reinforcement learning approach to automatic generation control," Electric Power Syst. Res., vol. 63, pp. 9-26, 2002.
- (2002) Electric Power Syst. Res , vol.63 , pp. 9-26
- Ahamed, T.P.I.¹ Rao, P.S.N.² Sastry, P.S.³

2
- 0013535965
- Infinite-horizon policy-gradient estimation
- J. Baxter and P. L. Bartlett, "Infinite-horizon policy-gradient estimation," J. Artif. Intell. Res., vol. 15, pp. 319-350, 2001.
- (2001) J. Artif. Intell. Res , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

3
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 14644444172
- An adaptive sampling algorithm for solving Markov decision processes
- H. S. Chang, M. C. Fu, J. Hu, and S. I. Marcus, "An adaptive sampling algorithm for solving Markov decision processes," Operat. Res., vol. 53, no. 1, pp. 126-139, 2005.
- (2005) Operat. Res , vol.53 , Issue.1 , pp. 126-139
- Chang, H.S.¹ Fu, M.C.² Hu, J.³ Marcus, S.I.⁴

5
- 34547120053
- London, U.K, Springer-Verlag
- H. S. Chang, M. C. Fu, J. Hu, and S. I. Marcus, Simulation-Based Algorithms for Markov Decision Processes. London, U.K.: Springer-Verlag, 2007.
- (2007) Simulation-Based Algorithms for Markov Decision Processes
- Chang, H.S.¹ Fu, M.C.² Hu, J.³ Marcus, S.I.⁴

7
- 0003952172
- New York: Springer-Verlag
- O. Hernácndez-Lerma, Adaptive Markov Control Processes. New York: Springer-Verlag, 1989.
- (1989) Adaptive Markov Control Processes
- Hernácndez-Lerma, O.¹

8
- 34547118450
- Simulation-based uniform value function estimates of Markov decision processes
- R. Jain and P. Varaiya, "Simulation-based uniform value function estimates of Markov decision processes," SIAM J. Control Optim., vol. 45, no. 5, pp. 1633-1656, 2006.
- (2006) SIAM J. Control Optim , vol.45 , Issue.5 , pp. 1633-1656
- Jain, R.¹ Varaiya, P.²

9
- 0004139304
- New York: Marcel Dekker
- A. S. Poznyak, K. Najim, and E. Gomez-Ramirez, Self-Learning Control of Finite Markov Chains. New York: Marcel Dekker, 2000.
- (2000) Self-Learning Control of Finite Markov Chains
- Poznyak, A.S.¹ Najim, K.² Gomez-Ramirez, E.³

10
- 85102627959
- New York: Wiley
- M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York: Wiley, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

14
- 2942609194
- New York: Springer-Verlag
- P. S. Sastry and M. A. L. Thathachar, Networks of Learning Automata: Techniques for Online Stochastic Optimization. New York: Springer-Verlag, 2003.
- (2003) Networks of Learning Automata: Techniques for Online Stochastic Optimization
- Sastry, P.S.¹ Thathachar, M.A.L.²

15
- 0004225404
- 2nd ed. New York: Springer-Verlag
- A. N. Shiryaev, Probability, 2nd ed. New York: Springer-Verlag, 1995.
- (1995) Probability
- Shiryaev, A.N.¹

16
- 0004102479
- Cambridge, MA: MIT Press
- R. Sutton and A. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.