SCOPUS 정보 검색 플랫폼

Volumn 121, Issue 1, 2000, Pages 31-47

Near-optimal polynomial time algorithm for learning in certain classes of stochastic games

Author keywords

[No Author keywords available]

Indexed keywords

DECISION THEORY; GAME THEORY; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; POLYNOMIALS;

MULTI-AGENT SYSTEMS; POLYNOMIAL TIME LEARNING SYSTEMS; STOCHASTIC GAMES;

ARTIFICIAL INTELLIGENCE;

EID: 0034247018 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/S0004-3702(00)00039-4 Document Type: Article

Times cited : (31)

References (10)

1
- 0003430191
- New York: Wiley
- Alon N., Spencer J.H., Erdos P. The Probabilistic Method. 1992;Wiley, New York.
- (1992) The Probabilistic Method
- Alon, N.¹ Spencer, J.H.² Erdos, P.³

2
- 0000182415
- A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations
- Chernoff H. A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations. Ann. Math. Statist. Vol. 23:1952;493-509.
- (1952) Ann. Math. Statist. , vol.23 , pp. 493-509
- Chernoff, H.¹

4
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Hu J., Wellman M.P. Multiagent reinforcement learning: Theoretical framework and an algorithm. Proc. 15th International Conference on Machine Learning (ICML-98), Madison, WI. 1998.
- (1998) Proc. 15th International Conference on Machine Learning (ICML-98), Madison, WI
- Hu, J.¹ Wellman, M.P.²

5
- 0012257655
- Near-optimal reinforcement learning in polynomial time
- Kearns M., Singh S. Near-optimal reinforcement learning in polynomial time. Proc. 15th International Conference on Machine Learning (ICML-98), Madison, WI. 1998.
- (1998) Proc. 15th International Conference on Machine Learning (ICML-98), Madison, WI
- Kearns, M.¹ Singh, S.²

6
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman M.L. Markov games as a framework for multi-agent reinforcement learning. Proc. 11th International Conference on Machine Learning, New Brunswick, NJ. 1994;157-163.
- (1994) Proc. 11th International Conference on Machine Learning, New Brunswick, NJ , pp. 157-163
- Littman, M.L.¹

7
- 0041166779
- Dynamic non-Bayesian decision-making
- Monderer D., Tennenholtz M. Dynamic non-Bayesian decision-making. J. Artificial Intelligence Res. Vol. 7:1997;231-248.
- (1997) J. Artificial Intelligence Res. , vol.7 , pp. 231-248
- Monderer, D.¹ Tennenholtz, M.²

8
- 0019537014
- An order field property for stochastic games when one player controls transition probabilities
- Parthasarathy T., Raghavan T.E.S. An order field property for stochastic games when one player controls transition probabilities. J. Optim. Theory Appl. Vol. 33:1981;375-392.
- (1981) J. Optim. Theory Appl. , vol.33 , pp. 375-392
- Parthasarathy, T.¹ Raghavan, T.E.S.²

9
- 0000392613
- Stochastic games
- Shapley L.S. Stochastic games. Proc. Nat. Acad. Sci. USA. Vol. 39:1953;1095-1100.
- (1953) Proc. Nat. Acad. Sci. USA , vol.39 , pp. 1095-1100
- Shapley, L.S.¹

10
- 0009656414
- Linear programming and undiscounted stochastic game in which one player controls transitions
- Vrieze O.J. Linear programming and undiscounted stochastic game in which one player controls transitions. OR Spektrum. Vol. 3:1981;29-35.
- (1981) OR Spektrum , vol.3 , pp. 29-35
- Vrieze, O.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.