SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

IEICE Transactions on Information and Systems

Volumn E82-D, Issue 12, 1999, Pages 1618-1626

Strategy acquisition for the game "othello" based on reinforcement learning

(3) Yoshioka, Taku a Ishii, Shin a Ito, Minoru a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

Min max strategy; Normalized gaussian network; Othello; Reinforcement learning

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; KNOWLEDGE REPRESENTATION; LEARNING SYSTEMS; MATHEMATICAL MODELS; NEURAL NETWORKS; TABLE LOOKUP;

MIN-MAX STRATEGY; NORMALIZED GAUSSIAN NETWORK; OTHELLO; REINFORCEMENT LEARNING; TEMPORAL DIFFERENCE ERROR;

KNOWLEDGE ACQUISITION;

EID: 0033342921 PISSN: 09168532 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (26)

References (20)

1
- 0029679044
- 1996.
- L.P. Kaclbling, M.L. Littman, and A.W. Moore, "Reinforcement learning: A survey," J. Artificial Intell. Res., vol.4, pp.237-285, 1996.
- M.L. Littman, and A.W. Moore, "Reinforcement Learning: a Survey," J. Artificial Intell. Res., Vol.4, Pp.237-285
- Kaclbling, L.P.¹

2
- 0029276036
- 1995.
- G. Tesauro, "Temporal difference learning and TD-Gammon," Commun. ACM, vol.38, pp.58-68, 1995.
- "Temporal Difference Learning and TD-Gammon," Commun. ACM, Vol.38, Pp.58-68
- Tesauro, G.¹

3
- 85027101010
- 1998.
- J. Baxter, A. Tridgell, and L. Weaver, "A chess program that learns by combining TD(A) with game-tree search," Proc. 15th Int. Conf. Machine Learning, pp.28-36, 1998.
- A. Tridgell, and L. Weaver, "A Chess Program that Learns by Combining TD(A) with Game-tree Search," Proc. 15th Int. Conf. Machine Learning, Pp.28-36
- Baxter, J.¹

4
- 85027114828
- 1994.
- N.N. Schraudolph, P. Dayan, and T.J. Sejnowski, "Temporal difference learning of position evaluation in the game of Go," Adv. Neural Inf. Proc. Syst., vol.6, pp.817-824, 1994.
- P. Dayan, and T.J. Sejnowski, "Temporal Difference Learning of Position Evaluation in the Game of Go," Adv. Neural Inf. Proc. Syst., Vol.6, Pp.817-824
- Schraudolph, N.N.¹

5
- 33847202724
- 1988.
- R.S. Sutton, "Learning to predict by the methods of temporal difference," Machine Learning, vol.3, pp.9-44, 1988.
- "Learning to Predict by the Methods of Temporal Difference," Machine Learning, Vol.3, Pp.9-44
- Sutton, R.S.¹

6
- 58049085683
- 1989.
- J. Moody and C.J. Darken, "Fast learning in networks of locally-tuned processing units," Neural Computation, vol.1, pp.281-294, 1989.
- "Fast Learning in Networks of Locally-tuned Processing Units," Neural Computation, Vol.1, Pp.281-294
- Moody, J.¹ Darken, C.J.²

7
- 85027113272
- 1998.
- T. Yoshioka, S. Ishii, and M. Ito, "Strategy acquisition of the game "Othello" based on reinforcement learning," Int. Conf. Neural Info. Proc., pp.841-844, 1998.
- S. Ishii, and M. Ito, "Strategy Acquisition of the Game "Othello" Based on Reinforcement Learning," Int. Conf. Neural Info. Proc., Pp.841-844
- Yoshioka, T.¹

8
- 79957749002
- 1995.
- M.E. Harmon, L.C. Baird, and A.H. Klopf, "Reinforcement learning applied to a differential game," Adaptive Behavior, vol.4, no.l, 1995.
- L.C. Baird, and A.H. Klopf, "Reinforcement Learning Applied to a Differential Game," Adaptive Behavior, Vol.4, No.l
- Harmon, M.E.¹

9
- 34249833101
- 1992.
- C.J.C.H. Watkins and P. Dayan, "Q-Learning," Machine Learning, vol.8, pp.279-292, 1992.
- "Q-Learning," Machine Learning, Vol.8, Pp.279-292
- Watkins, C.J.C.¹ Dayan, P.²

10
- 0004102479
- 1998.
- R.S. Sutton and R.S. Barto, Reinforcement learning: An introduction, MIT Press, 1998.
- Reinforcement Learning: an Introduction, MIT Press
- Sutton, R.S.¹ Barto, R.S.²

11
- 85027132093
- 1996.
- A. Leouski and P. Utgoff, "What a neural network can learn about Othello," Technical Report, 90-10, University of Massachusetts, Amherst, 1996.
- "What a Neural Network Can Learn about Othello," Technical Report, 90-10, University of Massachusetts, Amherst
- Leouski, A.¹ Utgoff, P.²

12
- 85027096662
- 1998.
- T. Yoshioka and S. Ishii, "Learning of an evaluation function of the game Othello by EM algorithm," IEICE Technical Report, NC98-41, 1998.
- "Learning of an Evaluation Function of the Game Othello by EM Algorithm," IEICE Technical Report, NC98-41
- Yoshioka, T.¹ Ishii, S.²

13
- 0025419413
- 1990.
- K.F. Lee and S. Mahajan, "The development of a world class Othello program," Artificial Intelligence, vol.43, pp.21-36, 1990.
- "The Development of a World Class Othello Program," Artificial Intelligence, Vol.43, Pp.21-36
- Lee, K.F.¹ Mahajan, S.²

14
- 85027158462
- 1995.
- M. Büro, "Statistical feature combination for the evaluation of game positions," J. Artificial Intell. Res., vol.3, pp.373382, 1995.
- "Statistical Feature Combination for the Evaluation of Game Positions," J. Artificial Intell. Res., Vol.3, Pp.373382
- Büro, M.¹

15
- 85027135771
- 1995.
- J.A. Boyan and A.W. Moore, "Generalization in reinforcement learning: Safely approximation the value function," Advances in Neural Information Processing Systems 7, pp.369-376, MIT Press, 1995.
- "Generalization in Reinforcement Learning: Safely Approximation the Value Function," Advances in Neural Information Processing Systems 7, Pp.369-376, MIT Press
- Boyan, J.A.¹ Moore, A.W.²

16
- 0038501238
- 1996.
- S. Schaal and C.C. Atkeson, "From isolation to cooperation: An alternative view of a system of experts," Advances in Neural Information Processing Systems 8, pp.605-611, MIT Press, 1996.
- "From Isolation to Cooperation: an Alternative View of a System of Experts," Advances in Neural Information Processing Systems 8, Pp.605-611, MIT Press
- Schaal, S.¹ Atkeson, C.C.²

17
- 0032312876
- 1998.
- J. Morimoto and K. Doya, "Reinforcement learning of dynamic motor sequence: learning to stand up," Proc. IEEE/RSJ Int. Conf. Intell. Robots & Syst., vol.3, pp.17211726, 1998.
- "Reinforcement Learning of Dynamic Motor Sequence: Learning to Stand Up," Proc. IEEE/RSJ Int. Conf. Intell. Robots & Syst., Vol.3, Pp.17211726
- Morimoto, J.¹ Doya, K.²

18
- 0002997066
- 1999.
- M. Sato and S. Ishii, "Reinforcement learning based on online EM algorithm," Advances in Neural Information Processing Systems 11, pp.1052-1058, MIT Press, 1999.
- "Reinforcement Learning Based on Online EM Algorithm," Advances in Neural Information Processing Systems 11, Pp.1052-1058, MIT Press
- Sato, M.¹ Ishii, S.²

19
- 85027126627
- 1998.
- S. Ishii and M. Sato, "On-line EM algorithm and reinforcement learning," Int. Conf. Artificial Neural Networks, pp.1127-1132, 1998.
- "On-line EM Algorithm and Reinforcement Learning," Int. Conf. Artificial Neural Networks, Pp.1127-1132
- Ishii, S.¹ Sato, M.²

20
- 85027197301
- 1998.
- M. Sato and S. Ishii, "On-line EM algorithm for mixture of local experts," Proc. Fifth Int. Conf. Neural Inf. Proc., vol.3, pp.1397-1401, 1998.
- "On-line EM Algorithm for Mixture of Local Experts," Proc. Fifth Int. Conf. Neural Inf. Proc., Vol.3, Pp.1397-1401
- Sato, M.¹ Ishii, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.