SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 2063, Issue , 2001, Pages 133-150

Chess neighborhoods, function combination, and reinforcement learning

(2) Levinson, Robert a Weber, Ryan a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

Computer chess; Exponentiated gradient; Gradient descent; Linear regression; Multi layer neural nets; Reinforcement learning; Temporal difference learning; Value function approximation

Indexed keywords

COMPUTER GAMES; GRAPHIC METHODS; LEARNING SYSTEMS; LINEAR REGRESSION;

COMPUTER CHESS; GRADIENT DESCENT; GRAPH REPRESENTATION; GRAPH-BASED MODELING; LEARNING EXPERIENCES; PRIOR KNOWLEDGE; TEMPORAL DIFFERENCE LEARNING; VALUE FUNCTION APPROXIMATION;

REINFORCEMENT LEARNING;

EID: 84898646291 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-45579-5_9 Document Type: Conference Paper

Times cited : (6)

References (29)

1
- 84958768821
- New Advances in Adaptive Pattern-Oriented Chess
- H.J. van den Herik and J. W.H., pp. 312-233 Uiterwijk, Universiteit Maastricht, The Netherlands
- Allen, J., Hamilton, E., and Levinson, R. New Advances in Adaptive Pattern-Oriented Chess (1997). In H.J. van den Herik and J. W.H., Uiterwijk. Advances in Computer Chess 8, pp. 312-233., Universiteit Maastricht, The Netherlands.
- (1997) Advances in Computer Chess , vol.8 , pp. 312-233
- Allen, J.¹ Hamilton, E.² Levinson, R.³

2
- 0002882372
- A chess program that learns by combining TD(λ) with game tree search
- Madision, WI, Morgan Kaufmann
- th International Conference on Machine Learning (ICML-98), pages 28-36. Madision, WI. 1998. Morgan Kaufmann.
- (1998) th International Conference on Machine Learning (ICML-98) , pp. 28-36
- Baxter, J.¹ Tridgell, A.² Weaver, L.³

3
- 0004118921
- Cambridge: MIT Press
- Ballard, D. H. An Introduction to Natural Computation. Cambridge: MIT Press.
- An Introduction to Natural Computation
- Ballard, D.H.¹

4
- 0003140349
- Random Evaluation in Chess
- A
- Beal, D. F., & Smith, M.C. (1994). Random Evaluation in Chess. ICCA Journal, Vol. 17, No. 1, pp. 3-9 (A).
- (1994) ICCA Journal , vol.17 , Issue.1 , pp. 3-9
- Beal, D.F.¹ Smith, M.C.²

5
- 0004502426
- Learning Piece Values Using Temporal Differences
- September
- Beal, D. F., & Smith, M.C. Learning Piece Values Using Temporal Differences. Journal of The International Computer Chess Association, September 1997.
- (1997) Journal of The International Computer Chess Association
- Beal, D.F.¹ Smith, M.C.²

6
- 56349117542
- First results from using temporal difference learning in Shogi
- H. J. van den Herik and H. Iida, editors, volume 1558 of Lecture Notes in Computer Science, Tsukuba, Japan, Springer-Verlag
- Beal, D. F., & Smith, M.C. First results from using temporal difference learning in Shogi. In H. J. van den Herik and H. Iida, editors, Proceedings of the First International Conference on Computers and Games (CG-98), volume 1558 of Lecture Notes in Computer Science, page 114, Tsukuba, Japan, 1998. Springer-Verlag.
- (1998) Proceedings of the First International Conference on Computers and Games (CG-98) , pp. 114
- Beal, D.F.¹ Smith, M.C.²

7
- 84948176317
- Oxford Univ. Press, ISBN 0-19-853864-2
- Bishop, Christopher M. Neural Networks for Pattern Recognition, Oxford Univ. Press, 1998. ISBN 0-19-853864-2.
- (1998) Neural Networks for Pattern Recognition
- Christopher, M.B.¹

8
- 0001771345
- Linear least-squares algorithms for temporal difference learning
- Bradtke, S. J., and Barto, A. G. (1996). Linear least-squares algorithms for temporal difference learning. Machine Learning, 22, 33-57.
- (1996) Machine Learning , vol.22 , pp. 33-57
- Bradtke, S.J.¹ Barto, A.G.²

9
- 85168770830
- A unified theory of heuristic evaluation functions and its applications to learning
- Christensen, J. and Korf, R. (1986). A unified theory of heuristic evaluation functions and its applications to learning. Proceedings of AAAI-86 (pp. 148-152).
- (1986) Proceedings of AAAI-86 , pp. 148-152
- Christensen, J.¹ Korf, R.²

10
- 0007943864
- Machine learning in computer chess: The next generation
- September
- Fürnkranz, J., Machine learning in computer chess: The next generation. International Computer Chess Association Journal, 19(3): 147-160, September (1996).
- (1996) International Computer Chess Association Journal , vol.19 , Issue.3 , pp. 147-160
- Fürnkranz, J.¹

11
- 5844312285
- Ph.D thesis. University of California, San Diego. San Diego, CA
- Gherrity, M. A Game-Learning Machine. Ph.D thesis. University of California, San Diego. San Diego, CA. 1993.
- (1993) A Game-Learning Machine
- Gherrity, M.¹

12
- 0346641007
- Worst-case loss bounds for sigmoided linear neurons
- MIT Press, Cambridge, MA
- Helmbold, D. P., Kivinen, J., and Warmuth, M. K. (1996a), Worst-case loss bounds for sigmoided linear neurons, in "Advances in Neural Information Processing Systems 8," MIT Press, Cambridge, MA.
- (1996) Advances in Neural Information Processing Systems 8
- Helmbold, D.P.¹ Kivinen, J.² Warmuth, M.K.³

13
- 84969385265
- A New Research Scope
- Herik, H.J. van den. A New Research Scope. International Computer Chess Association Journal 21(4), 1998.
- (1998) International Computer Chess Association Journal , vol.21 , Issue.4
- Herik, H.J.V.D.¹

14
- 84969380755
- Additive versus exponentiated gradient updates for linear prediction
- Kivinen, J. and Warmuth, M. K. Additive versus exponentiated gradient updates for linear prediction. Information and Computation. Vol. 2, pp. 285-318, 1998.
- (1998) Information and Computation , vol.2 , pp. 285-318
- Kivinen, J.¹ Warmuth, M.K.²

15
- 0007894587
- Adaptive pattern-oriented chess
- L. Birnbaum and G. Collins (Eds.), Morgan Kaufmann
- th International Workshop on Machine Learning, pp. 85-89, Morgan Kaufmann.
- (1991) th International Workshop on Machine Learning , pp. 85-89
- Levinson, R.A.¹ Snyder, R.²

16
- 5844322623
- International Computer Chess Association Journal, September
- Levinson, R. A., and Snyder, R., "Distance: Towards the Unification of Chess Knowledge", International Computer Chess Association Journal 16(3): 123-136, September 1993.
- (1993) Distance: Towards the Unification of Chess Knowledge , vol.16 , Issue.3 , pp. 123-136
- Levinson, R.A.¹ Snyder, R.²

17
- 84958795986
- Pattern-level Temporal Difference Learning, Data Fusion, and Chess
- Sensor Fusion: Architectures, Algorithms, and Applications IV
- th Annual Conference on Aerospace/Defense Sensing and Controls: Sensor Fusion: Architectures, Algorithms, and Applications IV.
- (2000) th Annual Conference on Aerospace/Defense Sensing and Controls
- Levinson, R.A.¹ Weber, R.J.²

18
- 0001928981
- On-line learning of linear functions
- Littlestone, N., Long, P.M., and Warmuth, M. K. (1995), On-line learning of linear functions, Journal of Computational Complexity 5, 1-23.
- (1995) Journal of Computational Complexity , vol.5 , pp. 1-23
- Littlestone, N.¹ Long, P.M.² Warmuth, M.K.³

19
- 0003961852
- Addison-Wesley, Reading, Massachusetts
- Pearl, J. (1984). Heuristics: Intelligent Search Strategies for Computer Problem Solving. Addison-Wesley, Reading, Massachusetts.
- (1984) Heuristics: Intelligent Search Strategies for Computer Problem Solving
- Pearl, J.¹

20
- 84969406622
- Pellen, Luke. Neural net chess program Octavius: http://home.seol.net.au/luke/Octavius (1999).
- (1999) Neural net chess program Octavius
- Pellen, L.¹

21
- 0001201756
- Some studies in machine learning using the game of checkers
- Samuel, A. (1959). Some studies in machine learning using the game of checkers. IBM J. of Research and Development, 3, 210-229.
- (1959) IBM J. of Research and Development , vol.3 , pp. 210-229
- Samuel, A.¹

22
- 84958778241
- Swarthmore College, Swarthmore, PA
- Scott, J. Machine Learning in Games: the Morph Project, Swarthmore College, Swarthmore, PA. http://forum.swarthmore.edu/~jay/learn-game/projects/morph.html.
- Machine Learning in Games: The Morph Project
- Scott, J.¹

23
- 77956735234
- A chess program that uses its transposition table to learn from experience
- Slate, D.J., A chess program that uses its transposition table to learn from experience. International Computer Chess Association Journal 10(2): 59-71, 1987.
- (1987) International Computer Chess Association Journal , vol.10 , Issue.2 , pp. 59-71
- Slate, D.J.¹

24
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

25
- 0004102479
- Cambridge: MIT Press
- Sutton, R. S., & Barto, A.G. (1998). Reinforcement Learning: An Introduction. Cambridge: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

26
- 0029276036
- Temporal Difference Learning and TD-Gammon
- Tesauro, G. Temporal Difference Learning and TD-Gammon. Communications of the ACM, Vol 38, No 3, March 1995.
- (1995) Communications of the ACM , vol.38 , Issue.3
- Tesauro, G.¹

27
- 0001046225
- Practical Issues in Temporal Difference Learning
- Tesauro, G. Practical Issues in Temporal Difference Learning. Machine Learning, 8: 257-278, 1992.
- (1992) Machine Learning , vol.8 , pp. 257-278
- Tesauro, G.¹

28
- 0003215153
- Learning to Play the Game of Chess
- G. Tesauro, D. Touretzky, and T. Leen (eds.), MIT Press
- Thrun, S., 1995. Learning to Play the Game of Chess. In Advances in Neural Information Processing Systems (NIPS) 7, G. Tesauro, D. Touretzky, and T. Leen (eds.), MIT Press.
- (1995) Advances in Neural Information Processing Systems (NIPS) 7
- Thrun, S.¹

29
- 0004113431
- Prentice Hall, Engel-wood Cliffs, NJ
- Widrow, B., and Stearns, S. (1985), "Adaptive Signal Processing," Prentice Hall, Engel-wood Cliffs, NJ.
- (1985) Adaptive Signal Processing
- Widrow, B.¹ Stearns, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.