SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 30, Issue , 2007, Pages 659-684

Learning to play using low-complexity rule-based policies: Illustrations through Ms. Pac-Man

(2) Szita, István a Lorincz, András a

a EÖTVÖS LORÁND UNIVERSITY (Hungary)

Author keywords

[No Author keywords available]

Indexed keywords

COMBINATORIAL MATHEMATICS; COMPUTATIONAL COMPLEXITY; GAME THEORY; OPTIMIZATION;

COMBINATORIAL REINFORCEMENT LEARNING; DECISION LISTS; ENTROPY OPTIMIZED POLICIES;

LEARNING SYSTEMS;

EID: 38349162555 PISSN: None EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.2368 Document Type: Article

Times cited : (55)

References (38)

1
- 17444384857
- Application of the cross-entropy method to the buffer allocation problem in a simulation-based environment
- Allon, G., Kroese, D. P., Raviv, T., & Rubinstein, R. Y. (2005). Application of the cross-entropy method to the buffer allocation problem in a simulation-based environment. Annals of Operations Research, 134, 137-151.
- (2005) Annals of Operations Research , vol.134 , pp. 137-151
- Allon, G.¹ Kroese, D.P.² Raviv, T.³ Rubinstein, R.Y.⁴

2
- 0040999713
- Toward a model of mind as a laissez-faire economy of idiots
- Baum, E. B. (1996). Toward a model of mind as a laissez-faire economy of idiots. In Proceedings of the 13rd International Conference on Machine Learning, pp. 28-36.
- (1996) Proceedings of the 13rd International Conference on Machine Learning , pp. 28-36
- Baum, E.B.¹

3
- 38349096103
- Machines that learn to play games, chap
- Nova Science Publishers, Inc
- Baxter, J., Tridgell, A., & Weaver, L. (2001). Machines that learn to play games, chap. Reinforcement learning and chess, pp. 91-116. Nova Science Publishers, Inc.
- (2001) Reinforcement learning and chess , pp. 91-116
- Baxter, J.¹ Tridgell, A.² Weaver, L.³

4
- 0003487482
- Athena Scientific
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-Dynamic Programming. Athena Scientific.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 38349171751
- Bonet, J. S. D., & Stauffer, C. P. (1999). Learning to play Pac-Man using incremental reinforcement learning.. [Online; accessed 09 October 2006].
- Bonet, J. S. D., & Stauffer, C. P. (1999). Learning to play Pac-Man using incremental reinforcement learning.. [Online; accessed 09 October 2006].

6
- 27144439387
- Applications of Learning Classifier Systems, chap
- Springer
- Bull, L. (2004). Applications of Learning Classifier Systems, chap. Learning Classifier Systems: A Brief Introduction, pp. 3-13. Springer.
- (2004) Learning Classifier Systems: A Brief Introduction , pp. 3-13
- Bull, L.¹

7
- 27144439387
- Foundations of Learning Classifier Systems, chap
- Springer
- Bull, L., & Kovacs, T. (2005). Foundations of Learning Classifier Systems, chap. Foundations of Learning Classifier Systems: An Introduction, pp. 3-14. Springer.
- (2005) Foundations of Learning Classifier Systems: An Introduction , pp. 3-14
- Bull, L.¹ Kovacs, T.²

8
- 38349150853
- Courtillat, P. (2001). NoN-SeNS Pacman 1.6 with C sourcecode.. [Online; accessed 09 October 2006].
- Courtillat, P. (2001). NoN-SeNS Pacman 1.6 with C sourcecode.. [Online; accessed 09 October 2006].

9
- 38349116176
- Cross-entropic learning of a machine for the decision in a partially observable universe
- To appear
- Dambreville, F. (2006). Cross-entropic learning of a machine for the decision in a partially observable universe. Journal of Global Optimization. To appear.
- (2006) Journal of Global Optimization
- Dambreville, F.¹

10
- 17444409624
- A tutorial on the cross-entropy method
- de Boer, P.-T., Kroese, D. P., Mannor, S., & Rubinstein, R. Y. (2004). A tutorial on the cross-entropy method. Annals of Operations Research, 134, 19-67.
- (2004) Annals of Operations Research , vol.134 , pp. 19-67
- de Boer, P.-T.¹ Kroese, D.P.² Mannor, S.³ Rubinstein, R.Y.⁴

11
- 84901386407
- Gallagher, M., & Ryan, A. (2003). Learning to play pac-man: An evolutionary, rule-based approach. In et. al., R. S. (Ed.), Proc. Congress on Evolutionary Computation, pp. 2462-2469.
- Gallagher, M., & Ryan, A. (2003). Learning to play pac-man: An evolutionary, rule-based approach. In et. al., R. S. (Ed.), Proc. Congress on Evolutionary Computation, pp. 2462-2469.

12
- 0000746883
- Escaping brittleness: The possibilities of general-purpose learning algorithms applied to parallel rule-based systems
- Mitchell, Michalski, & Carbonell Eds, chap. 20, pp, Morgan Kaufmann
- Holland, J. H. (1986). Escaping brittleness: The possibilities of general-purpose learning algorithms applied to parallel rule-based systems. In Mitchell, Michalski, & Carbonell (Eds.), Machine Learning, an Artificial Intelligence Approach. Volume II, chap. 20, pp. 593-623. Morgan Kaufmann.
- (1986) Machine Learning, an Artificial Intelligence Approach , vol.2 , pp. 593-623
- Holland, J.H.¹

13
- 0036927460
- Sequence alignment by rare event simulation
- Keith, J., & Kroese, D. P. (2002). Sequence alignment by rare event simulation. In Proceedings of the 2002 Winter Simulation Conference, pp. 320-327.
- (2002) Proceedings of the 2002 Winter Simulation Conference , pp. 320-327
- Keith, J.¹ Kroese, D.P.²

14
- 0003882343
- MIT Press
- Koza, J. (1992). Genetic programming: on the programming of computers by means of natural selection. MIT Press.
- (1992) Genetic programming: On the programming of computers by means of natural selection
- Koza, J.¹

15
- 80053632021
- Evolving a neural network location evaluator to play Ms. Pac-Man
- Lucas, S. M. (2005). Evolving a neural network location evaluator to play Ms. Pac-Man. In IEEE Symposium on Computational Intelligence and Games, pp. 203-210.
- (2005) IEEE Symposium on Computational Intelligence and Games , pp. 203-210
- Lucas, S.M.¹

16
- 1942516890
- The cross-entropy method for fast policy search
- Mannor, S., Rubinstein, R. Y., & Gat, Y. (2003). The cross-entropy method for fast policy search. In 20th International Conference on Machine Learning.
- (2003) 20th International Conference on Machine Learning
- Mannor, S.¹ Rubinstein, R.Y.² Gat, Y.³

17
- 17444377167
- On the convergence of the cross-entropy method
- Margolin, L. (2004). On the convergence of the cross-entropy method. Annals of Operations Research, 134, 201-214.
- (2004) Annals of Operations Research , vol.134 , pp. 201-214
- Margolin, L.¹

18
- 17444414191
- Basis function adaptation in temporal difference reinforcement learning
- Menache, I., Mannor, S., & Shimkin, N. (2005). Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134(1), 215-238.
- (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 215-238
- Menache, I.¹ Mannor, S.² Shimkin, N.³

19
- 0031215849
- The equation for response to selection and its use for prediction
- Muehlenbein, H. (1998). The equation for response to selection and its use for prediction. Evolutionary Computation, 5, 303-346.
- (1998) Evolutionary Computation , vol.5 , pp. 303-346
- Muehlenbein, H.¹

20
- 26944432123
- Improving adaptive game AI with evolutionary learning
- Ponsen, M., & Spronck, P. (2004). Improving adaptive game AI with evolutionary learning. In Computer Games: Artificial Intelligence, Design and Education.
- (2004) Computer Games: Artificial Intelligence, Design and Education
- Ponsen, M.¹ Spronck, P.²

21
- 0013464725
- Decision-theoretic planning with concurrent temporally extended actions
- Rohanimanesh, K., & Mahadevan, S. (2001). Decision-theoretic planning with concurrent temporally extended actions. In Proceedings of the 17th Conference on Uncerainty in Artificial Intelligence, pp. 472-479.
- (2001) Proceedings of the 17th Conference on Uncerainty in Artificial Intelligence , pp. 472-479
- Rohanimanesh, K.¹ Mahadevan, S.²

22
- 0000228665
- The cross-entropy method for combinatorial and continuous optimization
- Rubinstein, R. Y. (1999). The cross-entropy method for combinatorial and continuous optimization. Methodology and Computing in Applied Probability, 1, 127-190.
- (1999) Methodology and Computing in Applied Probability , vol.1 , pp. 127-190
- Rubinstein, R.Y.¹

23
- 0001201756
- Some studies in machine learning using the game of checkers
- Samuel, A. L. (1959). Some studies in machine learning using the game of checkers. IBM Journal of Research and Development, 6, 211-229.
- (1959) IBM Journal of Research and Development , vol.6 , pp. 211-229
- Samuel, A.L.¹

24
- 38349109282
- Master's thesis, École Polytechnique Fédérale de Lausanne
- Schaul, T. (2005). Evolving a compact concept-based Sokoban solver. Master's thesis, École Polytechnique Fédérale de Lausanne.
- (2005) Evolving a compact concept-based Sokoban solver
- Schaul, T.¹

25
- 0041644968
- A computer scientist's view of life, the universe, and everything
- Freksa, C, Jantzen, M, & Valk, R, Eds, Foundations of Computer Science: Potential, Theory, Cognition, of, Springer, Berlin
- Schmidhuber, J. (1997). A computer scientist's view of life, the universe, and everything. In Freksa, C., Jantzen, M., & Valk, R. (Eds.), Foundations of Computer Science: Potential - Theory - Cognition, Vol. 1337 of Lecture Notes in Computer Science, pp. 201-208. Springer, Berlin.
- (1997) Lecture Notes in Computer Science , vol.1337 , pp. 201-208
- Schmidhuber, J.¹

26
- 33744791741
- Adaptive game ai with dynamic scripting
- Spronck, P., Ponsen, M., Sprinkhuizen-Kuyper, I., & Postma, E. (2006). Adaptive game ai with dynamic scripting. Machine Learning, 63(3), 217-248.
- (2006) Machine Learning , vol.63 , Issue.3 , pp. 217-248
- Spronck, P.¹ Ponsen, M.² Sprinkhuizen-Kuyper, I.³ Postma, E.⁴

27
- 38349176261
- Online adaptation of computer game opponent AI
- Spronck, P., Sprinkhuizen-Kuyper, I., & Postma, E. (2003). Online adaptation of computer game opponent AI. In Proceedings of the 15th Belgium-Netherlands Conference on Artificial Intelligence, pp. 291-298.
- (2003) Proceedings of the 15th Belgium-Netherlands Conference on Artificial Intelligence , pp. 291-298
- Spronck, P.¹ Sprinkhuizen-Kuyper, I.² Postma, E.³

28
- 0004102479
- MIT Press, Cambridge
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press, Cambridge.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

29
- 33745683202
- Szabó, Z., Póczos, B., & Lorincz, A. (2006). Cross-entropy optimization for independent process analysis. In ICA, pp. 909-916.
- Szabó, Z., Póczos, B., & Lorincz, A. (2006). Cross-entropy optimization for independent process analysis. In ICA, pp. 909-916.

30
- 38349174986
- How to select the 100 voxels that are best for prediction - a simplistic approach
- Tech. rep, Eötvös Loránd University, Hungary
- Szita, I. (2006). How to select the 100 voxels that are best for prediction - a simplistic approach. Tech. rep., Eötvös Loránd University, Hungary.
- (2006)
- Szita, I.¹

31
- 33845344721
- Learning Tetris using the noisy cross-entropy method
- Szita, I., & Lorincz, A. (2006). Learning Tetris using the noisy cross-entropy method. Neural Computation, 18(12), 2936-2941.
- (2006) Neural Computation , vol.18 , Issue.12 , pp. 2936-2941
- Szita, I.¹ Lorincz, A.²

32
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2), 215-219.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

33
- 38349117491
- Automatic rule ordering for dynamic scripting
- Timuri, T., Spronck, P., & van den Herik, J. (2007). Automatic rule ordering for dynamic scripting. In The Third Artificial Intelligence and Interactive Digital Entertainment Conference, pp. 49-54.
- (2007) The Third Artificial Intelligence and Interactive Digital Entertainment Conference , pp. 49-54
- Timuri, T.¹ Spronck, P.² van den Herik, J.³

34
- 38349156059
- Bachelor's thesis, Department of Information Technology and Electrical Engineering
- Tiong, A. L. K. (2002). Rule set representation and fitness functions for an artificial pac man playing agent. Bachelor's thesis, Department of Information Technology and Electrical Engineering.
- (2002) Rule set representation and fitness functions for an artificial pac man playing agent
- Tiong, A.L.K.¹

35
- 0037158688
- Fast hands-free writing by gaze direction
- Ward, D. J., & MacKay, D. J. C. (2002). Fast hands-free writing by gaze direction. Nature, 418, 838-540.
- (2002) Nature , vol.418 , pp. 838-540
- Ward, D.J.¹ MacKay, D.J.C.²

36
- 38349110468
- Wikipedia (2006). Pac-Man -Wikipedia, the free encyclopedia. Wikipedia. [Online; accessed 20 May 2007].
- Wikipedia (2006). Pac-Man -Wikipedia, the free encyclopedia. Wikipedia. [Online; accessed 20 May 2007].

37
- 0023364261
- Arithmetic coding for data compression
- Witten, I. A., Neal, R. M., & Cleary, J. G. (1987). Arithmetic coding for data compression. Communications of the ACM, 30, 520-540.
- (1987) Communications of the ACM , vol.30 , pp. 520-540
- Witten, I.A.¹ Neal, R.M.² Cleary, J.G.³

38
- 0031118203
- No free lunch theorems for optimization
- Wolpert, D. H., & Macready, W. G. (1997). No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation, 1, 67-82.
- (1997) IEEE Transactions on Evolutionary Computation , vol.1 , pp. 67-82
- Wolpert, D.H.¹ Macready, W.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.