SCOPUS 정보 검색 플랫폼

Volumn 7895 LNAI, Issue PART 2, 2013, Pages 385-396

Opponent modelling by sequence prediction and lookahead in two-player games

Author keywords

Game Theory; Lookahead; Multi Agent Learning; Opponent Modelling; Reinforcement Learning; Sequence Prediction

Indexed keywords

ACTION SELECTION; HARD PROBLEMS; LEARNING SPEED; LOOKAHEAD; MEMORY LENGTH; MULTI-AGENT LEARNING; SEQUENCE PREDICTION; TWO-PLAYER GAMES;

GAME THEORY; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING; SOFT COMPUTING;

ALGORITHMS;

EID: 84884369341 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-38610-7_36 Document Type: Conference Paper

Times cited : (12)

References (20)

1
- 0004049893
- PhD thesis, Cambridge
- Watkins, C.J.C.H.: Learning from delayed rewards. PhD thesis, Cambridge (1989)
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

3
- 0030365402
- Learning models of intelligent agents
- Carmel, Markovitch: Learning models of intelligent agents. In: Proc. of 13th Int. Conf. on AI, AAAI, pp. 62-67 (1996)
- (1996) Proc. Of 13th Int. Conf. On AI, AAAI , pp. 62-67
- Carmel, M.¹

4
- 29344453415
- Non-stationary policy learning in 2-player zero sum games
- Jensen, B., Gini, S.: Non-stationary policy learning in 2-player zero sum games. In: Proc. of 20th Int. Conf. on AI, pp. 789-794 (2005)
- (2005) Proc. Of 20th Int. Conf. On AI , pp. 789-794
- Jensen, B.¹ Gini, S.²

5
- 84884367760
- arXiv:1108.3298
- Knoll, de Freitas: A machine learning perspective on predictive coding with paq. arXiv:1108.3298 (2011)
- (2011) A Machine Learning Perspective on Predictive Coding with Paq
- De Knoll, F.¹

6
- 0000115094
- Generation of random sequences by human subjects: Cognitive operations or psychological process?
- Treisman, Faulkner: Generation of random sequences by human subjects: Cognitive operations or psychological process? JEP: General 116, 337-355 (1987)
- (1987) JEP: General , vol.116 , pp. 337-355
- Treisman, F.¹

10
- 33748158636
- Lempel, Ziv: Compression of individual sequences via variable-rate coding (1978)
- (1978) Compression of Individual Sequences Via Variable-rate Coding
- Lempel, Z.¹

11
- 84884371517
- Knoll, B.: Text prediction and classification using string matching (2009)
- (2009) Text Prediction and Classification Using String Matching
- Knoll, B.¹

12
- 0025516650
- Implementing the ppm data compression scheme
- Moffat, A.: Implementing the ppm data compression scheme. IEEE Transactions on Communications 38, 1917-1921 (1990)
- (1990) IEEE Transactions on Communications , vol.38 , pp. 1917-1921
- Moffat, A.¹

13
- 24944489873
- Activelezi: An incremental parsing algorithm for sequential prediction
- Gopalratnam, K., Cook, D.J.: Activelezi: An incremental parsing algorithm for sequential prediction. In: 16th Int. FLAIRS Conf., pp. 38-42 (2003)
- (2003) 16th Int. FLAIRS Conf. , pp. 38-42
- Gopalratnam, K.¹ Cook, D.J.²

14
- 0028404750
- Discrete sequence prediction and its applications
- Laird, P., Saul, R.: Discrete sequence prediction and its applications. Machine Learning 15, 43-68 (1994)
- (1994) Machine Learning , vol.15 , pp. 43-68
- Laird, P.¹ Saul, R.²

16
- 0041965934
- Learning precise timing with lstm recurrent networks
- Gers, F.A., Schraudolph, N.N., Schmidhuber, J.: Learning precise timing with lstm recurrent networks. JMLR 3, 115-143 (2002)
- (2002) JMLR , vol.3 , pp. 115-143
- Gers, F.A.¹ Schraudolph, N.N.² Schmidhuber, J.³

17
- 0036531878
- Multiagent learning using a variable learning rate
- Bowling, M., Veloso, M.: Multiagent learning using a variable learning rate. Artificial Intelligence 136, 215-250 (2002)
- (2002) Artificial Intelligence , vol.136 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

18
- 84899897951
- Non-linear dynamics in multiagent reinforcement learning algorithms
- Abdallah, S., Lesser, V.R.: Non-linear dynamics in multiagent reinforcement learning algorithms. In: AAMAS (3), pp. 1321-1324 (2008)
- (2008) AAMAS , Issue.3 , pp. 1321-1324
- Abdallah, S.¹ Lesser, V.R.²

19
- 84884368079
- Multi-agent learning with policy prediction
- Zhang, Lesser: Multi-agent learning with policy prediction. In: AAAI (2010)
- (2010) AAAI
- Zhang, L.¹

20
- 80052009144
- Adaptive opponent modelling for the iterated prisoner's dilemma
- Piccolo, E., Squillero, G.: Adaptive opponent modelling for the iterated prisoner's dilemma. In: IEEE CEC, pp. 836-841 (2011)
- (2011) IEEE CEC , pp. 836-841
- Piccolo, E.¹ Squillero, G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.