SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 10, Issue 2, 2005, Pages 103-130

Learning and exploiting relative weaknesses of opponent agents

(2) Markovitch, Shaul a Reger, Ronit a

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

Author keywords

Machine learning; Multi agent systems; Opponent modelling

Indexed keywords

BOUNDEDLY RATIONAL AGENT; MODEL BASED LEARNING; MODEL-FREE LEARNING; OPPONENT MODELING;

ALGORITHMS; COMPUTATIONAL COMPLEXITY; DECISION SUPPORT SYSTEMS; FINITE AUTOMATA; LEARNING SYSTEMS; MATHEMATICAL MODELS; PROBLEM SOLVING;

MULTI AGENT SYSTEMS;

EID: 14744285085 PISSN: 13872532 EISSN: None Source Type: Journal
DOI: 10.1007/s10458-004-6977-7 Document Type: Article

Times cited : (28)

References (52)

1
- 0038245603
- Master's thesis, Department of mathematics and Computer Science, Vrije Universiteit, Amsterdam, The Netherlands
- V. Allis, "A knowledge-based approach of Connect-Four - the game is solved: White wins," Master's thesis, Department of mathematics and Computer Science, Vrije Universiteit, Amsterdam, The Netherlands, 1988.
- (1988) A Knowledge-based Approach of Connect-four - The Game is Solved: White Wins
- Allis, V.¹

2
- 0018062404
- On the complexity of minimum inference of regular sets
- D. Angluin, "On the complexity of minimum inference of regular sets," Information and Control vol. 39, pp. 337-350, 1978.
- (1978) Information and Control , vol.39 , pp. 337-350
- Angluin, D.¹

3
- 14744292247
- C. Atkeson, and J. Santamaria, "A comparison of direct and model-based reinforcement learning," 1997.
- (1997) A Comparison of Direct and Model-based Reinforcement Learning
- Atkeson, C.¹ Santamaria, J.²

4
- 0031635794
- Opponent modeling in poker
- Madison, Wisconsin
- D. Billings, D. Papp, J. Schaeffer, and D. Szafron, "Opponent modeling in poker," in Proceedings of the Fifteenth National Conference on Artificial Intelligence, Madison, Wisconsin, pp. 493-499, 1998.
- (1998) Proceedings of the Fifteenth National Conference on Artificial Intelligence , pp. 493-499
- Billings, D.¹ Papp, D.² Schaeffer, J.³ Szafron, D.⁴

5
- 85137650338
- Multi-robot team response to a multi-robot opponent team
- J. Bruce, M. Bowling, B. Browning, and M. Veloso, "Multi-robot team response to a multi-robot opponent team," in Proceedings of IROS-2002 workshop on Collaborative Rabots, 2002.
- (2002) Proceedings of IROS-2002 Workshop on Collaborative Rabots
- Bruce, J.¹ Bowling, M.² Browning, B.³ Veloso, M.⁴

6
- 85169126305
- Learning models of the opponent's strategy in game-playing
- North Carolina
- D. Carmel, and S. Markovitch, "Learning models of the opponent's strategy in game-playing," in, Proceedings of The AAAI Fall Symposium on Games: Planning and Learning, North Carolina, 1993.
- (1993) Proceedings of the AAAI Fall Symposium on Games: Planning and Learning
- Carmel, D.¹ Markovitch, S.²

7
- 0030349572
- Incorporating opponent models into adversary search
- Portland, Oregon
- D. Carmel, and S. Markovitch, "Incorporating Opponent Models into Adversary Search". in, Proceedings of the Thirteenth National Conference on Artificial Intelligence. Portland, Oregon, pp. 120-125.
- Proceedings of the Thirteenth National Conference on Artificial Intelligence , pp. 120-125
- Carmel, D.¹ Markovitch, S.²

8
- 4243574392
- Learning and using opponent models in adversary search
- Technion
- D. Carmel, and S. Markovitch, "Learning and using opponent models in adversary search," Technical Report CIS9609, Technion, 1996b.
- (1996) Technical Report , vol.CIS9609
- Carmel, D.¹ Markovitch, S.²

9
- 0030365402
- Learning models of intelligent agents
- Portland, Oregon
- D. Carmel, and S. Markovitch, "Learning models of intelligent agents," in, Proceedings of the Thirteenth National Conference on Artificial Intelligence. Portland, Oregon, pp. 62-67, 1996c.
- (1996) Proceedings of the Thirteenth National Conference on Artificial Intelligence , pp. 62-67
- Carmel, D.¹ Markovitch, S.²

10
- 0032344579
- Model-based learning of interaction strategies in multi-agent systems
- D. Carmel, and S. Markovitch, "Model-based learning of interaction strategies in multi-agent systems," Journal of Experimental and Theoretical Artificial Intelligence, vol.10, no. 3, pp.309-332, 1998.
- (1998) Journal of Experimental and Theoretical Artificial Intelligence , vol.10 , Issue.3 , pp. 309-332
- Carmel, D.¹ Markovitch, S.²

11
- 0033423368
- Exploration strategies for model-based learning in multiagent systems
- D. Carmel, and S. Markovitch, "Exploration strategies for model-based learning in multiagent systems," Autonomous Agents and Multi-agent Systems, vol. 2, no. 2, pp. 141-172,1999.
- (1999) Autonomous Agents and Multi-agent Systems , vol.2 , Issue.2 , pp. 141-172
- Carmel, D.¹ Markovitch, S.²

12
- 0035397965
- Probabilistic opponent-model search
- H.Donkers, J. Uiterwijk, and H. van den Herik, "Probabilistic opponent-model search," Information Sciences vol. 135 no. 3-4, 123-149, 2001.
- (2001) Information Sciences , vol.135 , Issue.3-4 , pp. 123-149
- Donkers, H.¹ Uiterwijk, J.² Van Den Herik, H.³

13
- 0029547692
- Efficient algorithms for learning to play repeated games against computationally bounded adversaries
- IEEE Computer Society Press, Los Alamitos, CA
- Y. Freund, M. Kearns, Y. Mansour, D. Ron, and R. Rubinfeld, "Efficient algorithms for learning to play repeated games against computationally bounded adversaries," in, Proceeding, of the 36th Annual Symposium on Foundations of Computer Science. IEEE Computer Society Press, Los Alamitos, CA, pp. 332-341, 1995.
- (1995) Proceeding, of the 36th Annual Symposium on Foundations of Computer Science , pp. 332-341
- Freund, Y.¹ Kearns, M.² Mansour, Y.³ Ron, D.⁴ Rubinfeld, R.⁵

14
- 0034915471
- Strategies anticipating a difference in search depth using opponent-model search
- X. Gao, H. Iida, J. W. Uiterwijk, and H. J. van den Herik, "Strategies anticipating a difference in search depth using opponent-model search," Theoretical Computer Science, vol. 252 no. 1-2, pp. 83-104, 2001.
- (2001) Theoretical Computer Science , vol.252 , Issue.1-2 , pp. 83-104
- Gao, X.¹ Iida, H.² Uiterwijk, J.W.³ Van Den Herik, H.J.⁴

15
- 14744287089
- Performance of (D,d)-OM search in othello
- Shikawa, Japan
- X. Gao, H. Iida, J. W. H. M. Uiterwijk, and H. J. van den Herik, "Performance of (D,d)-OM search in othello," in, Proceedings of JSSST 14th Conference, Shikawa, Japan, pp. 229-232, 1997.
- (1997) Proceedings of JSSST 14th Conference , pp. 229-232
- Gao, X.¹ Iida, H.² Uiterwijk, J.W.H.M.³ Van Den Herik, H.J.⁴

16
- 14744306098
- A speculative strategy
- X. Gao, H. Iida, J. W. H. M. Uiterwijk, and H. J. Van den Herik, "A Speculative Strategy," Lecture Notes in Computer Science, vol. 1558, pp.74-92, 1999.
- (1999) Lecture Notes in Computer Science , vol.1558 , pp. 74-92
- Gao, X.¹ Iida, H.² Uiterwijk, J.W.H.M.³ Van Den Herik, H.J.⁴

17
- 0006389601
- A rigorous, operational formalization of recursive modeling
- V. Lesser and L. Gasser (eds.), San Francisco, CA, USA, AAAI Press
- P. J. Gmytrasiewicz, and E. H. Durfee, "A rigorous, operational formalization of recursive modeling," in V. Lesser and L. Gasser (eds.), Proceedings of the First International Conference on Multi-Agent Systems (ICMAS-95). San Francisco, CA, USA, AAAI Press, 1995.
- (1995) Proceedings of the First International Conference on Multi-Agent Systems (ICMAS-95)
- Gmytrasiewicz, P.J.¹ Durfee, E.H.²

18
- 0031681785
- Bayesian update of recursive agent models
- P. J. Gmytrasiewicz, S. Noh, and T. Kellogg, "Bayesian update of recursive agent models," User Modeling and User-Adapted Interaction, An International Journal, Special Issue on Learning for User Modeling, vol. 8, no. 1-2, pp. 49-69, 1998.
- (1998) User Modeling and User-adapted Interaction, An International Journal, Special Issue on Learning for User Modeling , vol.8 , Issue.1-2 , pp. 49-69
- Gmytrasiewicz, P.J.¹ Noh, S.² Kellogg, T.³

19
- 0001878752
- Deep thought
- T. Marsland and J. Schaeffer (eds.), Springer New York
- F.-H. Hsu, T. Ananthraman, M. Campbell, and A. Nowatzyk, "Deep thought," in T. Marsland and J. Schaeffer (eds.), Computers, Chess and Cognition. Springer New York, pp. 55-78, 1990.
- (1990) Computers, Chess and Cognition , pp. 55-78
- Hsu, F.-H.¹ Ananthraman, T.² Campbell, M.³ Nowatzyk, A.⁴

20
- 0030361831
- Generation of attributes for learning algorithms
- Menlo Park, AAAI Press / MIT Press
- Y.-J. Hu, and D. F. Kibler, "Generation of attributes for learning algorithms," in, Proceedings of the Thirteenth National Conference on Artificial Intelligence and the Eighth Innovative Applications of Artificial Intelligence Conference, Menlo Park, AAAI Press / MIT Press, pp. 806-811, 1996.
- (1996) Proceedings of the Thirteenth National Conference on Artificial Intelligence and the Eighth Innovative Applications of Artificial Intelligence Conference , pp. 806-811
- Hu, Y.-J.¹ Kibler, D.F.²

21
- 0001666986
- Potential applications of opponent-model search, part I: The domain of applicability
- H. Iida, J. W. H. M. Uiterwijk, H. J. van den Herik, and I. S. Herschberg, "Potential applications of opponent-model search, Part I: The Domain of Applicability," ICCA Journal vol. 16 no. (4), pp. 201-208, 1993.
- (1993) ICCA Journal , vol.16 , Issue.4 , pp. 201-208
- Iida, H.¹ Uiterwijk, J.W.H.M.² Van Den Herik, H.J.³ Herschberg, I.S.⁴

22
- 0004693797
- Potential applications of opponent-model search, part II: Risks and strategies
- H. Iida, J. W. H. M. Uiterwijk, H. J. van den Herik, and I. S. Herschberg, "Potential applications of opponent-model search, Part II: Risks and strategies," ICCA Journal, vol. 17, no. 1, pp. 10-14, 1994.
- (1994) ICCA Journal , vol.17 , Issue.1 , pp. 10-14
- Iida, H.¹ Uiterwijk, J.W.H.M.² Van Den Herik, H.J.³ Herschberg, I.S.⁴

23
- 0004089723
- Ph.D. thesis, Carnegie Mellon University
- P. J. Jansen, "Using knowledge about the opponent in game-tree search," Ph.D. thesis, Carnegie Mellon University, 1992.
- (1992) Using Knowledge about the Opponent in Game-tree Search
- Jansen, P.J.¹

24
- 0007943080
- Search versus knowledge in game-playing programs revisited
- Nagoya, Japan
- A. Junghanns, and J. Schaeffer, "Search versus knowledge in game-playing programs revisited". in Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97). Nagoya, Japan, pp. 692-697, 1997.
- (1997) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97) , pp. 692-697
- Junghanns, A.¹ Schaeffer, J.²

25
- 0029679044
- Reinforcement learning, a survey
- L. P. Kaelbling, M. L. Littman, and A. P. Moore, "Reinforcement learning, a survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.P.³

26
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- New Brunswick, NJ, Morgan Kaufmann
- M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in, Proceedings of the 11th International Conference on Machine Learning (ML-94), New Brunswick, NJ, Morgan Kaufmann, pp. 157-163, 1994.
- (1994) Proceedings of the 11th International Conference on Machine Learning (ML-94) , pp. 157-163
- Littman, M.L.¹

27
- 0036778917
- Feature generation using general constructor functions
- S. Markovitch, and D. Rosenstein, "Feature generation using general constructor functions," Machine Learning, vol. 49, pp. 59-98, 2001.
- (2001) Machine Learning , vol.49 , pp. 59-98
- Markovitch, S.¹ Rosenstein, D.²

28
- 5844259149
- Learning of resource allocation strategies for game playing
- S.Markovitch, and Y. Sella, "Learning of resource allocation strategies for game playing," Computational Intelligence, vol. 12 no. (1), pp. 88-105, 1996.
- (1996) Computational Intelligence , vol.12 , Issue.1 , pp. 88-105
- Markovitch, S.¹ Sella, Y.²

29
- 0002011817
- Constructive induction on decision trees
- N. S. Sridharan (ed.), Detroit, MI, USA, Morgan Kaufmann
- C. J. Matheus, and L. A. Rendell, "Constructive induction on decision trees," in, N. S. Sridharan (ed.), Proceedings of the 11th International Joint Conference on Artificial Intelligence, Detroit, MI, USA, Morgan Kaufmann, pp. 645-650, 1989.
- (1989) Proceedings of the 11th International Joint Conference on Artificial Intelligence , pp. 645-650
- Matheus, C.J.¹ Rendell, L.A.²

30
- 0027684215
- Prioritized sweeping, reinforcement learning with less data and less time
- A. W. Moore, and C. G. Atkeson, "Prioritized sweeping, reinforcement learning with less data and less time," Machine Learning, vol. 13 , pp.103-130, 1993.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

31
- 84949966497
- Learn your opponent's strategy (in polynomial time)!
- G. Weiss and S. Sen (eds.), Springer-Verlag
- Y. Mor, C. Goldman, and J. Rosenschein, "Learn your opponent's strategy (in polynomial time)!," in, G. Weiss and S. Sen (eds.), Adaptation and Learning in Multi-agent Systems, Lecture Notes in Artificial Intelligence, vol. 1042. Springer-Verlag, 1996.
- (1996) Adaptation and Learning in Multi-agent Systems, Lecture Notes in Artificial Intelligence , vol.1042
- Mor, Y.¹ Goldman, C.² Rosenschein, J.³

32
- 85156219280
- Pathology on game trees: A summary of results
- Stanford, California
- D. S. Nau, "Pathology on game trees: A summary of results," in Proceedings of the First National Conference on Artificial Intelligence, Stanford, California, pp. 102-104, 1980.
- (1980) Proceedings of the First National Conference on Artificial Intelligence , pp. 102-104
- Nau, D.S.¹

33
- 0020207407
- An investigation of the causes of pathology in games
- D. S. Nau, "An investigation of the causes of pathology in games," Artificial Intelligence vol. 19, pp. 257-278, 1982.
- (1982) Artificial Intelligence , vol.19 , pp. 257-278
- Nau, D.S.¹

34
- 0025389210
- Boolean feature discovery in empirical learning
- G. Pagallo, and D. Haussler, "Boolean feature discovery in empirical learning," Machine Learning vol. 5 no. 1, pp. 71-99, 1990.
- (1990) Machine Learning , vol.5 , Issue.1 , pp. 71-99
- Pagallo, G.¹ Haussler, D.²

35
- 0020787874
- On the nature of pathology in game searching
- J. Pearl, "On the nature of pathology in game searching," Artificial Intelligence, vol. 20, pp. 427-453, 1983.
- (1983) Artificial Intelligence , vol.20 , pp. 427-453
- Pearl, J.¹

36
- 33744584654
- Induction of decision trees
- Morgan Kaufmann
- J. R.Quinlan, "Induction of decision trees," in Machine Learning, vol. 1, Morgan Kaufmann, pp. 81-106, 1986.
- (1986) Machine Learning , vol.1 , pp. 81-106
- Quinlan, J.R.¹

37
- 0020882218
- Non-minimax strategies for use against fallible opponents
- Los Altos, CA, William Kaufman
- A. Reibman, and B. Ballard, "Non-minimax strategies for use against fallible opponents," in, Proceedings of the international conference on artificial intelligence AAAI-83, Los Altos, CA, William Kaufman,pp. 338-343, 1983.
- (1983) Proceedings of the International Conference on Artificial Intelligence AAAI-83 , pp. 338-343
- Reibman, A.¹ Ballard, B.²

38
- 0003711660
- Artificial Intelligence, Cambridge, Mass, MIT Press
- S. Russell, and E. Wefald, Do the right thing: studies in limited rationality, Artificial Intelligence, Cambridge, Mass, MIT Press, 1991.
- (1991) Do the Right Thing: Studies in Limited Rationality
- Russell, S.¹ Wefald, E.²

39
- 0030050933
- Multiagent reinforcement learning and the iterated Prisoner's Dilemma
- T. W. Sandholm, and R. H. Crites, "Multiagent reinforcement learning and the iterated Prisoner's Dilemma," Biosystems Journal vol. 37, 147-166, 1995.
- (1995) Biosystems Journal , vol.37 , pp. 147-166
- Sandholm, T.W.¹ Crites, R.H.²

40
- 27444435127
- Modeling auction price uncertainty using boosting-based conditional density estimation
- R. Schapire, P. Stone, D. McAllester, M. Littman, and J. Csirik, "Modeling auction price uncertainty using boosting-based conditional density estimation," in Proceedings of the Nineteenth International Conference on Machine Learning, 2002.
- (2002) Proceedings of the Nineteenth International Conference on Machine Learning
- Schapire, R.¹ Stone, P.² McAllester, D.³ Littman, M.⁴ Csirik, J.⁵

41
- 1242330197
- Learning to take risks
- S. Sen, and N. Arora, "Learning to take risks," in AAAI-97 Workshop on Multiagent Learning, pp. 59-64, 1997.
- (1997) AAAI-97 Workshop on Multiagent Learning , pp. 59-64
- Sen, S.¹ Arora, N.²

42
- 0001842882
- Learning in multiagent systems
- G. Weiss (ed.), Cambridge, Massachusetts, The MIT Press, Chapt. 6
- S. Sen, and G. Weiss, "Learning in multiagent systems," in, G. Weiss (ed.), Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence. Cambridge, Massachusetts, The MIT Press, Chapt. 6, pp. 259-298, 1999.
- (1999) Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence , pp. 259-298
- Sen, S.¹ Weiss, G.²

43
- 0004077471
- Cambridge, Massachusetts, The MIT Press
- H. A. Simon, Models of Bounded Rationality, Volume 1. Cambridge, Massachusetts, The MIT Press, 1982.
- (1982) Models of Bounded Rationality , vol.1
- Simon, H.A.¹

44
- 0002634812
- Defining and using ideal teammate and opponent agent models
- Menlo Park, CA, AAAI Press
- P.Stone, P. Riley, and M. Veloso, "Defining and using ideal teammate and opponent agent models," in, Proceedings of the 7th Conference on Artificial Intelligence (AAAI-00) and of the 12th Conference on Innovative Applications of Artificial Intelligence (IAAI-00). Menlo Park, CA, AAAI Press, pp. 1040-1045, 2000.
- (2000) Proceedings of the 7th Conference on Artificial Intelligence (AAAI-00) and of the 12th Conference on Innovative Applications of Artificial Intelligence (IAAI-00) , pp. 1040-1045
- Stone, P.¹ Riley, P.² Veloso, M.³

45
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- R.Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," in Proceedings of the Seventh International Conference on Machine Learning. pp. 216-224, 1990.
- (1990) Proceedings of the Seventh International Conference on Machine Learning , pp. 216-224
- Sutton, R.¹

46
- 77956290651
- Generalizing adversarial reinforcement learning
- W. T. B. Uther, and M. M. Veloso, "Generalizing adversarial reinforcement learning," in Proceedings of the AAAI Fall Symposium on Model Directed Autonomous Systems, 1997.
- (1997) Proceedings of the AAAI Fall Symposium on Model Directed Autonomous Systems
- Uther, W.T.B.¹ Veloso, M.M.²

47
- 0002632292
- The impact of nested agent models in an information economy
- V. Lesser (ed.), Kyoto, Japan, The MIT Press, Cambridge, MA, USA
- J. M. Vidai, and E. H. Durfee, "The impact of nested agent models in an information economy," in, V. Lesser (ed.), Proceedings of the Second International Conference on Multi-Agent Systems (IC-MAS'96). Kyoto, Japan, The MIT Press, Cambridge, MA, USA, 1995.
- (1995) Proceedings of the Second International Conference on Multi-agent Systems (IC-MAS'96)
- Vidai, J.M.¹ Durfee, E.H.²

48
- 84956981872
- Using recursive agent models effectively
- M. Wooldridge, J. P. Müller, and M. Tambe (eds.), Proceedings on the IJCAI Workshop on Intelligent Agents II: Agent Theories, Architectures, and Languages, Springer-Verlag, Heidelberg, Germany
- J. M. Vidai, and E. H. Durfee, "Using recursive agent models effectively," in M. Wooldridge, J. P. Müller, and M. Tambe (eds.), Proceedings on the IJCAI Workshop on Intelligent Agents II: Agent Theories, Architectures, and Languages, vol. 1037 of LNAI. Springer-Verlag, Heidelberg, Germany, pp. 171-186, 1996.
- (1996) LNAI , vol.1037 , pp. 171-186
- Vidai, J.M.¹ Durfee, E.H.²

49
- 0004049893
- Ph.D. thesis, University of Cambridge
- C. J. Watkins, "Learning from delayed rewards," Ph.D. thesis, University of Cambridge, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.¹

50
- 34249833101
- Q-learaing
- C. J. Watkins, and P. Dayan, "Q-Learaing," Machine Learning vol. 8, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.¹ Dayan, P.²

51
- 0003782395
- Springer-Verlag
- G. Weiss, and S. Sen, Adaptation and learning in multi-agent systems, Lectures Notes in Articial Intelligence, vol. 1042. Springer-Verlag, 1996.
- (1996) Adaptation and Learning in Multi-agent Systems, Lectures Notes in Articial Intelligence , vol.1042
- Weiss, G.¹ Sen, S.²

52
- 0346861528
- Optimizing decision quality with contract algorithms
- Montreal, Canada
- S. Zilberstein,"Optimizing decision quality with contract algorithms". in, Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence. Montreal, Canada, pp. 1576-1582, 1995.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1576-1582
- Zilberstein, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.