메뉴 건너뛰기




Volumn 10, Issue 2, 2005, Pages 103-130

Learning and exploiting relative weaknesses of opponent agents

Author keywords

Machine learning; Multi agent systems; Opponent modelling

Indexed keywords

BOUNDEDLY RATIONAL AGENT; MODEL BASED LEARNING; MODEL-FREE LEARNING; OPPONENT MODELING;

EID: 14744285085     PISSN: 13872532     EISSN: None     Source Type: Journal    
DOI: 10.1007/s10458-004-6977-7     Document Type: Article
Times cited : (28)

References (52)
  • 2
    • 0018062404 scopus 로고
    • On the complexity of minimum inference of regular sets
    • D. Angluin, "On the complexity of minimum inference of regular sets," Information and Control vol. 39, pp. 337-350, 1978.
    • (1978) Information and Control , vol.39 , pp. 337-350
    • Angluin, D.1
  • 8
    • 4243574392 scopus 로고    scopus 로고
    • Learning and using opponent models in adversary search
    • Technion
    • D. Carmel, and S. Markovitch, "Learning and using opponent models in adversary search," Technical Report CIS9609, Technion, 1996b.
    • (1996) Technical Report , vol.CIS9609
    • Carmel, D.1    Markovitch, S.2
  • 11
    • 0033423368 scopus 로고    scopus 로고
    • Exploration strategies for model-based learning in multiagent systems
    • D. Carmel, and S. Markovitch, "Exploration strategies for model-based learning in multiagent systems," Autonomous Agents and Multi-agent Systems, vol. 2, no. 2, pp. 141-172,1999.
    • (1999) Autonomous Agents and Multi-agent Systems , vol.2 , Issue.2 , pp. 141-172
    • Carmel, D.1    Markovitch, S.2
  • 14
    • 0034915471 scopus 로고    scopus 로고
    • Strategies anticipating a difference in search depth using opponent-model search
    • X. Gao, H. Iida, J. W. Uiterwijk, and H. J. van den Herik, "Strategies anticipating a difference in search depth using opponent-model search," Theoretical Computer Science, vol. 252 no. 1-2, pp. 83-104, 2001.
    • (2001) Theoretical Computer Science , vol.252 , Issue.1-2 , pp. 83-104
    • Gao, X.1    Iida, H.2    Uiterwijk, J.W.3    Van Den Herik, H.J.4
  • 21
    • 0001666986 scopus 로고
    • Potential applications of opponent-model search, part I: The domain of applicability
    • H. Iida, J. W. H. M. Uiterwijk, H. J. van den Herik, and I. S. Herschberg, "Potential applications of opponent-model search, Part I: The Domain of Applicability," ICCA Journal vol. 16 no. (4), pp. 201-208, 1993.
    • (1993) ICCA Journal , vol.16 , Issue.4 , pp. 201-208
    • Iida, H.1    Uiterwijk, J.W.H.M.2    Van Den Herik, H.J.3    Herschberg, I.S.4
  • 22
    • 0004693797 scopus 로고
    • Potential applications of opponent-model search, part II: Risks and strategies
    • H. Iida, J. W. H. M. Uiterwijk, H. J. van den Herik, and I. S. Herschberg, "Potential applications of opponent-model search, Part II: Risks and strategies," ICCA Journal, vol. 17, no. 1, pp. 10-14, 1994.
    • (1994) ICCA Journal , vol.17 , Issue.1 , pp. 10-14
    • Iida, H.1    Uiterwijk, J.W.H.M.2    Van Den Herik, H.J.3    Herschberg, I.S.4
  • 27
    • 0036778917 scopus 로고    scopus 로고
    • Feature generation using general constructor functions
    • S. Markovitch, and D. Rosenstein, "Feature generation using general constructor functions," Machine Learning, vol. 49, pp. 59-98, 2001.
    • (2001) Machine Learning , vol.49 , pp. 59-98
    • Markovitch, S.1    Rosenstein, D.2
  • 28
    • 5844259149 scopus 로고    scopus 로고
    • Learning of resource allocation strategies for game playing
    • S.Markovitch, and Y. Sella, "Learning of resource allocation strategies for game playing," Computational Intelligence, vol. 12 no. (1), pp. 88-105, 1996.
    • (1996) Computational Intelligence , vol.12 , Issue.1 , pp. 88-105
    • Markovitch, S.1    Sella, Y.2
  • 30
    • 0027684215 scopus 로고
    • Prioritized sweeping, reinforcement learning with less data and less time
    • A. W. Moore, and C. G. Atkeson, "Prioritized sweeping, reinforcement learning with less data and less time," Machine Learning, vol. 13 , pp.103-130, 1993.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.W.1    Atkeson, C.G.2
  • 33
    • 0020207407 scopus 로고
    • An investigation of the causes of pathology in games
    • D. S. Nau, "An investigation of the causes of pathology in games," Artificial Intelligence vol. 19, pp. 257-278, 1982.
    • (1982) Artificial Intelligence , vol.19 , pp. 257-278
    • Nau, D.S.1
  • 34
    • 0025389210 scopus 로고
    • Boolean feature discovery in empirical learning
    • G. Pagallo, and D. Haussler, "Boolean feature discovery in empirical learning," Machine Learning vol. 5 no. 1, pp. 71-99, 1990.
    • (1990) Machine Learning , vol.5 , Issue.1 , pp. 71-99
    • Pagallo, G.1    Haussler, D.2
  • 35
    • 0020787874 scopus 로고
    • On the nature of pathology in game searching
    • J. Pearl, "On the nature of pathology in game searching," Artificial Intelligence, vol. 20, pp. 427-453, 1983.
    • (1983) Artificial Intelligence , vol.20 , pp. 427-453
    • Pearl, J.1
  • 36
    • 33744584654 scopus 로고
    • Induction of decision trees
    • Morgan Kaufmann
    • J. R.Quinlan, "Induction of decision trees," in Machine Learning, vol. 1, Morgan Kaufmann, pp. 81-106, 1986.
    • (1986) Machine Learning , vol.1 , pp. 81-106
    • Quinlan, J.R.1
  • 39
    • 0030050933 scopus 로고
    • Multiagent reinforcement learning and the iterated Prisoner's Dilemma
    • T. W. Sandholm, and R. H. Crites, "Multiagent reinforcement learning and the iterated Prisoner's Dilemma," Biosystems Journal vol. 37, 147-166, 1995.
    • (1995) Biosystems Journal , vol.37 , pp. 147-166
    • Sandholm, T.W.1    Crites, R.H.2
  • 43
    • 0004077471 scopus 로고
    • Cambridge, Massachusetts, The MIT Press
    • H. A. Simon, Models of Bounded Rationality, Volume 1. Cambridge, Massachusetts, The MIT Press, 1982.
    • (1982) Models of Bounded Rationality , vol.1
    • Simon, H.A.1
  • 45
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • R.Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," in Proceedings of the Seventh International Conference on Machine Learning. pp. 216-224, 1990.
    • (1990) Proceedings of the Seventh International Conference on Machine Learning , pp. 216-224
    • Sutton, R.1
  • 48
    • 84956981872 scopus 로고    scopus 로고
    • Using recursive agent models effectively
    • M. Wooldridge, J. P. Müller, and M. Tambe (eds.), Proceedings on the IJCAI Workshop on Intelligent Agents II: Agent Theories, Architectures, and Languages, Springer-Verlag, Heidelberg, Germany
    • J. M. Vidai, and E. H. Durfee, "Using recursive agent models effectively," in M. Wooldridge, J. P. Müller, and M. Tambe (eds.), Proceedings on the IJCAI Workshop on Intelligent Agents II: Agent Theories, Architectures, and Languages, vol. 1037 of LNAI. Springer-Verlag, Heidelberg, Germany, pp. 171-186, 1996.
    • (1996) LNAI , vol.1037 , pp. 171-186
    • Vidai, J.M.1    Durfee, E.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.