메뉴 건너뛰기




Volumn 10, Issue 3, 1998, Pages 309-332

Model-based learning of interaction strategies in multi-agent systems

Author keywords

Model based learning; Multi agent system

Indexed keywords


EID: 0032344579     PISSN: 0952813X     EISSN: 13623079     Source Type: Journal    
DOI: 10.1080/095281398146789     Document Type: Article
Times cited : (44)

References (31)
  • 1
    • 0000926141 scopus 로고
    • The structure of Nash equilibrium in repeated games with finite automata
    • Abreu, D., and Rubinstein, A., 1988, The structure of Nash equilibrium in repeated games with finite automata. Econometrica, 56(6): 1259-1281.
    • (1988) Econometrica , vol.56 , Issue.6 , pp. 1259-1281
    • Abreu, D.1    Rubinstein, A.2
  • 2
    • 0018062404 scopus 로고
    • On the complexity of minimum inference of regular sets
    • Angluin, D., 1978, On the complexity of minimum inference of regular sets. Information and Control, 39: 337-350.
    • (1978) Information and Control , vol.39 , pp. 337-350
    • Angluin, D.1
  • 3
    • 0023453626 scopus 로고
    • Learning regular sets from queries and counter-examples
    • Angluin, D., 1987, Learning regular sets from queries and counter-examples. Information and Computation, 75: 87-106.
    • (1987) Information and Computation , vol.75 , pp. 87-106
    • Angluin, D.1
  • 5
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A. G., Bradtke, S. J., and Singh, S. P., 1995, Learning to act using real-time dynamic programming. Artificial Intelligence, 72(1): 81-138.
    • (1995) Artificial Intelligence , vol.72 , Issue.1 , pp. 81-138
    • Barto, A.G.1    Bradtke, S.J.2    Singh, S.P.3
  • 6
    • 0011471586 scopus 로고
    • The complexity of computing a best response automaton in repeated games with mixed strategies
    • Ben-Porath, E., 1990, The complexity of computing a best response automaton in repeated games with mixed strategies. Games and Economic Behaviour, 2: 1-12.
    • (1990) Games and Economic Behaviour , vol.2 , pp. 1-12
    • Ben-Porath, E.1
  • 14
    • 38249006045 scopus 로고
    • Bounded versus unbounded rationality: The tyranny of the weak
    • Gilboa, I., and Samet, R. E., 1989, Bounded versus unbounded rationality: The tyranny of the weak. Games and Economic Behaviour, 1: 213-221.
    • (1989) Games and Economic Behaviour , vol.1 , pp. 213-221
    • Gilboa, I.1    Samet, R.E.2
  • 15
    • 0001187706 scopus 로고
    • Complexity of automaton identification from given data
    • Gold, E. M., 1978, Complexity of automaton identification from given data. Information and Control, 37: 301-320.
    • (1978) Information and Control , vol.37 , pp. 301-320
    • Gold, E.M.1
  • 18
    • 0011514818 scopus 로고
    • Computable strategies for repeated Prisoner's Dilemma
    • Knoblauch, V., 1994, Computable strategies for repeated Prisoner's Dilemma. Games and Economic Behaviour, 7: 381-389.
    • (1994) Games and Economic Behaviour , vol.7 , pp. 381-389
    • Knoblauch, V.1
  • 20
    • 0000948830 scopus 로고
    • On players with a bounded number of states
    • Papadimitriou, C. H., 1992, On players with a bounded number of states. Games and Economic Behaviour, 4: 122-131.
    • (1992) Games and Economic Behaviour , vol.4 , pp. 122-131
    • Papadimitriou, C.H.1
  • 22
    • 85037549689 scopus 로고
    • Inductive inference, DFAs and computational complexity
    • K. P. Jantke, (ed.), Berlin: Springer-Verlag
    • Pitt, L., 1989, Inductive inference, DFAs and computational complexity. In K. P. Jantke, (ed.) Analogical and Inductive Inference, Lecture Notes in AI, Vol. 397 (Berlin: Springer-Verlag), pp. 18-44.
    • (1989) Analogical and Inductive Inference, Lecture Notes in AI , vol.397 , pp. 18-44
    • Pitt, L.1
  • 23
    • 0001349185 scopus 로고
    • Inference of finite automata using homing sequences
    • Rivest, R. L., and Schapire, R. E., 1993, Inference of finite automata using homing sequences. Information and Computation, 103(2): 299-347.
    • (1993) Information and Computation , vol.103 , Issue.2 , pp. 299-347
    • Rivest, R.L.1    Schapire, R.E.2
  • 25
    • 46149134052 scopus 로고
    • Finite automata play the repeated Prisoner's Dilemma
    • Rubinstein, A., 1986, Finite automata play the repeated Prisoner's Dilemma. Journal of Economic Theory, 39: 83-96.
    • (1986) Journal of Economic Theory , vol.39 , pp. 83-96
    • Rubinstein, A.1
  • 26
    • 0030050933 scopus 로고
    • Multiagent reinforcement learning and the iterated Prisoner's Dilemma
    • Sandholm, T. W., and Crites, R. H., 1995, Multiagent reinforcement learning and the iterated Prisoner's Dilemma. Biosystems Journal, 37: 147-166.
    • (1995) Biosystems Journal , vol.37 , pp. 147-166
    • Sandholm, T.W.1    Crites, R.H.2
  • 28
    • 0004342269 scopus 로고
    • Technical Report STAN-CS-TR-94-1511 (CA: Stanford University, Department of Computer Science)
    • Shoham, Y., and Tennenholtz, M., 1994, Co-Learning and the evolution of social activity. Technical Report STAN-CS-TR-94-1511 (CA: Stanford University, Department of Computer Science).
    • (1994) Co-Learning and the Evolution of Social Activity
    • Shoham, Y.1    Tennenholtz, M.2
  • 29
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • San Mateo, CA (Morgan Kaufman)
    • Sutton, R. S., 1990, Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proceedings of the 7 th International Conference on Machine Learning, San Mateo, CA (Morgan Kaufman), pp. 216-224.
    • (1990) Proceedings of the 7 Th International Conference on Machine Learning , pp. 216-224
    • Sutton, R.S.1
  • 30


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.