SCOPUS 정보 검색 플랫폼

Journal of Experimental and Theoretical Artificial Intelligence

Volumn 10, Issue 3, 1998, Pages 309-332

Model-based learning of interaction strategies in multi-agent systems

(2) Carmel, David a Markovitch, Shaul a

a Techion (Israel)

Author keywords

Model based learning; Multi agent system

Indexed keywords

EID: 0032344579 PISSN: 0952813X EISSN: 13623079 Source Type: Journal
DOI: 10.1080/095281398146789 Document Type: Article

Times cited : (44)

References (31)

1
- 0000926141
- The structure of Nash equilibrium in repeated games with finite automata
- Abreu, D., and Rubinstein, A., 1988, The structure of Nash equilibrium in repeated games with finite automata. Econometrica, 56(6): 1259-1281.
- (1988) Econometrica , vol.56 , Issue.6 , pp. 1259-1281
- Abreu, D.¹ Rubinstein, A.²

2
- 0018062404
- On the complexity of minimum inference of regular sets
- Angluin, D., 1978, On the complexity of minimum inference of regular sets. Information and Control, 39: 337-350.
- (1978) Information and Control , vol.39 , pp. 337-350
- Angluin, D.¹

3
- 0023453626
- Learning regular sets from queries and counter-examples
- Angluin, D., 1987, Learning regular sets from queries and counter-examples. Information and Computation, 75: 87-106.
- (1987) Information and Computation , vol.75 , pp. 87-106
- Angluin, D.¹

4
- 84936824515
- New York: Basic Books)
- Axelrod, R., 1984, The Evolution of Cooperation (New York: Basic Books).
- (1984) The Evolution of Cooperation
- Axelrod, R.¹

5
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A. G., Bradtke, S. J., and Singh, S. P., 1995, Learning to act using real-time dynamic programming. Artificial Intelligence, 72(1): 81-138.
- (1995) Artificial Intelligence , vol.72 , Issue.1 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

6
- 0011471586
- The complexity of computing a best response automaton in repeated games with mixed strategies
- Ben-Porath, E., 1990, The complexity of computing a best response automaton in repeated games with mixed strategies. Games and Economic Behaviour, 2: 1-12.
- (1990) Games and Economic Behaviour , vol.2 , pp. 1-12
- Ben-Porath, E.¹

7
- 0003565779
- New York: Prentice-Hall)
- Bertsekas, D. P., 1987, Dynamic Programming: Deterministic and Stochastic Models (New York: Prentice-Hall).
- (1987) Dynamic Programming: Deterministic and Stochastic Models
- Bertsekas, D.P.¹

8
- 0030349572
- Incorporating opponent models into adversary search
- Portland, Oregon, August
- Carmel, D., and Markovitch, S., 1996a, Incorporating opponent models into adversary search. In Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI 96), Portland, Oregon, August, pp. 120-125.
- (1996) Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI 96) , pp. 120-125
- Carmel, D.¹ Markovitch, S.²

9
- 0030365402
- Learning models of intelligent agents
- Portland, Oregon, August
- Carmel, D., and Markovitch, S., 1996b, Learning models of intelligent agents. In Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI 96), Portland, Oregon, August, pp. 62-67.
- (1996) Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI 96) , pp. 62-67
- Carmel, D.¹ Markovitch, S.²

10
- 0042413243
- Exploration and adaptation in multi-agent systems: A model-based approach
- Nagoya, Japan, August
- Carmel, D., and Markovitch, S., 1997, Exploration and adaptation in multi-agent systems: a model-based approach. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI 97), Nagoya, Japan, August pp. 606-611.
- (1997) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI 97) , pp. 606-611
- Carmel, D.¹ Markovitch, S.²

11
- 0004116989
- (Cambridge, Mass: M IT Press)
- Cormen, T. H., Leiserson, C. E., and Rivest, R. L., 1990, Introduction to Algorithms (Cambridge, Mass: M IT Press).
- (1990) Introduction to Algorithms
- Cormen, T.H.¹ Leiserson, C.E.² Rivest, R.L.³

12
- 0028062304
- Optimality and domination in repeated games with bounded players
- Montreal, Quebec (ACM Press)
- Fortnow, L., and Whang, D., 1994, Optimality and domination in repeated games with bounded players. In Proceedings of the 26th Annual ACM Symposium on Theory and Computing, Montreal, Quebec (ACM Press), pp. 741-749.
- (1994) Proceedings of the 26Th Annual ACM Symposium on Theory and Computing , pp. 741-749
- Fortnow, L.¹ Whang, D.²

13
- 0027307379
- Efficient learning of typical finite automata from random walls
- San Diego, CA, May (ACM Press)
- Freund, Y., Kearns, M., Ron, D., Rubinfeld, R., Schapire, R. E., and Sellie, L., 1993, Efficient learning of typical finite automata from random walls. In Proceedings of the 25th Annual ACM Symposium on Theory and Computing, San Diego, CA, May (ACM Press), pp. 315-324.
- (1993) Proceedings of the 25Th Annual ACM Symposium on Theory and Computing , pp. 315-324
- Freund, Y.¹ Kearns, M.² Ron, D.³ Rubinfeld, R.⁴ Schapire, R.E.⁵ Sellie, L.⁶

14
- 38249006045
- Bounded versus unbounded rationality: The tyranny of the weak
- Gilboa, I., and Samet, R. E., 1989, Bounded versus unbounded rationality: The tyranny of the weak. Games and Economic Behaviour, 1: 213-221.
- (1989) Games and Economic Behaviour , vol.1 , pp. 213-221
- Gilboa, I.¹ Samet, R.E.²

15
- 0001187706
- Complexity of automaton identification from given data
- Gold, E. M., 1978, Complexity of automaton identification from given data. Information and Control, 37: 301-320.
- (1978) Information and Control , vol.37 , pp. 301-320
- Gold, E.M.¹

16
- 0003620778
- Mass: Addison-Wesley
- Hopcroft, J. E., and Ullman, J. D., 1979, Introduction to Automata Theory, Languages and Computation (Mass: Addison-Wesley), 1979.
- (1979) Introduction to Automata Theory, Languages and Computation , pp. 1979
- Hopcroft, J.E.¹ Ullman, J.D.²

17
- 0030350177
- Learning to take actions
- Portland, Oregon
- Khardon, R., 1996, Learning to take actions. In Proceeding of the Thirteenth National Conference on Artificial Intelligence (AAAI-96), Portland, Oregon, pp. 787-792.
- (1996) In Proceeding of the Thirteenth National Conference on Artificial Intelligence (AAAI-96) , pp. 787-792
- Khardon, R.¹

18
- 0011514818
- Computable strategies for repeated Prisoner's Dilemma
- Knoblauch, V., 1994, Computable strategies for repeated Prisoner's Dilemma. Games and Economic Behaviour, 7: 381-389.
- (1994) Games and Economic Behaviour , vol.7 , pp. 381-389
- Knoblauch, V.¹

19
- 84949966497
- Learn your opponent's strategy (In polynomial time)
- G. Weiß and S. Sen, Berlin: Springer-Verlag
- Mor, V., Goldman, C. V., and Rosenschein, J. S., 1996, Learn your opponent's strategy (in polynomial time). In G. Weiß and S. Sen (eds) Adaptation and Learning in Multi-agent Systems, Lecture Notes in AI (Berlin: Springer-Verlag).
- (1996) Adaptation and Learning in Multi-Agent Systems, Lecture Notes in AI
- Mor, V.¹ Goldman, C.V.² Rosenschein, J.S.³

20
- 0000948830
- On players with a bounded number of states
- Papadimitriou, C. H., 1992, On players with a bounded number of states. Games and Economic Behaviour, 4: 122-131.
- (1992) Games and Economic Behaviour , vol.4 , pp. 122-131
- Papadimitriou, C.H.¹

21
- 0000977910
- The complexity of Markov decision processes
- Papadimitriou, C. H., and Tsitsiklis, J. N., 1987, The complexity of Markov decision processes. Mathematics of Operations Research, 12(3): 441-450.
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

22
- 85037549689
- Inductive inference, DFAs and computational complexity
- K. P. Jantke, (ed.), Berlin: Springer-Verlag
- Pitt, L., 1989, Inductive inference, DFAs and computational complexity. In K. P. Jantke, (ed.) Analogical and Inductive Inference, Lecture Notes in AI, Vol. 397 (Berlin: Springer-Verlag), pp. 18-44.
- (1989) Analogical and Inductive Inference, Lecture Notes in AI , vol.397 , pp. 18-44
- Pitt, L.¹

23
- 0001349185
- Inference of finite automata using homing sequences
- Rivest, R. L., and Schapire, R. E., 1993, Inference of finite automata using homing sequences. Information and Computation, 103(2): 299-347.
- (1993) Information and Computation , vol.103 , Issue.2 , pp. 299-347
- Rivest, R.L.¹ Schapire, R.E.²

24
- 0041410936
- Exactly learning automata with small cover time
- Santa Cruz, Ca
- Ron, D., and Rubinfeled, R., 1995, Exactly learning automata with small cover time. In Proceedings of the Eighth Annual ACM Conference on Computational Learning Theory, Santa Cruz, Ca, pp. 427-436.
- (1995) Proceedings of the Eighth Annual ACM Conference on Computational Learning Theory , pp. 427-436
- Ron, D.¹ Rubinfeled, R.²

25
- 46149134052
- Finite automata play the repeated Prisoner's Dilemma
- Rubinstein, A., 1986, Finite automata play the repeated Prisoner's Dilemma. Journal of Economic Theory, 39: 83-96.
- (1986) Journal of Economic Theory , vol.39 , pp. 83-96
- Rubinstein, A.¹

26
- 0030050933
- Multiagent reinforcement learning and the iterated Prisoner's Dilemma
- Sandholm, T. W., and Crites, R. H., 1995, Multiagent reinforcement learning and the iterated Prisoner's Dilemma. Biosystems Journal, 37: 147-166.
- (1995) Biosystems Journal , vol.37 , pp. 147-166
- Sandholm, T.W.¹ Crites, R.H.²

27
- 0028555752
- Learning to coordinate without sharing information
- Seattle, Washington
- Sen, S., Sekaran, M., and Hale, J., 1994, Learning to coordinate without sharing information. In Proceeding of the Twelfth National Conference on Artificial Intelligence (AAAI-94), Seattle, Washington, pp. 426-431.
- (1994) In Proceeding of the Twelfth National Conference on Artificial Intelligence (AAAI-94) , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

28
- 0004342269
- Technical Report STAN-CS-TR-94-1511 (CA: Stanford University, Department of Computer Science)
- Shoham, Y., and Tennenholtz, M., 1994, Co-Learning and the evolution of social activity. Technical Report STAN-CS-TR-94-1511 (CA: Stanford University, Department of Computer Science).
- (1994) Co-Learning and the Evolution of Social Activity
- Shoham, Y.¹ Tennenholtz, M.²

29
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- San Mateo, CA (Morgan Kaufman)
- Sutton, R. S., 1990, Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proceedings of the 7 th International Conference on Machine Learning, San Mateo, CA (Morgan Kaufman), pp. 216-224.
- (1990) Proceedings of the 7 Th International Conference on Machine Learning , pp. 216-224
- Sutton, R.S.¹

30
- 34249833101
- Technical notes: Q-learning
- Watkins, C. J. C. H., and P. Dayan, 1992, Technical notes: Q-learning. Machine Learning, 8: 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

31
- 0003782395
- Berlin: Springer-Verlag
- Weiß, G., and Sen, S., 1996, Adaptation and Learning in Multi-agent Systems, Lecture Notes in AI, Vol. 1042 (Berlin: Springer-Verlag).
- (1996) Adaptation and Learning in Multi-Agent Systems, Lecture Notes in AI , pp. 1042
- Weiß, G.¹ Sen, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.