SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Autonomous Agents and Multi-Agent Systems

Volumn 5, Issue 3, 2002, Pages 289-304

Pricing in agent economies using multi-agent Q-learning

(2) Tesauro, Gerald a Kephart, Jeffrey O a

a IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

Adaptive multi agent systems; Agent economies; Machine learning; Reinforcement learning

Indexed keywords

AGENT ECONOMIES;

ADAPTIVE SYSTEMS; COMPETITION; DECISION MAKING; LEARNING ALGORITHMS; MARKETING; MULTI AGENT SYSTEMS; OPTIMAL SYSTEMS;

LEARNING SYSTEMS;

EID: 0036274424 PISSN: 13872532 EISSN: None Source Type: Journal
DOI: 10.1023/A:1015504423309 Document Type: Article

Times cited : (125)

References (17)

1
- 85156187730
- Improving elevator performance using reinforcement learning
- D. Touretzky et al. (eds.), MIT Press
- R. H. Crites and A. G. Barto, "Improving elevator performance using reinforcement learning," in D. Touretzky et al. (eds.), Advances in Neural Information Processing Systems, MIT Press, 1996, vol. 8, pp. 1017-1023.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

2
- 0004260007
- MIT Press: Cambridge, MA
- D. Fudenberg and J. Tirole, Game Theory, MIT Press: Cambridge, MA: 1991.
- (1991) Game Theory
- Fudenberg, D.¹ Tirole, J.²

3
- 84880673269
- Shopbots and pricebots
- A. Greenwald and J. O. Kephart, "Shopbots and pricebots," to appear in: Proc. IJCAI-99, 1999.
- (1999) Proc. IJCAI-99
- Greenwald, A.¹ Kephart, J.O.²

4
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- J. Hu and M. P. Wellman, "Multiagent reinforcement learning: theoretical framework and an algorithm," Proc. ICML-98, 1998.
- (1998) Proc. ICML-98
- Hu, J.¹ Wellman, M.P.²

5
- 0041893011
- Price-war dynamics in a free-market economy of software agents
- Los Angeles
- J. O. Kephart, J. E. Hanson and, J. Sairamesh, "Price-war dynamics in a free-market economy of software agents," in Proc. ALIFE-VI, Los Angeles, 1998.
- (1998) Proc. ALIFE-VI
- Kephart, J.O.¹ Hanson, J.E.² Sairamesh, J.³

6
- 0003758853
- Princeton Univ. Press: Princeton, NJ
- D. Kreps, A Course in Microeconomic Theory, Princeton Univ. Press: Princeton, NJ, 1990.
- (1990) A Course in Microeconomic Theory
- Kreps, D.¹

7
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Morgan Kaufmann
- M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," Proc. Eleventh Int. Conf. Machine Learning, Morgan Kaufmann, 1994, pp. 157-163.
- (1994) Proc. Eleventh Int. Conf. Machine Learning , pp. 157-163
- Littman, M.L.¹

8
- 84981290796
- Dynamics of price and quality differentiation in information and computational markets
- ACM Press
- J. Sairamesh and J. O. Kephart, "Dynamics of price and quality differentiation in information and computational markets," Proc. First Int. Conf. Information and Computation Economics (ICE-98), ACM Press, 1998, pp. 28-36.
- (1998) Proc. First Int. Conf. Information and Computation Economics (ICE-98) , pp. 28-36
- Sairamesh, J.¹ Kephart, J.O.²

9
- 0010623451
- On multiagent Q-Learning in a semi-competitive domain
- Workshop on Adaptation and Learning in Multiagent Systems, Montreal, Canada
- T. W. Sandholm and R. H. Crites, "On multiagent Q-Learning in a semi-competitive domain," 14th Int. Joint Conf. Artificial Intelligence (IJCAI-95) Workshop on Adaptation and Learning in Multiagent Systems, Montreal, Canada, 1995, pp. 71-77.
- (1995) 14th Int. Joint Conf. Artificial Intelligence (IJCAI-95) , pp. 71-77
- Sandholm, T.W.¹ Crites, R.H.²

10
- 84962045565
- Multi-agent Q-learning and regression trees for automated pricing decisions
- to appear
- M. Sridharan and G. Tesauro, "Multi-agent Q-learning and regression trees for automated pricing decisions," Proc. ICML-00, to appear, 2000.
- (2000) Proc. ICML-00
- Sridharan, M.¹ Tesauro, G.²

11
- 0029276036
- Temporal difference learning and TD-Gammon
- G. Tesauro, "Temporal difference learning and TD-Gammon," Comm. of the ACM, vol. 38, no. 3, pp. 58-67, 1995.
- (1995) Comm. of the ACM , vol.38 , Issue.3 , pp. 58-67
- Tesauro, G.¹

12
- 85010804349
- Foresight-based pricing algorithms in an economy of software agents
- ACM Press
- G. J. Tesauro and J. O. Kephart, "Foresight-based pricing algorithms in an economy of software agents," Proc. First Int. Conf. Information and Computation Economics (ICE-98), ACM Press, 1998, pp. 37-44.
- (1998) Proc. First Int. Conf. Information and Computation Economics (ICE-98) , pp. 37-44
- Tesauro, G.J.¹ Kephart, J.O.²

13
- 0141771048
- Foresight-based pricing algorithms in agent economies
- to appear
- G. J. Tesauro and J. O. Kephart, "Foresight-based pricing algorithms in agent economies," Decision Support Sciences, to appear, 1999.
- (1999) Decision Support Sciences
- Tesauro, G.J.¹ Kephart, J.O.²

14
- 0032357654
- Learning nested agent models in an information economy
- to appear
- J. M. Vidal and E. H. Durfee, "Learning nested agent models in an information economy," J. Experimental and Theoretical AI, to appear, 1998.
- (1998) J. Experimental and Theoretical AI
- Vidal, J.M.¹ Durfee, E.H.²

15
- 0004049893
- Ph.D. thesis, Cambridge University
- C. J. C. H. Watkins, "Learning from delayed rewards," Ph.D. thesis, Cambridge University, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

16
- 34249833101
- Q-learning
- C. J. C. H. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

17
- 85156225449
- High-performance job-shop scheduling with a time-delay TD(λ) network
- D. Touretzky et al. (eds.), am Press
- W. Zhang and T. G. Dietterich, "High-performance job-shop scheduling with a time-delay TD(λ) network." in D. Touretzky et al. (eds.), Advances in Neural Information Processing Systems, am Press, 1996, vol. 8, pp. 1024-1030.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1024-1030
- Zhang, W.¹ Dietterich, T.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.