SCOPUS 정보 검색 플랫폼

Volumn 24, Issue 7, 2008, Pages 687-693

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

Author keywords

Dynamic pricing; Grid; Policy gradient; Reinforcement learning

Indexed keywords

ELECTRONIC COMMERCE; GRID COMPUTING; LEARNING ALGORITHMS; PARAMETER ESTIMATION;

COMPUTING JOB; GRID MARKET ENVIRONMENT; MULTIPLE SELLERS;

REINFORCEMENT LEARNING;

EID: 44249093442 PISSN: 0167739X EISSN: None Source Type: Journal
DOI: 10.1016/j.future.2008.02.012 Document Type: Article

Times cited : (19)

References (13)

1
- 0013535965
- Infinite-horizon policy-gradient estimation
- Baxter J., and Bartlett P.L. Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research 15 (2001) 319-350
- (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

2
- 0242708616
- Dynamic pricing in the presence of inventory considerations: Research overview, current practices and future directions
- Elmaghraby W., and Keskinocak P. Dynamic pricing in the presence of inventory considerations: Research overview, current practices and future directions. Management Science 49 10 (2003) 1287-1309
- (2003) Management Science , vol.49 , Issue.10 , pp. 1287-1309
- Elmaghraby, W.¹ Keskinocak, P.²

3
- 34047241684
- C. Li, H. Wang, Y. Zhang, Dynamic pricing decision in a duopolistic retailing market, in: Proceedings of the Sixth World Congress on Intelligent Control and Automation, WCICA, June 2006, pp. 6993-6997
- C. Li, H. Wang, Y. Zhang, Dynamic pricing decision in a duopolistic retailing market, in: Proceedings of the Sixth World Congress on Intelligent Control and Automation, WCICA, June 2006, pp. 6993-6997

4
- 23044485603
- Dynamic pricing models for electronic business
- Narahari Y., Raju C.V.L., Ravikumar K., and Shah S. Dynamic pricing models for electronic business. Sadhana 30 2-3 (2005) 231-256
- (2005) Sadhana , vol.30 , Issue.2-3 , pp. 231-256
- Narahari, Y.¹ Raju, C.V.L.² Ravikumar, K.³ Shah, S.⁴

6
- 84962045565
- M. Sridharan, G.J. Tesauro, Multi-agent Q-learning and regression trees for automated pricing decisions, in: Proceedings of the Seventh International Conference on Machine Learning, 2000, pp. 927-934
- M. Sridharan, G.J. Tesauro, Multi-agent Q-learning and regression trees for automated pricing decisions, in: Proceedings of the Seventh International Conference on Machine Learning, 2000, pp. 927-934

7
- 0004102479
- MIT Press
- Sutton R., and Barto A.G. Reinforcement Learning: An Introduction (1998), MIT Press
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.G.²

8
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Sutton R.S., McAllester D., Singh S., and Mansour Y. Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems vol. 12 (2000) 1057-1063
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.S.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

9
- 44249105275
- G.J. Tesauro, J.O. Kephart, Pricing in agent economies with multi-agent Q-learning, in: Proceedings of Workshop on Decision Theoretic and Game Theoretic Agents. University College London, London, July 6, 1999
- G.J. Tesauro, J.O. Kephart, Pricing in agent economies with multi-agent Q-learning, in: Proceedings of Workshop on Decision Theoretic and Game Theoretic Agents. University College London, London, July 6, 1999

11
- 33745767064
- A taxonomy of data grids for distributed data sharing, management and processing
- Venugopal S., Buyya R., and Ramamohanarao K. A taxonomy of data grids for distributed data sharing, management and processing. ACM Computing Surveys 38 1 (2006) 1-53
- (2006) ACM Computing Surveys , vol.38 , Issue.1 , pp. 1-53
- Venugopal, S.¹ Buyya, R.² Ramamohanarao, K.³

12
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8 (1992) 229-256
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

13
- 44249121565
- C.S. Yeo, R. Buyya, Pricing for utility-driven resource management and allocation in clusters, in: Proceedings of the 12th International Conference on Advanced Computing and Communication, ADCOM, 2004
- C.S. Yeo, R. Buyya, Pricing for utility-driven resource management and allocation in clusters, in: Proceedings of the 12th International Conference on Advanced Computing and Communication, ADCOM, 2004

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.