메뉴 건너뛰기




Volumn 24, Issue 7, 2008, Pages 687-693

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

Author keywords

Dynamic pricing; Grid; Policy gradient; Reinforcement learning

Indexed keywords

ELECTRONIC COMMERCE; GRID COMPUTING; LEARNING ALGORITHMS; PARAMETER ESTIMATION;

EID: 44249093442     PISSN: 0167739X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.future.2008.02.012     Document Type: Article
Times cited : (19)

References (13)
  • 2
    • 0242708616 scopus 로고    scopus 로고
    • Dynamic pricing in the presence of inventory considerations: Research overview, current practices and future directions
    • Elmaghraby W., and Keskinocak P. Dynamic pricing in the presence of inventory considerations: Research overview, current practices and future directions. Management Science 49 10 (2003) 1287-1309
    • (2003) Management Science , vol.49 , Issue.10 , pp. 1287-1309
    • Elmaghraby, W.1    Keskinocak, P.2
  • 3
    • 34047241684 scopus 로고    scopus 로고
    • C. Li, H. Wang, Y. Zhang, Dynamic pricing decision in a duopolistic retailing market, in: Proceedings of the Sixth World Congress on Intelligent Control and Automation, WCICA, June 2006, pp. 6993-6997
    • C. Li, H. Wang, Y. Zhang, Dynamic pricing decision in a duopolistic retailing market, in: Proceedings of the Sixth World Congress on Intelligent Control and Automation, WCICA, June 2006, pp. 6993-6997
  • 4
    • 23044485603 scopus 로고    scopus 로고
    • Dynamic pricing models for electronic business
    • Narahari Y., Raju C.V.L., Ravikumar K., and Shah S. Dynamic pricing models for electronic business. Sadhana 30 2-3 (2005) 231-256
    • (2005) Sadhana , vol.30 , Issue.2-3 , pp. 231-256
    • Narahari, Y.1    Raju, C.V.L.2    Ravikumar, K.3    Shah, S.4
  • 5
    • 33644903192 scopus 로고    scopus 로고
    • Learning dynamic prices in multi-seller electronic markets with price sensitive customers, stochastic demands, and inventory replenishments
    • Game-theoretic Analysis and Stochastic Simulation of Negotiation Agents (Special Issue)
    • Raju C.V.L., Narahari Y., and Ravi Kumar K. Learning dynamic prices in multi-seller electronic markets with price sensitive customers, stochastic demands, and inventory replenishments. Game-theoretic Analysis and Stochastic Simulation of Negotiation Agents. IEEE Transactions on Systems, Man, and Cybernetics, Part C 36 1 (2006) 92-106 (Special Issue)
    • (2006) IEEE Transactions on Systems, Man, and Cybernetics, Part C , vol.36 , Issue.1 , pp. 92-106
    • Raju, C.V.L.1    Narahari, Y.2    Ravi Kumar, K.3
  • 6
    • 84962045565 scopus 로고    scopus 로고
    • M. Sridharan, G.J. Tesauro, Multi-agent Q-learning and regression trees for automated pricing decisions, in: Proceedings of the Seventh International Conference on Machine Learning, 2000, pp. 927-934
    • M. Sridharan, G.J. Tesauro, Multi-agent Q-learning and regression trees for automated pricing decisions, in: Proceedings of the Seventh International Conference on Machine Learning, 2000, pp. 927-934
  • 9
    • 44249105275 scopus 로고    scopus 로고
    • G.J. Tesauro, J.O. Kephart, Pricing in agent economies with multi-agent Q-learning, in: Proceedings of Workshop on Decision Theoretic and Game Theoretic Agents. University College London, London, July 6, 1999
    • G.J. Tesauro, J.O. Kephart, Pricing in agent economies with multi-agent Q-learning, in: Proceedings of Workshop on Decision Theoretic and Game Theoretic Agents. University College London, London, July 6, 1999
  • 10
    • 33745604967 scopus 로고    scopus 로고
    • Adaptive utility-based scheduling in resource-constrained systems
    • 18th Australian Joint Conference on Artificial Intelligence, December 5-9, Sydney, Australia, 2005
    • Vengerov D. Adaptive utility-based scheduling in resource-constrained systems. 18th Australian Joint Conference on Artificial Intelligence, December 5-9, Sydney, Australia, 2005. Springer-Verlag Lecture Notes in Computer Science vol. 3809 (2005) 477-488
    • (2005) Springer-Verlag Lecture Notes in Computer Science , vol.3809 , pp. 477-488
    • Vengerov, D.1
  • 11
    • 33745767064 scopus 로고    scopus 로고
    • A taxonomy of data grids for distributed data sharing, management and processing
    • Venugopal S., Buyya R., and Ramamohanarao K. A taxonomy of data grids for distributed data sharing, management and processing. ACM Computing Surveys 38 1 (2006) 1-53
    • (2006) ACM Computing Surveys , vol.38 , Issue.1 , pp. 1-53
    • Venugopal, S.1    Buyya, R.2    Ramamohanarao, K.3
  • 12
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8 (1992) 229-256
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.J.1
  • 13
    • 44249121565 scopus 로고    scopus 로고
    • C.S. Yeo, R. Buyya, Pricing for utility-driven resource management and allocation in clusters, in: Proceedings of the 12th International Conference on Advanced Computing and Communication, ADCOM, 2004
    • C.S. Yeo, R. Buyya, Pricing for utility-driven resource management and allocation in clusters, in: Proceedings of the 12th International Conference on Advanced Computing and Communication, ADCOM, 2004


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.