메뉴 건너뛰기




Volumn 36, Issue 1, 2006, Pages 92-106

Learning dynamic prices in multiseller electronic retail markets with price sensitive customers, stochastic demands, and inventory replenishments

Author keywords

Dynamic pricing; Inventory replenishments; Markovian game; Multi agent learning; Online retail markets; Price sensitive customers; Reinforcement learning (RL); Stochastic demands

Indexed keywords

ELECTRONIC COMMERCE; GAME THEORY; INVENTORY CONTROL; MARKETING; MARKOV PROCESSES; MULTI AGENT SYSTEMS; SALES;

EID: 33644903192     PISSN: 10946977     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCC.2005.860578     Document Type: Article
Times cited : (43)

References (44)
  • 2
    • 18744371204 scopus 로고    scopus 로고
    • "Reinforcement learning in Markovian evolutionary games"
    • Denver, Colorado, USA
    • V. S. Borkar, "Reinforcement learning in Markovian evolutionary games," Adv. Complex Syst., vol. 5, pp. 55-72, Denver, Colorado, USA, 2002.
    • (2002) Adv. Complex Syst. , vol.5 , pp. 55-72
    • Borkar, V.S.1
  • 4
    • 23044507058 scopus 로고    scopus 로고
    • "Dynamic pricing and reinforcement learning"
    • Department of Statistics University of British Columbia, Vancouver, Canada, Tech. Rep
    • A. Carvalho and M. Puterman, "Dynamic pricing and reinforcement learning," Department of Statistics, University of British Columbia, Vancouver, Canada, Tech. Rep., 2003.
    • (2003)
    • Carvalho, A.1    Puterman, M.2
  • 6
    • 0242706243 scopus 로고    scopus 로고
    • "Dynamic pricing strategies for manufacturing with stochastic demand and discretionary sales"
    • Northwestern University, Evanston, IL, USA, Tech. Rep
    • L. Chan, D. Simchi-Levi, and J. Swann, "Dynamic pricing strategies for manufacturing with stochastic demand and discretionary sales," Industrial Engineering and Operations Research, Northwestern University, Evanston, IL, USA, Tech. Rep., 2002.
    • (2002) Industrial Engineering and Operations Research
    • Chan, L.1    Simchi-Levi, D.2    Swann, J.3
  • 7
    • 84957011065 scopus 로고    scopus 로고
    • "Dynamic pricing with limited competitor information in a multi-agent economy"
    • Presented at the Conf. Cooperative Information Systems [Online]. Available: citeseer.nj.nec.com/dasgupta01dynamic.html
    • P. Dasgupta and R. Das, "Dynamic pricing with limited competitor information in a multi-agent economy," presented at the Conf. Cooperative Information Systems, pp. 299-310, 2000. [Online]. Available: citeseer.nj.nec.com/dasgupta01dynamic.html
    • (2000) , pp. 299-310
    • Dasgupta, P.1    Das, R.2
  • 10
    • 0242708616 scopus 로고    scopus 로고
    • "Dynamic pricing: Research overview, current practices and future directions"
    • W. Elmaghraby and P. Keskinocak, "Dynamic pricing: Research overview, current practices and future directions," Management Science, vol. 49, no. 10, pp. 1287-1309.
    • Management Science , vol.49 , Issue.10 , pp. 1287-1309
    • Elmaghraby, W.1    Keskinocak, P.2
  • 11
    • 0001381152 scopus 로고
    • "On n-person stochastic games with denumerable state space"
    • A. Federgruen, "On n-person stochastic games with denumerable state space," Adv. Appl. Probability, vol. 10, pp. 452-471, 1978.
    • (1978) Adv. Appl. Probability , vol.10 , pp. 452-471
    • Federgruen, A.1
  • 12
    • 0032674986 scopus 로고    scopus 로고
    • "Combined pricing and inventory control under uncertainty"
    • A. Federgruen and A. Heching, "Combined pricing and inventory control under uncertainty," Oper. Res., vol. 47, pp. 454-475, 1999.
    • (1999) Oper. Res. , vol.47 , pp. 454-475
    • Federgruen, A.1    Heching, A.2
  • 13
    • 0028480132 scopus 로고
    • "Optimal dynamic pricing of inventories with stochastic demand over finite horizons"
    • G. Gallego and G. van Ryzin, "Optimal dynamic pricing of inventories with stochastic demand over finite horizons," Manage. Sci., vol. 40, no. 8, pp. 999-1020, 1994.
    • (1994) Manage. Sci. , vol.40 , Issue.8 , pp. 999-1020
    • Gallego, G.1    van Ryzin, G.2
  • 14
    • 0038344813 scopus 로고    scopus 로고
    • "Adaptive strategies for price markdown in a multi-unit descending price auction: A comparative study"
    • M. Gupta, K. Ravikumar, and M. Kumar, "Adaptive strategies for price markdown in a multi-unit descending price auction: A comparative study," in Proc. IEEE Conf. Systems., Man, and Cybernetics, 2002, pp. 373-378.
    • (2002) Proc. IEEE Conf. Systems., Man, and Cybernetics , pp. 373-378
    • Gupta, M.1    Ravikumar, K.2    Kumar, M.3
  • 15
    • 2942744741 scopus 로고    scopus 로고
    • "Uncoupled dynamics do not lead to Nash equilibrium"
    • S. Hart and A. Mas-Colell, "Uncoupled dynamics do not lead to Nash equilibrium," Amer. Econ. Rev., vol. 93, pp. 1830-1836, 2003.
    • (2003) Amer. Econ. Rev. , vol.93 , pp. 1830-1836
    • Hart, S.1    Mas-Colell, A.2
  • 16
    • 33644899291 scopus 로고    scopus 로고
    • "Equilibrium price dispersion with consumer inventories"
    • P. Hong, R. P. McAfee, and A. Nayyar, "Equilibrium price dispersion with consumer inventories," J. Econ. Theory, vol. 22, pp. 1-15, 2001.
    • (2001) J. Econ. Theory , vol.22 , pp. 1-15
    • Hong, P.1    McAfee, P.2    Nayyar, A.3
  • 17
    • 0000929496 scopus 로고    scopus 로고
    • "Multiagent reinforcement learning: Theoretical framework and an algorithm"
    • San Francisco, CA, [Online]. Available:, citeseer.nj.nec.com/ hu98multiagent.html
    • J. Hu and M. P. Wellman, "Multiagent reinforcement learning: theoretical framework and an algorithm," in Proc. 15th Int. Conf. Machine Learning, San Francisco, CA, 1998, pp. 242-250, [Online]. Available:, citeseer.nj.nec.com/hu98multiagent.html
    • (1998) Proc. 15th Int. Conf. Machine Learning , pp. 242-250
    • Hu, J.1    Wellman, M.P.2
  • 18
    • 23044437621 scopus 로고    scopus 로고
    • "Online reinformcenet learning in multiagent systems"
    • William E. Simon Graduate School of Business Administration, University of Rochester, Rochester, NY, USA, Tech. Rep
    • J. Hu and Y. Zhang, "Online reinformcenet learning in multiagent systems," William E. Simon Graduate School of Business Administration, University of Rochester, Rochester, NY, USA, Tech. Rep., 2002.
    • (2002)
    • Hu, J.1    Zhang, Y.2
  • 20
    • 0012826632 scopus 로고    scopus 로고
    • "Pseudo-convergent Q-learning by competitive pricebots"
    • San Francisco, CA, [Online]. Available:, citeseer.nj.nec.com/306829.html
    • J. O. Kephart and G. J. Tesauro, "Pseudo-convergent Q-learning by competitive pricebots," in Proc. 17th Int. Conf. Machine Learning, San Francisco, CA, 2000. pp. 463-470. [Online]. Available:, citeseer.nj.nec.com/306829.html
    • (2000) Proc. 17th Int. Conf. Machine Learning , pp. 463-470
    • Kephart, J.O.1    Tesauro, G.J.2
  • 21
    • 0343893613 scopus 로고    scopus 로고
    • "Actor-critic type learning algorithms for markov decision processes"
    • V. R. Konda and V. S. Borkar, "Actor-critic type learning algorithms for markov decision processes," SIAM J. Control Optim., vol. 38, pp. 94-123, 1999.
    • (1999) SIAM J. Control Optim. , vol.38 , pp. 94-123
    • Konda, V.R.1    Borkar, V.S.2
  • 22
    • 0024031728 scopus 로고
    • "The newsboy problem with price-dependent demand distribution"
    • A. Lau and H. Lau, "The newsboy problem with price-dependent demand distribution," IIE Trans., vol. 20, pp. 168-175, 1988.
    • (1988) IIE Trans. , vol.20 , pp. 168-175
    • Lau, A.1    Lau, H.2
  • 23
    • 33644931035 scopus 로고    scopus 로고
    • "A machine-learning approach to optimal bid pricing"
    • IBM, Research Report, Jul
    • R. Lawrence, "A machine-learning approach to optimal bid pricing," IBM, Research Report, Jul. 2002.
    • (2002)
    • Lawrence, R.1
  • 24
    • 0346913265 scopus 로고    scopus 로고
    • "Convergent multiple-timescales reinforcement learning algorithms in normal form games"
    • D. Leslie and E. Collins, "Convergent multiple-timescales reinforcement learning algorithms in normal form games," Ann. Appl. Probability, vol. 13, pp. 1231-1251, 2003.
    • (2003) Ann. Appl. Probability , vol.13 , pp. 1231-1251
    • Leslie, D.1    Collins, E.2
  • 25
    • 85149834820 scopus 로고
    • "Markov games as a framework for multi-agent reinforcement learning"
    • San Francisco, CA, USA
    • M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proc. 11th Int. Conf. Machine Learning, San Francisco, CA, USA, 1994, pp. 157-163.
    • (1994) Proc. 11th Int. Conf. Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 26
    • 0242466944 scopus 로고    scopus 로고
    • "Friend-or-foe q-learning in general-sum games"
    • Williamstown, MA, USA
    • M. L. Littman, "Friend-or-foe q-learning in general-sum games," in Proc. 18th Int. Conf. Machine Learning, Williamstown, MA, USA, 2001, pp. 322-328.
    • (2001) Proc. 18th Int. Conf. Machine Learning , pp. 322-328
    • Littman, M.L.1
  • 27
    • 0032642848 scopus 로고    scopus 로고
    • "Revenue management: Research overview and prospects"
    • J. Mcgill and G. van Ryzin, "Revenue management: Research overview and prospects," Transport. Sci., vol. 33, no. 2, pp. 233-256, 1999.
    • (1999) Transport. Sci. , vol.33 , Issue.2 , pp. 233-256
    • Mcgill, J.1    van Ryzin, G.2
  • 28
    • 23044435348 scopus 로고    scopus 로고
    • "Multi-agent learning for dynamic pricing games of service markets"
    • K. Ravikumar, G. Batra, and R. Saluja, "Multi-agent learning for dynamic pricing games of service markets," Communicated, 2002.
    • (2002) Communicated
    • Ravikumar, K.1    Batra, G.2    Saluja, R.3
  • 29
    • 0000561934 scopus 로고
    • "The theory of sales: A simple model of equilibrium price dispersion with identical agents"
    • S. Salop and J. Stiglitz, "The theory of sales: A simple model of equilibrium price dispersion with identical agents," Amer. Econ. Rev., vol. 72, no. 5, pp. 1121-1130, 1982.
    • (1982) Amer. Econ. Rev. , vol.72 , Issue.5 , pp. 1121-1130
    • Salop, S.1    Stiglitz, J.2
  • 31
    • 0040030247 scopus 로고    scopus 로고
    • "E-commerce and operations research in airline planning, marketing, and distribution"
    • B. Smith, D. Gunther, B. Rao, and R. Ratliff, "E-commerce and operations research in airline planning, marketing, and distribution," Interfaces, vol. 31, no. 2, 2001.
    • (2001) Interfaces , vol.31 , Issue.2
    • Smith, B.1    Gunther, D.2    Rao, B.3    Ratliff, R.4
  • 33
    • 84962045565 scopus 로고    scopus 로고
    • "Multi-agent q-learning and regression trees for automated pricing decisions"
    • San Francisco, CA
    • M. Sridharan and G. J. Tesauro, "Multi-agent q-learning and regression trees for automated pricing decisions," in Proc. 17th Int. Conf. Machine Learning, San Francisco, CA, 2000.
    • (2000) Proc. 17th Int. Conf. Machine Learning
    • Sridharan, M.1    Tesauro, G.J.2
  • 34
    • 0002734011 scopus 로고
    • "The economics of information"
    • G. Stigler, "The economics of information," J. Political Econ., vol. 69, pp. 213-225, 1961.
    • (1961) J. Political Econ. , vol.69 , pp. 213-225
    • Stigler, G.1
  • 35
    • 0000604783 scopus 로고
    • "Equilibrium in product markets with imperfect information"
    • J. Stiglitz, "Equilibrium in product markets with imperfect information," Amer. Econ. Rev. Proc., vol. 69, pp. 339-345, 1979.
    • (1979) Amer. Econ. Rev. Proc. , vol.69 , pp. 339-345
    • Stiglitz, J.1
  • 36
    • 2442544666 scopus 로고    scopus 로고
    • "Dynamic pricing models to improve supply chain performance"
    • Ph.D. Dissertation, IEMS, Northwestern University, Evanston, IL, USA
    • J. Swann, "Dynamic pricing models to improve supply chain performance," Ph.D. Dissertation, IEMS, Northwestern University, Evanston, IL, USA, 2001.
    • (2001)
    • Swann, J.1
  • 37
    • 23044509119 scopus 로고    scopus 로고
    • "Flexible pricing policies: Introduction and a survey of implementation in various industries"
    • General Motors Corporation, Contract Report # CR-99/04/ESL, Oct
    • J. Swann, "Flexible pricing policies: Introduction and a survey of implementation in various industries," General Motors Corporation, Contract Report # CR-99/04/ESL, Oct. 1999.
    • (1999)
    • Swann, J.1
  • 40
    • 0016553601 scopus 로고
    • "A dynamic nonstationary inventory problem for a price/quantity setting firm"
    • G. Thowsen, "A dynamic nonstationary inventory problem for a price/ quantity setting firm," Navel Res. Logistics Quart., vol. 22, pp. 461-476, 1975.
    • (1975) Navel Res. Logistics Quart. , vol.22 , pp. 461-476
    • Thowsen, G.1
  • 41
    • 0000878787 scopus 로고
    • "A model of sales"
    • H. R. Varian, "A model of sales," Amer. Econ. Rev., pp. 651-659, 1980.
    • (1980) Amer. Econ. Rev. , pp. 651-659
    • Varian, H.R.1
  • 42
    • 0003240094 scopus 로고    scopus 로고
    • "Differential pricing and efficiency"
    • [Online]. Available: www.firstmonday.dk
    • H. R. Varian, "Differential pricing and efficiency," First Monday, vol. 1, 1996. [Online]. Available: www.firstmonday.dk.
    • (1996) First Monday , vol.1
    • Varian, H.R.1
  • 44
    • 0043034832 scopus 로고
    • "Monopoly and uncertainty"
    • E. Zabel, "Monopoly and uncertainty," Rev. Econ. Studies, vol. 37, pp. 205-219, 1970.
    • (1970) Rev. Econ. Studies , vol.37 , pp. 205-219
    • Zabel, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.