SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews

Volumn 36, Issue 1, 2006, Pages 92-106

Learning dynamic prices in multiseller electronic retail markets with price sensitive customers, stochastic demands, and inventory replenishments

(3) Raju Chinthalapati, V L a Yadati, Narahari b Karumanchi, Ravikumar c,d

a LONDON SCHOOL OF ECONOMICS (United Kingdom)

b INDIAN INSTITUTE OF SCIENCE (India)

c IBM INDIA RESEARCH LAB (India)

d GENERAL MOTORS CORPORATION (United States)

Author keywords

Dynamic pricing; Inventory replenishments; Markovian game; Multi agent learning; Online retail markets; Price sensitive customers; Reinforcement learning (RL); Stochastic demands

Indexed keywords

ELECTRONIC COMMERCE; GAME THEORY; INVENTORY CONTROL; MARKETING; MARKOV PROCESSES; MULTI AGENT SYSTEMS; SALES;

DYNAMIC PRICING; INVENTORY REPLENISHMENTS; MARKOVIAN GAME; MULTI-AGENT LEARNING; ONLINE RETAIL MARKETS; REINFORCEMENT LEARNING (RL); STOCHASTIC DEMANDS;

LEARNING SYSTEMS;

EID: 33644903192 PISSN: 10946977 EISSN: None Source Type: Journal
DOI: 10.1109/TSMCC.2005.860578 Document Type: Article

Times cited : (43)

References (44)

1
- 33644898019
- "Strategic pricebot dynamics"
- Denver, Colorado, USA
- G. A. J. O. Kephart and G. J. Tesauro, "Strategic pricebot dynamics," in Proc. 1st ACM Conf. Electronic Commerce EC-99, Denver, Colorado, USA, 1999.
- (1999) Proc. 1st ACM Conf. Electronic Commerce EC-99
- Kephart, G.A.J.O.¹ Tesauro, G.J.²

2
- 18744371204
- "Reinforcement learning in Markovian evolutionary games"
- Denver, Colorado, USA
- V. S. Borkar, "Reinforcement learning in Markovian evolutionary games," Adv. Complex Syst., vol. 5, pp. 55-72, Denver, Colorado, USA, 2002.
- (2002) Adv. Complex Syst. , vol.5 , pp. 55-72
- Borkar, V.S.¹

3
- 0011979798
- "Automated strategy searches in an electronic goods market: Learning and complex price schedules"
- C. Brooks, R. Fay, R. Das, J. K. MacKie-Mason, J. Kephart, and E. Durfee, "Automated strategy searches in an electronic goods market: learning and complex price schedules," in Proc. 1st ACM Conf. Electronic Commerce EC-99, 1999, pp. 31-40.
- (1999) Proc. 1st ACM Conf. Electronic Commerce EC-99 , pp. 31-40
- Brooks, C.¹ Fay, R.² Das, R.³ MacKie-Mason, J.K.⁴ Kephart, J.⁵ Durfee, E.⁶

4
- 23044507058
- "Dynamic pricing and reinforcement learning"
- Department of Statistics University of British Columbia, Vancouver, Canada, Tech. Rep
- A. Carvalho and M. Puterman, "Dynamic pricing and reinforcement learning," Department of Statistics, University of British Columbia, Vancouver, Canada, Tech. Rep., 2003.
- (2003)
- Carvalho, A.¹ Puterman, M.²

5
- 33644913867
- "Review of dynamic and online pricing research to improve supply chain performance"
- Kluwer, Norwell, MA
- L. Chan, Z. J. M. Shen, D. Simchi-Levi, and J. Swann, "Review of dynamic and online pricing research to improve supply chain performance," Handbook on Supply Chain Analysis in the eBusiness Era, Kluwer, Norwell, MA, 2001.
- (2001) Handbook on Supply Chain Analysis in the EBusiness Era
- Chan, L.¹ Shen, Z.J.M.² Simchi-Levi, D.³ Swann, J.⁴

6
- 0242706243
- "Dynamic pricing strategies for manufacturing with stochastic demand and discretionary sales"
- Northwestern University, Evanston, IL, USA, Tech. Rep
- L. Chan, D. Simchi-Levi, and J. Swann, "Dynamic pricing strategies for manufacturing with stochastic demand and discretionary sales," Industrial Engineering and Operations Research, Northwestern University, Evanston, IL, USA, Tech. Rep., 2002.
- (2002) Industrial Engineering and Operations Research
- Chan, L.¹ Simchi-Levi, D.² Swann, J.³

7
- 84957011065
- "Dynamic pricing with limited competitor information in a multi-agent economy"
- Presented at the Conf. Cooperative Information Systems [Online]. Available: citeseer.nj.nec.com/dasgupta01dynamic.html
- P. Dasgupta and R. Das, "Dynamic pricing with limited competitor information in a multi-agent economy," presented at the Conf. Cooperative Information Systems, pp. 299-310, 2000. [Online]. Available: citeseer.nj.nec.com/dasgupta01dynamic.html
- (2000) , pp. 299-310
- Dasgupta, P.¹ Das, R.²

8
- 23044515505
- "Dynamic pricing strategies under a finite time horizon"
- J. DiMicco, A. Greenwald, and P. Maes, "Dynamic pricing strategies under a finite time horizon," in Proc. 3rd ACM Conf. Electronic Commerce EC-01, 2001, pp. 51-60.
- (2001) Proc. 3rd ACM Conf. Electronic Commerce EC-01 , pp. 51-60
- DiMicco, J.¹ Greenwald, A.² Maes, P.³

9
- 23044440169
- [Online]. Available: citeseer.nj.nec.com/563289.html [Online]
- J. M. DiMicco, A. Greenwald, and P. Maes, Learning Curve: A Simulation-Based Approach to Dynamic Pricing, 2002, [Online]. Available: citeseer.nj.nec.com/563289.html [Online].
- (2002) Learning Curve: A Simulation-Based Approach to Dynamic Pricing
- DiMicco, J.M.¹ Greenwald, A.² Maes, P.³

10
- 0242708616
- "Dynamic pricing: Research overview, current practices and future directions"
- W. Elmaghraby and P. Keskinocak, "Dynamic pricing: Research overview, current practices and future directions," Management Science, vol. 49, no. 10, pp. 1287-1309.
- Management Science , vol.49 , Issue.10 , pp. 1287-1309
- Elmaghraby, W.¹ Keskinocak, P.²

11
- 0001381152
- "On n-person stochastic games with denumerable state space"
- A. Federgruen, "On n-person stochastic games with denumerable state space," Adv. Appl. Probability, vol. 10, pp. 452-471, 1978.
- (1978) Adv. Appl. Probability , vol.10 , pp. 452-471
- Federgruen, A.¹

12
- 0032674986
- "Combined pricing and inventory control under uncertainty"
- A. Federgruen and A. Heching, "Combined pricing and inventory control under uncertainty," Oper. Res., vol. 47, pp. 454-475, 1999.
- (1999) Oper. Res. , vol.47 , pp. 454-475
- Federgruen, A.¹ Heching, A.²

13
- 0028480132
- "Optimal dynamic pricing of inventories with stochastic demand over finite horizons"
- G. Gallego and G. van Ryzin, "Optimal dynamic pricing of inventories with stochastic demand over finite horizons," Manage. Sci., vol. 40, no. 8, pp. 999-1020, 1994.
- (1994) Manage. Sci. , vol.40 , Issue.8 , pp. 999-1020
- Gallego, G.¹ van Ryzin, G.²

14
- 0038344813
- "Adaptive strategies for price markdown in a multi-unit descending price auction: A comparative study"
- M. Gupta, K. Ravikumar, and M. Kumar, "Adaptive strategies for price markdown in a multi-unit descending price auction: A comparative study," in Proc. IEEE Conf. Systems., Man, and Cybernetics, 2002, pp. 373-378.
- (2002) Proc. IEEE Conf. Systems., Man, and Cybernetics , pp. 373-378
- Gupta, M.¹ Ravikumar, K.² Kumar, M.³

15
- 2942744741
- "Uncoupled dynamics do not lead to Nash equilibrium"
- S. Hart and A. Mas-Colell, "Uncoupled dynamics do not lead to Nash equilibrium," Amer. Econ. Rev., vol. 93, pp. 1830-1836, 2003.
- (2003) Amer. Econ. Rev. , vol.93 , pp. 1830-1836
- Hart, S.¹ Mas-Colell, A.²

16
- 33644899291
- "Equilibrium price dispersion with consumer inventories"
- P. Hong, R. P. McAfee, and A. Nayyar, "Equilibrium price dispersion with consumer inventories," J. Econ. Theory, vol. 22, pp. 1-15, 2001.
- (2001) J. Econ. Theory , vol.22 , pp. 1-15
- Hong, P.¹ McAfee, P.² Nayyar, A.³

17
- 0000929496
- "Multiagent reinforcement learning: Theoretical framework and an algorithm"
- San Francisco, CA, [Online]. Available:, citeseer.nj.nec.com/ hu98multiagent.html
- J. Hu and M. P. Wellman, "Multiagent reinforcement learning: theoretical framework and an algorithm," in Proc. 15th Int. Conf. Machine Learning, San Francisco, CA, 1998, pp. 242-250, [Online]. Available:, citeseer.nj.nec.com/hu98multiagent.html
- (1998) Proc. 15th Int. Conf. Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

18
- 23044437621
- "Online reinformcenet learning in multiagent systems"
- William E. Simon Graduate School of Business Administration, University of Rochester, Rochester, NY, USA, Tech. Rep
- J. Hu and Y. Zhang, "Online reinformcenet learning in multiagent systems," William E. Simon Graduate School of Business Administration, University of Rochester, Rochester, NY, USA, Tech. Rep., 2002.
- (2002)
- Hu, J.¹ Zhang, Y.²

19
- 0001835872
- "Prices and optimal inventory policies"
- S. Karlin and C. Carr, "Prices and optimal inventory policies," Studies in Applied Probability Manage. Sci., pp. 159-172, 1962.
- (1962) Studies in Applied Probability Manage. Sci. , pp. 159-172
- Karlin, S.¹ Carr, C.²

20
- 0012826632
- "Pseudo-convergent Q-learning by competitive pricebots"
- San Francisco, CA, [Online]. Available:, citeseer.nj.nec.com/306829.html
- J. O. Kephart and G. J. Tesauro, "Pseudo-convergent Q-learning by competitive pricebots," in Proc. 17th Int. Conf. Machine Learning, San Francisco, CA, 2000. pp. 463-470. [Online]. Available:, citeseer.nj.nec.com/306829.html
- (2000) Proc. 17th Int. Conf. Machine Learning , pp. 463-470
- Kephart, J.O.¹ Tesauro, G.J.²

21
- 0343893613
- "Actor-critic type learning algorithms for markov decision processes"
- V. R. Konda and V. S. Borkar, "Actor-critic type learning algorithms for markov decision processes," SIAM J. Control Optim., vol. 38, pp. 94-123, 1999.
- (1999) SIAM J. Control Optim. , vol.38 , pp. 94-123
- Konda, V.R.¹ Borkar, V.S.²

22
- 0024031728
- "The newsboy problem with price-dependent demand distribution"
- A. Lau and H. Lau, "The newsboy problem with price-dependent demand distribution," IIE Trans., vol. 20, pp. 168-175, 1988.
- (1988) IIE Trans. , vol.20 , pp. 168-175
- Lau, A.¹ Lau, H.²

23
- 33644931035
- "A machine-learning approach to optimal bid pricing"
- IBM, Research Report, Jul
- R. Lawrence, "A machine-learning approach to optimal bid pricing," IBM, Research Report, Jul. 2002.
- (2002)
- Lawrence, R.¹

24
- 0346913265
- "Convergent multiple-timescales reinforcement learning algorithms in normal form games"
- D. Leslie and E. Collins, "Convergent multiple-timescales reinforcement learning algorithms in normal form games," Ann. Appl. Probability, vol. 13, pp. 1231-1251, 2003.
- (2003) Ann. Appl. Probability , vol.13 , pp. 1231-1251
- Leslie, D.¹ Collins, E.²

25
- 85149834820
- "Markov games as a framework for multi-agent reinforcement learning"
- San Francisco, CA, USA
- M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proc. 11th Int. Conf. Machine Learning, San Francisco, CA, USA, 1994, pp. 157-163.
- (1994) Proc. 11th Int. Conf. Machine Learning , pp. 157-163
- Littman, M.L.¹

26
- 0242466944
- "Friend-or-foe q-learning in general-sum games"
- Williamstown, MA, USA
- M. L. Littman, "Friend-or-foe q-learning in general-sum games," in Proc. 18th Int. Conf. Machine Learning, Williamstown, MA, USA, 2001, pp. 322-328.
- (2001) Proc. 18th Int. Conf. Machine Learning , pp. 322-328
- Littman, M.L.¹

27
- 0032642848
- "Revenue management: Research overview and prospects"
- J. Mcgill and G. van Ryzin, "Revenue management: Research overview and prospects," Transport. Sci., vol. 33, no. 2, pp. 233-256, 1999.
- (1999) Transport. Sci. , vol.33 , Issue.2 , pp. 233-256
- Mcgill, J.¹ van Ryzin, G.²

28
- 23044435348
- "Multi-agent learning for dynamic pricing games of service markets"
- K. Ravikumar, G. Batra, and R. Saluja, "Multi-agent learning for dynamic pricing games of service markets," Communicated, 2002.
- (2002) Communicated
- Ravikumar, K.¹ Batra, G.² Saluja, R.³

29
- 0000561934
- "The theory of sales: A simple model of equilibrium price dispersion with identical agents"
- S. Salop and J. Stiglitz, "The theory of sales: A simple model of equilibrium price dispersion with identical agents," Amer. Econ. Rev., vol. 72, no. 5, pp. 1121-1130, 1982.
- (1982) Amer. Econ. Rev. , vol.72 , Issue.5 , pp. 1121-1130
- Salop, S.¹ Stiglitz, J.²

30
- 0004175691
- Cambridge, MA: HER Press
- C. Shapiro and H. Varian, Information Rules. Cambridge, MA: HER Press, 1998.
- (1998) Information Rules
- Shapiro, C.¹ Varian, H.²

31
- 0040030247
- "E-commerce and operations research in airline planning, marketing, and distribution"
- B. Smith, D. Gunther, B. Rao, and R. Ratliff, "E-commerce and operations research in airline planning, marketing, and distribution," Interfaces, vol. 31, no. 2, 2001.
- (2001) Interfaces , vol.31 , Issue.2
- Smith, B.¹ Gunther, D.² Rao, B.³ Ratliff, R.⁴

32
- 0004283165
- Cambridge, MA: MIT Press
- M. Smith, J. Bailey, and E. Brynjolfsson, Understanding Digital Markets: Review and Assessment. Cambridge, MA: MIT Press, 2000.
- (2000) Understanding Digital Markets: Review and Assessment
- Smith, M.¹ Bailey, J.² Brynjolfsson, E.³

33
- 84962045565
- "Multi-agent q-learning and regression trees for automated pricing decisions"
- San Francisco, CA
- M. Sridharan and G. J. Tesauro, "Multi-agent q-learning and regression trees for automated pricing decisions," in Proc. 17th Int. Conf. Machine Learning, San Francisco, CA, 2000.
- (2000) Proc. 17th Int. Conf. Machine Learning
- Sridharan, M.¹ Tesauro, G.J.²

34
- 0002734011
- "The economics of information"
- G. Stigler, "The economics of information," J. Political Econ., vol. 69, pp. 213-225, 1961.
- (1961) J. Political Econ. , vol.69 , pp. 213-225
- Stigler, G.¹

35
- 0000604783
- "Equilibrium in product markets with imperfect information"
- J. Stiglitz, "Equilibrium in product markets with imperfect information," Amer. Econ. Rev. Proc., vol. 69, pp. 339-345, 1979.
- (1979) Amer. Econ. Rev. Proc. , vol.69 , pp. 339-345
- Stiglitz, J.¹

36
- 2442544666
- "Dynamic pricing models to improve supply chain performance"
- Ph.D. Dissertation, IEMS, Northwestern University, Evanston, IL, USA
- J. Swann, "Dynamic pricing models to improve supply chain performance," Ph.D. Dissertation, IEMS, Northwestern University, Evanston, IL, USA, 2001.
- (2001)
- Swann, J.¹

37
- 23044509119
- "Flexible pricing policies: Introduction and a survey of implementation in various industries"
- General Motors Corporation, Contract Report # CR-99/04/ESL, Oct
- J. Swann, "Flexible pricing policies: Introduction and a survey of implementation in various industries," General Motors Corporation, Contract Report # CR-99/04/ESL, Oct. 1999.
- (1999)
- Swann, J.¹

38
- 23044485958
- "Pricing in agent economies using multiagent q-learning"
- London, U.K
- G. Tesauro and J. O. Kephart, "Pricing in agent economies using multiagent q-learning," in Proc. Workshop on Decision Theoretic and Game Theoretic Agents, London, U.K., 1999.
- (1999) Proc. Workshop on Decision Theoretic and Game Theoretic Agents
- Tesauro, G.¹ Kephart, J.O.²

39
- 23044453781
- "Pricing in agent economies using neural networks and multi-agent q-learning"
- Stockholm, Sweden
- G. Tesauro and J. O. Kephart, "Pricing in agent economies using neural networks and multi-agent q-learning," in Proc. Workshop ABS-3: Learning About, From and with other Agents (Held in Conjunction with IJCAI'99), Stockholm, Sweden, 1999.
- (1999) Proc. Workshop ABS-3: Learning About, From and With Other Agents (Held in Conjunction With IJCAI'99)
- Tesauro, G.¹ Kephart, J.O.²

40
- 0016553601
- "A dynamic nonstationary inventory problem for a price/quantity setting firm"
- G. Thowsen, "A dynamic nonstationary inventory problem for a price/ quantity setting firm," Navel Res. Logistics Quart., vol. 22, pp. 461-476, 1975.
- (1975) Navel Res. Logistics Quart. , vol.22 , pp. 461-476
- Thowsen, G.¹

41
- 0000878787
- "A model of sales"
- H. R. Varian, "A model of sales," Amer. Econ. Rev., pp. 651-659, 1980.
- (1980) Amer. Econ. Rev. , pp. 651-659
- Varian, H.R.¹

42
- 0003240094
- "Differential pricing and efficiency"
- [Online]. Available: www.firstmonday.dk
- H. R. Varian, "Differential pricing and efficiency," First Monday, vol. 1, 1996. [Online]. Available: www.firstmonday.dk.
- (1996) First Monday , vol.1
- Varian, H.R.¹

43
- 34249833101
- "Q-learning"
- C. J. C. H. Watkins and P. Dayan, "Q-learning,"Machine Learning, vol. 8, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

44
- 0043034832
- "Monopoly and uncertainty"
- E. Zabel, "Monopoly and uncertainty," Rev. Econ. Studies, vol. 37, pp. 205-219, 1970.
- (1970) Rev. Econ. Studies , vol.37 , pp. 205-219
- Zabel, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.