메뉴 건너뛰기




Volumn 221, Issue 1, 2012, Pages 99-109

Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems

Author keywords

Pricing; Q learning; Reinforcement learning (RL); Scheduling; Semi Markov Decision Problem (SMDP); Simulation based optimization

Indexed keywords

CAPACITY CONSTRAINTS; DISCRETE-EVENT SIMULATION MODEL; EXPECTED PROFITS; FIXED COST; HEURISTIC POLICIES; INVENTORY HOLDING; JOINT PRICING; LEADTIME; MAKE TO ORDER; PLANNING HORIZONS; Q-LEARNING; Q-LEARNING ALGORITHMS; SCHEDULING DECISIONS; SEMI-MARKOV DECISION PROBLEMS; SIMULATION-BASED OPTIMIZATIONS; STOCHASTIC DEMAND; TARDINESS COST; VARIABLE COSTS;

EID: 84860259623     PISSN: 03772217     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ejor.2012.03.020     Document Type: Article
Times cited : (37)

References (55)
  • 1
    • 38549172189 scopus 로고    scopus 로고
    • Pricing and manufacturing decisions when demand is a function of price in multiple period
    • H. Ahn, M. Gumus, and P. Kaminsky Pricing and manufacturing decisions when demand is a function of price in multiple period Operations Research 55 6 2007 1039 1057
    • (2007) Operations Research , vol.55 , Issue.6 , pp. 1039-1057
    • Ahn, H.1    Gumus, M.2    Kaminsky, P.3
  • 2
    • 0242708615 scopus 로고    scopus 로고
    • Revenue management and e-commerce
    • E.A. Boyd, and I.C. Bilegan Revenue management and e-commerce Management Science 49 10 2003 1363 1386
    • (2003) Management Science , vol.49 , Issue.10 , pp. 1363-1386
    • Boyd, E.A.1    Bilegan, I.C.2
  • 4
    • 0038141847 scopus 로고    scopus 로고
    • The analysis of optimal control model in matching problem between manufacturing and marketing
    • M. Chen, and M. Chu The analysis of optimal control model in matching problem between manufacturing and marketing European Journal of Operational Research 150 2 2003 293 303
    • (2003) European Journal of Operational Research , vol.150 , Issue.2 , pp. 293-303
    • Chen, M.1    Chu, M.2
  • 6
    • 0003160484 scopus 로고
    • Towards fasters stochastic gradient search
    • J. Moody, S. Hanson, R. Lippmann, Morgan Kaufmann San Mateo, CA
    • C. Darken, and J. Moody Towards fasters stochastic gradient search J. Moody, S. Hanson, R. Lippmann, Advances in Neural Information Processing Systems vol. 4 1992 Morgan Kaufmann San Mateo, CA 1009 1016
    • (1992) Advances in Neural Information Processing Systems , vol.4 , pp. 1009-1016
    • Darken, C.1    Moody, J.2
  • 7
    • 0032643313 scopus 로고    scopus 로고
    • Solving semi-markov decision problems using average reward reinforcement learning
    • T.K. Das, A. Gosavi, S. Mahadevan, and N. Marchalleck Solving semi-markov decision problems using average reward reinforcement learning Management Science 45 4 1999 560 574
    • (1999) Management Science , vol.45 , Issue.4 , pp. 560-574
    • Das, T.K.1    Gosavi, A.2    Mahadevan, S.3    Marchalleck, N.4
  • 8
    • 33646375445 scopus 로고    scopus 로고
    • Joint production and pricing decision with setup costs and capacity constraints
    • S. Deng, and C.A. Yano Joint production and pricing decision with setup costs and capacity constraints Management Science 52 5 2006 741 756
    • (2006) Management Science , vol.52 , Issue.5 , pp. 741-756
    • Deng, S.1    Yano, C.A.2
  • 9
    • 0001131108 scopus 로고
    • Single facility due date setting with multiple customer classes
    • I. Duenya Single facility due date setting with multiple customer classes Management Science 41 4 1995 608 619
    • (1995) Management Science , vol.41 , Issue.4 , pp. 608-619
    • Duenya, I.1
  • 10
    • 0002987480 scopus 로고
    • Quoting customer lead times
    • I. Duenya, and W.J. Hopp Quoting customer lead times Management Science 41 1 1995 43 57
    • (1995) Management Science , vol.41 , Issue.1 , pp. 43-57
    • Duenya, I.1    Hopp, W.J.2
  • 11
    • 0032628965 scopus 로고    scopus 로고
    • Pricing and lead time decision for make-to-order firms with contingent orders
    • F. Easton, and D. Moodie Pricing and lead time decision for make-to-order firms with contingent orders European Journal of Operational Research 116 2 1999 305 318
    • (1999) European Journal of Operational Research , vol.116 , Issue.2 , pp. 305-318
    • Easton, F.1    Moodie, D.2
  • 12
    • 77957780588 scopus 로고
    • Marketing-production joint decision-making
    • J. Eliashberg, G.L. Lilien, North Holland Amsterdam
    • J. Eliashberg, and R. Steinberg Marketing-production joint decision-making J. Eliashberg, G.L. Lilien, Handbooks in Operations Research and Management Science Vol. 5 1993 North Holland Amsterdam 827 880
    • (1993) Handbooks in Operations Research and Management Science , vol.5 , pp. 827-880
    • Eliashberg, J.1    Steinberg, R.2
  • 14
    • 0033116210 scopus 로고    scopus 로고
    • Coordination of pricing and multi-period production for constant priced goods
    • S.M. Gilbert Coordination of pricing and multi-period production for constant priced goods European Journal of Operational Research 114 2 1999 330 337
    • (1999) European Journal of Operational Research , vol.114 , Issue.2 , pp. 330-337
    • Gilbert, S.M.1
  • 15
    • 0034476593 scopus 로고    scopus 로고
    • Coordination of pricing and multi-period production across multiple constant priced goods
    • S.M. Gilbert Coordination of pricing and multi-period production across multiple constant priced goods Management Science 46 12 2000 1602 1616
    • (2000) Management Science , vol.46 , Issue.12 , pp. 1602-1616
    • Gilbert, S.M.1
  • 16
    • 2342446663 scopus 로고    scopus 로고
    • A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis
    • A. Gosavi A reinforcement learning algorithm based on policy iteration for average reward: empirical results with yield management and convergence analysis Machine Learning 55 1 2004 5 29
    • (2004) Machine Learning , vol.55 , Issue.1 , pp. 5-29
    • Gosavi, A.1
  • 17
    • 0742319170 scopus 로고    scopus 로고
    • Reinforcement learning for long-run average cost
    • A. Gosavi Reinforcement learning for long-run average cost European Journal of Operations Research 155 3 2004 654 674
    • (2004) European Journal of Operations Research , vol.155 , Issue.3 , pp. 654-674
    • Gosavi, A.1
  • 18
    • 0036722536 scopus 로고    scopus 로고
    • A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking
    • A. Gosavi, N. Bandla, and T.K. Das A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking IIE Transactions 34 9 2002 729 742
    • (2002) IIE Transactions , vol.34 , Issue.9 , pp. 729-742
    • Gosavi, A.1    Bandla, N.2    Das, T.K.3
  • 20
    • 33847617835 scopus 로고    scopus 로고
    • Reinforcement learning versus heuristics for order acceptance on a single resource
    • M.M. Hing, A. van Harten, and P.C. Schuur Reinforcement learning versus heuristics for order acceptance on a single resource Journal of Heuristic 13 2 2007 167 187
    • (2007) Journal of Heuristic , vol.13 , Issue.2 , pp. 167-187
    • Hing, M.M.1    Van Harten, A.2    Schuur, P.C.3
  • 24
    • 0035261939 scopus 로고    scopus 로고
    • Scheduling and reliable lead time quotation for orders with availability intervals and lead time sensitive revenues
    • P. Keskinocak, R. Ravi, and S. Tayur Scheduling and reliable lead time quotation for orders with availability intervals and lead time sensitive revenues Management Science 47 2 2001 264 279
    • (2001) Management Science , vol.47 , Issue.2 , pp. 264-279
    • Keskinocak, P.1    Ravi, R.2    Tayur, S.3
  • 25
    • 0007990754 scopus 로고    scopus 로고
    • Optimal joint pricing and lot sizing with fixed and variable capacity
    • D. Kim, and W.J. Lee Optimal joint pricing and lot sizing with fixed and variable capacity European Journal of Operational Research 109 1 1998 212 227
    • (1998) European Journal of Operational Research , vol.109 , Issue.1 , pp. 212-227
    • Kim, D.1    Lee, W.J.2
  • 27
    • 0029393210 scopus 로고
    • Estimating flowtimes and setting due-dates in complex production systems
    • S.R. Lawrence Estimating flowtimes and setting due-dates in complex production systems IIE Transactions 27 5 1995 657 688
    • (1995) IIE Transactions , vol.27 , Issue.5 , pp. 657-688
    • Lawrence, S.R.1
  • 28
    • 5544319794 scopus 로고    scopus 로고
    • Ford heeds the profits
    • S. Leibs Ford heeds the profits CFO Magazine 16 9 2000 33 35
    • (2000) CFO Magazine , vol.16 , Issue.9 , pp. 33-35
    • Leibs, S.1
  • 29
  • 30
    • 0029752592 scopus 로고    scopus 로고
    • Average reward reinforcement learning: Foundations, algorithms and empirical results
    • S. Mahadevan Average reward reinforcement learning: foundations, algorithms and empirical results Machine Learning 22 1 1996 159 195
    • (1996) Machine Learning , vol.22 , Issue.1 , pp. 159-195
    • Mahadevan, S.1
  • 34
    • 0000696160 scopus 로고
    • A queueing reward system with several customer classes
    • B. Miller A queueing reward system with several customer classes Management Science 16 3 1969 234 245
    • (1969) Management Science , vol.16 , Issue.3 , pp. 234-245
    • Miller, B.1
  • 36
    • 0032003355 scopus 로고    scopus 로고
    • Lead-time setting, capacity utilization and pricing decision under lead-time dependent demand
    • K. Palaka, S. Erlebacher, and D.H. Kropp Lead-time setting, capacity utilization and pricing decision under lead-time dependent demand IIE Transactions 30 2 1998 151 163
    • (1998) IIE Transactions , vol.30 , Issue.2 , pp. 151-163
    • Palaka, K.1    Erlebacher, S.2    Kropp, D.H.3
  • 37
    • 0035124331 scopus 로고    scopus 로고
    • Intelligent dynamic control of single-product serial production lines
    • C.D. Paternina-Arboleda, and T.K. Das Intelligent dynamic control of single-product serial production lines IIE Transaction 33 1 2001 65 77
    • (2001) IIE Transaction , vol.33 , Issue.1 , pp. 65-77
    • Paternina-Arboleda, C.D.1    Das, T.K.2
  • 38
    • 0001122069 scopus 로고
    • Simultaneous price-production decisions
    • D. Pekelman Simultaneous price-production decisions Operations Research 22 4 1974 788 794
    • (1974) Operations Research , vol.22 , Issue.4 , pp. 788-794
    • Pekelman, D.1
  • 39
    • 84990604706 scopus 로고
    • Using neural networks to determine internally-set due-date assinments for shop scheduling
    • P. Philipoom, L. Rees, and L. Wiegmann Using neural networks to determine internally-set due-date assinments for shop scheduling Decision Science 25 5-6 1994 825 851
    • (1994) Decision Science , vol.25 , Issue.56 , pp. 825-851
    • Philipoom, P.1    Rees, L.2    Wiegmann, L.3
  • 40
    • 0029252917 scopus 로고
    • Multiproduct production/inventory control under random demands
    • J. Qiu, and R. Loulou Multiproduct production/inventory control under random demands IEEE Transactions on Automatic Control 40 2 1995 350 356
    • (1995) IEEE Transactions on Automatic Control , vol.40 , Issue.2 , pp. 350-356
    • Qiu, J.1    Loulou, R.2
  • 42
    • 0042072128 scopus 로고    scopus 로고
    • Price and time competition for service delivery, manufacturing and service
    • K.C. So Price and time competition for service delivery, manufacturing and service Operations Management 2 4 2000 392 409
    • (2000) Operations Management , vol.2 , Issue.4 , pp. 392-409
    • So, K.C.1
  • 43
    • 0032210055 scopus 로고    scopus 로고
    • Price, delivery time guarantees and capacity selection
    • K.C. So, and J.-S. Song Price, delivery time guarantees and capacity selection European Journal of Operational Research 111 1 1998 28 49
    • (1998) European Journal of Operational Research , vol.111 , Issue.1 , pp. 28-49
    • So, K.C.1    Song, J.-S.2
  • 44
    • 33750298269 scopus 로고    scopus 로고
    • Revenue management in make-to-order manufacturing - An application to the iron and steel industry
    • T. Spengler, S. Rehkopf, and T. Volling Revenue management in make-to-order manufacturing - an application to the iron and steel industry OR Spectrum 29 1 2007 158 171
    • (2007) OR Spectrum , vol.29 , Issue.1 , pp. 158-171
    • Spengler, T.1    Rehkopf, S.2    Volling, T.3
  • 46
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • R.L. Sutton Learning to predict by the method of temporal differences Machine Learning 3 1 1988 9 44
    • (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
    • Sutton, R.L.1
  • 48
    • 84915161869 scopus 로고
    • Simultaneous price-repoduction decision making with production adjustment costs
    • Vanthienen, L.G.; 1975. Simultaneous price-repoduction decision making with production adjustment costs. In: TIMS XX International Meeting. pp. 249-254.
    • (1975) TIMS XX International Meeting , pp. 249-254
    • Vanthienen, L.G.1
  • 49
  • 50
    • 13544273051 scopus 로고    scopus 로고
    • Dynamic pricing and lead-time policies
    • S. Webster Dynamic pricing and lead-time policies Decision Science 33 4 2002 579 599
    • (2002) Decision Science , vol.33 , Issue.4 , pp. 579-599
    • Webster, S.1
  • 51
    • 0002816196 scopus 로고
    • Inventory control and price theory
    • T. Whitin Inventory control and price theory Management Science 2 1 1955 61 68
    • (1955) Management Science , vol.2 , Issue.1 , pp. 61-68
    • Whitin, T.1
  • 53
    • 67349148632 scopus 로고    scopus 로고
    • The impact of estimation error on the dynamic order admission policy in B2B MTO environments
    • A. Wu, and D. Chiang The impact of estimation error on the dynamic order admission policy in B2B MTO environments Expert Systems with Applications 36 9 2009 11782 11791
    • (2009) Expert Systems with Applications , vol.36 , Issue.9 , pp. 11782-11791
    • Wu, A.1    Chiang, D.2
  • 54
    • 12344291007 scopus 로고    scopus 로고
    • Coordinated pricing and production/procurement decisions: A review
    • A. Chakravarty, J. Eliashberg, Engineering and Manufacturing Perspectives Kluwer Academic Publishers Boston, MA
    • C.A. Yano, and S.M. Gilbert Coordinated pricing and production/ procurement decisions: a review A. Chakravarty, J. Eliashberg, Managing Business Interfaces: Marketing Engineering and Manufacturing Perspectives 2004 Kluwer Academic Publishers Boston, MA
    • (2004) Managing Business Interfaces: Marketing
    • Yano, C.A.1    Gilbert, S.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.