메뉴 건너뛰기




Volumn 19, Issue 5, 2009, Pages 331-344

Reinforcement learning in supply chains

Author keywords

Agent based modeling; Reinforcement learning; Supply chain management

Indexed keywords

AGENT-BASED MODELING; COGNITIVE PSYCHOLOGY; DECISION MAKERS; EFFECTIVE MANAGEMENT; HUMAN BEING; INDEPENDENT AGENTS; LEARNING AGENTS; MULTI-AGENT SETTING; MULTI-STAGE; POTENTIAL MECHANISM; TIME PERIODS;

EID: 71049115473     PISSN: 01290657     EISSN: None     Source Type: Journal    
DOI: 10.1142/S0129065709002063     Document Type: Article
Times cited : (30)

References (48)
  • 2
    • 0002823699 scopus 로고
    • Managing supply chain inventory: Pitfalls and opportunities
    • H. L. Lee and C. Billington, Managing supply chain inventory: Pitfalls and opportunities, Sloan Management Review (Spring) (1993) 65-73.
    • (1993) Sloan Management Review (Spring , pp. 65-73
    • Lee, H.L.1    Billington, C.2
  • 3
    • 0035481191 scopus 로고    scopus 로고
    • Market power and efficiency in a computational electricity market with discriminatory double-auction pricing
    • J. Nicolaisen, V. Petrov and L. Tesfatsion, Market power and efficiency in a computational electricity market with discriminatory double-auction pricing, IEEE Transactions on Evolutionary Computation 5(5) (2001) 504-523.
    • (2001) IEEE Transactions on Evolutionary Computation , vol.5 , Issue.5 , pp. 504-523
    • Nicolaisen, J.1    Petrov, V.2    Tesfatsion, L.3
  • 4
    • 0001248680 scopus 로고
    • Le comportement de l'homme rationnel devant le risqúe: Critique des postulats et axiommes de l'Ecole Americaine [Behavior of rational man in the critique of the postulates of the American School.]
    • M. Allias, Le comportement de l'homme rationnel devant le risqúe: Critique des postulats et axiommes de l'Ecole Americaine [Behavior of rational man in the critique of the postulates of the American School.], Econometrica 21 (1953) 503-546.
    • (1953) Econometrica , vol.21 , pp. 503-546
    • Allias, M.1
  • 5
    • 84957363402 scopus 로고
    • Risk, ambiguity and the savage axioms
    • D. Ellsberg, Risk, ambiguity and the savage axioms, Quarterly Journal of Economics 75 (1961) 643-669.
    • (1961) Quarterly Journal of Economics , vol.75 , pp. 643-669
    • Ellsberg, D.1
  • 7
    • 0016264378 scopus 로고
    • Judgment under uncertainty: Heuristics and biases
    • A. Tversky and D. Kahneman, Judgment under uncertainty: Heuristics and biases, Science 185 (1974) 1124-1131.
    • (1974) Science , vol.185 , pp. 1124-1131
    • Tversky, A.1    Kahneman, D.2
  • 8
    • 0019392722 scopus 로고
    • The framing of decisions and the psychology of choice
    • A. Tversky and D. Kahneman, The framing of decisions and the psychology of choice, Science 211 (1981) 453-458.
    • (1981) Science , vol.211 , pp. 453-458
    • Tversky, A.1    Kahneman, D.2
  • 9
    • 31744450082 scopus 로고
    • Advances in prospect theory: Cumulative representation of uncertainty
    • A. Tversky and D. Kahneman, Advances in prospect theory: Cumulative representation of uncertainty, Journal of Risk and Uncertainty 5 (1992) 297-323.
    • (1992) Journal of Risk and Uncertainty , vol.5 , pp. 297-323
    • Tversky, A.1    Kahneman, D.2
  • 10
    • 0141711957 scopus 로고    scopus 로고
    • Multiattribute decision making in context: A dynamic neural network methodology
    • S. J. Leven and D. S. Levine, Multiattribute decision making in context: A dynamic neural network methodology, Cognitive Science 20 (1996) 271-299.
    • (1996) Cognitive Science , vol.20 , pp. 271-299
    • Leven, S.J.1    Levine, D.S.2
  • 12
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement learning in experimental games with unique
    • L. Erev and A. E. Roth, Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria, American Economic Review 88(4) (1998) 848-881.
    • (1998) Mixed Strategy Equilibria, American Economic Review , vol.88 , Issue.4 , pp. 848-881
    • Erev, L.1    Roth, A.E.2
  • 13
    • 0038709220 scopus 로고    scopus 로고
    • Strategic play and adaptive learning in the sealed bid bargaining mechanism
    • T. Daniel, D. Seale and A. Rapoport, Strategic play and adaptive learning in the sealed bid bargaining mechanism, Journal of Mathematical Psychology 42 (1998) 133-166.
    • (1998) Journal of Mathematical Psychology , vol.42 , pp. 133-166
    • Daniel, T.1    Seale, D.2    Rapoport, A.3
  • 14
    • 36049009101 scopus 로고    scopus 로고
    • A multi-agent system for building project memories to facilitate the design process
    • D. Monticolov, V. Hilaire, S. Gomes and A. Koukam, A multi-agent system for building project memories to facilitate the design process, Integrated Computer- Aided Engineering 15(1) (2008) 3-20.
    • (2008) Integrated Computer- Aided Engineering , vol.15 , Issue.1 , pp. 3-20
    • Monticolov, D.1    Hilaire, V.2    Gomes, S.3    Koukam, A.4
  • 22
    • 0000361362 scopus 로고
    • Modeling managerial behavior: Misperceptions of feedback in a dynamic decision making experiment
    • J. D. Sterman, Modeling managerial behavior: Misperceptions of feedback in a dynamic decision making experiment, Management Science 35(3) (1989) 21-339.
    • (1989) Management Science , vol.35 , Issue.3 , pp. 21-339
    • Sterman, J.D.1
  • 24
    • 0036643207 scopus 로고    scopus 로고
    • Computers play the Beer Game: Can artificial agents manage supply chains
    • S. O. Kimbrough, D. J. Wu and F. Zhong, Computers play the Beer Game: Can artificial agents manage supply chains, Decision Support Systems 33(3) (2002) 323-333.
    • (2002) Decision Support Systems , vol.33 , Issue.3 , pp. 323-333
    • Kimbrough, S.O.1    Wu, D.J.2    Zhong, F.3
  • 25
    • 53349100494 scopus 로고    scopus 로고
    • A reinforcement learning model for supply chain ordering management: An application to the Beer Game
    • S. K. Chaharsooghi, J. Heydari and H. Zegordi, A reinforcement learning model for supply chain ordering management: An application to the Beer Game, Decision Support Systems 45 (2008) 949-959.
    • (2008) Decision Support Systems , vol.45 , pp. 949-959
    • Chaharsooghi, S.K.1    Heydari, J.2    Zegordi, H.3
  • 28
    • 0037151232 scopus 로고    scopus 로고
    • Inventory management in supply chains: A reinforcement learning approach
    • DOI 10.1016/S0925-5273(00)00156-0, PII S0925527300001560
    • I. Giannoccaro and P. Pontrandolfo, Inventory management in supply chains: A reinforcement learning approach, International Journal of Production Economics 78 (2002) 153-161. (Pubitemid 34532511)
    • (2002) International Journal of Production Economics , vol.78 , Issue.2 , pp. 153-161
    • Giannoccaro, I.1    Pontrandolfo, P.2
  • 29
    • 0010400851 scopus 로고    scopus 로고
    • DASCh: Dynamics analysis of supply chains
    • Center for Electronic Commerce, ERIM
    • H. V. D. Parunak, R. Savit, R. L. Riolo and S. J. Clark, DASCh: Dynamics analysis of supply chains, Executive Report (Center for Electronic Commerce, ERIM, 1999).
    • (1999) Executive Report
    • Parunak, H.V.D.1    Savit, R.2    Riolo, R.L.3    Clark, S.J.4
  • 32
    • 0019537951 scopus 로고
    • Toward a modern theory of adaptive networks: Expectation and prediction
    • R. S. Sutton and A. G. Barto, Toward a modern theory of adaptive networks: Expectation and prediction, Psychological Review 88 (1981) 135-140.
    • (1981) Psychological Review , vol.88 , pp. 135-140
    • Sutton, R.S.1    Barto, A.G.2
  • 37
    • 0001812752 scopus 로고
    • Exploration and exploitation in organizational learning
    • J. March, Exploration and exploitation in organizational learning, Organization Science 2(1) (1991).
    • (1991) Organization Science , vol.2 , pp. 1
    • March, J.1
  • 38
    • 0000723997 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • R. S. Sutton, Generalization in reinforcement learning: Successful examples using sparse coarse coding, Advances in Neural Information Processing Systems (1996).
    • (1996) Advances in Neural Information Processing Systems
    • Sutton, R.S.1
  • 40
    • 0031231885 scopus 로고    scopus 로고
    • Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces
    • J. C. Santamaria, R. S. Sutton and A. Ram, Experiments with reinforcement learning in problems with continuous state and action spaces, Adaptive Behavior 6(2) (1998) 163-218. (Pubitemid 128049217)
    • (1997) Adaptive Behavior , vol.6 , Issue.2 , pp. 163-218
    • Santamaria, J.C.1    Sutton, R.S.2    Ram, A.3
  • 41
    • 0003477315 scopus 로고
    • Reinforcement learning with high-dimensional, continuous actions
    • Wright Laboratory
    • L. Baird and H. Klopf, Reinforcement learning with high-dimensional, continuous actions, Technical Report WL-TR-93-1147 (Wright Laboratory, 1993).
    • (1993) Technical Report WL-TR- 93-1147
    • Baird, L.1    Klopf, H.2
  • 42
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • G. Tesauro, Practical issues in temporal difference learning, Machine Learning 8 (1992) 257-277.
    • (1992) Machine Learning , vol.8 , pp. 257-277
    • Tesauro, G.1
  • 43
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program achieves master-level play
    • G. Tesauro, TD-Gammon, a self-teaching backgammon program achieves master-level play, Neural Computation 6 (1994) 215-219.
    • (1994) Neural Computation , vol.6 , pp. 215-219
    • Tesauro, G.1
  • 44
    • 0029276036 scopus 로고
    • Temporal difference learning and TDGammon
    • G. Tesauro, Temporal difference learning and TDGammon, Communications of the ACM9 38 (1995) 58-68.
    • (1995) Communications of the ACM9 , vol.38 , pp. 58-68
    • Tesauro, G.1
  • 48
    • 0003270924 scopus 로고
    • Issues in using function approximation for reinforcement learning
    • Hillsdale, NJ: Lawrence Erlbaum Publisher
    • S. Thrun and A. Schwartz, Issues in using function approximation for reinforcement learning, Proceedings of the Fourth Connectionist Models Summer School (Hillsdale, NJ: Lawrence Erlbaum Publisher, 1993).
    • (1993) Proceedings of the Fourth Connectionist Models Summer School
    • Thrun, S.1    Schwartz, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.