SCOPUS 정보 검색 플랫폼

International Journal of Neural Systems

Volumn 19, Issue 5, 2009, Pages 331-344

Reinforcement learning in supply chains

(3) Valluri, Annapurna a North, Michael J b MacAl, Charles M b

a UNIVERSITY OF PENNSYLVANIA (United States)

b ARGONNE NATIONAL LABORATORY (United States)

Author keywords

Agent based modeling; Reinforcement learning; Supply chain management

Indexed keywords

AGENT-BASED MODELING; COGNITIVE PSYCHOLOGY; DECISION MAKERS; EFFECTIVE MANAGEMENT; HUMAN BEING; INDEPENDENT AGENTS; LEARNING AGENTS; MULTI-AGENT SETTING; MULTI-STAGE; POTENTIAL MECHANISM; TIME PERIODS;

ARTIFICIAL INTELLIGENCE; EDUCATION; INTELLIGENT AGENTS; LEARNING ALGORITHMS; REINFORCEMENT; REINFORCEMENT LEARNING; SUPPLY CHAIN MANAGEMENT;

SUPPLY CHAINS;

ALGORITHM; ARTICLE; ARTIFICIAL NEURAL NETWORK; AUTOMATED PATTERN RECOGNITION; BIOLOGICAL MODEL; COMPUTER SIMULATION; DECISION SUPPORT SYSTEM; GAME; HUMAN; PROBABILITY; REINFORCEMENT;

ALGORITHMS; COMPUTER SIMULATION; DECISION SUPPORT TECHNIQUES; GAME THEORY; HUMANS; MARKOV CHAINS; MODELS, NEUROLOGICAL; NEURAL NETWORKS (COMPUTER); PATTERN RECOGNITION, AUTOMATED; REINFORCEMENT (PSYCHOLOGY);

EID: 71049115473 PISSN: 01290657 EISSN: None Source Type: Journal
DOI: 10.1142/S0129065709002063 Document Type: Article

Times cited : (30)

References (48)

1
- 0001116969
- Simulation of order fulfillment in divergent assembly supply chains
- T. J. Strader, F. Lin and M. J. Shaw, Simulation of order fulfillment in divergent assembly supply chains, Journal of Artificial Societies and Social Simulation 1(2) (1998) [http://jasss.soc.survey.ac.uk/ 1/2/s.html].
- (1998) Journal of Artificial Societies and Social Simulation , vol.1 , pp. 2
- Strader, T.J.¹ Lin, F.² Shaw, M.J.³

2
- 0002823699
- Managing supply chain inventory: Pitfalls and opportunities
- H. L. Lee and C. Billington, Managing supply chain inventory: Pitfalls and opportunities, Sloan Management Review (Spring) (1993) 65-73.
- (1993) Sloan Management Review (Spring , pp. 65-73
- Lee, H.L.¹ Billington, C.²

3
- 0035481191
- Market power and efficiency in a computational electricity market with discriminatory double-auction pricing
- J. Nicolaisen, V. Petrov and L. Tesfatsion, Market power and efficiency in a computational electricity market with discriminatory double-auction pricing, IEEE Transactions on Evolutionary Computation 5(5) (2001) 504-523.
- (2001) IEEE Transactions on Evolutionary Computation , vol.5 , Issue.5 , pp. 504-523
- Nicolaisen, J.¹ Petrov, V.² Tesfatsion, L.³

4
- 0001248680
- Le comportement de l'homme rationnel devant le risqúe: Critique des postulats et axiommes de l'Ecole Americaine [Behavior of rational man in the critique of the postulates of the American School.]
- M. Allias, Le comportement de l'homme rationnel devant le risqúe: Critique des postulats et axiommes de l'Ecole Americaine [Behavior of rational man in the critique of the postulates of the American School.], Econometrica 21 (1953) 503-546.
- (1953) Econometrica , vol.21 , pp. 503-546
- Allias, M.¹

5
- 84957363402
- Risk, ambiguity and the savage axioms
- D. Ellsberg, Risk, ambiguity and the savage axioms, Quarterly Journal of Economics 75 (1961) 643-669.
- (1961) Quarterly Journal of Economics , vol.75 , pp. 643-669
- Ellsberg, D.¹

6
- 84884079276
- Princeton, NJ: Princeton University Press
- J. von Neumann and O. Morgenstern, Theory of Games and Economic Behavior (Princeton, NJ: Princeton University Press, 1944).
- (1944) Theory of Games and Economic Behavior
- Von Neumann, J.¹ Morgenstern, O.²

7
- 0016264378
- Judgment under uncertainty: Heuristics and biases
- A. Tversky and D. Kahneman, Judgment under uncertainty: Heuristics and biases, Science 185 (1974) 1124-1131.
- (1974) Science , vol.185 , pp. 1124-1131
- Tversky, A.¹ Kahneman, D.²

8
- 0019392722
- The framing of decisions and the psychology of choice
- A. Tversky and D. Kahneman, The framing of decisions and the psychology of choice, Science 211 (1981) 453-458.
- (1981) Science , vol.211 , pp. 453-458
- Tversky, A.¹ Kahneman, D.²

9
- 31744450082
- Advances in prospect theory: Cumulative representation of uncertainty
- A. Tversky and D. Kahneman, Advances in prospect theory: Cumulative representation of uncertainty, Journal of Risk and Uncertainty 5 (1992) 297-323.
- (1992) Journal of Risk and Uncertainty , vol.5 , pp. 297-323
- Tversky, A.¹ Kahneman, D.²

10
- 0141711957
- Multiattribute decision making in context: A dynamic neural network methodology
- S. J. Leven and D. S. Levine, Multiattribute decision making in context: A dynamic neural network methodology, Cognitive Science 20 (1996) 271-299.
- (1996) Cognitive Science , vol.20 , pp. 271-299
- Leven, S.J.¹ Levine, D.S.²

11
- 0001316418
- Theories of bounded rationality
- Simon H. (ed.), Cambridge, MA: MIT Press
- H. Simon, Theories of bounded rationality, in Simon H. (ed.), Models of Bounded Rationality. Behavioral Economics and Business Organization (Cambridge, MA: MIT Press, 1982).
- (1982) Models of Bounded Rationality. Behavioral Economics and Business Organization
- Simon, H.¹

12
- 0038829878
- Predicting how people play games: Reinforcement learning in experimental games with unique
- L. Erev and A. E. Roth, Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria, American Economic Review 88(4) (1998) 848-881.
- (1998) Mixed Strategy Equilibria, American Economic Review , vol.88 , Issue.4 , pp. 848-881
- Erev, L.¹ Roth, A.E.²

13
- 0038709220
- Strategic play and adaptive learning in the sealed bid bargaining mechanism
- T. Daniel, D. Seale and A. Rapoport, Strategic play and adaptive learning in the sealed bid bargaining mechanism, Journal of Mathematical Psychology 42 (1998) 133-166.
- (1998) Journal of Mathematical Psychology , vol.42 , pp. 133-166
- Daniel, T.¹ Seale, D.² Rapoport, A.³

14
- 36049009101
- A multi-agent system for building project memories to facilitate the design process
- D. Monticolov, V. Hilaire, S. Gomes and A. Koukam, A multi-agent system for building project memories to facilitate the design process, Integrated Computer- Aided Engineering 15(1) (2008) 3-20.
- (2008) Integrated Computer- Aided Engineering , vol.15 , Issue.1 , pp. 3-20
- Monticolov, D.¹ Hilaire, V.² Gomes, S.³ Koukam, A.⁴

15
- 58449112725
- Reverse engineering a social agentbased hidden Markov model
- H. C. Chen, M. Goldberg, M. Magdon-Ismail and W. A. Wallace, Reverse engineering a social agentbased hidden Markov model, International Journal of Neural Systems 18(6) (2008) 491-526.
- (2008) International Journal of Neural Systems , vol.18 , Issue.6 , pp. 491-526
- Chen, H.C.¹ Goldberg, M.² Magdon-Ismail, M.³ Wallace, W.A.⁴

16
- 28344457777
- Multi-agent systems for the simulation of land use and land cover change: A review
- D. Parker, S. Manson, M. Janssen, M. Hoffman and P. Deadman, Multi-agent systems for the simulation of land use and land cover change: A review, Annals of the Association of American Geographers (2003).
- (2003) Annals of the Association of American Geographers
- Parker, D.¹ Manson, S.² Janssen, M.³ Hoffman, M.⁴ Deadman, P.⁵

17
- 0003530707
- Cambridge, MA: MIT Press
- J. M. Epstein and R. Axtell, Growing Artificial Societies (Cambridge, MA: MIT Press, 1996).
- (1996) Growing Artificial Societies
- Epstein, J.M.¹ Axtell, R.²

18
- 0036407552
- On adaptive emergence of trust behavior in the game of Stag Hunt
- C. Fang, S. O. Kimbrough, S. Pace, A. Valluri and Z. Zheng, On adaptive emergence of trust behavior in the game of Stag Hunt, Group Decision and Negotiation 11(6) (2002) 449-467.
- (2002) Group Decision and Negotiation , vol.11 , Issue.6 , pp. 449-467
- Fang, C.¹ Kimbrough, S.O.² Pace, S.³ Valluri, A.⁴ Zheng, Z.⁵

19
- 84921065499
- Oxford, USA
- M. J. North and C. M. Macal, Managing Business Complexity: Discovering Strategic Solutions with Agent-based Modeling and Simulation (Oxford, USA, 2007).
- (2007) Managing Business Complexity: Discovering Strategic Solutions with Agent-based Modeling and Simulation
- North, M.J.¹ MacAl, C.M.²

20
- 0003905320
- Buckingham, UK and Philadelphia, PA: Open University Press
- N. Gilbert and K. G. Troitzsch, Simulation for the Social Scientist (Buckingham, UK and Philadelphia, PA: Open University Press, 1999).
- (1999) Simulation for the Social Scientist
- Gilbert, N.¹ Troitzsch, K.G.²

21
- 0038702140
- Agentbased modeling vs. equation-based modeling: A case study and users' guide
- H. V. D. Parunak, R. Savit and R. L. Riolo, Agentbased modeling vs. equation-based modeling: A case study and users' guide, Proceedings of Multi-agent systems and Agent-based Simulation (1998).
- (1998) Proceedings of Multi-agent Systems and Agent-based Simulation
- Parunak, H.V.D.¹ Savit, R.² Riolo, R.L.³

22
- 0000361362
- Modeling managerial behavior: Misperceptions of feedback in a dynamic decision making experiment
- J. D. Sterman, Modeling managerial behavior: Misperceptions of feedback in a dynamic decision making experiment, Management Science 35(3) (1989) 21-339.
- (1989) Management Science , vol.35 , Issue.3 , pp. 21-339
- Sterman, J.D.¹

23
- 0002663868
- The Bullwhip Effect in supply chains
- H. L. Lee, V. Padmanabhan and S. Whang, The Bullwhip Effect in supply chains, Sloan Management Review (1997) 93-102.
- (1997) Sloan Management Review , pp. 93-102
- Lee, H.L.¹ Padmanabhan, V.² Whang, S.³

24
- 0036643207
- Computers play the Beer Game: Can artificial agents manage supply chains
- S. O. Kimbrough, D. J. Wu and F. Zhong, Computers play the Beer Game: Can artificial agents manage supply chains, Decision Support Systems 33(3) (2002) 323-333.
- (2002) Decision Support Systems , vol.33 , Issue.3 , pp. 323-333
- Kimbrough, S.O.¹ Wu, D.J.² Zhong, F.³

25
- 53349100494
- A reinforcement learning model for supply chain ordering management: An application to the Beer Game
- S. K. Chaharsooghi, J. Heydari and H. Zegordi, A reinforcement learning model for supply chain ordering management: An application to the Beer Game, Decision Support Systems 45 (2008) 949-959.
- (2008) Decision Support Systems , vol.45 , pp. 949-959
- Chaharsooghi, S.K.¹ Heydari, J.² Zegordi, H.³

26
- 40949095618
- Q-Learning in a competitive supply chain
- Montreal, Que.
- T. van Tongeren, U. Kaymak, D. Naso and E. van Asperen, Q-Learning in a competitive supply chain, IEEE 2007 International Conference on Systems, Man and Cybernetics (Montreal, Que.) (2007) 1211-1216.
- (2007) IEEE 2007 International Conference on Systems, Man and Cybernetics , pp. 1211-1216
- Van Tongeren, T.¹ Kaymak, U.² Naso, D.³ Van Asperen, E.⁴

27
- 38949165997
- Policy transition of reinforcement learning for an agent based SCM system
- Singapore
- G. Zhao, and R. Sun, Policy transition of reinforcement learning for an agent based SCM system," 2006 IEEE International Conference on Industrial Informatics, Singapore (2006).
- (2006) 2006 IEEE International Conference on Industrial Informatics
- Zhao, G.¹ Sun, R.²

28
- 0037151232
- Inventory management in supply chains: A reinforcement learning approach
- DOI 10.1016/S0925-5273(00)00156-0, PII S0925527300001560
- I. Giannoccaro and P. Pontrandolfo, Inventory management in supply chains: A reinforcement learning approach, International Journal of Production Economics 78 (2002) 153-161. (Pubitemid 34532511)
- (2002) International Journal of Production Economics , vol.78 , Issue.2 , pp. 153-161
- Giannoccaro, I.¹ Pontrandolfo, P.²

29
- 0010400851
- DASCh: Dynamics analysis of supply chains
- Center for Electronic Commerce, ERIM
- H. V. D. Parunak, R. Savit, R. L. Riolo and S. J. Clark, DASCh: Dynamics analysis of supply chains, Executive Report (Center for Electronic Commerce, ERIM, 1999).
- (1999) Executive Report
- Parunak, H.V.D.¹ Savit, R.² Riolo, R.L.³ Clark, S.J.⁴

30
- 71049182532
- Swarm Development Group, Seattle, WA USA
- M. J. North and C. M. Macal, The beer dock: Three and a half implementations of the Beer Distribution Game SwarmFest 2002 Proceedings (Swarm Development Group, Seattle, WA USA, 2002), p. 17.
- (2002) The Beer Dock: Three and A Half Implementations of the Beer Distribution Game Swarm Fest 2002 Proceedings , pp. 17
- North, M.J.¹ MacAl, C.M.²

31
- 0034171704
- Agentoriented supply-chain management
- M. S. Fox, M. Barbuceanu and R. Teigen, Agentoriented supply-chain management, International Journal of Flexible Manufacturing Systems 12 (2000) 165-188.
- (2000) International Journal of Flexible Manufacturing Systems , vol.12 , pp. 165-188
- Fox, M.S.¹ Barbuceanu, M.² Teigen, R.³

32
- 0019537951
- Toward a modern theory of adaptive networks: Expectation and prediction
- R. S. Sutton and A. G. Barto, Toward a modern theory of adaptive networks: Expectation and prediction, Psychological Review 88 (1981) 135-140.
- (1981) Psychological Review , vol.88 , pp. 135-140
- Sutton, R.S.¹ Barto, A.G.²

33
- 0000580224
- A temporal-difference model of classical conditioning
- R. S. Sutton and A. G. Barto, A temporal-difference model of classical conditioning, Proceedings of the Ninth Annual Conference of the Cognitive Science Society (1987) 355-378.
- (1987) Proceedings of the Ninth Annual Conference of the Cognitive Science Society , pp. 355-378
- Sutton, R.S.¹ Barto, A.G.²

34
- 0004007508
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning (Cambridge, MA: MIT Press, 1998).
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

35
- 0004049893
- PhD Thesis King's College, Oxford
- C. Watkins, Learning from Delayed Rewards (PhD Thesis King's College, Oxford, 1989).
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

36
- 0004192228
- Englewood Cliffs, N.J.: Prentice-Hall
- R. D. Cyert and J. G. March, A Behavioral Theory of the Firm (Englewood Cliffs, N.J.: Prentice-Hall, 1963).
- (1963) A Behavioral Theory of the Firm
- Cyert, R.D.¹ March, J.G.²

37
- 0001812752
- Exploration and exploitation in organizational learning
- J. March, Exploration and exploitation in organizational learning, Organization Science 2(1) (1991).
- (1991) Organization Science , vol.2 , pp. 1
- March, J.¹

38
- 0000723997
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- R. S. Sutton, Generalization in reinforcement learning: Successful examples using sparse coarse coding, Advances in Neural Information Processing Systems (1996).
- (1996) Advances in Neural Information Processing Systems
- Sutton, R.S.¹

39
- 85151728371
- Residual Algorithms: Reinforcement learning with function approximation
- L. Baird, Residual Algorithms: Reinforcement learning with function approximation, in Proceedings of the International Conference on Machine Learning (1995).
- (1995) Proceedings of the International Conference on Machine Learning
- Baird, L.¹

40
- 0031231885
- Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces
- J. C. Santamaria, R. S. Sutton and A. Ram, Experiments with reinforcement learning in problems with continuous state and action spaces, Adaptive Behavior 6(2) (1998) 163-218. (Pubitemid 128049217)
- (1997) Adaptive Behavior , vol.6 , Issue.2 , pp. 163-218
- Santamaria, J.C.¹ Sutton, R.S.² Ram, A.³

41
- 0003477315
- Reinforcement learning with high-dimensional, continuous actions
- Wright Laboratory
- L. Baird and H. Klopf, Reinforcement learning with high-dimensional, continuous actions, Technical Report WL-TR-93-1147 (Wright Laboratory, 1993).
- (1993) Technical Report WL-TR- 93-1147
- Baird, L.¹ Klopf, H.²

42
- 0001046225
- Practical issues in temporal difference learning
- G. Tesauro, Practical issues in temporal difference learning, Machine Learning 8 (1992) 257-277.
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.¹

43
- 0000985504
- TD-Gammon, a self-teaching backgammon program achieves master-level play
- G. Tesauro, TD-Gammon, a self-teaching backgammon program achieves master-level play, Neural Computation 6 (1994) 215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.¹

44
- 0029276036
- Temporal difference learning and TDGammon
- G. Tesauro, Temporal difference learning and TDGammon, Communications of the ACM9 38 (1995) 58-68.
- (1995) Communications of the ACM9 , vol.38 , pp. 58-68
- Tesauro, G.¹

45
- 0031185223
- Learning in dynamic decision tasks: Computational model and empirical evidence
- F. Gibson, M. Fichman and D. C. Plaut, Learning in dynamic decision tasks: Computational model and empirical evidence, Organizational Behavior and Human Decision Processes 71(1) (1997) 1-35.
- (1997) Organizational Behavior and Human Decision Processes , vol.71 , Issue.1 , pp. 1-35
- Gibson, F.¹ Fichman, M.² Plaut, D.C.³

46
- 71049120855
- Available at
- ROAD: Repast Organization for Architecture and Design Home Page, Available at http://repast. sourceforge.net/ (2009).
- (2009) ROAD: Repast Organization for Architecture and Design Home Page

47
- 0013528313
- Scaling reinforcement learning toward Robocup soccer
- Williams College: Morgan Kaufman
- P. Stone and R. S. Sutton, Scaling reinforcement learning toward Robocup soccer, Proceedings of the Eighteenth International Conference on Machine Learning, (Williams College: Morgan Kaufman, 2001), pp. 537-1534
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 537-1534
- Stone, P.¹ Sutton, R.S.²

48
- 0003270924
- Issues in using function approximation for reinforcement learning
- Hillsdale, NJ: Lawrence Erlbaum Publisher
- S. Thrun and A. Schwartz, Issues in using function approximation for reinforcement learning, Proceedings of the Fourth Connectionist Models Summer School (Hillsdale, NJ: Lawrence Erlbaum Publisher, 1993).
- (1993) Proceedings of the Fourth Connectionist Models Summer School
- Thrun, S.¹ Schwartz, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.