메뉴 건너뛰기




Volumn 20, Issue 3, 2009, Pages 538-551

Near-term liability of exploitation: Exploration and exploitation in multistage problems

Author keywords

Exploration and exploitation; Maximization; Multistage problems; Reinforcement learning; Softmax choice rule

Indexed keywords


EID: 70350391648     PISSN: 10477039     EISSN: 15265455     Source Type: Journal    
DOI: 10.1287/orsc.1080.0376     Document Type: Article
Times cited : (47)

References (90)
  • 3
    • 85012688561 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Bellman, R. 1957. Dynamic Programming. Princeton University Press, Princeton, NJ.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 4
    • 0037769506 scopus 로고    scopus 로고
    • Process management and technological innovation: A longitudinal study of the photography and paint industries
    • Benner, M., M. Tushman. 2002. Process management and technological innovation: A longitudinal study of the photography and paint industries. Admin. Sci. Quart. 47 676-706.
    • (2002) Admin. Sci. Quart. , vol.47 , pp. 676-706
    • Benner, M.1    Tushman, M.2
  • 5
    • 0037391031 scopus 로고    scopus 로고
    • Exploitation, exploration and process management: The productivity dilemma revisited
    • Benner, M., M. Tushman. 2003. Exploitation, exploration and process management: The productivity dilemma revisited. Acad. Management Rev. 28(2) 238-257.
    • (2003) Acad. Management Rev. , vol.28 , Issue.2 , pp. 238-257
    • Benner, M.1    Tushman, M.2
  • 6
    • 16244399557 scopus 로고    scopus 로고
    • Approximate optimal control as a model for motor learning
    • Berthier, N., M. Rosenstein, A. Barto. 2005. Approximate optimal control as a model for motor learning. Psych. Rev. 112(2) 329-346.
    • (2005) Psych. Rev. , vol.112 , Issue.2 , pp. 329-346
    • Berthier, N.1    Rosenstein, M.2    Barto, A.3
  • 7
    • 0001430502 scopus 로고
    • Milestones for successful venture planning
    • Block, Z., I. MacMillan. 1985. Milestones for successful venture planning. Harvard Bus. Rev. 63(5) 184-190.
    • (1985) Harvard Bus. Rev. , vol.63 , Issue.5 , pp. 184-190
    • Block, Z.1    MacMillan, I.2
  • 8
    • 0010465428 scopus 로고
    • Noise in the modeling and control of dynamical systems
    • Breeden, J., F. Dionkelacker, A. Huber. 1990. Noise in the modeling and control of dynamical systems. Physical Rev. A 42 5827-5836.
    • (1990) Physical Rev. A , vol.42 , pp. 5827-5836
    • Breeden, J.1    Dionkelacker, F.2    Huber, A.3
  • 9
    • 0003341991 scopus 로고
    • Feedback delays in complex dynamic decision tasks
    • P. Frensch, J. Funke, eds. Erlbaum Associates, Hillsdale, NJ
    • Brehmer, B. 1995. Feedback delays in complex dynamic decision tasks. P. Frensch, J. Funke, eds. Complex Problem Solving, The European Perspective. Erlbaum Associates, Hillsdale, NJ.
    • (1995) Complex Problem Solving, The European Perspective
    • Brehmer, B.1
  • 10
    • 0030305349 scopus 로고    scopus 로고
    • Organizational evolution, learning and selection: A genetic algorithm-based model
    • Bruderer, E., J. Singh. 1996. Organizational evolution, learning and selection: A genetic algorithm-based model. Acad. Management J. 39 1322-1329.
    • (1996) Acad. Management J , vol.39 , pp. 1322-1329
    • Bruderer, E.1    Singh, J.2
  • 11
    • 18644365144 scopus 로고    scopus 로고
    • Experience-weighted attraction learning in normal form games
    • Camerer, C., T. H. Ho. 1999. Experience-weighted attraction learning in normal form games. Econometrica 67 837-874.
    • (1999) Econometrica , vol.67 , pp. 837-874
    • Camerer, C.1    Ho, T.H.2
  • 12
    • 12344331136 scopus 로고
    • Blind variation and selective retention in creative thought processes
    • Campbell, D. T. 1960. Blind variation and selective retention in creative thought processes. Psych. Rev. 67 380-400.
    • (1960) Psych. Rev. , vol.67 , pp. 380-400
    • Campbell, D.T.1
  • 14
    • 2942703911 scopus 로고    scopus 로고
    • Matchmaking
    • Daw, N., P. Dayan. 2004. Matchmaking. Science 304 1753-1754.
    • (2004) Science , vol.304 , pp. 1753-1754
    • Daw, N.1    Dayan, P.2
  • 15
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw, N., J. O'Doherty, P. Dayan, B. Seymour, R. Dolan. 2006. Cortical substrates for exploratory decisions in humans. Nature 441(15) 876-879.
    • (2006) Nature , vol.441 , Issue.15 , pp. 876-879
    • Daw, N.1    O'Doherty, J.2    Dayan, P.3    Seymour, B.4    Dolan, R.5
  • 18
    • 8344285850 scopus 로고    scopus 로고
    • Learning from t-mazes to labyrinths: Learning from model-based feedback
    • Denrell, J., C. Fang, D. Levinthal. 2004. Learning from t-mazes to labyrinths: Learning from model-based feedback. Management Sci. 50(10) 1366-1378.
    • (2004) Management Sci , vol.50 , Issue.10 , pp. 1366-1378
    • Denrell, J.1    Fang, C.2    Levinthal, D.3
  • 19
    • 0001891355 scopus 로고
    • Behavioral decision theory: Processes of judgment and choice
    • Einhorn, H., R. Hogarth. 1981. Behavioral decision theory: Processes of judgment and choice. J. Accounting Res. 19(1) 1-32.
    • (1981) J. Accounting Res. , vol.19 , Issue.1 , pp. 1-32
    • Einhorn, H.1    Hogarth, R.2
  • 21
    • 70350394336 scopus 로고    scopus 로고
    • Robust local search for spacecraft operations using adaptive noise
    • Workshop Planning Scheduling Space, Darmstadt, Germany
    • Fukunaga, A., G. Rabideau, S. Chien. 2004. Robust local search for spacecraft operations using adaptive noise. Proc. 4th Internat. Workshop Planning Scheduling Space, Darmstadt, Germany.
    • (2004) Proc. 4th Internat
    • Fukunaga, A.1    Rabideau, G.2    Chien, S.3
  • 22
    • 61549113484 scopus 로고    scopus 로고
    • Simple models of discrete choice and their performance in bandit experiments
    • Gans, N., G. Knox, R. Croson. 2007. Simple models of discrete choice and their performance in bandit experiments. Manufacturing Service Oper. Management 9(4) 383-408.
    • (2007) Manufacturing Service Oper. Management , vol.9 , Issue.4 , pp. 383-408
    • Gans, N.1    Knox, G.2    Croson, R.3
  • 23
    • 0026252729 scopus 로고
    • How the baldridge award really works
    • Garvin, D. 1991. How the baldridge award really works. Harvard Bus. Rev. 69(6) 80-93.
    • (1991) Harvard Bus. Rev. , vol.69 , Issue.6 , pp. 80-93
    • Garvin, D.1
  • 24
    • 0034339412 scopus 로고    scopus 로고
    • Looking forward and looking backward: Cognitive and experiential search
    • Gavetti, G., D. Levinthal. 2000. Looking forward and looking backward: Cognitive and experiential search. Admin. Sci. Quart. 45(1) 113-137.
    • (2000) Admin. Sci. Quart. , vol.45 , Issue.1 , pp. 113-137
    • Gavetti, G.1    Levinthal, D.2
  • 25
    • 21944448494 scopus 로고    scopus 로고
    • Strategy making in novel and complex worlds: The power of analogy
    • Gavetti, G., D. Levinthal, J. Rivkin. 2005. Strategy making in novel and complex worlds: The power of analogy. Strategic Management J. 26 691-712.
    • (2005) Strategic Management J , vol.26 , pp. 691-712
    • Gavetti, G.1    Levinthal, D.2    Rivkin, J.3
  • 26
    • 0031185223 scopus 로고    scopus 로고
    • Learning in dynamic decision tasks: Computational model and empirical evidence
    • Gibson, F., M. Fichman, D. Plaut. 1997. Learning in dynamic decision tasks: Computational model and empirical evidence. Organ. Behav. Human Decision Processes 71 11-35.
    • (1997) Organ. Behav. Human Decision Processes , vol.71 , pp. 11-35
    • Gibson, F.1    Fichman, M.2    Plaut, D.3
  • 31
    • 44949271572 scopus 로고
    • On the control of complex dynamic systems
    • Jackson, E. 1991a. On the control of complex dynamic systems. Physica D 50 341-366.
    • (1991) Physica D , vol.50 , pp. 341-366
    • Jackson, E.1
  • 32
    • 33645050210 scopus 로고
    • Controls of dynamic flows with attractors
    • Jackson, E. 1991b. Controls of dynamic flows with attractors. Physical Rev. A 44 4839-4853.
    • (1991) Physical Rev. A , vol.44 , pp. 4839-4853
    • Jackson, E.1
  • 34
    • 58149417364 scopus 로고
    • On the psychology of prediction
    • Kahneman, D., A. Tversky. 1973. On the psychology of prediction. Psych. Rev. 80 237-251.
    • (1973) Psych. Rev. , vol.80 , pp. 237-251
    • Kahneman, D.1    Tversky, A.2
  • 38
    • 0001958927 scopus 로고
    • Computer simulations of organizations as experiential learning systems: Implications for organization theory
    • K. Carley, M. Prietula, eds. Lawrence Erlbaum Associates, Hillsdale, NJ
    • Lant, T. K. 1994. Computer simulations of organizations as experiential learning systems: Implications for organization theory. K. Carley, M. Prietula, eds. Computational Organization Theory. Lawrence Erlbaum Associates, Hillsdale, NJ.
    • (1994) Computational Organization Theory
    • Lant, T.K.1
  • 39
    • 0000188232 scopus 로고
    • Managing discontinuous change: A simulation study of organizational learning and entrepreneurship
    • Lant, T., S. Mezias. 1990. Managing discontinuous change: A simulation study of organizational learning and entrepreneurship. Strategic Management J. 11(4) 147-179.
    • (1990) Strategic Management J , vol.11 , Issue.4 , pp. 147-179
    • Lant, T.1    Mezias, S.2
  • 40
    • 0002840570 scopus 로고
    • An organizational learning model of convergence and reorientation
    • Lant, T., S. Mezias. 1992. An organizational learning model of convergence and reorientation. Organ. Sci. 3(1) 47-71.
    • (1992) Organ. Sci. , vol.3 , Issue.1 , pp. 47-71
    • Lant, T.1    Mezias, S.2
  • 42
    • 33745217336 scopus 로고    scopus 로고
    • Best to go with what you know?
    • Lee, D. 2006. Best to go with what you know? Nature 441(15) 822-823.
    • (2006) Nature , vol.441 , Issue.15 , pp. 822-823
    • Lee, D.1
  • 44
    • 0031176451 scopus 로고    scopus 로고
    • Adaptation on rugged landscapes
    • Levinthal, D. 1997. Adaptation on rugged landscapes. Management Sci. 43(7) 934-950.
    • (1997) Management Sci , vol.43 , Issue.7 , pp. 934-950
    • Levinthal, D.1
  • 45
    • 1542793915 scopus 로고
    • A model of adaptive organizational search
    • Levinthal, D., J. March. 1981. A model of adaptive organizational search. J. Econom. Behav. Organ. 2 307-333.
    • (1981) J. Econom. Behav. Organ. , vol.2 , pp. 307-333
    • Levinthal, D.1    March, J.2
  • 49
    • 0001230471 scopus 로고
    • Bounded rationality, ambiguity and the engineering of choice
    • March, J. 1978. Bounded rationality, ambiguity and the engineering of choice. Bell J. Econom. 9(2) 587-608.
    • (1978) Bell J. Econom. , vol.9 , Issue.2 , pp. 587-608
    • March, J.1
  • 50
    • 0001812752 scopus 로고
    • Exploration and exploitation in organization learning
    • March, J. 1991. Exploration and exploitation in organization learning. Organ. Sci. 2(1) 71-87.
    • (1991) Organ. Sci. , vol.2 , Issue.1 , pp. 71-87
    • March, J.1
  • 51
    • 0033472505 scopus 로고    scopus 로고
    • Avoiding complexity catastrophe in co-evolutionary pockets: Strategies for rugged landscape
    • McKelvey, B. 1999. Avoiding complexity catastrophe in co-evolutionary pockets: Strategies for rugged landscape. Organ. Sci. 10(3) 294-321.
    • (1999) Organ. Sci. , vol.10 , Issue.3 , pp. 294-321
    • McKelvey, B.1
  • 52
    • 84937350040 scopus 로고
    • Steps towards artificial intelligence
    • Minsky, M. 1961. Steps towards artificial intelligence. Proc. Inst. Radio Engineers 49(1) 8-30.
    • (1961) Proc. Inst. Radio Engineers , vol.49 , Issue.1 , pp. 8-30
    • Minsky, M.1
  • 53
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesence-phalic dopamine systems based on predictive hebbian learning
    • Montague, P. R., P. Dayan, T. J. Sejnowski. 1996. A framework for mesence-phalic dopamine systems based on predictive hebbian learning. J. Neuroscience 16 1936-1947.
    • (1996) J. Neuroscience , vol.16 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 54
    • 0142058800 scopus 로고    scopus 로고
    • A computational substrate for incentive salience
    • McClure, S., N. Daw, P. Montague. 2003. A computational substrate for incentive salience. Trends in Neurosciences 26 423-428.
    • (2003) Trends in Neurosciences , vol.26 , pp. 423-428
    • McClure, S.1    Daw, N.2    Montague, P.3
  • 56
    • 1942520195 scopus 로고    scopus 로고
    • Dissociable roles of ventral and dorsal striatum in instrumental conditioning
    • O'Doherty, J., P. Dayan, J. Schultz, R. Deichmann, K. Friston, R. Dolan. 2004. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science 304 452-454.
    • (2004) Science , vol.304 , pp. 452-454
    • O'Doherty, J.1    Dayan, P.2    Schultz, J.3    Deichmann, R.4    Friston, K.5    Dolan, R.6
  • 57
    • 70350396045 scopus 로고
    • Technical Report CCER-90-13, Department of Physics, Beckman Institute, University of Illinois, Urbana-Champaign
    • Ohle, F., F. Dinkelacker, A. Huber, M. Welge. 1990. Adapative control of chaotic systems. Technical Report CCER-90-13, Department of Physics, Beckman Institute, University of Illinois, Urbana-Champaign.
    • (1990) Adapative control of chaotic systems
    • Ohle, F.1    Dinkelacker, F.2    Huber, A.3    Welge, M.4
  • 58
    • 84939129511 scopus 로고
    • Robust action and the rise of the medici, 1400-1434
    • Padgett, J., C. Ansell. 1993. Robust action and the rise of the medici, 1400-1434. Amer. J. Sociol. 98(6) 1259-1319.
    • (1993) Amer. J. Sociol. , vol.98 , Issue.6 , pp. 1259-1319
    • Padgett, J.1    Ansell, C.2
  • 60
    • 60449090848 scopus 로고    scopus 로고
    • Can science be a business?
    • (October)
    • Pisano, G. 2006. Can science be a business? Harvard Bus. Rev. (October) 1-12.
    • (2006) Harvard Bus. Rev. , pp. 1-12
    • Pisano, G.1
  • 63
    • 0034207532 scopus 로고    scopus 로고
    • Imitation of complex strategies
    • Rivkin, J. W. 2000. Imitation of complex strategies. Management Sci. 46(6) 824-844.
    • (2000) Management Sci , vol.46 , Issue.6 , pp. 824-844
    • Rivkin, J.W.1
  • 64
    • 58149324992 scopus 로고
    • Learning in extensive form games: Experimental data and simple dynamic models in the intermediate term
    • Roth, A., I. Erev. 1995. Learning in extensive form games: Experimental data and simple dynamic models in the intermediate term. Games and Econom. Behav. 8 164-212.
    • (1995) Games and Econom. Behav. , vol.8 , pp. 164-212
    • Roth, A.1    Erev, I.2
  • 65
    • 0001201756 scopus 로고
    • Some studies in machine learning using the game of checkers
    • Samuel, A. 1959. Some studies in machine learning using the game of checkers. IBM J. Res. Development 32 211-229.
    • (1959) IBM J. Res. Development , vol.32 , pp. 211-229
    • Samuel, A.1
  • 66
    • 0001201757 scopus 로고
    • Some studies in machine learning using the game of checkers II-Recent progress
    • Samuel, A. 1967. Some studies in machine learning using the game of checkers II-Recent progress. IBM J. Res. Development 11 601-617.
    • (1967) IBM J. Res. Development , vol.11 , pp. 601-617
    • Samuel, A.1
  • 68
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., P. Dayan, P. R. Montague. 1997. A neural substrate of prediction and reward. Science 275 1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 69
    • 70350392288 scopus 로고    scopus 로고
    • Incubation in problem solving as a context effect
    • Lawrence Erlbaum Associates, Mahwah, NJ
    • Seabrook, R., Z. Dienes. 2003. Incubation in problem solving as a context effect. Proc. 25th Meeting Cognitive Sci. Soc., Lawrence Erlbaum Associates, Mahwah, NJ.
    • (2003) Proc. 25th Meeting Cognitive Sci. Soc.
    • Seabrook, R.1    Dienes, Z.2
  • 70
    • 1842813242 scopus 로고    scopus 로고
    • Incubation in insight problem solving
    • Segal, E. 2004. Incubation in insight problem solving. Creativity Res. J. 16(1) 141-148.
    • (2004) Creativity Res. J , vol.16 , Issue.1 , pp. 141-148
    • Segal, E.1
  • 71
    • 0027706953 scopus 로고
    • An empirical study of greedy local search for satisfiability testing
    • Selman, B., H. Kautz. 1993. An empirical study of greedy local search for satisfiability testing. Proc. Amer. Assoc. Artificial Intelligence, 46-51.
    • (1993) Proc. Amer. Assoc. Artificial Intelligence , pp. 46-51
    • Selman, B.1    Kautz, H.2
  • 73
    • 84959810873 scopus 로고
    • A behavioral model of rational choice
    • Simon, H. 1955. A behavioral model of rational choice. Quart. J. Econom. 69(1) 99-118.
    • (1955) Quart. J. Econom. , vol.69 , Issue.1 , pp. 99-118
    • Simon, H.1
  • 74
    • 0002977490 scopus 로고
    • Scientific discovery and the psychology of problem solving
    • R. Colodny, ed. University of Pittsburgh Press, Pittsburg
    • Simon, H. A. 1966. Scientific discovery and the psychology of problem solving. R. Colodny, ed. Mind and Cosmos. University of Pittsburgh Press, Pittsburg, 22-40.
    • (1966) Mind and Cosmos , pp. 22-40
    • Simon, H.A.1
  • 76
    • 11944265738 scopus 로고
    • Invariants of human behavior
    • Simon, H. 1990. Invariants of human behavior. Annual Rev. Psych. 41 1-19.
    • (1990) Annual Rev. Psych. , vol.41 , pp. 1-19
    • Simon, H.1
  • 77
    • 0012969532 scopus 로고
    • Rational decision-making in business organizations
    • A. Lindbeck, ed. World Scientific Publishing, Singapore
    • Simon, H. 1992. Rational decision-making in business organizations. A. Lindbeck, ed. Nobel Lectures, Economics 1969-1980. World Scientific Publishing, Singapore.
    • (1992) Nobel Lectures, Economics 1969-1980
    • Simon, H.1
  • 78
    • 0008188538 scopus 로고
    • Foresight in insight? A Darwinian answer
    • R. J. Sternberg, J. E. Davidson, eds. MIT Press, Cambridge, MA
    • Simonton, D. 1995. Foresight in insight? A Darwinian answer. R. J. Sternberg, J. E. Davidson, eds. The Nature of Insight.MIT Press, Cambridge, MA, 465-494.
    • (1995) The Nature of Insight , pp. 465-494
    • Simonton, D.1
  • 79
    • 38649096027 scopus 로고
    • Misperceptions of feedback in dynamic decision making
    • Sterman, J. 1989. Misperceptions of feedback in dynamic decision making. Organ. Behav. Human Decision Processes 43(3) 301-328.
    • (1989) Organ. Behav. Human Decision Processes , vol.43 , Issue.3 , pp. 301-328
    • Sterman, J.1
  • 80
    • 0031123734 scopus 로고    scopus 로고
    • Unanticipated side effects of successful quality programs: Exploring a paradox of organizational improvement
    • Sterman, J., N. Repenning, F. Kofman. 1997. Unanticipated side effects of successful quality programs: Exploring a paradox of organizational improvement. Management Sci. 43(4) 503-521.
    • (1997) Management Sci , vol.43 , Issue.4 , pp. 503-521
    • Sterman, J.1    Repenning, N.2    Kofman, F.3
  • 82
    • 0001394603 scopus 로고
    • An adaptive network that constructs and uses an internal model of its world
    • Sutton, R., A. Barto. 1981. An adaptive network that constructs and uses an internal model of its world. Cognition Brain Theory 4(3) 217-246.
    • (1981) Cognition Brain Theory , vol.4 , Issue.3 , pp. 217-246
    • Sutton, R.1    Barto, A.2
  • 86
    • 0002621983 scopus 로고
    • Animal intelligence: An experimental study of the associative processes in animals
    • Monograph Supplements
    • Thorndike, E. 1898. Animal intelligence: An experimental study of the associative processes in animals. Psych. Rev. Monograph Supplements 8.
    • (1898) Psych. Rev. , pp. 8
    • Thorndike, E.1
  • 89
    • 1942443226 scopus 로고    scopus 로고
    • Predicting risk sensitivity in humans and lower animals: Risk as variance or coefficient of variation
    • Weber, E., S. Shafir, A.-R. Blais, 2004. Predicting risk sensitivity in humans and lower animals: Risk as variance or coefficient of variation. Psych. Rev. 11 430-445.
    • (2004) Psych. Rev. , vol.11 , pp. 430-445
    • Weber, E.1    Shafir, S.2    Blais, A.-R.3
  • 90
    • 0023319002 scopus 로고
    • Activation and metacognition of inaccessible stored information: Potential bases of incubation effects in problem solving
    • Yaniv, I., D. Meyer. 1987. Activation and metacognition of inaccessible stored information: Potential bases of incubation effects in problem solving. J. Experiment. Psych. Learn., Memory, Cognition 13 187-205.
    • (1987) J. Experiment. Psych. Learn., Memory, Cognition , vol.13 , pp. 187-205
    • Yaniv, I.1    Meyer, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.