메뉴 건너뛰기




Volumn 469, Issue 2153, 2013, Pages

Thermodynamics as a theory of decision-making with information-processing costs

Author keywords

Bounded rationality; Decision making; Information processing

Indexed keywords

COSTS; DATA PROCESSING; DECISION MAKING; EXPERIMENTS; FREE ENERGY; MACHINERY; OPTIMIZATION; PHYSICS; THERMODYNAMICS; VARIATIONAL TECHNIQUES;

EID: 84877280273     PISSN: 13645021     EISSN: 14712946     Source Type: Journal    
DOI: 10.1098/rspa.2012.0683     Document Type: Article
Times cited : (236)

References (92)
  • 3
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: Opportunity costs and the control of response vigor
    • doi:10.1007/s00213-006-0502-4
    • Niv Y, Daw ND, Joel D, Dayan P. 2007 Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology 191, 507-520. (doi:10.1007/s00213-006-0502-4).
    • (2007) Psychopharmacology , vol.191 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 4
    • 84877264809 scopus 로고    scopus 로고
    • Model-based reinforcement learning as cognitive search: Neuro computational theories
    • Boston, MA: MIT Press
    • Daw ND. 2012 Model-based reinforcement learning as cognitive search: neurocomputational theories. In Cognitive search: evolution algorithms and the brain. Boston, MA: MIT Press.
    • (2012) Cognitive Search: Evolution Algorithms and the Brain
    • Daw, N.D.1
  • 8
    • 58149433367 scopus 로고
    • Rational choice and the structure of the environment
    • doi:10.1037/h0042769
    • Simon HA. 1956 Rational choice and the structure of the environment. Psychol. Rev. 63, 129-38. (doi:10.1037/h0042769).
    • (1956) Psychol. Rev. , vol.63 , pp. 129-138
    • Simon, H.A.1
  • 9
    • 0000597948 scopus 로고
    • Theories of bounded rationality
    • (eds CB Radner, R Radner), Amsterdam, The Netherlands: North-Holland
    • Simon H. 1972 Theories of bounded rationality. In Decision and organization (eds CB Radner, R Radner), pp. 161-176. Amsterdam, The Netherlands: North-Holland.
    • (1972) Decision and Organization , pp. 161-176
    • Simon, H.1
  • 11
    • 0002148464 scopus 로고
    • Information processing and bounded rationality: A survey
    • doi:10.2307/136022
    • Lipman B. 1995 Information processing and bounded rationality: a survey. Canad. J. Econ. 28, 42-67. (doi:10.2307/136022).
    • (1995) Canad. J. Econ. , vol.28 , pp. 42-67
    • Lipman, B.1
  • 12
    • 0342398261 scopus 로고
    • Rationality and intelligence
    • (ed. C Mellish), San Francisco, CA: Morgan Kaufmann
    • Russell SJ. 1995 Rationality and intelligence. In Proc. 14th Int. Joint Conf. on Artificial Intelligence (ed. C Mellish), pp. 950-957. San Francisco, CA: Morgan Kaufmann.
    • (1995) Proc. 14th Int. Joint Conf. on Artificial Intelligence , pp. 950-957
    • Russell, S.J.1
  • 14
    • 0031256620 scopus 로고    scopus 로고
    • Rationality and bounded rationality
    • doi:10.1006/game.1997.0585
    • Aumann RJ. 1997 Rationality and bounded rationality. Games Econ. Behav. 21, 2-14. (doi:10.1006/game.1997.0585).
    • (1997) Games Econ. Behav. , vol.21 , pp. 2-14
    • Aumann, R.J.1
  • 17
    • 2942700268 scopus 로고    scopus 로고
    • Maps of bounded rationality: Psychology for behavioral economics
    • doi:10.1257/000282803322655392
    • Kahneman D. 2003 Maps of bounded rationality: psychology for behavioral economics. Am. Econ. Rev. 93, 1449-1475. (doi:10.1257/000282803322655392).
    • (2003) Am. Econ. Rev. , vol.93 , pp. 1449-1475
    • Kahneman, D.1
  • 20
    • 0348166371 scopus 로고
    • Quantal response equilibrium for normal form games
    • doi:10.1006/game.1995.1023
    • McKelvey RD, Palfrey TR. 1995 Quantal response equilibria for normal form games. Games Econ. Behav. 10, 6-38. (doi:10.1006/game.1995.1023).
    • (1995) Games Econ. Behav. , vol.10 , pp. 6-38
    • McKelvey, R.D.1    Palfrey, T.R.2
  • 21
    • 0003087027 scopus 로고    scopus 로고
    • Quantal response equilibrium for extensive form games
    • doi:10.1007/BF01426213
    • Mckelvey R, Palfrey TR. 1998 Quantal response equilibria for extensive form games. Exp. Econ. 1, 9-41. (doi:10.1007/BF01426213).
    • (1998) Exp. Econ. , vol.1 , pp. 9-41
    • McKelvey, R.1    Palfrey, T.R.2
  • 22
    • 14344265932 scopus 로고    scopus 로고
    • Information theory - The bridge connecting bounded rational game theory and statistical physics
    • (eds D Braha, Y Bar-Yam). Cambridge, MA: Perseus Books
    • Wolpert DH. 2004 Information theory-the bridge connecting bounded rational game theory and statistical physics. In Complex engineering systems (eds D Braha, Y Bar-Yam). Cambridge, MA: Perseus Books.
    • (2004) Complex Engineering Systems
    • Wolpert, D.H.1
  • 23
    • 84858266821 scopus 로고    scopus 로고
    • Hysteresis effects of changing parameters of noncooperative games
    • doi:10.1103/PhysRevE.85. 036102
    • Wolpert DH, Harré M, Bertschinger N, Olbrich E, Jost J. 2012 Hysteresis effects of changing parameters of noncooperative games. Phys. Rev. E 85, 036102. (doi:10.1103/PhysRevE.85. 036102).
    • (2012) Phys. Rev. E , vol.85 , pp. 036102
    • Wolpert, D.H.1    Harré, M.2    Bertschinger, N.3    Olbrich, E.4    Jost, J.5
  • 27
    • 0002297105 scopus 로고
    • Conditional logit analysis of qualitative choice behavior
    • (ed. P Zarembka), New York, NY: Academic Press
    • McFadden D. 1974 Conditional logit analysis of qualitative choice behavior. In Frontiers in econometrics (ed. P Zarembka), pp. 105-142. New York, NY: Academic Press.
    • (1974) Frontiers in Econometrics , pp. 105-142
    • McFadden, D.1
  • 28
    • 12844285237 scopus 로고
    • A new class of symmetric utility rules for gambles, subjective marginal probability functions, and a generalized bayes rule
    • Meginnis JR. 1976 A new class of symmetric utility rules for gambles, subjective marginal probability functions, and a generalized Bayes' rule. In Proc. American Statistical Association, Business and Economic Statistics Section, pp. 471-476.
    • (1976) Proc. American Statistical Association, Business and Economic Statistics Section , pp. 471-476
    • Meginnis, J.R.1
  • 29
    • 0000466473 scopus 로고
    • Learning mixed equilibria
    • doi:10.1006/game.1993.1021
    • Fudenberg D, Kreps D. 1993 Learning mixed equilibria. Games Econ. Behav. 5, 320-367. (doi:10.1006/game.1993.1021).
    • (1993) Games Econ. Behav. , vol.5 , pp. 320-367
    • Fudenberg, D.1    Kreps, D.2
  • 33
    • 79961193055 scopus 로고    scopus 로고
    • Lecture Notes on Artificial Intelligence, vol. 6830, Berlin, Germany: Springer
    • Ortega PA, Braun DA. 2011 Information, utility and bounded rationality. Lecture Notes on Artificial Intelligence, vol. 6830, pp. 269-274. Berlin, Germany: Springer.
    • (2011) Information, Utility and Bounded Rationality , pp. 269-274
    • Ortega, P.A.1    Braun, D.A.2
  • 35
    • 0000125532 scopus 로고
    • Prospect theory: An analysis of decision under risk
    • doi:10.2307/1914185
    • Kahneman D, Tversky A. 1979 Prospect theory: an analysis of decision under risk. Econometrica 47, 263-291. (doi:10.2307/1914185).
    • (1979) Econometrica , vol.47 , pp. 263-291
    • Kahneman, D.1    Tversky, A.2
  • 36
    • 67649653956 scopus 로고    scopus 로고
    • The free-energy principle: A rough guide to the brain?
    • doi:10.1016/j.tics.2009.04.005
    • Friston K. 2009 The free-energy principle: a rough guide to the brain? Trends Cogn. Sci. 13, 293-301. (doi:10.1016/j.tics.2009.04.005).
    • (2009) Trends Cogn. Sci. , vol.13 , pp. 293-301
    • Friston, K.1
  • 37
    • 75549090229 scopus 로고    scopus 로고
    • The free-energy principle: A unified brain theory?
    • doi:10.1038/nrn2787
    • Friston K. 2010 The free-energy principle: a unified brain theory? Nat. Rev. Neurosci. 11, 127-138. (doi:10.1038/nrn2787).
    • (2010) Nat. Rev. Neurosci. , vol.11 , pp. 127-138
    • Friston, K.1
  • 39
    • 0000328287 scopus 로고
    • Irreversibility and heat generation in the computing process
    • doi:10.1147/rd.53.0183
    • Landauer R. 1961 Irreversibility and heat generation in the computing process. IBM J. Res. Dev. 5, 183-191. (doi:10.1147/rd.53.0183).
    • (1961) IBM J. Res. Dev. , vol.5 , pp. 183-191
    • Landauer, R.1
  • 40
    • 0000244133 scopus 로고
    • Energy and information
    • doi:10.1038/scientificamerican0971-179
    • Tribus M, McIrvine EC. 1971 Energy and information. Scient. Am. 225, 179-188. (doi:10.1038/scientificamerican0971-179).
    • (1971) Scient. Am. , vol.225 , pp. 179-188
    • Tribus, M.1    McIrvine, E.C.2
  • 41
    • 0015680909 scopus 로고
    • Logical reversibility of computation
    • doi:10.1147/rd.176.0525
    • Bennett CH. 1973 Logical reversibility of computation. IBM J. Res. Dev. 17, 525-532. (doi:10.1147/rd.176.0525).
    • (1973) IBM J. Res. Dev. , vol.17 , pp. 525-532
    • Bennett, C.H.1
  • 42
    • 0010146684 scopus 로고
    • The thermodynamics of computation-A review
    • doi:10.1007/BF02084158
    • Bennett CH. 1982 The thermodynamics of computation-a review. Int. J. Theoret. Phys. 21, 905-940. (doi:10.1007/BF02084158).
    • (1982) Int. J. Theoret. Phys. , vol.21 , pp. 905-940
    • Bennett, C.H.1
  • 44
  • 45
    • 33750347385 scopus 로고    scopus 로고
    • The physics of optimal decision making: A formal analysis of models of performance in two-alternative forced-choice tasks
    • doi:10.1037/0033-295X.113.4.700
    • Bogacz R, Brown E, Moehlis J, Holmes P, Cohen JD. 2006 The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. Psychol. Rev. 113, 700-765. (doi:10.1037/0033-295X.113.4.700).
    • (2006) Psychol. Rev. , vol.113 , pp. 700-765
    • Bogacz, R.1    Brown, E.2    Moehlis, J.3    Holmes, P.4    Cohen, J.D.5
  • 46
    • 84908178296 scopus 로고    scopus 로고
    • Free energy and the generalized optimality equations for sequential decision making
    • Edinburgh, UK
    • Ortega PA, Braun DA. 2012 Free energy and the generalized optimality equations for sequential decision making. In European Workshop for Reinforcement Learning, Edinburgh, UK.
    • (2012) European Workshop for Reinforcement Learning
    • Ortega, P.A.1    Braun, D.A.2
  • 47
    • 28844435646 scopus 로고    scopus 로고
    • A linear theory for control of non-linear stochastic systems
    • doi:10.1103/PhysRevLett.95.200201
    • Kappen HJ. 2005 A linear theory for control of non-linear stochastic systems. Phys. Rev. Lett. 95, 200201. (doi:10.1103/PhysRevLett.95.200201).
    • (2005) Phys. Rev. Lett. , vol.95 , pp. 200201
    • Kappen, H.J.1
  • 48
    • 84864055301 scopus 로고    scopus 로고
    • Linearly solvable markov decision problems
    • Vancouver, Canada
    • Todorov E. 2006 Linearly solvable Markov decision problems. In Advances in neural information processing systems, Vancouver, Canada, vol. 19, pp. 1369-1376.
    • (2006) Advances in Neural Information Processing Systems , vol.19 , pp. 1369-1376
    • Todorov, E.1
  • 49
    • 67650915125 scopus 로고    scopus 로고
    • Efficient computation of optimal actions
    • doi:10.1073/pnas.0710743106
    • Todorov E. 2009 Efficient computation of optimal actions. Proc. Natl Acad. Sci. USA 106, 11478- 11483. (doi:10.1073/pnas.0710743106).
    • (2009) Proc. Natl Acad. Sci. USA , vol.106 , pp. 11478-11483
    • Todorov, E.1
  • 50
    • 84862024986 scopus 로고    scopus 로고
    • Optimal control as a graphical model inference problem
    • doi:10.1007/S10994-012-5278-7
    • Kappen HJ, Gómez V, Opper M. 2012 Optimal control as a graphical model inference problem. Mach. Learn. 87, 159-182. (doi:10.1007/S10994-012-5278- 7).
    • (2012) Mach. Learn. , vol.87 , pp. 159-182
    • Kappen, H.J.1    Gómez, V.2    Opper, M.3
  • 52
    • 0003787146 scopus 로고
    • Princeton, NJ: Princeton University Press
    • Bellman RE. 1957 Dynamic programming. Princeton, NJ: Princeton University Press.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 55
    • 33646467175 scopus 로고    scopus 로고
    • Princeton, NJ: Princeton University Press
    • Hansen LP, Sargent TJ. 2008 Robustness. Princeton, NJ: Princeton University Press.
    • (2008) Robustness
    • Hansen, L.P.1    Sargent, T.J.2
  • 56
    • 0015615984 scopus 로고
    • Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic differential games
    • doi:10.1109/TAC.1973.1100265
    • Jacobson D. 1973 Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic differential games. IEEE Trans. Automat. Control 18, 124-131. (doi:10.1109/TAC.1973.1100265).
    • (1973) IEEE Trans. Automat. Control , vol.18 , pp. 124-131
    • Jacobson, D.1
  • 57
    • 0024085053 scopus 로고
    • State-space formulae for all stabilizing controllers that satisfy an H-norm bound and relations to risk sensitivity
    • doi:10.1016/0167-6911(88)90055-2
    • Glover K, Boyle J. 1988 State-space formulae for all stabilizing controllers that satisfy an H-norm bound and relations to risk sensitivity. Syst. Control Lett. 11, 167-172. (doi:10.1016/0167-6911(88)90055-2).
    • (1988) Syst. Control Lett. , vol.11 , pp. 167-172
    • Glover, K.1    Boyle, J.2
  • 58
    • 84866328209 scopus 로고    scopus 로고
    • Risk-sensitive path integral control
    • Catalina Island, CA
    • van den Broek JL, Wiegerinck WAJJ, Kappen HJ. 2010 Risk-sensitive path integral control. In UAI, Catalina Island, CA, vol. 6, pp. 1-8.
    • (2010) UAI , vol.6 , pp. 1-8
    • Van Den Broek, J.L.1    Wiegerinck, W.A.J.J.2    Kappen, H.J.3
  • 59
    • 84957363402 scopus 로고
    • Risk, ambiguity and the savage axioms
    • doi:10.2307/1884324
    • Ellsberg D. 1961 Risk, ambiguity and the savage axioms. Q. J. Econ. 75, 643-669. (doi:10.2307/1884324).
    • (1961) Q. J. Econ. , vol.75 , pp. 643-669
    • Ellsberg, D.1
  • 60
    • 33750564062 scopus 로고    scopus 로고
    • Ambiguity aversion, robustness, and the variation representation of preferences
    • doi:10.1111/j.1468-0262.2006.00716.x
    • Maccheroni F, Marinacci M, Rustichini A. 2006 Ambiguity aversion, robustness, and the variational representation of preferences. Econometrica 74, 1447-1498. (doi:10.1111/j.1468-0262.2006.00716.x).
    • (2006) Econometrica , vol.74 , pp. 1447-1498
    • Maccheroni, F.1    Marinacci, M.2    Rustichini, A.3
  • 61
    • 0001248680 scopus 로고
    • Le comportment de l'homme rationnel devant la risque: Critique des postulats et axiomes de l'ecole Americaine
    • doi:10.2307/1907921
    • Allais M. 1953 Le comportment de l'homme rationnel devant la risque: critique des postulats et axiomes de l'ecole Americaine. Econometrica 21, 503-546. (doi:10.2307/1907921).
    • (1953) Econometrica , vol.21 , pp. 503-546
    • Allais, M.1
  • 63
    • 65549115914 scopus 로고    scopus 로고
    • Economic decision-making compared with an equivalent motor task
    • doi:10.1073/pnas. 0900102106
    • Wu S, Delgado MR, Maloney LT. 2009 Economic decision-making compared with an equivalent motor task. Proc. Natl Acad. Sci. USA 106, 6088-6093. (doi:10.1073/pnas. 0900102106).
    • (2009) Proc. Natl Acad. Sci. USA , vol.106 , pp. 6088-6093
    • Wu, S.1    Delgado, M.R.2    Maloney, L.T.3
  • 64
    • 31744450082 scopus 로고
    • Advances in prospect theory: Cumulative representation of uncertainty
    • doi:10.1007/BF00122574
    • Tversky A, Kahneman D. 1992 Advances in prospect theory: cumulative representation of uncertainty. J. Risk Uncertain. 5, 297-323. (doi:10.1007/BF00122574).
    • (1992) J. Risk Uncertain , vol.5 , pp. 297-323
    • Tversky, A.1    Kahneman, D.2
  • 65
    • 0001627170 scopus 로고
    • A generalization of the quasilinear mean with application to the measurement of income inequality and decision theory resolving the allais paradox
    • doi:10.2307/1912052
    • Hong CS. 1983 A generalization of the quasilinear mean with application to the measurement of income inequality and decision theory resolving the Allais paradox. Econometrica 51, 1065- 1092. (doi:10.2307/1912052).
    • (1983) Econometrica , vol.51 , pp. 1065-1092
    • Hong, C.S.1
  • 66
    • 0002048382 scopus 로고
    • Über eine klasse der mittelwerte
    • Nagumo M. 1930 Über eine Klasse der Mittelwerte. Japan J. Math. 7, 71-79.
    • (1930) Japan J. Math. , vol.7 , pp. 71-79
    • Nagumo, M.1
  • 67
    • 0000971462 scopus 로고
    • Sur la notion de la moyenne
    • Kolmogorov A. 1930 Sur la notion de la moyenne. Rend. Accad. Lincei 12, 388-391.
    • (1930) Rend. Accad. Lincei , vol.12 , pp. 388-391
    • Kolmogorov, A.1
  • 69
    • 0026482814 scopus 로고
    • The analysis of visual motion: A comparison of neuronal and psychophysical performance
    • Britten K, Shadlen MN, Newsome WT, Movshon JA. 1992 The analysis of visual motion: a comparison of neuronal and psychophysical performance. J. Neurosci. 12, 4745-4767.
    • (1992) J. Neurosci. , vol.12 , pp. 4745-4767
    • Britten, K.1    Shadlen, M.N.2    Newsome, W.T.3    Movshon, J.A.4
  • 70
    • 0037860978 scopus 로고    scopus 로고
    • Statistical decision theory and trade-offs in the control of motor response
    • doi:10.1163/156856803322467527
    • Trommershauser J, Maloney LT, Landy MS. 2003 Statistical decision theory and trade-offs in the control of motor response. Spat. Vis. 16, 255-275. (doi:10.1163/156856803322467527).
    • (2003) Spat. Vis. , vol.16 , pp. 255-275
    • Trommershauser, J.1    Maloney, L.T.2    Landy, M.S.3
  • 71
    • 4344685154 scopus 로고    scopus 로고
    • Optimality principles in sensor motor control
    • doi:10.1038/nn1309
    • Todorov E. 2004 Optimality principles in sensorimotor control. Nat. Neurosci. 7, 907-915. (doi:10.1038/nn1309).
    • (2004) Nat. Neurosci. , vol.7 , pp. 907-915
    • Todorov, E.1
  • 72
    • 84878184749 scopus 로고    scopus 로고
    • Motor control is decision-making
    • doi:10.1016/j.conb.2012.05.003
    • Wolpert DM, Landy MS. 2012 Motor control is decision-making. Curr. Opin. Neurobiol. 22, 996-1003. (doi:10.1016/j.conb.2012.05.003).
    • (2012) Curr. Opin. Neurobiol. , vol.22 , pp. 996-1003
    • Wolpert, D.M.1    Landy, M.S.2
  • 73
    • 78049261093 scopus 로고    scopus 로고
    • Risk-sensitive optimal feedback control accounts for sensor motor behavior under uncertainty
    • doi:10.1371/journal.pcbi.1000857
    • Nagengast AJ, Braun DA, Wolpert DM. 2010 Risk-sensitive optimal feedback control accounts for sensorimotor behavior under uncertainty. PLoS Comput. Biol. 6, e1000857. (doi:10.1371/journal.pcbi.1000857).
    • (2010) PLoS Comput. Biol. , vol.6
    • Nagengast, A.J.1    Braun, D.A.2    Wolpert, D.M.3
  • 74
    • 79959731729 scopus 로고    scopus 로고
    • Risk-sensitivity and the mean-variance tradeoff: Decision making in sensor motor control
    • doi:10.1098/rspb.2010.2518
    • Nagengast AJ, Braun DA, Wolpert DM. 2011 Risk-sensitivity and the mean-variance tradeoff: decision making in sensorimotor control. Proc. R. Soc. B 278, 2325-2332. (doi:10.1098/rspb.2010.2518).
    • (2011) Proc. R. Soc. B , vol.278 , pp. 2325-2332
    • Nagengast, A.J.1    Braun, D.A.2    Wolpert, D.M.3
  • 75
    • 79959347132 scopus 로고    scopus 로고
    • Risk sensitivity in a motor task with speed- accuracy trade-off
    • doi:10.1152/jn.00804.2010
    • Nagengast AJ, Braun DA, Wolpert DM. 2011 Risk sensitivity in a motor task with speed- accuracy trade-off. J. Neurophysiol. 105, 2668-2674. (doi:10.1152/jn.00804.2010).
    • (2011) J. Neurophysiol. , vol.105 , pp. 2668-2674
    • Nagengast, A.J.1    Braun, D.A.2    Wolpert, D.M.3
  • 76
    • 81555216199 scopus 로고    scopus 로고
    • Risk-sensitivity in sensor motor control
    • doi:10.3389/fnhum.2011.00001
    • Braun DA, Nagengast AJ, Wolpert DM. 2011 Risk-sensitivity in sensorimotor control. Front. Hum. Neurosci. 5, 1. (doi:10.3389/fnhum.2011.00001).
    • (2011) Front. Hum. Neurosci. , vol.5 , pp. 1
    • Braun, D.A.1    Nagengast, A.J.2    Wolpert, D.M.3
  • 77
    • 84866932531 scopus 로고    scopus 로고
    • Risk-sensitivity in bayesian sensor motor integration
    • doi:10.1371/journal.pcbi.1002698
    • Grau-Moya J, Ortega PA, Braun DA. 2012 Risk-sensitivity in bayesian sensorimotor integration. PLoS Comput. Biol. 8, e1002698. (doi:10.1371/journal. pcbi.1002698).
    • (2012) PLoS Comput. Biol. , vol.8
    • Grau-Moya, J.1    Ortega, P.A.2    Braun, D.A.3
  • 80
    • 79551503171 scopus 로고    scopus 로고
    • A generalized path integral approach to reinforcement learning
    • Theodorou E, Buchli J, Schaal S. 2010 A generalized path integral approach to reinforcement learning. J. Mach. Learn. Res. 11, 3137-3181.
    • (2010) J. Mach. Learn. Res. , vol.11 , pp. 3137-3181
    • Theodorou, E.1    Buchli, J.2    Schaal, S.3
  • 81
    • 84877282363 scopus 로고    scopus 로고
    • On stochastic optimal control and reinforcement learning by approximate inference
    • Sydney, Australia
    • Rawlik K, Toussaint M, Vijayakumar S. 2012 On stochastic optimal control and reinforcement learning by approximate inference. In Proc. Robotics: Science and Systems, Sydney, Australia.
    • (2012) Proc. Robotics: Science and Systems
    • Rawlik, K.1    Toussaint, M.2    Vijayakumar, S.3
  • 82
    • 0024057250 scopus 로고
    • Entropy formulation for optimal and adaptive control
    • doi:10.1109/9.1287
    • Saridis G. 1988 Entropy formulation for optimal and adaptive control. IEEE Trans. Automat. Control 33, 713-721. (doi:10.1109/9.1287).
    • (1988) IEEE Trans. Automat. Control , vol.33 , pp. 713-721
    • Saridis, G.1
  • 83
    • 79051470133 scopus 로고    scopus 로고
    • An information-theoretic approach to interactive learning
    • doi:10.1209/0295-5075/85/28005
    • Still S. 2009 An information-theoretic approach to interactive learning. Europhys. Lett. 85, 28005. (doi:10.1209/0295-5075/85/28005).
    • (2009) Europhys. Lett. , vol.85 , pp. 28005
    • Still, S.1
  • 85
    • 0347901758 scopus 로고
    • Entropy and search theory
    • (eds CR Smith, WT Grandy), Dordrecht, The Netherlands: D. Reidel
    • Jaynes ET. 1985 Entropy and search theory. In Maximum entropy and Bayesian methods in inverse problems (eds CR Smith, WT Grandy). Dordrecht, The Netherlands: D. Reidel.
    • (1985) Maximum Entropy and Bayesian Methods in Inverse Problems
    • Jaynes, E.T.1
  • 87
    • 33644652516 scopus 로고    scopus 로고
    • Time, space, and energy in reversible computing
    • Ischia, Italy
    • Vitanyi PMB. 2005 Time, space, and energy in reversible computing. In Proc. 2nd ACM Conf. on Computing Frontiers, Ischia, Italy, pp. 435-444.
    • (2005) Proc. 2nd ACM Conf. on Computing Frontiers , pp. 435-444
    • Vitanyi, P.M.B.1
  • 88
    • 56449086189 scopus 로고    scopus 로고
    • The probabilistic nature of preferential choice
    • doi:10.1037/a0013646
    • Rieskamp J. 2008 The probabilistic nature of preferential choice. J. Exp. Psychol.: Learn. Mem. Cogn. 34, 1446-1465. (doi:10.1037/a0013646).
    • (2008) J. Exp. Psychol.: Learn. Mem. Cogn. , vol.34 , pp. 1446-1465
    • Rieskamp, J.1
  • 89
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • doi:10.1016/j.neuron.2010.04.016
    • Gläscher J, Daw N, Dayan P, O'Doherty JP. 2010 States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 66, 585-595. (doi:10.1016/j.neuron. 2010.04.016).
    • (2010) Neuron , vol.66 , pp. 585-595
    • Gläscher, J.1    Daw, N.2    Dayan, P.3    O'Doherty, J.P.4
  • 90
    • 84859323549 scopus 로고    scopus 로고
    • Model-based learning and the contribution of the orbit frontal cortex to the model-free world
    • doi:10.1111/j.1460-9568.2011.07982.x
    • McDannald MA, Takahashi YK, Lopatina N, Pietras BW, Jones JL, Schoenbaum G. 2012 Model-based learning and the contribution of the orbitofrontal cortex to the model-free world. Eur. J. Neurosci. 35, 991-996. (doi:10.1111/j.1460-9568. 2011.07982.x).
    • (2012) Eur. J. Neurosci. , vol.35 , pp. 991-996
    • McDannald, M.A.1    Takahashi, Y.K.2    Lopatina, N.3    Pietras, B.W.4    Jones, J.L.5    Schoenbaum, G.6
  • 92
    • 84988300338 scopus 로고    scopus 로고
    • Dynamics of Bayesian updating with dependent data and misspecified models
    • doi:10.1214/09-EJS485
    • Shahlizi CR. 2009 Dynamics of Bayesian updating with dependent data and misspecified models. Electron. J. Statist. 3, 1039-1074. (doi:10.1214/09-EJS485) .
    • (2009) Electron. J. Statist. , vol.3 , pp. 1039-1074
    • Shahlizi, C.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.