SCOPUS 정보 검색 플랫폼

Frontiers in Psychology

Volumn 1, Issue NOV, 2010, Pages

Credit assignment in multiple goal embodied visuomotor behavior

(2) Rothkopf, Constantin A a Ballard, Dana H b

a FRANKFURT INSTITUTE FOR ADVANCED STUDIES (Germany)

b University of Texas at Austin (United States)

Author keywords

Credit assignment; Learning; Modules; Reinforcement; Reward

Indexed keywords

EID: 80054091681 PISSN: None EISSN: 16641078 Source Type: Journal
DOI: 10.3389/fpsyg.2010.00173 Document Type: Article

Times cited : (31)

References (60)

1
- 0004208648
- Cambridge, MA: Harvard University Press
- Anderson, J. (1983). The Architecture of Cognition. Cambridge, MA: Harvard University Press.
- (1983) The Architecture of Cognition
- Anderson, J.¹

2
- 0003869829
- Cambridge, MA: MIT Press
- Arkin, R. (1998). Behavior Based Robotics. Cambridge, MA: MIT Press.
- (1998) Behavior Based Robotics
- Arkin, R.¹

3
- 0033170833
- Animation control for real-time virtual humans
- Badler, N., Palmer, M., and Bindiganavale, R. (1999). Animation control for real-time virtual humans. Commun. ACM 42, 64-73.
- (1999) Commun. ACM , vol.42 , pp. 64-73
- Badler, N.¹ Palmer, M.² Bindiganavale, R.³

4
- 0003462092
- New York, NY: Oxford University Press
- Badler, N. I., Phillips, C. B., and Webber, B. L. (1993). Simulating Humans: Computer Graphics Animation and Control. New York, NY: Oxford University Press.
- (1993) Simulating Humans: Computer Graphics Animation and Control
- Badler, N.I.¹ Phillips, C.B.² Webber, B.L.³

5
- 0031429913
- Deictic codes for the embodiment of cognition
- Ballard, D. H., Hayhoe, M. M., Pook, P. K., and Rao, R. P. N. (1997). Deictic codes for the embodiment of cognition. Behav. Brain Sci. 20, 723-767.
- (1997) Behav. Brain Sci. , vol.20 , pp. 723-767
- Ballard, D.H.¹ Hayhoe, M.M.² Pook, P.K.³ Rao, R.P.N.⁴

6
- 66149104409
- Simulation, situated conceptualization, and prediction
- Barsalou, L. (2009). Simulation, situated conceptualization, and prediction. Phil. Trans. R. Soc. B 364, 1281-1289.
- (2009) Phil. Trans. R. Soc. B , vol.364 , pp. 1281-1289
- Barsalou, L.¹

7
- 0141988716
- Recent advances in hierarchical reinforcement learning
- Barto, A. G., and Mahadevan, S. (2003). Recent advances in hierarchical reinforcement learning. Discrete Event Dyn. Syst. 13, 41-77.
- (2003) Discrete Event Dyn. Syst. , vol.13 , pp. 41-77
- Barto, A.G.¹ Mahadevan, S.²

8
- 0031524077
- Experiences with an architecture for intelligent, reactive agents
- Bonasso, R. P., Firby, J. R., Gat, E., Kortenkamp, D., Miller, D. D. P., and Slack, M. G. (1997). Experiences with an architecture for intelligent, reactive agents. J. Exp. Theor. Artif. Intell. 9, 237-256.
- (1997) J. Exp. Theor. Artif. Intell. , vol.9 , pp. 237-256
- Bonasso, R.P.¹ Firby, J.R.² Gat, E.³ Kortenkamp, D.⁴ Miller, D.D.P.⁵ Slack, M.G.⁶

9
- 0022688781
- A robust layered control system for a mobile robot
- Brooks, R. (1986). A robust layered control system for a mobile robot. IEEE J. Robot. Autom. 2, 14-23.
- (1986) IEEE J. Robot. Autom. , vol.2 , pp. 14-23
- Brooks, R.¹

10
- 84880894382
- Modularity and design in reactive intelligence
- Seattle, WA
- Bryson, J. J., and Stein, L. A. (2001). "Modularity and design in reactive intelligence," in International Joint Conference on Artificial Intelligence, Seattle, WA.
- (2001) International Joint Conference on Artificial Intelligence
- Bryson, J.J.¹ Stein, L.A.²

11
- 28044450875
- Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- Daw, N. D., Niv, Y., and Dayan, P. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 8, 1704-1711.
- (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

12
- 33745223257
- Cortical substrates for exploratory decisions in humans
- Daw, N. D., O'Doherty, J. P., Dayan, P., Seymour, B., and Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature 441, 876-879.
- (2006) Nature , vol.441 , pp. 876-879
- Daw, N.D.¹ O'Doherty, J.P.² Dayan, P.³ Seymour, B.⁴ Dolan, R.J.⁵

13
- 0001234682
- Feudal reinforcement learning
- (San Francisco, CA: Morgan Kaufmann Publishers Inc.)
- Dayan, P., and Hinton, G. E. (1992). "Feudal reinforcement learning," in Advances in Neural Information Processing Systems, (San Francisco, CA: Morgan Kaufmann Publishers Inc.) 5, 271-278.
- (1992) Advances in Neural Information Processing Systems , vol.5 , pp. 271-278
- Dayan, P.¹ Hinton, G.E.²

14
- 0034248853
- Stochastic dynamic programming with factored representations
- Dearden, R. Boutilier, C., and Goldszmidt, M. (2000). Stochastic dynamic programming with factored representations. Artif. Intell. 121, 49-107.
- (2000) Artif. Intell. , vol.121 , pp. 49-107
- Dearden, R.¹ Boutilier, C.² Goldszmidt, M.³

15
- 0036618011
- Multiple model-based reinforcement learning
- Doya, K., Samejima, K., Katagiri K., and Kawato, M. (2002). Multiple model-based reinforcement learning. Neural Comput. 14, 1347-1369.
- (2002) Neural Comput , vol.14 , pp. 1347-1369
- Doya, K.¹ Samejima, K.² Katagiri, K.³ Kawato, M.⁴

16
- 0005334538
- An architecture for vision and action
- (Montreal, Canada; Morgan Kaufmann Publishers Inc.)
- Firby, R. J., Kahn, R. E., Prokopowicz, P. N., and Swain, M. J. (1995). "An architecture for vision and action," in International Joint Conference on Artificial Intelligence (Montreal, Canada; Morgan Kaufmann Publishers Inc.), 72-79.
- (1995) International Joint Conference on Artificial Intelligence , pp. 72-79
- Firby, R.J.¹ Kahn, R.E.² Prokopowicz, P.N.³ Swain, M.J.⁴

17
- 77951632406
- Taxing executive processes does not necessarily increase impulsive decision making
- Franco-Watkins, A. M., Rickard, T. C., and Pashler, H. (2010). Taxing executive processes does not necessarily increase impulsive decision making. Exp. Psychol. 57, 193-201.
- (2010) Exp. Psychol. , vol.57 , pp. 193-201
- Franco-Watkins, A.M.¹ Rickard, T.C.² Pashler, H.³

18
- 77956582789
- Embodiment as a unifying perspective for psychology
- Glenberg, A. M. (2010). Embodiment as a unifying perspective for psychology. Wiley Interdiscip. Rev. Cogn. Sci. 1, 586-596.
- (2010) Wiley Interdiscip. Rev. Cogn. Sci. , vol.1 , pp. 586-596
- Glenberg, A.M.¹

19
- 0042932360
- Encoding predictive reward value in human amygdala and orbitofrontal cortex
- Gottfried, J. A., O'Doherty, J., and Dolan, R. J. (2003). Encoding predictive reward value in human amygdala and orbitofrontal cortex. Science 301, 1104-1107.
- (2003) Science , vol.301 , pp. 1104-1107
- Gottfried, J.A.¹ O'Doherty, J.² Dolan, R.J.³

20
- 4544318426
- Efficient solution algorithms for factored MDPs
- Guestrin, C. E., Koller, D., Parr, R., and Venkataraman, S. (2003). Efficient solution algorithms for factored MDPs. J. Artif. Intell. Res. 19, 399-468.
- (2003) J. Artif. Intell. Res. , vol.19 , pp. 399-468
- Guestrin, C.E.¹ Koller, D.² Parr, R.³ Venkataraman, S.⁴

21
- 33644858743
- Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning
- Haruno, M., and Kawato, M. (2006). Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning. J. Neurophysiol. 95, 948-959.
- (2006) J. Neurophysiol. , vol.95 , pp. 948-959
- Haruno, M.¹ Kawato, M.²

22
- 52049093182
- New insights on the subcortical representation of reward
- Hikosaka, O., Bromberg-Martin, E., Hong, S., and Matsumoto, M. (2008). New insights on the subcortical representation of reward. Curr. Opin. Neurobiol. 18, 203-208.
- (2008) Curr. Opin. Neurobiol. , vol.18 , pp. 203-208
- Hikosaka, O.¹ Bromberg-Martin, E.² Hong, S.³ Matsumoto, M.⁴

23
- 0007914441
- Action selection methods using reinforcement learning
- eds P. Maes, M. Mataric, J.-A. Meyer, J. Pollack, and S. W. Wilson (Cambridge, MA: MIT Press, Bradford Books)
- Humphrys, M. (1996). "Action selection methods using reinforcement learning," in From Animals to Animats 4: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, eds P. Maes, M. Mataric, J.-A. Meyer, J. Pollack, and S. W. Wilson (Cambridge, MA: MIT Press, Bradford Books), 135-144.
- (1996) From Animals to Animats 4 Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior , pp. 135-144
- Humphrys, M.¹

24
- 0005551545
- PhD thesis, University of Rochester, Rochester
- Karlsson, J. (1997). Learning to Solve Multiple Goals. PhD thesis, University of Rochester, Rochester.
- (1997) Learning to Solve Multiple Goals
- Karlsson, J.¹

25
- 0023422739
- Soar: an architecture for general intelligence
- Laird, J. E., Newell, A., and Rosenblum, P. S. (1987). Soar: an architecture for general intelligence. Artif. Intell. 33, 1-64.
- (1987) Artif. Intell. , vol.33 , pp. 1-64
- Laird, J.E.¹ Newell, A.² Rosenblum, P.S.³

26
- 33646365064
- Learning recursive control programs from problem solving
- Langley, P., and Choi, D. (2006). Learning recursive control programs from problem solving. J. Mach. Learn. Res. 7, 493-518.
- (2006) J. Mach. Learn. Res. , vol.7 , pp. 493-518
- Langley, P.¹ Choi, D.²

27
- 0030778788
- The capacity of visual working memory for features and conjunctions
- Luck, S. J., and Vogel, E. K. (1997). The capacity of visual working memory for features and conjunctions. Nature 390, 279-281.
- (1997) Nature , vol.390 , pp. 279-281
- Luck, S.J.¹ Vogel, E.K.²

28
- 0031632806
- AAAI/IAAI
- Meuleau, N., Hauskrecht, M., Kim, K.-E., Peshkin, L., Kaelbling, L., Dean, T., and Boutilier, C. (1998). "Solving very large weakly coupled Markov decision processes," in AAAI/IAAI, 165-172.
- (1998) Solving very large weakly coupled Markov decision processes , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.-E.³ Peshkin, L.⁴ Kaelbling, L.⁵ Dean, T.⁶ Boutilier, C.⁷

29
- 33747585633
- Midbrain dopamine neurons encode decisions for future action
- Morris, G., Nevet, A., Arkadir, D., Vaadia, E., and Bergman, H. (2006). Midbrain dopamine neurons encode decisions for future action. Nat. Neurosci. 9, 1057-1063.
- (2006) Nat. Neurosci. , vol.9 , pp. 1057-1063
- Morris, G.¹ Nevet, A.² Arkadir, D.³ Vaadia, E.⁴ Bergman, H.⁵

30
- 0004243597
- New York: Appleton-Century-Crofts
- Neisser, U. (1967). Cognitive Psychology. New York: Appleton-Century-Crofts.
- (1967) Cognitive Psychology
- Neisser, U.¹

31
- 0004291277
- Cambridge, MA: Harvard University Press
- Newell, A. (1990). Unified Theories of Cognition. Cambridge, MA: Harvard University Press.
- (1990) Unified Theories of Cognition
- Newell, A.¹

32
- 0141596576
- Policy invariance under reward transformations: theory and application to reward shaping
- Bled, Slovenia
- Ng, A. Y., Harada, D., and Russell, S. (1999). "Policy invariance under reward transformations: theory and application to reward shaping," in Proceedings of the Sixteenth International Conference on Machine Learning, Bled, Slovenia.
- (1999) Proceedings of the Sixteenth International Conference on Machine Learning
- Ng, A.Y.¹ Harada, D.² Russell, S.³

33
- 84898956770
- Reinforcement learning with hierarchies of machines
- M. I. Jordan, M. J. Kearns, and S. A. Solla. (Cambridge, MA: MIT Press)
- Parr, R., and Russell, S. (1997). "Reinforcement learning with hierarchies of machines," in Advances in Neural Information Processing Systems, M. I. Jordan, M. J. Kearns, and S. A. Solla. (Cambridge, MA: MIT Press), 1043-1049.
- (1997) Advances in Neural Information Processing Systems , pp. 1043-1049
- Parr, R.¹ Russell, S.²

34
- 33748302924
- Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans
- Pessiglione, M., Seymour, B., Flandin, G., Dolan, R. J., and Frith, C. D. (2006). Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 442, 1042-1045.
- (2006) Nature , vol.442 , pp. 1042-1045
- Pessiglione, M.¹ Seymour, B.² Flandin, G.³ Dolan, R.J.⁴ Frith, C.D.⁵

35
- 77952544599
- Neural computations associated with goal-directed choice
- Rangel, A., and Hare, T. (2010). Neural computations associated with goal-directed choice. Curr. Opin. Neurobiol. 20, 262-270.
- (2010) Curr. Opin. Neurobiol. , vol.20 , pp. 262-270
- Rangel, A.¹ Hare, T.²

36
- 33746062361
- Authoring content in the pat algebra tutor
- Ritter, S., Anderson, J. R., Cytrynowicz, M., and Medvedeva, O. (1998). Authoring content in the pat algebra tutor. J. Interact. Media Educ. 98, 1-30.
- (1998) J. Interact. Media Educ. , vol.98 , pp. 1-30
- Ritter, S.¹ Anderson, J.R.² Cytrynowicz, M.³ Medvedeva, O.⁴

37
- 65349126422
- Image statistics at the point of gaze during human navigation
- Rothkopf, C. A., and Ballard, D. H. (2009). Image statistics at the point of gaze during human navigation. Vis. Neurosci. 26, 81-92.
- (2009) Vis. Neurosci. , vol.26 , pp. 81-92
- Rothkopf, C.A.¹ Ballard, D.H.²

38
- 84867041360
- Learning and coordinating reper-toires of behaviors: credit assignment and module activation
- Eds G. Baldassarre and M. Mirolli (in press)
- Rothkopf, C. A., and Ballard, D. H. (2010). "Learning and coordinating reper-toires of behaviors: credit assignment and module activation," in Intrinsically Motivated Cumulative Learning in Natural and Artificial Systems, Eds G. Baldassarre and M. Mirolli (in press).
- (2010) Intrinsically Motivated Cumulative Learning in Natural and Artificial Systems
- Rothkopf, C.A.¹ Ballard, D.H.²

39
- 0036152936
- Learning words from sights and sounds: a computational model
- Roy, D. K., and Pentland, A. P. (2002). Learning words from sights and sounds: a computational model. Cogn. Sci. 26, 113-146.
- (2002) Cogn. Sci. , vol.26 , pp. 113-146
- Roy, D.K.¹ Pentland, A.P.²

40
- 1942484759
- Q-decomposition for reinforcement learning agents
- Washington, DC
- Russell, S., and Zimdars, A. (2003). "Q-decomposition for reinforcement learning agents," in Proceedings of the International Conference on Machine Learning, Washington, DC.
- (2003) Proceedings of the International Conference on Machine Learning
- Russell, S.¹ Zimdars, A.²

41
- 32844474095
- Reinforcement learning with factored states and actions
- Sallans, B., and Hinton, G. E. (2004). Reinforcement learning with factored states and actions. J. Mach. Learn. Res. 5, 1063-1088.
- (2004) J. Mach. Learn. Res. , vol.5 , pp. 1063-1088
- Sallans, B.¹ Hinton, G.E.²

42
- 0742324926
- Inter-module credit assignment in modular reinforcement learning
- Samejima, K., Doya, K., and Kawato, M. (2003). Inter-module credit assignment in modular reinforcement learning. Neural Netw. 16, 985-994.
- (2003) Neural Netw , vol.16 , pp. 985-994
- Samejima, K.¹ Doya, K.² Kawato, M.³

43
- 0034576323
- Multiple reward signals in the brain
- Schultz, W. (2000). Multiple reward signals in the brain. Nat. Rev. Neurosci. 1, 199-207.
- (2000) Nat. Rev. Neurosci. , vol.1 , pp. 199-207
- Schultz, W.¹

44
- 0030896968
- A neural substrate of prediction and reward
- Schultz, W., Dayan, P., and Montague, P. R. (1997). A neural substrate of prediction and reward. Science 275, 1593-1599.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

45
- 0035189004
- What controls attention in natural environments?
- Shinoda, H., Hayhoe, M. M., and Shrivastava, A. (2001). What controls attention in natural environments? Vis. Res. 41, 3535-3546.
- (2001) Vis. Res , vol.41 , pp. 3535-3546
- Shinoda, H.¹ Hayhoe, M.M.² Shrivastava, A.³

46
- 84899022377
- How to dynamically merge Markov decision processes
- Singh, S., and Cohn, D. (1998). How to dynamically merge Markov decision processes. Neural Inf. Process. Syst. 10, 1057-1063.
- (1998) Neural Inf. Process. Syst. , vol.10 , pp. 1057-1063
- Singh, S.¹ Cohn, D.²

47
- 70149110301
- Multiple-goal reinforcement learning with modular sarsa(0)
- Acapulco
- Sprague, N., and Ballard, D. (2003). "Multiple-goal reinforcement learning with modular sarsa(0)," in International Joint Conference on Artificial Intelligence, Acapulco.
- (2003) International Joint Conference on Artificial Intelligence
- Sprague, N.¹ Ballard, D.²

48
- 85013237108
- Modeling embodied visual behaviors
- Sprague, N., Ballard, D., and Robinson, A. (2007). Modeling embodied visual behaviors. ACM Trans. Appl. Percept. 4, 1-23.
- (2007) ACM Trans. Appl. Percept. , vol.4 , pp. 1-23
- Sprague, N.¹ Ballard, D.² Robinson, A.³

49
- 33846607265
- chapter 4. Cambridge: Cambridge University Press
- Sun, R. (2006). Cognition and Multi-Agent Interaction, chapter 4. Cambridge: Cambridge University Press, 79-99.
- (2006) Cognition and Multi-Agent Interaction , pp. 79-99
- Sun, R.¹

50
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., and Barto, A. G. (1998). Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

51
- 0033170372
- Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning
- Sutton, R. S., Precup, D., and Singh, S. P. (1999). Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. Artif. Intell. 112, 181-211.
- (1999) Artif. Intell , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

52
- 0002648372
- Artificial life for computer graphics
- Terzopoulos, D. (1999). Artificial life for computer graphics. Commun. ACM 42, 32-42.
- (1999) Commun. ACM , vol.42 , pp. 32-42
- Terzopoulos, D.¹

53
- 0001049378
- Artificial fishes: autonomous locomotion, perception, behavior, and learning in a simulated physical world
- Terzopoulos, D., Tu, X., and Grzeszczuk, R. (1994). Artificial fishes: autonomous locomotion, perception, behavior, and learning in a simulated physical world. Artif. Life, 1, 327-351.
- (1994) Artif. Life , vol.1 , pp. 327-351
- Terzopoulos, D.¹ Tu, X.² Grzeszczuk, R.³

54
- 58149442669
- Cognitive maps in rats and men
- Tolman, E. C. (1948). Cognitive maps in rats and men. Psychol. Rev. 55, 189-208.
- (1948) Psychol. Rev. , vol.55 , pp. 189-208
- Tolman, E.C.¹

55
- 0018878142
- A feature-integration theory of attention
- Treisman, A. M. (1980). A feature-integration theory of attention. Cogn. Psychol. 12, 97-136.
- (1980) Cogn. Psychol. , vol.12 , pp. 97-136
- Treisman, A.M.¹

56
- 0027961585
- Why are small and large numbers enumerated differently? A limited-capacity preattentive stage in vision
- Trick, L. M., and Pylyshyn, Z. W. (1994). Why are small and large numbers enumerated differently? A limited-capacity preattentive stage in vision. Psychol. Rev. 101, 80-102.
- (1994) Psychol. Rev. , vol.101 , pp. 80-102
- Trick, L.M.¹ Pylyshyn, Z.W.²

57
- 0021700041
- Visual routines
- Ullman, S. (1984). Visual routines. Cognition 18, 97-157.
- (1984) Cognition , vol.18 , pp. 97-157
- Ullman, S.¹

58
- 80054969173
- Intrinsically motivated hierarchical skill learning in structured environments
- Vigorito, C. M., and Barto, A. G. (2010). Intrinsically motivated hierarchical skill learning in structured environments. IEEE Trans. Auton. Ment. Dev. 2, 132-143.
- (2010) IEEE Trans. Auton. Ment. Dev. , vol.2 , pp. 132-143
- Vigorito, C.M.¹ Barto, A.G.²

59
- 4344577338
- Behavioral dynamics of human locomotion
- Warren, W. H., and Fajen, B. R. (2004). Behavioral dynamics of human locomotion. Ecol. Psychol. 16, 61-66.
- (2004) Ecol. Psychol. , vol.16 , pp. 61-66
- Warren, W.H.¹ Fajen, B.R.²

60
- 0004049893
- PhD thesis, University of Cambridge, Cambridge
- Watkins, C. J. C. H. (1989). Learning from delayed rewards. PhD thesis, University of Cambridge, Cambridge.
- (1989) Learning from delayed rewards
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.