메뉴 건너뛰기




Volumn 9783642323751, Issue , 2013, Pages 257-278

Deciding which skill to learn when: Temporal-difference competence-based intrinsic motivation (TD-CB-IM)

Author keywords

[No Author keywords available]

Indexed keywords

KNOWLEDGE BASED SYSTEMS; MERGERS AND ACQUISITIONS; REINFORCEMENT LEARNING; MOTIVATION;

EID: 84904871924     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1007/978-3-642-32375-1_11     Document Type: Chapter
Times cited : (10)

References (57)
  • 1
    • 84966716963 scopus 로고    scopus 로고
    • A modular neural-network model of the basal ganglia's role in learning and selecting motor behaviours
    • Altmann, E.M., Cleermans, A., Schunn, C.D, Gray, W.D. (eds.) Fairfax, Virgina, USA, 26-29 July Lawrence Erlbaum, Mahwah (2001)
    • Baldassarre, G.: A modular neural-network model of the basal ganglia's role in learning and selecting motor behaviours. In: Altmann, E.M., Cleermans, A., Schunn, C.D, Gray, W.D. (eds.) Proceedings of the Fourth International Conference on Cognitive Modeling (ICCM2001), pp. 37-42. Fairfax, Virgina, USA, 26-29 July 2001. Lawrence Erlbaum, Mahwah (2001)
    • (2001) Proceedings of the Fourth International Conference on Cognitive Modeling (ICCM2001) , pp. 37-42
    • Baldassarre, G.1
  • 2
    • 0842276977 scopus 로고    scopus 로고
    • A modular neural-network model of the basal ganglia's role in learning and selecting motor behaviours
    • Special Issue Dynamic and Recurrent Neural Networks
    • Baldassarre, G.: A modular neural-network model of the basal ganglia's role in learning and selecting motor behaviours. J. Cogn. Syst. Res. 3(2), 5-13. Special Issue Dynamic and Recurrent Neural Networks (2002a)
    • (2002) J. Cogn. Syst. Res. , vol.3 , Issue.2 , pp. 5-13
    • Baldassarre, G.1
  • 4
    • 80054971108 scopus 로고    scopus 로고
    • What are intrinsic motivations? A biological perspective
    • Cangelosi, A., Triesch, J., Fasel, I., Rohlfing, K., Nori, F., Oudeyer, P.-Y., Schlesinger, M, Nagai, Y. (eds.) Frankfurt, Germany, 24-27 August IEEE, Piscataway (2011)
    • Baldassarre, G.: What are intrinsic motivations? a biological perspective. In: Cangelosi, A., Triesch, J., Fasel, I., Rohlfing, K., Nori, F., Oudeyer, P.-Y., Schlesinger, M, Nagai, Y. (eds.) Proceedings of the International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob-2011), pp. E1-E8. Frankfurt, Germany, 24-27 August, 2011. IEEE, Piscataway (2011)
    • (2011) Proceedings of the International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob-2011) , pp. E1-E8
    • Baldassarre, G.1
  • 5
    • 84893542099 scopus 로고    scopus 로고
    • Intrinsically motivated action-outcome learning and goal-based action recall: A system-level bio-constrained computational model
    • in press
    • Baldassarre, G., Mannella, F., Fiore, V.G., Redgrave, P., Gurney, K., Mirolli, M.: Intrinsically motivated action-outcome learning and goal-based action recall: A system-level bio-constrained computational model. Neural Netw. (2012, in press)
    • (2012) Neural Netw.
    • Baldassarre, G.1    Mannella, F.2    Fiore, V.G.3    Redgrave, P.4    Gurney, K.5    Mirolli, M.6
  • 7
    • 33749651693 scopus 로고    scopus 로고
    • Intrinsically motivated learning of hierarchical collections of skills
    • La Jolla, CA, 20-22 October, 2004. IEEE, Piscataway
    • Barto, A., Singh, S., Chentanez, N.: Intrinsically motivated learning of hierarchical collections of skills. In: International Conference on Developmental Learning (ICDL2004). La Jolla, CA, 20-22 October, 2004. IEEE, Piscataway (2004)
    • (2004) International Conference on Developmental Learning (ICDL2004)
    • Barto, A.1    Singh, S.2    Chentanez, N.3
  • 8
    • 0141988716 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Barto, A.G., Mahadevan, S.: Recent advances in hierarchical reinforcement learning. Discr. Event Dyn. Syst. 13(4), 341-379 (2003)
    • (2003) Discr. Event Dyn. Syst. , vol.13 , Issue.4 , pp. 341-379
    • Barto, A.G.1    Mahadevan, S.2
  • 9
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement-learning perspective
    • Botvinick, M.M., Niv, Y., Barto, A.: Hierarchically organized behavior and its neural foundations: A reinforcement-learning perspective. Cognition 113(3), 262-280 (2008)
    • (2008) Cognition , vol.113 , Issue.3 , pp. 262-280
    • Botvinick, M.M.1    Niv, Y.2    Barto, A.3
  • 10
    • 80054994182 scopus 로고    scopus 로고
    • A bio-inspired hierarchical reinforcement learning architecture for modeling learning of multiple skills with continuous state and actions
    • Kuipers, B., Shultz, T., Stoytchev, A., Yu, C. (eds.) Ann Arbor, MI, USA, 18-21 August, 2010 IEEE, Piscataway
    • Caligiore, D., Mirolli, M., Parisi, D., Baldassarre, G.: A bio-inspired hierarchical reinforcement learning architecture for modeling learning of multiple skills with continuous state and actions. In: Kuipers, B., Shultz, T., Stoytchev, A., Yu, C. (eds.) IEEE International Conference on Development and Learning (ICDL2010). Ann Arbor, MI, USA, 18-21 August, 2010 IEEE, Piscataway (2010)
    • (2010) IEEE International Conference on Development and Learning (ICDL2010)
    • Caligiore, D.1    Mirolli, M.2    Parisi, D.3    Baldassarre, G.4
  • 11
    • 0035285941 scopus 로고    scopus 로고
    • Extrinsic rewards and intrinsic motivation in education: Reconsidered once again
    • Deci, E., Koestner, R., Ryan, R.: Extrinsic rewards and intrinsic motivation in education: Reconsidered once again. Rev. Educ. Res. 71(1), 1-27 (2001)
    • (2001) Rev. Educ. Res. , vol.71 , Issue.1 , pp. 1-27
    • Deci, E.1    Koestner, R.2    Ryan, R.3
  • 12
  • 13
    • 34047255425 scopus 로고    scopus 로고
    • Evolutionary development of hierarchical learning structures
    • Elfwing, S., Uchibe, E., Doya, K., Christensen, H.: Evolutionary development of hierarchical learning structures. IEEE Trans. Evol. Comput. 11(2), 249-264 (2007)
    • (2007) IEEE Trans. Evol. Comput. , vol.11 , Issue.2 , pp. 249-264
    • Elfwing, S.1    Uchibe, E.2    Doya, K.3    Christensen, H.4
  • 14
    • 0009762657 scopus 로고
    • Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys
    • Harlow, H.F.: Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys. J. Comp. Physiol. Psychol. 43, 289-294 (1950)
    • (1950) J. Comp. Physiol. Psychol. , vol.43 , pp. 289-294
    • Harlow, H.F.1
  • 15
    • 80052879447 scopus 로고    scopus 로고
    • Learning generalizable control programs
    • Hart, S., Grupen, R.: Learning generalizable control programs. IEEE Trans. Auton. Mental Dev. 3(1), 216-231 (2011)
    • (2011) IEEE Trans. Auton. Mental Dev. , vol.3 , Issue.1 , pp. 216-231
    • Hart, S.1    Grupen, R.2
  • 16
    • 84929056927 scopus 로고    scopus 로고
    • Intrinsically motivated affordance discovery and modeling
    • Baldassarre, G., Mirolli, M. (eds.) Springer, Berlin this volume
    • Hart, S., Grupen, R.: Intrinsically motivated affordance discovery and modeling. In: Baldassarre, G., Mirolli, M. (eds.) Intrinsically Motivated Learning in Natural and Artificial Systems. Springer, Berlin (2012, this volume)
    • (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
    • Hart, S.1    Grupen, R.2
  • 17
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • Houk, J.C., Davids, J.L., Beiser, D.G. (eds.) The MIT Press, Cambridge
    • Houk, J.C., Adams, J.L., Barto, A.G.: A model of how the basal ganglia generate and use neural signals that predict reinforcement. In: Houk, J.C., Davids, J.L., Beiser, D.G. (eds.) Models of Information Processing in the Basal Ganglia, pp. 249-270. The MIT Press, Cambridge (1995)
    • (1995) Models of Information Processing in the Basal Ganglia , pp. 249-270
    • Houk, J.C.1    Adams, J.L.2    Barto, A.G.3
  • 19
    • 79952169043 scopus 로고    scopus 로고
    • Empowerment for continuous agent-environment systems
    • Jung, T., Polani, D., Stone, P.: Empowerment for continuous agent-environment systems. Adap. Behav. 19(1), 16-39 (2011)
    • (2011) Adap. Behav. , vol.19 , Issue.1 , pp. 16-39
    • Jung, T.1    Polani, D.2    Stone, P.3
  • 20
    • 0036592029 scopus 로고    scopus 로고
    • Dopamine: Generalization and bonuses
    • Kakade, S., Dayan, P.: Dopamine: Generalization and bonuses. Neural Netw. 15(4-6), 549-559 (2002)
    • (2002) Neural Netw. , vol.15 , Issue.4-6 , pp. 549-559
    • Kakade, S.1    Dayan, P.2
  • 21
    • 85088976837 scopus 로고    scopus 로고
    • Search of the neural circuits of intrinsic motivation
    • Kaplan, F., Oudeyer, P.-Y.: In: Search of the neural circuits of intrinsic motivation. Front. Neurosci. 1, 225-236 (2007)
    • (2007) Front. Neurosci. , vol.1 , pp. 225-236
    • Kaplan, F.1    Oudeyer, P.-Y.2
  • 22
    • 27144496638 scopus 로고    scopus 로고
    • Empowerment: A universal agent-centric measure of control
    • Edinburg UK, 2-4 September
    • Klyubin, A., Polani, D., Nehaniv, C.: Empowerment: A universal agent-centric measure of control. In: The 2005 IEEE Congress on Evolutionary Computation, Vol. 1, pp. 128-135. Edinburg UK, 2-4 September, (2005)
    • (2005) The 2005 IEEE Congress on Evolutionary Computation , vol.1 , pp. 128-135
    • Klyubin, A.1    Polani, D.2    Nehaniv, C.3
  • 24
    • 80055020279 scopus 로고    scopus 로고
    • Artificial curiosity with planning for autonomous perceptual and cognitive development
    • Cangelosi, A., Triesch, J., Fasel, I., Rohlfing, K., Nori, F., Oudeyer, P.-Y., Schlesinger, M., Nagai, Y. (eds.) IEEE, Frankfurt, Germany, 24-27 August Piscataway (2011)
    • Luciw, M., Graziano, V., Ring, M., Schmidhuber, J.: Artificial curiosity with planning for autonomous perceptual and cognitive development. In: Cangelosi, A., Triesch, J., Fasel, I., Rohlfing, K., Nori, F., Oudeyer, P.-Y., Schlesinger, M., Nagai, Y. (eds.) IEEE International Conference on Development and Learning (ICDL2011), pp. E1-8. IEEE, Frankfurt, Germany, 24-27 August, 2011. Piscataway (2011)
    • (2011) IEEE International Conference on Development and Learning (ICDL2011) , pp. E1-8
    • Luciw, M.1    Graziano, V.2    Ring, M.3    Schmidhuber, J.4
  • 25
    • 77957064197 scopus 로고
    • Catastrophic interference in connectionist networks: The sequential learning problem
    • Bower, G.H. (ed.) Academic Press, San Diego
    • McCloskey, M., Cohen, N.: Catastrophic interference in connectionist networks: The sequential learning problem. In: Bower, G.H. (ed.) The Psychology of Learning and Motivation, Vol. 24, pp. 109-165. Academic Press, San Diego (1989)
    • (1989) The Psychology of Learning and Motivation , vol.24 , pp. 109-165
    • McCloskey, M.1    Cohen, N.2
  • 27
    • 79955471193 scopus 로고    scopus 로고
    • Modular and hierarchically modular organization of brain networks
    • Meunier, D., Lambiotte, R., Bullmore, E.T.: Modular and hierarchically modular organization of brain networks. Front. Neurosci. 4, 200 (2010)
    • (2010) Front. Neurosci. , vol.4 , pp. 200
    • Meunier, D.1    Lambiotte, R.2    Bullmore, E.T.3
  • 28
    • 84929053357 scopus 로고    scopus 로고
    • Functions and mechanisms of intrinsic motivations: The knowledge versus competence distinction
    • Baldassarre, G., Mirolli, M. (eds.) Springer, Berlin this volume
    • Mirolli, M., Baldassarre, G.: Functions and mechanisms of intrinsic motivations: The knowledge versus competence distinction. In: Baldassarre, G., Mirolli, M. (eds.) Intrinsically Motivated Learning in Natural and Artificial Systems. Springer, Berlin (2012, this volume)
    • (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
    • Mirolli, M.1    Baldassarre, G.2
  • 29
    • 84929057911 scopus 로고    scopus 로고
    • Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcement driving both action acquisition and reward maximization: A simulated robotic study
    • submitted
    • Mirolli, M., Santucci, V.G., Baldassarre, G.: Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcement driving both action acquisition and reward maximization: A simulated robotic study. Neural Netw. (2012, submitted)
    • (2012) Neural Netw.
    • Mirolli, M.1    Santucci, V.G.2    Baldassarre, G.3
  • 30
    • 84872861664 scopus 로고    scopus 로고
    • Intrinsically motivated learning of real world sensorimotor skills with developmental constraints
    • Baldassarre, G., Mirolli, M. (eds.) Springer, Berlin this volume
    • Oudeyer, P.-Y., Banares, A., Frédéric, K.: Intrinsically motivated learning of real world sensorimotor skills with developmental constraints. In: Baldassarre, G., Mirolli, M. (eds.) Intrinsically Motivated Learning in Natural and Artificial Systems. Springer, Berlin (2012, this volume)
    • (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
    • Oudeyer, P.-Y.1    Banares, A.2    Frédéric, K.3
  • 31
    • 75149137813 scopus 로고    scopus 로고
    • What is intrinsic motivation? A typology of computational approaches
    • Oudeyer, P.-Y., Kaplan, F.: What is intrinsic motivation? a typology of computational approaches. Front. Neurorobot. 1, 6 (2007)
    • (2007) Front. Neurorobot , vol.1 , pp. 6
    • Oudeyer, P.-Y.1    Kaplan, F.2
  • 32
    • 34047267520 scopus 로고    scopus 로고
    • Intrinsic motivation systems for autonomous mental development
    • Oudeyer, P.-Y., Kaplan, F., Hafner, V.V.: Intrinsic motivation systems for autonomous mental development. IEEE Trans. Evol. Comput. 11(2), 265-286 (2007)
    • (2007) IEEE Trans. Evol. Comput. , vol.11 , Issue.2 , pp. 265-286
    • Oudeyer, P.-Y.1    Kaplan, F.2    Hafner, V.V.3
  • 33
    • 14344250461 scopus 로고    scopus 로고
    • Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning
    • Sammut, C., Hoffmann, A.G. (eds.) Sydney, Australia, 8-12 July Morgan Kaufmann, San Francisco (2002)
    • Pickett, M., Barto, A.: Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning. In: Sammut, C., Hoffmann, A.G. (eds.) Proceedings of the Nineteenth International Conference on Machine Learning, pp. 506-513. Sydney, Australia, 8-12 July 2002. Morgan Kaufmann, San Francisco (2002)
    • (2002) Proceedings of the Nineteenth International Conference on Machine Learning , pp. 506-513
    • Pickett, M.1    Barto, A.2
  • 34
    • 33751184634 scopus 로고    scopus 로고
    • The short-latency dopamine signal: A role in discovering novel actions?
    • Redgrave, P., Gurney, K.: The short-latency dopamine signal: A role in discovering novel actions? Nat. Rev. Neurosci. 7(12), 967-975 (2006)
    • (2006) Nat. Rev. Neurosci. , vol.7 , Issue.12 , pp. 967-975
    • Redgrave, P.1    Gurney, K.2
  • 36
    • 0002209063 scopus 로고    scopus 로고
    • Intrinsic and extrinsic motivations: Classic definitions and new directions
    • Ryan, R., Deci, E.: Intrinsic and extrinsic motivations: Classic definitions and new directions. Contemp. Educ. Psychol. 25, 54-67 (2000)
    • (2000) Contemp. Educ. Psychol. , vol.25 , pp. 54-67
    • Ryan, R.1    Deci, E.2
  • 37
    • 80055017940 scopus 로고    scopus 로고
    • Biological cumulative learning through intrinsic motivations: A simulated robotic study on development of visually-guided reaching
    • Johansson, B., Sahin, E., Balkenius, C. (eds.) Lund, Sweden. Lund: Lund University Cognitive Studies
    • Santucci, V.G., Baldassarre, G., Mirolli, M.: Biological cumulative learning through intrinsic motivations: A simulated robotic study on development of visually-guided reaching. In: Johansson, B., Sahin, E., Balkenius, C. (eds.) Proceedings of the Tenth International Conference on Epigenetic Robotics (EpiRob2010), pp. 121-128. Lund, Sweden. Lund: Lund University Cognitive Studies Vol. 149 (2010)
    • (2010) Proceedings of the Tenth International Conference on Epigenetic Robotics (EpiRob2010) , vol.149 , pp. 121-128
    • Santucci, V.G.1    Baldassarre, G.2    Mirolli, M.3
  • 38
    • 38049093767 scopus 로고    scopus 로고
    • Evolution and learning in an intrinsically motivated reinforcement learning robot
    • Almeida e Costa Fernando, Rocha, L.M., Costa, E., Harvey, I., Coutinho, A. (eds.) Advances in Artificial Life Lisbon, Portugal, 10-14 September Lecture Notes in Artificial Intelligence Springer, Berlin (2007)
    • Schembri, M., Mirolli, M., Baldassarre, G.: Evolution and learning in an intrinsically motivated reinforcement learning robot. In: Almeida e Costa Fernando, Rocha, L.M., Costa, E., Harvey, I., Coutinho, A. (eds.) Advances in Artificial Life. Proceedings of the 9th European Conference on Artificial Life (ECAL2007), Lisbon, Portugal, 10-14 September 2007. Lecture Notes in Artificial Intelligence, Vol. 4648, pp. 294-333. Springer, Berlin (2007a)
    • (2007) Proceedings of the 9th European Conference on Artificial Life (ECAL2007) , vol.4648 , pp. 294-333
    • Schembri, M.1    Mirolli, M.2    Baldassarre, G.3
  • 39
    • 79958838807 scopus 로고    scopus 로고
    • Evolving childhood's length and learning parameters in an intrinsically motivated reinforcement learning robot
    • Berthouze, L., Dhristiopher, P.G., Littman, M., Kozima, H., Balkenius, C. (eds.) Lund, Sweden. Lund: Lund University Cognitive Studies Vol. 149
    • Schembri, M., Mirolli, M., Baldassarre, G.: Evolving childhood's length and learning parameters in an intrinsically motivated reinforcement learning robot. In: Berthouze, L., Dhristiopher, P.G., Littman, M., Kozima, H., Balkenius, C. (eds.) Proceedings of the Seventh International Conference on Epigenetic Robotics, Vol. 134, pp. 141-148. Lund, Sweden. Lund: Lund University Cognitive Studies Vol. 149 (2007b)
    • (2007) Proceedings of the Seventh International Conference on Epigenetic Robotics , vol.134 , pp. 141-148
    • Schembri, M.1    Mirolli, M.2    Baldassarre, G.3
  • 40
    • 84872764045 scopus 로고    scopus 로고
    • Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot
    • Demiris, Y., Mareschal, D., Scassellati, B., Weng, J. (eds.) London, UK, 11-13 July IEEE, Piscataway (2007)
    • Schembri, M., Mirolli, M., Baldassarre, G.: Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot. In: Demiris, Y., Mareschal, D., Scassellati, B., Weng, J. (eds.) Proceedings of the 6th International Conference on Development and Learning, pp. E1-6. London, UK, 11-13 July 2007. IEEE, Piscataway (2007c)
    • (2007) Proceedings of the 6th International Conference on Development and Learning , pp. E1-6
    • Schembri, M.1    Mirolli, M.2    Baldassarre, G.3
  • 42
    • 2442467081 scopus 로고
    • A possibility for implementing curiosity and boredom in model-building neural controllers
    • Meyer, J.-A., Wilson, S. (eds.) Paris, France, December MIT, Cambridge
    • Schmidhuber, J.: A possibility for implementing curiosity and boredom in model-building neural controllers. In: Meyer, J.-A., Wilson, S. (eds.) From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior, Paris, France, December, 1990 pp. 222-227, MIT, Cambridge (1991b)
    • (1990) From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior , pp. 222-227
    • Schmidhuber, J.1
  • 43
    • 77956578648 scopus 로고    scopus 로고
    • Formal theory of creativity, fun, and intrinsic motivation (1990-2010)
    • Schmidhuber, J.: Formal theory of creativity, fun, and intrinsic motivation (1990-2010). IEEE Trans. Auton. Mental Dev. 2(3), 230-247 (2010)
    • (2010) IEEE Trans. Auton. Mental Dev. , vol.2 , Issue.3 , pp. 230-247
    • Schmidhuber, J.1
  • 44
    • 84929056929 scopus 로고    scopus 로고
    • Maximizing fun by creating data with easily reducible subjective complexity
    • Baldassarre, G., Mirolli, M. (eds.) Springer, Berlin this volume
    • Schmidhuber, J.: Maximizing fun by creating data with easily reducible subjective complexity. In: Baldassarre, G., Mirolli, M. (eds.) Intrinsically Motivated Learning in Natural and Artificial Systems. Springer, Berlin (2012, this volume)
    • (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
    • Schmidhuber, J.1
  • 45
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • Schultz, W.: Getting formal with dopamine and reward. Neuron 36(2), 241-263 (2002)
    • (2002) Neuron , vol.36 , Issue.2 , pp. 241-263
    • Schultz, W.1
  • 46
    • 84899031920 scopus 로고    scopus 로고
    • Intrinsically motivated reinforcement learning
    • Saul, L.K., Weiss, Y., Bottou, L. (eds.) Vancouver, British Columbia, Canada, 13-18 December 2004. MIT, Cambridge
    • Singh, S., Barto, A., Chentanez, N.: Intrinsically motivated reinforcement learning. In: Saul, L.K., Weiss, Y., Bottou, L. (eds.). Advances in Neural Information Processing Systems 17: Proceedings of the 2004 Conference. Vancouver, British Columbia, Canada, 13-18 December 2004. MIT, Cambridge (2005)
    • (2005) Advances in Neural Information Processing Systems 17: Proceedings of the 2004 Conference
    • Singh, S.1    Barto, A.2    Chentanez, N.3
  • 47
    • 79953822184 scopus 로고    scopus 로고
    • Intrinsically motivated reinforcement learning: An evolutionary perspective
    • Singh, S., Lewis, R., Barto, A., Sorg, J.: Intrinsically motivated reinforcement learning: An evolutionary perspective. IEEE Trans. Auton. Mental Dev. 2(2), 70-82 (2010)
    • (2010) IEEE Trans. Auton. Mental Dev. , vol.2 , Issue.2 , pp. 70-82
    • Singh, S.1    Lewis, R.2    Barto, A.3    Sorg, J.4
  • 48
    • 78149251512 scopus 로고    scopus 로고
    • Competence progress intrinsic motivation
    • Kuipers, B., Shultz, T., Stoytchev, A., Yu, C. (eds.) Ann Arbor, MI, USA, 18-21 August, 2010. IEEE, Piscataway
    • Stout, A., Barto, A.G.: Competence progress intrinsic motivation. In: Kuipers, B., Shultz, T., Stoytchev, A., Yu, C. (eds.) IEEE International Conference on Development and Learning (ICDL2010). Ann Arbor, MI, USA, 18-21 August, 2010. IEEE, Piscataway (2010)
    • (2010) IEEE International Conference on Development and Learning (ICDL2010)
    • Stout, A.1    Barto, A.G.2
  • 49
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R., Precup, D., Singh, S.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artif. Intell. 112, 181-211 (1999)
    • (1999) Artif. Intell. , vol.112 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3
  • 51
    • 68949157375 scopus 로고    scopus 로고
    • Transfer learning for reinforcement learning domains: A survey
    • Taylor, M., Stone, P.: Transfer learning for reinforcement learning domains: A survey. J. Mach. Learn. Res. 10, 1633-1685 (2009)
    • (2009) J. Mach. Learn. Res. , vol.10 , pp. 1633-1685
    • Taylor, M.1    Stone, P.2
  • 52
    • 33749882712 scopus 로고
    • Finding structure in reinforcement learning
    • Tesauro, G., Touretzky, D, Leen, T. (eds.) Denver, Colorado, USA MIT, Cambridge
    • Thrun, S., Schwartz, A.: Finding structure in reinforcement learning. In: Tesauro, G., Touretzky, D, Leen, T. (eds.) Advances in Neural Information Processing Systems 7 (NIPS1994), Denver, Colorado, USA, pp. 385-392. MIT, Cambridge (1995)
    • (1995) Advances in Neural Information Processing Systems 7 (NIPS1994) , pp. 385-392
    • Thrun, S.1    Schwartz, A.2
  • 53
    • 80054969173 scopus 로고    scopus 로고
    • Intrinsically motivated hierarchical skill learning in structured environments
    • Vigorito, C., Barto, A.: Intrinsically motivated hierarchical skill learning in structured environments. IEEE Trans. Auton. Mental Dev. 2(2), 132-143 (2010)
    • (2010) IEEE Trans. Auton. Mental Dev. , vol.2 , Issue.2 , pp. 132-143
    • Vigorito, C.1    Barto, A.2
  • 54
    • 33845802539 scopus 로고    scopus 로고
    • Action in development
    • von Hofsten, C.: Action in development. Dev. Sci. 10(1), 54-60 (2007)
    • (2007) Dev. Sci. , vol.10 , Issue.1 , pp. 54-60
    • Von Hofsten, C.1
  • 56
    • 33749411161 scopus 로고
    • Motivation reconsidered: The concept of competence
    • White, R.W.: Motivation reconsidered: The concept of competence. Psychol. Rev. 66, 297-333 (1959)
    • (1959) Psychol. Rev. , vol.66 , pp. 297-333
    • White, R.W.1
  • 57
    • 0033362601 scopus 로고    scopus 로고
    • Evolving artificial neural networks
    • Yao, X.: Evolving artificial neural networks. In: Proceedings of the IEEE, Vol. 87, pp. 1423-1447. (1999)
    • (1999) Proceedings of the IEEE , vol.87 , pp. 1423-1447
    • Yao, X.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.