SCOPUS 정보 검색 플랫폼

Computational and Robotic Models of the Hierarchical Organization of Behavior

Volumn 9783642398759, Issue , 2014, Pages 13-46

Behavioral hierarchy: Exploration and representation

(3) Barto, Andrew G a Konidaris, George a Vigorito, Christopher b

a UNIVERSITY OF MASSACHUSETTS (United States)

b MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL AGENTS; BUILDING BLOCKES; COMPLEX DOMAINS; EXPLORATORY BEHAVIOURS; HIERARCHICAL REINFORCEMENT LEARNING; MULTIPLE LEVELS; NATURAL AGENTS;

EID: 84907552069 PISSN: None EISSN: None Source Type: Book
DOI: 10.1007/978-3-642-39875-9_2 Document Type: Chapter

Times cited : (19)

References (100)

1
- 84937546745
- Exploiting behavioral hierarchy for efficient model checking
- E. Brinksma & K. G. Larsen (Eds.), (Lecture notes in computer science) . Berlin: Springer
- Alur, R., McDougall, M., Yang, Z. (2002). Exploiting behavioral hierarchy for efficient model checking. In E. Brinksma & K. G. Larsen (Eds.), Computer aided verification: 14th international conference, proceedings (Lecture notes in computer science) (pp. 338-342). Berlin: Springer.
- (2002) th International Conference, Proceedings , pp. 338-342
- Alur, R.¹ McDougall, M.² Yang, Z.³

2
- 84930524287
- Problems of representation in heuristic problemsolving: Related issues in the development ofexpert systems
- New Brunswick NJ
- Amarel, S. (1981). Problems of representation in heuristic problemsolving: related issues in the development ofexpert systems. Technical Report CBM-TR-118, Laboratory for Computer Science, Rutgers University, New Brunswick NJ.
- (1981) Technical Report CBM-TR-118, Laboratory for Computer Science, Rutgers University
- Amarel, S.¹

3
- 5644290841
- An integrated theory of mind
- Anderson, J. R. (2004). An integrated theory of mind. Psychological Review, 111, 1036-1060.
- (2004) Psychological Review , vol.111 , pp. 1036-1060
- Anderson, J.R.¹

4
- 0003624283
- Norwell MA: Kluwer
- Antsaklis, P. J., & Passino, K. M. (Eds.), (1993). An introduction to intelligent and autonomous control. Norwell MA: Kluwer.
- (1993) An Introduction to Intelligent and Autonomous Control
- Antsaklis, P.J.¹ Passino, K.M.²

5
- 33845876447
- Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization
- F. Groen, N. Amato, A. Bonarini, E. Yoshida, B. Kröse (Eds.), . Amsterdam, The Netherlands: IOS
- Bakker, B., & Schmidhuber, J. (2004). Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization. In F. Groen, N. Amato, A. Bonarini, E. Yoshida, B. Kröse (Eds.), Proceedings of the 8th conference on intelligent autonomous systems, IAS- 8 (pp. 438-445). Amsterdam, The Netherlands: IOS.
- (2004) th Conference on Intelligent Autonomous Systems, IAS- 8 , pp. 438-445
- Bakker, B.¹ Schmidhuber, J.²

6
- 84967334530
- Berlin: Springer
- Baldassarre, G., & Mirolli, M. (Eds.), (2012). Intrinsically motivated learning in natural and artificial systems. Berlin: Springer.
- (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
- Baldassarre, G.¹ Mirolli, M.²

7
- 33749651693
- Intrinsically motivated learning of hierarchical collections of skills
- J. Triesch & T. Jebara (Eds.), . UCSD Institute for Neural Computation
- Barto, A., Singh, S., Chentanez, N. (2004). Intrinsically motivated learning of hierarchical collections of skills. In J. Triesch & T. Jebara (Eds.), Proceedings of the 2004 international conference on development and learning (pp. 112-119). UCSD Institute for Neural Computation.
- (2004) Proceedings of the 2004 International Conference on Development and Learning , pp. 112-119
- Barto, A.¹ Singh, S.² Chentanez, N.³

8
- 84872770177
- Intrinsic motivation and reinforcement learning
- G. Baldassarre & M. Miroll (Eds.), Berlin: Springer
- Barto, A. G. (2012). Intrinsic motivation and reinforcement learning. In G. Baldassarre & M. Miroll (Eds.), Intrinsically motivated learning in natural and artificial system. Berlin: Springer.
- (2012) Intrinsically Motivated Learning in Natural and Artificial System
- Barto, A.G.¹

9
- 0037288370
- Recent advances in hierarchical reinforcement learning
- Barto, A. G., & Mahadevan, S. (2003). Recent advances in hierarchical reinforcement learning. Discrete Event Dynamcal Systems: Theory and Applications, 13, 341-379.
- (2003) Discrete Event Dynamcal Systems: Theory and Applications , vol.13 , pp. 341-379
- Barto, A.G.¹ Mahadevan, S.²

10
- 85012688561
- Princeton: Princeton University Press
- Bellman, R. E. (1957). Dynamic programming. Princeton: Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.E.¹

11
- 33749244036
- Reusing old policies to accelerate learning on new MDPs
- Technical Report, Amherst
- Bernstein, D. S. (1999). Reusing old policies to accelerate learning on new MDPs. Technical Report Technical Report UM-CS-1999-026, Department of Computer Science, University of Massachusetts Amherst.
- (1999) Technical Report UM-CS-1999-026, Department of Computer Science, University of Massachusetts
- Bernstein, D.S.¹

12
- 1942443210
- Doing without schema hierarchies: A recurrent connectionist approach to normal and impaired routine sequential action
- Botvinick, M. M., & Plaut, D. C. (2004). Doing without schema hierarchies: A recurrent connectionist approach to normal and impaired routine sequential action. Psychological Review, 111, 395-429.
- (2004) Psychological Review , vol.111 , pp. 395-429
- Botvinick, M.M.¹ Plaut, D.C.²

13
- 70350566799
- Hierarchically organized behavior and its neural foundations: A reinforcement-learning perspective
- Botvinick, M. M., Niv, Y., Barto, A. G. (2009). Hierarchically organized behavior and its neural foundations: A reinforcement-learning perspective. Cognition, 113, 262-280.
- (2009) Cognition , vol.113 , pp. 262-280
- Botvinick, M.M.¹ Niv, Y.² Barto, A.G.³

14
- 0034248853
- Stochastic dynamic programming with factored representations
- Boutilier, C., Dearden, R., Goldszmdt, M. (2000). Stochastic dynamic programming with factored representations. Artificial Intelligence, 121, 49-107.
- (2000) Artificial Intelligence , vol.121 , pp. 49-107
- Boutilier, C.¹ Dearden, R.² Goldszmdt, M.³

15
- 0001926525
- Theory refinement on Bayesian networks
- B. D'Ambrosio & P. Smets (Eds.), . San Francisco: Morgan Kaufmann
- Buntine, W. (1991). Theory refinement on Bayesian networks. In B. D'Ambrosio & P. Smets (Eds.), UAI '91: proceedings of the seventh annual conference on uncertainty in artificial intelligence (pp. 52-60). San Francisco: Morgan Kaufmann.
- (1991) UAI '91: Proceedings of the Seventh Annual Conference on Uncertainty in Artificial Intelligence , pp. 52-60
- Buntine, W.¹

16
- 0032647341
- Sequential composition of dynamically dextrous robot behaviors
- Burridge, R. R., Rizzi, A. A., Koditschek, D. E. (1999). Sequential composition of dynamically dextrous robot behaviors. International Journal of Robotics Research, 18, 534-555.
- (1999) International Journal of Robotics Research , vol.18 , pp. 534-555
- Burridge, R.R.¹ Rizzi, A.A.² Koditschek, D.E.³

17
- 34447562101
- The ubiquity of modularity
- W. Callebaut & D. Rasskin-Gutman (Eds.), . Cambridge: MIT
- Callebaut, W. (2005). The ubiquity of modularity. In W. Callebaut & D. Rasskin-Gutman (Eds.), Modularity: understanding the development and evolution of natural complex systems (pp. 3- 28). Cambridge: MIT.
- (2005) Modularity: Understanding the Development and Evolution of Natural Complex Systems , pp. 3-28
- Callebaut, W.¹

18
- 0004305904
- Cambridge: MIT
- Callebaut, W., & Rasskin-Gutman, D. (Eds.) (2005). Modularity: understanding the development and evolution of natural complex systems. Cambridge: MIT.
- (2005) Modularity: Understanding the Development and Evolution of Natural Complex Systems
- Callebaut, W.¹ Rasskin-Gutman, D.²

19
- 84867115622
- Learning parameterized skills
- J. Langford & J. Pineau (Eds.), . Omnipress: Edinburgh
- da Silva, B. C., Konidaris, G., & Barto, A. G. (2012). Learning parameterized skills. In J. Langford & J. Pineau (Eds.), Machine learning, proceedings of the 29th international conference (ICML 2012) (pp. 1679-1686). Omnipress: Edinburgh.
- (2012) Machine Learning, Proceedings of the 29th International Conference (ICML 2012) , pp. 1679-1686
- Da Silva, B.C.¹ Konidaris, G.² Barto, A.G.³

20
- 84990553353
- A model for reasoning about persistence and causation
- Dean, T. L., & Kanazawa, K. (1989). A model for reasoning about persistence and causation. Computational Intelligence, 5, 142-150.
- (1989) Computational Intelligence , vol.5 , pp. 142-150
- Dean, T.L.¹ Kanazawa, K.²

21
- 34250766214
- Learning the structure of factored Markov decision processes in reinforcement learning problems
- W. W. Cohen & A. Moore (Eds.), ACM international conference proceeding series . New York: ACM
- Degris, T., Sigaud, O., Wuillemin, P. H. (2006). Learning the structure of factored Markov decision processes in reinforcement learning problems. In W. W. Cohen & A. Moore (Eds.), Machine learning, proceedings of the twenty-third international conference (ICML 2006). ACM international conference proceeding series (vol. 148, pp. 257-264). New York: ACM.
- (2006) Machine Learning, Proceedings of the Twenty-third International Conference (ICML 2006) , vol.148 , pp. 257-264
- Degris, T.¹ Sigaud, O.² Wuillemin, P.H.³

22
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. G. (2000a). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

23
- 0003506152
- State abstraction in MAXQ hierarchical reinforcement learning
- S. A. Solla, T. K. Leen, K.-R.Müller (Eds.), . Cambridge: MIT
- Dietterich, T. G. (2000b). State abstraction in MAXQ hierarchical reinforcement learning. In S. A. Solla, T. K. Leen, K.-R.Müller (Eds.), Advances in neural information processing systems 12 (pp. 994-1000). Cambridge: MIT.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 994-1000
- Dietterich, T.G.¹

24
- 0007907759
- Emergent hierarchical control structures: Learning reactive/hierarchical relationships inreinforcement environments
- P. Meas, M. Mataric, J.-A. Meyer, J. Pollack, S. W. Wilson (Eds.), . Cambridge: MIT
- Digney, B. (1996). Emergent hierarchical control structures: learning reactive/hierarchical relationships inreinforcement environments. In P. Meas, M. Mataric, J.-A. Meyer, J. Pollack, S. W. Wilson (Eds.), From animals to animats 4: proceedings of the fourth international conference on simulation of adaptive behavior (pp. 363-372). Cambridge: MIT.
- (1996) From Animals to Animats 4: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior , pp. 363-372
- Digney, B.¹

25
- 70049104382
- The adaptive k-meteorologists problems and its application to structure learning and feature selection in reinforcement learning
- A. P. Danyluk, L. Bottou, M. L. Littman (Eds.), ACM international conference proceeding series . New York: ACM
- Diuk, C., Li, L., Leffler, B. (2009). The adaptive k-meteorologists problems and its application to structure learning and feature selection in reinforcement learning. In A. P. Danyluk, L. Bottou, M. L. Littman (Eds.), Proceedings of the 26th annual international conference on machine learning, ICML 2009. ACM international conference proceeding series (vol. 382, pp. 249-256). New York: ACM.
- (2009) Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009 , vol.382 , pp. 249-256
- Diuk, C.¹ Li, L.² Leffler, B.³

26
- 0015440625
- Learning and executing generalized robot plans
- Fikes, R. E., Hart, P. E., Nilsson, N. J. (1972). Learning and executing generalized robot plans. Artificial Intelligence, 3, 251-288.
- (1972) Artificial Intelligence , vol.3 , pp. 251-288
- Fikes, R.E.¹ Hart, P.E.² Nilsson, N.J.³

27
- 0000854197
- Learning the structure of dynamic probabilistic networks
- G. F. Cooper & S. Moral (Eds.), . San Francisco: Morgan Kaufmann
- Friedman, N., Murphy, K., Russell, S. (1998). Learning the structure of dynamic probabilistic networks. In G. F. Cooper & S. Moral (Eds.), UAI '98: proceedings of the fourteenth conference on uncertainty in artificial intelligence (pp. 139-147). San Francisco: Morgan Kaufmann.
- (1998) UAI '98: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 139-147
- Friedman, N.¹ Murphy, K.² Russell, S.³

28
- 84880803349
- Generalizing plans to new environments in relational MDPs
- San Francisco: Morgan Kaufmann
- Guestrin, C., Koller, D., Gearhart, C., Kanodia, N. (2003). Generalizing plans to new environments in relational MDPs. In IJCAI-03, Proceedings of the eighteenth international joint conference on artificial intelligence (pp. 1003-1010). San Francisco: Morgan Kaufmann.
- (2003) IJCAI-03, Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence , pp. 1003-1010
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

29
- 80052879447
- Learning generalizable control programs
- Special Issue on Representations and Architectures for Cognitive Systems
- Hart, S., & Grupen, R. (2011). Learning generalizable control programs. IEEE Transactions on Autonomous Mental Development, 3, 216-231. Special Issue on Representations and Architectures for Cognitive Systems.
- (2011) IEEE Transactions on Autonomous Mental Development , vol.3 , pp. 216-231
- Hart, S.¹ Grupen, R.²

30
- 84929056927
- Intrinsically motivated affordance discovery and modeling
- G. Baldassarre & M. Mirolli (Eds.), Berlin: Springer
- Hart, S., & Grupen, R. (2012). Intrinsically motivated affordance discovery and modeling. In G. Baldassarre & M. Mirolli (Eds.), Intrinsically motivated learning in natural and artificial systems. Berlin: Springer.
- (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
- Hart, S.¹ Grupen, R.²

31
- 34249761849
- Learning Bayesian networks: The combination of knowledge and statistical data
- Heckerman, D., Geiger, D., Chickering, D. (1995). Learning Bayesian networks: the combination of knowledge and statistical data. Machine Learning, 20, 197-243.
- (1995) Machine Learning , vol.20 , pp. 197-243
- Heckerman, D.¹ Geiger, D.² Chickering, D.³

32
- 0013465036
- Discovering hierarchy in reinforcement learning with HEXQ
- C. Sammut & A. G. Hoffmann (Eds.), . San Francisco: Morgan Kaufmann
- Hengst, B. (2002). Discovering hierarchy in reinforcement learning with HEXQ. In C. Sammut & A. G. Hoffmann (Eds.), Machine learning, proceedings of the nineteenth international conference (ICML 2002) (pp. 243-250). San Francisco: Morgan Kaufmann.
- (2002) Machine Learning, Proceedings of the Nineteenth International Conference (ICML 2002) , pp. 243-250
- Hengst, B.¹

33
- 0031343489
- A feedback control structure for on-line learning tasks
- Huber, M., & Grupen, R. A. (1997). A feedback control structure for on-line learning tasks. Robotics and Autonomous Systems, 22, 303-315.
- (1997) Robotics and Autonomous Systems , vol.22 , pp. 303-315
- Huber, M.¹ Grupen, R.A.²

34
- 85131157011
- The theory of affordances
- R. Shaw & J. Bransford (Eds.), . Hillsdale: Lawrence Erlbaum
- Gibson, J. (1977). The theory of affordances. In R. Shaw & J. Bransford (Eds.), Perceiving, acting, and knowing: toward an ecological psychology (pp. 67-82). Hillsdale: Lawrence Erlbaum.
- (1977) Perceiving, Acting, and Knowing: Toward An Ecological Psychology , pp. 67-82
- Gibson, J.¹

35
- 85192401540
- Automated state abstraction for options using the U-tree algorithm
- T. G. Dietterich, S. Becker, Z. Ghahramani (Eds.), . Cambridge: MIT
- Jonsson, A., & Barto, A. G. (2002). Automated state abstraction for options using the U-tree algorithm. In T. G. Dietterich, S. Becker, Z. Ghahramani (Eds.), Advances in neural information processing systems 14: proceedings of the 2001 neural information processing systems (NIPS) conference (pp. 1054-1060). Cambridge: MIT.
- (2002) Advances in Neural Information Processing Systems 14: Proceedings of the 2001 Neural Information Processing Systems (NIPS) Conference , pp. 1054-1060
- Jonsson, A.¹ Barto, A.G.²

36
- 33750705246
- Causal graph based decomposition of factored mdps
- Jonsson, A., & Barto, A. G. (2006). Causal graph based decomposition of factored mdps. Journal of Machine Learning Research, 7, 2259-2301.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 2259-2301
- Jonsson, A.¹ Barto, A.G.²

37
- 34548070563
- Active learning of dynamic Bayesian networks in Markov decision processes
- I. Miguel & W. Rumi (Eds.), Whistler, Canada, July 18-21, 2007. Lecture notes in computer science: Abstraction, reformulation, and approximation . Berlin: Springer
- Jonsson, A., & Barto, A. G. (2007). Active learning of dynamic Bayesian networks in Markov decision processes. In I. Miguel & W. Rumi (Eds.), Proceedings of Abstraction, reformulation, and approximation, 7th international symposium, SARA 2007, Whistler, Canada, July 18-21, 2007. Lecture notes in computer science: Abstraction, reformulation, and approximation (vol. 4612, pp. 273-284). Berlin: Springer.
- (2007) Proceedings of Abstraction, Reformulation, and Approximation, 7th International Symposium, SARA 2007 , vol.4612 , pp. 273-284
- Jonsson, A.¹ Barto, A.G.²

38
- 84880873347
- Building portable options: Skill transfer in reinforcement learning
- M. Veloso (Ed.), Hyderabad, India, 6-12 January 2007 . Menlo Park: AAAI Press
- Konidaris, G., & Barto, A. (2007). Building portable options: Skill transfer in reinforcement learning. In M. Veloso (Ed.), IJCAI 2007, proceedings of the 20th international joint conference on artificial intelligence, Hyderabad, India, 6-12 January 2007 (pp. 895-900). Menlo Park: AAAI Press.
- (2007) th International Joint Conference on Artificial Intelligence , pp. 895-900
- Konidaris, G.¹ Barto, A.²

39
- 78751681641
- Efficient skill learning using abstraction selection
- C. Boutilier (Ed.), Pasadena, California, USA, 11-17 July 2009 . Menlo Park: AAAI Press
- Konidaris, G., & Barto, A. (2009a). Efficient skill learning using abstraction selection. In C. Boutilier (Ed.), IJCAI 2009, Proceedings of the 21st international joint conference on artificial intelligence, Pasadena, California, USA, 11-17 July 2009 (pp. 1107-1112). Menlo Park: AAAI Press.
- (2009) st International Joint Conference on Artificial Intelligence , pp. 1107-1112
- Konidaris, G.¹ Barto, A.²

40
- 80055032021
- Skill discovery in continuous reinforcement learning domains using skill chaining
- Y. Bengio, D. Schuurmans, J. Lafferty, C. Williams, A. Culotta (Eds.), . NIPS Foundation
- Konidaris, G., & Barto, A. (2009b). Skill discovery in continuous reinforcement learning domains using skill chaining. In Y. Bengio, D. Schuurmans, J. Lafferty, C. Williams, A. Culotta (Eds.), Proceedings of the 2009 conference of Advances in neural information processing systems 22 (pp. 1015-1023). NIPS Foundation.
- (2009) Proceedings of the 2009 Conference of Advances in Neural Information Processing Systems , vol.22 , pp. 1015-1023
- Konidaris, G.¹ Barto, A.²

41
- 84862001711
- Transfer in reinforcement learning via shared features
- Konidaris, G., Barto, A., Scheidwasser, I. (2012a). Transfer in reinforcement learning via shared features. Journal of Machine Learning Research, 13, 1333-1371.
- (2012) Journal of Machine Learning Research , vol.13 , pp. 1333-1371
- Konidaris, G.¹ Barto, A.² Scheidwasser, I.³

42
- 80055049318
- Autonomous skill acquisition on a mobile manipulator
- W. Burgard & D. Roth (Eds.), . San Francisco: AAAI
- Konidaris, G., Kuindersma, S., Grupen, R., Barto, A. (2011a). Autonomous skill acquisition on a mobile manipulator. In W. Burgard & D. Roth (Eds.), Proceedings of the twenty-fifth AAAI conference on artificial intelligence, AAAI 2011 (pp. 1468-1473). San Francisco: AAAI.
- (2011) Proceedings of the Twenty-fifth AAAI Conference on Artificial Intelligence, AAAI 2011 , pp. 1468-1473
- Konidaris, G.¹ Kuindersma, S.² Grupen, R.³ Barto, A.⁴

43
- 84857832945
- Robot learning from demonstration by constructing skill trees
- Konidaris, G., Kuindersma, S., Grupen, R., Barto, A. (2012b). Robot learning from demonstration by constructing skill trees. The International Journal of Robotics Research, 31, 360-375.
- (2012) The International Journal of Robotics Research , vol.31 , pp. 360-375
- Konidaris, G.¹ Kuindersma, S.² Grupen, R.³ Barto, A.⁴

44
- 80055028007
- Value function approximation in reinforcement learning using the Fourier basis
- W. Burgard & D. Roth (Eds.), . San Francisco: AAAI
- Konidaris, G., Osentoski, S., Thomas, P. (2011b). Value function approximation in reinforcement learning using the Fourier basis. InW. Burgard & D. Roth (Eds.), Proceedings of the twenty-fifth AAAI conference on artificial intelligence, AAAI 2011 (pp. 380-385). San Francisco: AAAI.
- (2011) Proceedings of the Twenty-fifth AAAI Conference on Artificial Intelligence, AAAI 2011 , pp. 380-385
- Konidaris, G.¹ Osentoski, S.² Thomas, P.³

45
- 84877767633
- PhD thesis, Computer Science, University of Massachusetts Amherst
- Konidaris, G. D. (2011). Autonomous robot skill acquisition. PhD thesis, Computer Science, University of Massachusetts Amherst.
- (2011) Autonomous Robot Skill Acquisition
- Konidaris, G.D.¹

46
- 0003828024
- Boston: Pitman
- Korf, R. E. (1985). Learning to solve problems by searching for macro-operators. Boston: Pitman.
- (1985) Learning to Solve Problems by Searching for Macro-operators
- Korf, R.E.¹

47
- 71749084105
- Acquisition of hierarchical reactive skills in a unified cognitive architecture
- Langley, P., Choi, D., Rogers, S. (2009). Acquisition of hierarchical reactive skills in a unified cognitive architecture. Cognitive Systems Research, 10, 316-332.
- (2009) Cognitive Systems Research , vol.10 , pp. 316-332
- Langley, P.¹ Choi, D.² Rogers, S.³

48
- 33745211358
- Cumulative learning of hierarchical skills
- J. Triesch & T. Jebara (Eds.), . UCSD Institute for Neural Computation
- Langley, P., & Rogers, S. (2004). Cumulative learning of hierarchical skills. In J. Triesch & T. Jebara (Eds.), Proceedings of the 2004 international conference on development and learning (pp. 1-8). UCSD Institute for Neural Computation.
- (2004) Proceedings of the 2004 International Conference on Development and Learning , pp. 1-8
- Langley, P.¹ Rogers, S.²

49
- 0001990073
- The problem of serial order in behavior
- L. A. Jeffress (Ed.), . New York: Wiley
- Lashley, K. S. (1951). The problem of serial order in behavior. In L. A. Jeffress (Ed.), Cerebral mechanisms in behavior: the Hixon symposium (pp. 112-136). New York: Wiley.
- (1951) Cerebral Mechanisms in Behavior: The Hixon Symposium , pp. 112-136
- Lashley, K.S.¹

50
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- IEEE Circuits and Systems Society
- Lewis, F. L., & Vrabie, D. (2009). Reinforcement learning and adaptive dynamic programming for feedback control. In IEEE circuits and systems magazine (vol. 9, pp. 32-50). IEEE Circuits and Systems Society.
- (2009) IEEE Circuits and Systems Magazine , vol.9 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

51
- 84864535343
- Towards a unified theory of state abstraction for MDPs
- Fort Lauderdale, Florida, USA, 4-6 January 2006
- Li, L., Walsh, T., Littman, M. (2006). Towards a unified theory of state abstraction for MDPs. In International symposium on artificial intelligence and mathematics (ISAIM 2006), Fort Lauderdale, Florida, USA, 4-6 January 2006.
- (2006) International Symposium on Artificial Intelligence and Mathematics (ISAIM 2006)
- Li, L.¹ Walsh, T.² Littman, M.³

52
- 33750742257
- Value-function-based transfer for reinforcement learning using structure mapping
- San Francisco: AAAI
- Liu, Y., & Stone, P. (2006). Value-function-based transfer for reinforcement learning using structure mapping. In Proceedings, the twenty-first national conference on artificial intelligence and the eighteenth innovative applications of artificial intelligence conference (pp. 415-420). San Francisco: AAAI.
- (2006) Proceedings, the Twenty-first National Conference on Artificial Intelligence and the Eighteenth Innovative Applications of Artificial Intelligence Conference , pp. 415-420
- Liu, Y.¹ Stone, P.²

53
- 70349322784
- Learning representation and control in Markov decision processes: New frontiers
- Hanover: Now Publishers Inc
- Mahadevan, S. (2009). Learning representation and control in Markov decision processes: new frontiers. Foundations and trends in machine learning (vol. 1). Hanover: Now Publishers Inc.
- (2009) Foundations and Trends in Machine Learning , vol.1
- Mahadevan, S.¹

54
- 14344250635
- Dynamic abstraction in reinforcement learning via clustering
- C. E. Brodley (Ed.), ACM international conference proceeding series . New York: ACM
- Mannor, S., Menache, I., Hoze, A., Klein, U. (2004). Dynamic abstraction in reinforcement learning via clustering. In C. E. Brodley (Ed.), Machine learning, proceedings of the twenty-first international conference (ICML 2004). ACM international conference proceeding series (vol. 69, pp. 560-567). New York: ACM.
- (2004) Machine Learning, Proceedings of the Twenty-first International Conference (ICML 2004) , vol.69 , pp. 560-567
- Mannor, S.¹ Menache, I.² Hoze, A.³ Klein, U.⁴

55
- 0003932121
- PhD thesis, University of Rochester
- McCallum, A. K. (1996). Reinforcement learning with selective perception and hidden state. PhD thesis, University of Rochester.
- (1996) Reinforcement Learning with Selective Perception and Hidden State
- McCallum, A.K.¹

56
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- C. E. Brodley & A. P. Danyluk (Eds.), . San Francisco: Morgan Kaufmann
- McGovern, A., & Barto, A. (2001). Automatic discovery of subgoals in reinforcement learning using diverse density. In C. E. Brodley & A. P. Danyluk (Eds.), Proceedings of the eighteenth international conference on machine learning (ICML 2001) (pp. 361-368). San Francisco: Morgan Kaufmann.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001) , pp. 361-368
- McGovern, A.¹ Barto, A.²

57
- 55149090494
- Transfer in variable-reward hierarchical reinforcement learning
- Mehta, N., Natarajan, S., Tadepalli, P. (2008). Transfer in variable-reward hierarchical reinforcement learning. Machine Learning, 73, 289-312.
- (2008) Machine Learning , vol.73 , pp. 289-312
- Mehta, N.¹ Natarajan, S.² Tadepalli, P.³

58
- 84945250000
- Q-Cut - Dynamic discovery of sub-goals in reinforcement learning
- Lecture notes in computer science . Berlin: Springer
- Menache, I., Mannor, S., Shimkin, N. (2002). Q-Cut - Dynamic discovery of sub-goals in reinforcement learning. In Machine learning: ECML 2002, 13th European conference on machine learning. Lecture notes in computer science (vol. 2430, pp. 295-306). Berlin: Springer.
- (2002) th European Conference on Machine Learning , vol.2430 , pp. 295-306
- Menache, I.¹ Mannor, S.² Shimkin, N.³

59
- 0004248275
- NewYork: Holt, Rinehart & Winston
- Miller, G. A., Galanter, E., Pribram, K. H. (1960). Plans and the structure of behavior. NewYork: Holt, Rinehart & Winston.
- (1960) Plans and the Structure of Behavior
- Miller, G.A.¹ Galanter, E.² Pribram, K.H.³

60
- 78751697580
- Autonomously learning an action hierarchy using a learned qualitative state representation
- C. Boutilier (Ed.), e, Pasadena, California, USA, 11-17 July 2009 . Menlo Park: AAAI Press
- Mugan, J., & Kuipers, B. (2009). Autonomously learning an action hierarchy using a learned qualitative state representation. In C. Boutilier (Ed.), IJCAI 2009, Proceedings of the 21st international joint conference on artificial intelligence, Pasadena, California, USA, 11-17 July 2009 (pp. 1175-1180). Menlo Park: AAAI Press.
- (2009) st International Joint Conference on Artificial Intelligenc , pp. 1175-1180
- Mugan, J.¹ Kuipers, B.²

61
- 0037767534
- Active learning of causal Bayes net structure
- Berkeley CA
- Murphy, K. (2001). Active learning of causal Bayes net structure. Technical report, Computer Science Division, University of California, Berkeley CA.
- (2001) Technical Report, Computer Science Division, University of California
- Murphy, K.¹

62
- 70049112930
- Learning complex motions by sequencing simpler motion templates
- A. P. Danyluk, L. Bottou, M. L. Littman (Eds.), ACM international conference proceeding series . New York: ACM
- Neumann, G., Maass, W., Peters, J. (2009). Learning complex motions by sequencing simpler motion templates. In A. P. Danyluk, L. Bottou, M. L. Littman (Eds.), Proceedings of the 26th annual international conference on machine learning, ICML 2009. ACM international conference proceeding series (vol. 382, pp. 753-760). New York: ACM.
- (2009) Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009 , vol.382 , pp. 753-760
- Neumann, G.¹ Maass, W.² Peters, J.³

63
- 0002921687
- GPS, a program that simulates human thought
- J. Feldman (Ed.), . New York: McGraw-Hill
- Newell, A., Shaw, J. C., Simon, H. A. (1963). GPS, a program that simulates human thought. In J. Feldman (Ed.), Computers and thought (pp. 279-293). New York: McGraw-Hill.
- (1963) Computers and Thought , pp. 279-293
- Newell, A.¹ Shaw, J.C.² Simon, H.A.³

64
- 85162360219
- Clustering via Dirichlet process mixture models for portable skill discovery
- J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, K. Weinberger (Eds.), (NIPS) . Curran Associates
- Niekum, S., & Barto, A. G. (2011). Clustering via Dirichlet process mixture models for portable skill discovery. In J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, K. Weinberger (Eds.), Advances in neural information processing systems 24 (NIPS) (pp. 1818-1826). Curran Associates.
- (2011) Advances in Neural Information Processing Systems , vol.24 , pp. 1818-1826
- Niekum, S.¹ Barto, A.G.²

65
- 84899441331
- Basis function construction for hierarchical reinforcement learning
- W. van der Hoek, G. A. Kaminka, Y. Lespérance, M. Luck, S. Sen (Eds.), . International Foundation for Autonomous Agents and MultiAgent Systems (IFAAMAS
- Osentoski, S., & Mahadevan, S. (2010). Basis function construction for hierarchical reinforcement learning. In W. van der Hoek, G. A. Kaminka, Y. Lespérance, M. Luck, S. Sen (Eds.), 9th international conference on autonomous agents and multiagent systems (AAMAS 2010) (pp. 747-754). International Foundation for Autonomous Agents and MultiAgent Systems (IFAAMAS).
- (2010) th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010) , pp. 747-754
- Osentoski, S.¹ Mahadevan, S.²

66
- 0003989214
- PhD thesis, University of California, Berkeley CA
- Parr, R. (1998). Hierarchical control and learning for Markov decision processes. PhD thesis, University of California, Berkeley CA.
- (1998) Hierarchical Control and Learning for Markov Decision Processes
- Parr, R.¹

67
- 84898956770
- Reinforcement learning with hierarchies of machines
- M. I. Jordan, M. J. Kearns, S. A. Solla (Eds.), . Cambridge: MIT
- Parr, R., & Russell, S. (1998). Reinforcement learning with hierarchies of machines. In M. I. Jordan, M. J. Kearns, S. A. Solla (Eds.), Advances in neural information processing systems 10: proceedings of the 1997 conference (pp. 1043-1049). Cambridge: MIT.
- (1998) Advances in Neural Information Processing Systems 10: Proceedings of the 1997 Conference , pp. 1043-1049
- Parr, R.¹ Russell, S.²

68
- 0003398906
- Cambridge: Cambridge University Press
- Pearl, J. (2000). Causality: models, reasoning, and inference. Cambridge: Cambridge University Press.
- (2000) Causality: Models, Reasoning, and Inference
- Pearl, J.¹

69
- 0142121953
- Using options for knowledge transfer in reinforcement learning
- Perkins, T. J., &Precup, D. (1999). Using options for knowledge transfer in reinforcement learning. Technical Report UM-CS-1999-034, University of Massachusetts Amherst.
- (1999) Technical Report UM-CS-1999-034, University of Massachusetts Amherst
- Perkins, T.J.¹ Precup, D.²

70
- 14344250461
- PolicyBlocks: An algorithm for creating useful macro-actions in reinforcement learning
- C. Sammut & A. Hoffmann (Eds.), . San Francisco: Morgan Kaufmann
- Pickett, M., & Barto, A. G. (2002). PolicyBlocks: An algorithm for creating useful macro-actions in reinforcement learning. In C. Sammut & A. Hoffmann (Eds.), Machine learning, proceedings of the nineteenth international conference (ICML 2002) (pp. 506-513). San Francisco: Morgan Kaufmann.
- (2002) Machine Learning, Proceedings of the Nineteenth International Conference (ICML 2002) , pp. 506-513
- Pickett, M.¹ Barto, A.G.²

71
- 84956854078
- Model minimization in hierarchical reinforcement learning
- S. Koenig & R. C. Holte (Eds.), Kananaskis, Alberta, Canada, 2-4 August 2002, proceedings. Lecture notes in computer science . Berlin: Springer
- Ravindran, B., & Barto, A. G. (2002). Model minimization in hierarchical reinforcement learning. In S. Koenig & R. C. Holte (Eds.), Abstraction, reformulation and approximation, 5th international symposium, SARA 2002, Kananaskis, Alberta, Canada, 2-4 August 2002, proceedings. Lecture notes in computer science (vol. 2371, pp. 196-211). Berlin: Springer.
- (2002) thInternational Symposium, SARA 2002 , vol.2371 , pp. 196-211
- Ravindran, B.¹ Barto, A.G.²

72
- 0002209063
- Intrinsic and extrinsic motivations: Classic definitions and new directions
- Ryan, R. M., & Deci, E. L. (2000). Intrinsic and extrinsic motivations: classic definitions and new directions. Contemporary Educational Psychology, 25, 54-67.
- (2000) Contemporary Educational Psychology , vol.25 , pp. 54-67
- Ryan, R.M.¹ Deci, E.L.²

73
- 0016069798
- Planning in a hierarchy of abstraction spaces
- Sacerdoti, E. D. (1974). Planning in a hierarchy of abstraction spaces. Artificial Intelligence, 5, 115-135.
- (1974) Artificial Intelligence , vol.5 , pp. 115-135
- Sacerdoti, E.D.¹

74
- 0344252216
- Adaptive confidence and adaptive curiosity
- Arcisstr. 21, 800 München 2, Germany
- Schmidhuber, J. (1991a). Adaptive confidence and adaptive curiosity. Technical Report FKI- 149-91, Institut für Informatik, Technische Universität München, Arcisstr. 21, 800 München 2, Germany.
- (1991) Technical Report FKI- 149-91, Institut Für Informatik, Technische Universität München
- Schmidhuber, J.¹

75
- 2442467081
- A possibility for implementing curiosity and boredom in model-building neural controllers
- J.-A.Meyer & S.W.Wilson (Eds.) Cambridge: MIT
- Schmidhuber, J. (1991b). A possibility for implementing curiosity and boredom in model-building neural controllers. In J.-A.Meyer & S.W.Wilson (Eds.), From animals to animats: proceedings of the first international conference on simulation of adaptive behavior (complex adaptive systems) (pp. 222-227). Cambridge: MIT.
- (1991) From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior (Complex Adaptive Systems) , pp. 222-227
- Schmidhuber, J.¹

76
- 33750711410
- Hierarchical control of cognitive processes: Switching tasks in sequences
- Schneider, D. W., & Logan, G. D. (2006). Hierarchical control of cognitive processes: switching tasks in sequences. Journal of Experimental Psychology: General, 135, 623-640.
- (2006) Journal of Experimental Psychology: General , vol.135 , pp. 623-640
- Schneider, D.W.¹ Logan, G.D.²

77
- 0003960921
- 3rd edn. Cambridge: MIT
- Simon, H. A. (1996). The sciences of the artificial, 3rd edn. Cambridge: MIT.
- (1996) The Sciences of the Artificial
- Simon, H.A.¹

78
- 77953026498
- The structure of complexity in an evolving world: The role of near decomposability
- W. Callebaut & D. Rasskin-Gutman (Eds.), . Cambridge: MIT
- Simon, H. A. (2005). The structure of complexity in an evolving world: the role of near decomposability. In W. Callebaut & D. Rasskin-Gutman (Eds.), Modularity: understanding the development and evolution of natural complex systems (pp. ix-xiii). Cambridge: MIT.
- (2005) Modularity: Understanding the Development and Evolution of Natural Complex Systems , pp. ix-xiii
- Simon, H.A.¹

79
- 14344261491
- Using relative novelty to identify useful temporal abstractions in reinforcement learning
- In C. E. Brodley (Ed.), ACM international conference proceeding series . New York: ACM
- Ş imşek, Ö ., & Barto, A. (2004). Using relative novelty to identify useful temporal abstractions in reinforcement learning. In C. E. Brodley (Ed.), Machine learning, proceedings of the twentyfirst international conference (ICML 2004) ACM international conference proceeding series (vol. 69, pp. 751-758). New York: ACM.
- (2004) Machine Learning, Proceedings of the Twentyfirst International Conference (ICML 2004) , vol.69 , pp. 751-758
- Ş Imşek, O.¹ Barto, A.²

80
- 78651097494
- Skill characterization based on betweenness
- D. Koller, D. Schuurmans, Y. Bengio, L. Bottou (Eds.), Proceedings of the twenty-second annual conference on neural information processing systems . Red Hook: Curran Associates, Inc
- Ş imşek, Ö ., & Barto, A. (2009). Skill characterization based on betweenness. In D. Koller, D. Schuurmans, Y. Bengio, L. Bottou (Eds.), Advances in neural information processing systems 21, Proceedings of the twenty-second annual conference on neural information processing systems (pp. 1497-1504). Red Hook: Curran Associates, Inc.
- (2009) Advances in Neural Information Processing Systems , vol.21 , pp. 1497-1504
- Ş Imşek, O.¹ Barto, A.²

81
- 31844447221
- Identifying useful subgoals in reinforcement learning by local graph partitioning
- L. D. Raedt & S. Wrobel (Eds.), ACM international conference proceeding series . New York: ACM
- Ş imşek, Ö ., Wolfe, A. P., Barto, A. (2005). Identifying useful subgoals in reinforcement learning by local graph partitioning. In L. D. Raedt & S. Wrobel (Eds.), Machine learning, proceedings of the twenty-second international conference (ICML 2005) ACM international conference proceeding series (vol. 119, pp. 816-823). New York: ACM.
- (2005) Machine Learning, Proceedings of the Twenty-second International Conference (ICML 2005) , vol.119 , pp. 816-823
- Ş Imşek, O.¹ Wolfe, A.P.² Barto, A.³

82
- 84899031920
- Intrinsically motivated reinforcement learning
- L. K. Saul, Y. Weiss, L. Bottou (Eds.), . Cambridge: MIT
- Singh, S., Barto, A. G., Chentanez, N. (2005). Intrinsically motivated reinforcement learning. In L. K. Saul, Y. Weiss, L. Bottou (Eds.), Advances in neural information processing systems 17: proceedings of the 2004 conference (pp. 1281-1288). Cambridge: MIT.
- (2005) Advances in Neural Information Processing Systems 17: Proceedings of the 2004 Conference , pp. 1281-1288
- Singh, S.¹ Barto, A.G.² Chentanez, N.³

83
- 79953822184
- Intrinsically motivated reinforcement learning: An evolutionary perspective
- Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges
- Singh, S., Lewis, R. L., Barto, A. G., Sorg, J. (2010). Intrinsically motivated reinforcement learning: An evolutionary perspective. IEEE Transactions on AutonomousMental Development, 2, 70-82. Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges.
- (2010) IEEE Transactions on AutonomousMental Development , vol.2 , pp. 70-82
- Singh, S.¹ Lewis, R.L.² Barto, A.G.³ Sorg, J.⁴

84
- 80055036368
- Reinforcement learning of hierarchical skills on the Sony Aibo robot
- L. Smith, O. Sporns, C. Yu, M. Gasser, C. Breazeal, G. Deak, J. Weng (Eds.), Bloomington IN
- Soni, V., & Singh, S. (2006). Reinforcement learning of hierarchical skills on the Sony Aibo robot. In L. Smith, O. Sporns, C. Yu, M. Gasser, C. Breazeal, G. Deak, J. Weng (Eds.), Fifth international conference on development and learning (ICDL). Bloomington IN.
- (2006) Fifth International Conference on Development and Learning (ICDL)
- Soni, V.¹ Singh, S.²

85
- 31844452167
- Unsupervised active learning in large domains
- A. Darwiche & N. Friedman (Eds.), . San Francisco: Morgan Kaufmann
- Steck, H., & Jaakkola, T. (2002). Unsupervised active learning in large domains. In A. Darwiche & N. Friedman (Eds.), UAI '02, Proceedings of the 18th conference in uncertainty in artificial intelligence (pp. 469-476). San Francisco: Morgan Kaufmann.
- (2002) UAI '02, Proceedings of the 18th Conference in Uncertainty in Artificial Intelligence , pp. 469-476
- Steck, H.¹ Jaakkola, T.²

86
- 36348930987
- Efficient structure learning in factored-state MDPs
- San Francisco: AAAI
- Strehl, A. L., Diuk, C., Littman, M. L. (2007). Efficient structure learning in factored-state MDPs. In Proceedings of the twenty-second AAAI conference on artificial intelligence. San Francisco: AAAI.
- (2007) Proceedings of the Twenty-second AAAI Conference on Artificial Intelligence
- Strehl, A.L.¹ Diuk, C.² Littman, M.L.³

87
- 0004102479
- Cambridge: MIT
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge: MIT.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

88
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction inreinforcement learning
- Sutton, R. S., Precup, D., Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction inreinforcement learning. Artificial Intelligence, 112, 181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

89
- 68949157375
- Transfer learning for reinforcement learning domains: A survey
- Taylor, M. E., & Stone, P. (2009). Transfer learning for reinforcement learning domains: A survey. Journal of Machine Learning Research, 10, 1633-1685.
- (2009) Journal of Machine Learning Research , vol.10 , pp. 1633-1685
- Taylor, M.E.¹ Stone, P.²

90
- 34848816477
- Transfer learning via inter-task mappings for temporal difference learning
- Taylor, M. E., Stone, P., Liu, Y. (2007). Transfer learning via inter-task mappings for temporal difference learning. Journal of Machine Learning Research, 8, 2125-2167.
- (2007) Journal of Machine Learning Research , vol.8 , pp. 2125-2167
- Taylor, M.E.¹ Stone, P.² Liu, Y.³

91
- 84930518945
- LQR-Trees: Feedback motion planning on sparse randomized trees
- J. Trinkle, Y. Matsuoka, J. A. Castellanos (Eds.), . Cambridge: MIT
- Tedrake, R. (2010). LQR-Trees: feedback motion planning on sparse randomized trees. In J. Trinkle, Y. Matsuoka, J. A. Castellanos (Eds.), Robotics: science and systems V: proceedings of the fifth annual robotics: science and systems conference (pp. 17-24). Cambridge: MIT.
- (2010) Robotics: Science and Systems V: Proceedings of the Fifth Annual Robotics: Science and Systems Conference , pp. 17-24
- Tedrake, R.¹

92
- 14044262287
- Stochastic policy gradient reinforcement learning on a simple 3D biped
- Japan: Sendai
- Tedrake, R., Zhang, T. W., Seung, H. S. (2004). Stochastic policy gradient reinforcement learning on a simple 3D biped. In Proceedings of the IEEE international conference on intelligent robots and systems (IROS) (vol. 3, pp. 2849-2854). Japan: Sendai.
- (2004) Proceedings of the IEEE International Conference on Intelligent Robots and Systems (IROS) , vol.3 , pp. 2849-2854
- Tedrake, R.¹ Zhang, T.W.² Seung, H.S.³

93
- 0000985504
- TD-gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. J. (1994). TD-gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6, 215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.J.¹

94
- 33749882712
- Finding structure in reinforcement learning
- G. Tesauro, D. S. Touretzky, T. Leen (Eds.), . Cambridge: MIT
- Thrun, S. B., & Schwartz, A. (1995). Finding structure in reinforcement learning. In G. Tesauro, D. S. Touretzky, T. Leen (Eds.), Advances in neural information processing systems 7: proceedings of the 1994 conference (pp. 385-392). Cambridge: MIT.
- (1995) Advances in Neural Information Processing Systems 7: Proceedings of the 1994 Conference , pp. 385-392
- Thrun, S.B.¹ Schwartz, A.²

95
- 84880880273
- Active learning for structure in Bayesian networks
- B. Nebel (Ed.), . San Francisco: Morgan Kaufmann
- Tong, S., & Koller, D. (2001). Active learning for structure in Bayesian networks. In B. Nebel (Ed.), Proceedings of the seventeenth international joint conference on artificial intelligence, IJCAI 2001 (pp. 863-869). San Francisco: Morgan Kaufmann.
- (2001) Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, IJCAI 2001 , pp. 863-869
- Tong, S.¹ Koller, D.²

96
- 40249114836
- Relational macros for transfer in reinforcement learning
- H. Blockeel, J. Ramon, J. Shavlik, P. Tadepalli (Eds.), Lecture notes in computer science . Berlin: Springer
- Torrey, L., Shavlik, J., Walker, J., Maclin, R. (2008). Relational macros for transfer in reinforcement learning. In H. Blockeel, J. Ramon, J. Shavlik, P. Tadepalli (Eds.), Inductive logic programming 17th international conference, ILP 2007. Lecture notes in computer science (vol. 4894, pp. 254-268). Berlin: Springer.
- (2008) th International Conference, ILP 2007 , vol.4894 , pp. 254-268
- Torrey, L.¹ Shavlik, J.² Walker, J.³ Maclin, R.⁴

97
- 77950346977
- Switching between representations in reinforcement learning
- Studies in Computational Intelligence R. Babuska & F. C. A. Groen (Eds.), . Berlin: Springer
- van Seijen, H., Whiteson, S., Kester, L. (2007). Switching between representations in reinforcement learning. In R. Babuska & F. C. A. Groen (Eds.), Interactive collaborative information systems. Studies in computational intelligence (vol. 281, pp. 65-84). Berlin: Springer.
- (2007) Interactive Collaborative Information Systems , vol.281 , pp. 65-84
- Van Seijen, H.¹ Whiteson, S.² Kester, L.³

98
- 80054969173
- Intrinsically motivated hierarchical skill learning in structured environments
- Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges
- Vigorito, C., & Barto, A. G. (2010). Intrinsically motivated hierarchical skill learning in structured environments. IEEE Transactions on Autonomous Mental Development, 2, 83-90. Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges.
- (2010) IEEE Transactions on Autonomous Mental Development , vol.2 , pp. 83-90
- Vigorito, C.¹ Barto, A.G.²

99
- 0003669060
- New York: Academic
- Waterman, D. A., & Hayes-Roth, F. (1978). Pattern-directed inference systems. New York: Academic.
- (1978) Pattern-directed Inference Systems
- Waterman, D.A.¹ Hayes-Roth, F.²

100
- 33749411161
- Motivation reconsidered: The concept of competence
- White, R. W. (1959). Motivation reconsidered: the concept of competence. Psychological Review, 66, 297-333.
- (1959) Psychological Review , vol.66 , pp. 297-333
- White, R.W.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.