SCOPUS 정보 검색 플랫폼

Adaptive Behavior

Volumn 13, Issue 1, 2005, Pages 5-32

An architecture for behavior-based reinforcement learning

(2) Konidaris, G D a Hayes, G M a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Artificial intelligence; Layered learning; Reinforcement learning; Robotics

Indexed keywords

EID: 26444468603 PISSN: 10597123 EISSN: None Source Type: Journal
DOI: 10.1177/105971230501300101 Document Type: Article

Times cited : (25)

References (39)

1
- 0025449341
- What are plans for?
- P. Maes (Ed.), Cambridge, MA: MIT Press
- Agre, P., & Chapman, D. (1990). What are plans for? In P. Maes (Ed.), New architectures for autonomous agents: Task-level decomposition and emergent functionality. Cambridge, MA: MIT Press.
- (1990) New Architectures for Autonomous Agents: Task-level Decomposition and Emergent Functionality
- Agre, P.¹ Chapman, D.²

2
- 0003601068
- (Technical Report GIT-CC-97-11). College of Computing, Georgia Institute of Technology
- Balch, T. (1997a). Clay: Integrating motor schemas and reinforcement learning (Technical Report GIT-CC-97-11). College of Computing, Georgia Institute of Technology.
- (1997) Clay: Integrating Motor Schemas and Reinforcement Learning
- Balch, T.¹

3
- 26444584292
- Integrating RL and behavior-based control for soccer
- Berlin: Springer-Verlag
- Balch, T. (1997b). Integrating RL and behavior-based control for soccer. RoboCup-97: Proceedings of the First Robot World Cup Soccer Games and Conferences. Berlin: Springer-Verlag.
- (1997) RoboCup-97: Proceedings of the First Robot World Cup Soccer Games and Conferences
- Balch, T.¹

4
- 26444506193
- Reward and diversity in multirobot foraging
- S. Sen and J. M. Vidal (Eds.)
- Balch, T. (1999). Reward and diversity in multirobot foraging. In S. Sen and J. M. Vidal (Eds.), Proceedings of the IJCAI Workshop on Agents Learning About, From and With Other Agents.
- (1999) Proceedings of the IJCAI Workshop on Agents Learning About, from and with Other Agents
- Balch, T.¹

5
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A., Bradtke, S., & Singh, S. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.¹ Bradtke, S.² Singh, S.³

6
- 0003636164
- Englewood Cliffs, NJ: Prentice Hall
- Bertsekas, D., & Tsitsiklis, J. (1989). Parallel and distributed computation: Numerical methods. Englewood Cliffs, NJ: Prentice Hall.
- (1989) Parallel and Distributed Computation: Numerical Methods
- Bertsekas, D.¹ Tsitsiklis, J.²

7
- 27144476780
- Planning is just a way of avoiding figuring out what to do next
- R. Brooks (Ed.), Cambridge, MA: MIT Press
- Brooks, R. (1987). Planning is just a way of avoiding figuring out what to do next. In R. Brooks (Ed.), Cambrian intelligence: The early history of the new AI (pp. 103-110). Cambridge, MA: MIT Press.
- (1987) Cambrian Intelligence: The Early History of the New AI , pp. 103-110
- Brooks, R.¹

8
- 0010535077
- Intelligence without representation
- J. Haugeland (Ed.), Cambridge, MA: MIT Press
- Brooks, R. (1991a). Intelligence without representation. In J. Haugeland (Ed.), Mind design II (pp. 395-420). Cambridge, MA: MIT Press.
- (1991) Mind Design II , pp. 395-420
- Brooks, R.¹

9
- 0009351684
- The role of learning in autonomous robots
- M. K. Warmuth and L. G. Valiant (Eds.), San Francisco, CA: Morgan Kauffman
- Brooks, R. (1991b). The role of learning in autonomous robots. In M. K. Warmuth and L. G. Valiant (Eds.), Proceedings of the Fourth Annual Workshop on Computational Learning Theory (COLT '91) (pp. 5-10). San Francisco, CA: Morgan Kauffman.
- (1991) Proceedings of the Fourth Annual Workshop on Computational Learning Theory (COLT '91) , pp. 5-10
- Brooks, R.¹

10
- 35248894899
- Modularity and specialized learning: Reexamining behavior-based artificial intelligence
- M. Butz, P. Gérard, & O. Sigaud (Eds.), Berlin: Springer
- Bryson, J. (2002). Modularity and specialized learning: Reexamining behavior-based artificial intelligence. In M. Butz, P. Gérard, & O. Sigaud (Eds.), Proceedings of the Workshop on Adaptive Behavior in Anticipatory Learning Systems. Berlin: Springer.
- (2002) Proceedings of the Workshop on Adaptive Behavior in Anticipatory Learning Systems
- Bryson, J.¹

11
- 0035301619
- Topological simultaneous localization and mapping (SLAM): Towards exact localization without explicit localization
- Choset, H., & Nagatani, K. (2001). Topological simultaneous localization and mapping (SLAM): Towards exact localization without explicit localization. IEEE Transactions on Robotics and Automation, 17(2), 125-137.
- (2001) IEEE Transactions on Robotics and Automation , vol.17 , Issue.2 , pp. 125-137
- Choset, H.¹ Nagatani, K.²

12
- 79953249396
- Learning in a state of confusion: Perceptual aliasing in grid world navigation
- U. Nehmzow and C. Melhush (Eds.), London, UK: IEE
- Crook, P., & Hayes, G. (2003). Learning in a state of confusion: Perceptual aliasing in grid world navigation. In U. Nehmzow and C. Melhush (Eds.), Proceedings of the 4th British Conference on (Mobile) Robotics: Towards Intelligent Mobile Robots (TIMR 2003). London, UK: IEE.
- (2003) Proceedings of the 4th British Conference on (Mobile) Robotics: Towards Intelligent Mobile Robots (TIMR 2003)
- Crook, P.¹ Hayes, G.²

13
- 0004782095
- Learning hierarchical control structures for multiple tasks and changing environments
- R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), Cambridge, MA: MIT Press
- Digney, B. (1998). Learning hierarchical control structures for multiple tasks and changing environments. In R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior (pp. 321-330). Cambridge, MA: MIT Press.
- (1998) From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior , pp. 321-330
- Digney, B.¹

14
- 0004136810
- Using local information in a non-local way for mapping graph-like worlds
- R. Bajcsy (Ed.) San Francisco, CA: Morgan Kaufmann
- Dudek, G., Freedman, P., & Hadjres, S. (1993). Using local information in a non-local way for mapping graph-like worlds. In R. Bajcsy (Ed.) Proceedings of the International Joint Conference of Artificial Intelligence (pp. 1639-1647). San Francisco, CA: Morgan Kaufmann.
- (1993) Proceedings of the International Joint Conference of Artificial Intelligence , pp. 1639-1647
- Dudek, G.¹ Freedman, P.² Hadjres, S.³

15
- 85152517921
- An approach to anytime learning
- D. H. Sleeman and P. Edwards (Eds.), San Francisco, CA: Morgan Kaufmann
- Grefenstette, J., & Ramsey, C. (1992). An approach to anytime learning. In D. H. Sleeman and P. Edwards (Eds.), Proceedings of the Ninth International Conference on Machine Learning (pp. 189-195). San Francisco, CA: Morgan Kaufmann.
- (1992) Proceedings of the Ninth International Conference on Machine Learning , pp. 189-195
- Grefenstette, J.¹ Ramsey, C.²

16
- 0011714199
- D. Phil. thesis, School of Cognitive and Computing Sciences, University of Sussex
- Harvey, I. (1995). The artificial evolution of adaptive behaviour. D. Phil. thesis, School of Cognitive and Computing Sciences, University of Sussex.
- (1995) The Artificial Evolution of Adaptive Behaviour
- Harvey, I.¹

17
- 0007914441
- Action selection methods using reinforcement learning
- P. Maes, M. Matarić, J.-A. Meyer, J. Pollack, & S. Wilson (Eds.), Cambridge, MA: MIT Press
- Humphrys, M. (1996). Action selection methods using reinforcement learning. In P. Maes, M. Matarić, J.-A. Meyer, J. Pollack, & S. Wilson (Eds.), From Animals to Animats 4: The Fourth International Conference on the Simulation of Adaptive Behaviour (SAB-96) (pp. 135-144). Cambridge, MA: MIT Press.
- (1996) From Animals to Animats 4: The Fourth International Conference on the Simulation of Adaptive Behaviour (SAB-96) , pp. 135-144
- Humphrys, M.¹

18
- 26444579198
- Lausanne, Switzerland
- K-Team SA (1999a). Khepera K213 vision turret user manual. Lausanne, Switzerland.
- (1999) Khepera K213 Vision Turret User Manual

19
- 26444582589
- Lausanne, Switzerland
- K-Team SA (1999b). Khepera user manual. Lausanne, Switzerland.
- (1999) Khepera User Manual

20
- 0003527079
- Berlin: Springer-Verlag
- Kohonen, T. (1989). Self-organization and associative memory (3rd ed.). Berlin: Springer-Verlag.
- (1989) Self-organization and Associative Memory (3rd Ed.)
- Kohonen, T.¹

21
- 26444470752
- Master's thesis, School of Informatics, University of Edinburgh
- Konidaris, G. (2003). Behaviour-based reinforcement learning. Master's thesis, School of Informatics, University of Edinburgh.
- (2003) Behaviour-based Reinforcement Learning
- Konidaris, G.¹

22
- 84976813028
- Learning to coordinate behaviors
- T. Dietterich and W. Swartout (Eds.), Cambridge, MA
- Maes, P., & Brooks, R. (1990). Learning to coordinate behaviors. In T. Dietterich and W. Swartout (Eds.), Proceedings of the Eighth National Conference on Artificial Intelligence (pp. 796-802). Cambridge, MA.
- (1990) Proceedings of the Eighth National Conference on Artificial Intelligence , pp. 796-802
- Maes, P.¹ Brooks, R.²

23
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- Mahadevan, S., & Connell, J. (1992). Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55(2-3), 311-365.
- (1992) Artificial Intelligence , vol.55 , Issue.2-3 , pp. 311-365
- Mahadevan, S.¹ Connell, J.²

24
- 0036789790
- A self-organising network that grows when required
- Marsland, S., Shapiro, J., & Nehmzow, U. (2002). A self-organising network that grows when required. Neural Networks, 15(8-9), 1041-1058.
- (2002) Neural Networks , vol.15 , Issue.8-9 , pp. 1041-1058
- Marsland, S.¹ Shapiro, J.² Nehmzow, U.³

25
- 84957895797
- Reward functions for accelerated learning
- W. W. Cohen and H. Hirsh (Eds.), San Francisco, CA: Morgan Kaufmann
- Matarić, M. (1994). Reward functions for accelerated learning. In W. W. Cohen and H. Hirsh (Eds.), Proceedings of the Eleventh International Conference on Machine Learning (pp. 181-189). San Francisco, CA: Morgan Kaufmann.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 181-189
- Matarić, M.¹

26
- 0030647149
- Reinforcement learning in the multi-robot domain
- Matarić, M. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4(1), 73-83.
- (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 73-83
- Matarić, M.¹

27
- 26444496413
- Learning a distributed map representation based on navigation behaviors
- R. Brooks (Ed.), Cambridge, Massachusetts: The MIT Press
- Matarić, M., & Brooks, R. (1990). Learning a distributed map representation based on navigation behaviors. In R. Brooks (Ed.), Cambrian intelligence : The early history of the new AI. Cambridge, Massachusetts: The MIT Press.
- (1990) Cambrian Intelligence: The Early History of the New AI
- Matarić, M.¹ Brooks, R.²

28
- 0004255908
- London, UK: McGraw-Hill
- Mitchell, T. (1997). Machine learning. London, UK: McGraw-Hill. 42
- (1997) Machine Learning , vol.42
- Mitchell, T.¹

29
- 0004156494
- Evolutionary algorithms for reinforcement learning
- Moriarty, D., Schultz, A., & Grefenstette, J. (1999). Evolutionary algorithms for reinforcement learning. Journal of Artificial Intelligence Research, 11.
- (1999) Journal of Artificial Intelligence Research , vol.11
- Moriarty, D.¹ Schultz, A.² Grefenstette, J.³

30
- 84898304094
- Polarization compass for robot navigation
- D. Polani, J. Kim, & T. Martinetz (Eds.), Berlin: Akademische Verlagsgesellschaft Aka
- Schmolke, A., & Mallot, H. (2002). Polarization compass for robot navigation. In D. Polani, J. Kim, & T. Martinetz (Eds.), The Fifth German Workshop on Artificial Life (pp. 163-167). Berlin: Akademische Verlagsgesellschaft Aka.
- (2002) The Fifth German Workshop on Artificial Life , pp. 163-167
- Schmolke, A.¹ Mallot, H.²

31
- 0001898381
- Practical reinforcement learning in continuous spaces
- P. Langley (Ed.), San Francisco, CA: Morgan Kaufmann
- Smart, W., & Kaelbling, L. (2000). Practical reinforcement learning in continuous spaces. In P. Langley (Ed.), Proceedings of the Seventeenth International Conference on Machine Learning (pp. 903-910). San Francisco, CA: Morgan Kaufmann.
- (2000) Proceedings of the Seventeenth International Conference on Machine Learning , pp. 903-910
- Smart, W.¹ Kaelbling, L.²

32
- 0036790898
- Applications of the self-organising map to reinforcement learning
- Smith, A. J. (2002). Applications of the self-organising map to reinforcement learning. Neural Networks, 15, 1107-1124.
- (2002) Neural Networks , vol.15 , pp. 1107-1124
- Smith, A.J.¹

33
- 84974678409
- Layered learning
- R. Lopez de Mantarasand E. Plaza (Eds.), Berlin: Springer
- Stone, P., & Veloso, M. (2000). Layered learning. In R. Lopez de Mantarasand E. Plaza (Eds.), Proceedings of the 11th European Conference on Machine Learning (pp. 369-381). Berlin: Springer.
- (2000) Proceedings of the 11th European Conference on Machine Learning , pp. 369-381
- Stone, P.¹ Veloso, M.²

34
- 0011200414
- Reinforcement learning architectures for animats
- J. Meyer, & S. Wilson (Eds.), Cambridge, MA: MIT Press
- Sutton, R. (1990). Reinforcement learning architectures for animats. In J. Meyer, & S. Wilson (Eds.), From animals to animats: Proceedings of the International Conference on Simulation of Adaptive Behavior (pp. 288-296). Cambridge, MA: MIT Press.
- (1990) From Animals to Animats: Proceedings of the International Conference on Simulation of Adaptive Behavior , pp. 288-296
- Sutton, R.¹

35
- 85152618928
- Planning by incremental dynamic programming
- L. Birnbaum and G. Collins (Eds.), San Francisco, CA: Morgan Kaufmann
- Sutton, R. (1991). Planning by incremental dynamic programming. In L. Birnbaum and G. Collins (Eds.), Proceedings of the Ninth Conference on Machine Learning (pp. 353-357). San Francisco, CA: Morgan Kaufmann.
- (1991) Proceedings of the Ninth Conference on Machine Learning , pp. 353-357
- Sutton, R.¹

36
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

37
- 26444556750
- Reinforcement landmark learning
- R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), Cambridge, MA: MIT Press
- Toombs, S., Phillips, W., & Smith, L. (1998). Reinforcement landmark learning. In R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), From animals to animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior (pp. 205-212). Cambridge, MA: MIT Press.
- (1998) From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior , pp. 205-212
- Toombs, S.¹ Phillips, W.² Smith, L.³

38
- 34249833101
- Q-learning
- Watkins, C., & Dayan, P. (1992). Q-learning. Machine Learning, 8, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

39
- 1142280955
- Concurrent layered learning
- J. S. Rosenschein, M. Woolbridge, T. Sandholm and M. Yokoo (Eds.), New York, NY: ACM Press
- Whiteson, S., & Stone, P. (2003). Concurrent layered learning. In J. S. Rosenschein, M. Woolbridge, T. Sandholm and M. Yokoo (Eds.), Proceedings of the Second International Joint Conference on Autonomous Agents and Multi-Agent Systems (pp. 193-200). New York, NY: ACM Press.
- (2003) Proceedings of the Second International Joint Conference on Autonomous Agents and Multi-Agent Systems , pp. 193-200
- Whiteson, S.¹ Stone, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.