메뉴 건너뛰기




Volumn 13, Issue 1, 2005, Pages 5-32

An architecture for behavior-based reinforcement learning

Author keywords

Artificial intelligence; Layered learning; Reinforcement learning; Robotics

Indexed keywords


EID: 26444468603     PISSN: 10597123     EISSN: None     Source Type: Journal    
DOI: 10.1177/105971230501300101     Document Type: Article
Times cited : (25)

References (39)
  • 2
  • 5
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A., Bradtke, S., & Singh, S. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
    • (1995) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 7
    • 27144476780 scopus 로고
    • Planning is just a way of avoiding figuring out what to do next
    • R. Brooks (Ed.), Cambridge, MA: MIT Press
    • Brooks, R. (1987). Planning is just a way of avoiding figuring out what to do next. In R. Brooks (Ed.), Cambrian intelligence: The early history of the new AI (pp. 103-110). Cambridge, MA: MIT Press.
    • (1987) Cambrian Intelligence: The Early History of the New AI , pp. 103-110
    • Brooks, R.1
  • 8
    • 0010535077 scopus 로고
    • Intelligence without representation
    • J. Haugeland (Ed.), Cambridge, MA: MIT Press
    • Brooks, R. (1991a). Intelligence without representation. In J. Haugeland (Ed.), Mind design II (pp. 395-420). Cambridge, MA: MIT Press.
    • (1991) Mind Design II , pp. 395-420
    • Brooks, R.1
  • 9
    • 0009351684 scopus 로고
    • The role of learning in autonomous robots
    • M. K. Warmuth and L. G. Valiant (Eds.), San Francisco, CA: Morgan Kauffman
    • Brooks, R. (1991b). The role of learning in autonomous robots. In M. K. Warmuth and L. G. Valiant (Eds.), Proceedings of the Fourth Annual Workshop on Computational Learning Theory (COLT '91) (pp. 5-10). San Francisco, CA: Morgan Kauffman.
    • (1991) Proceedings of the Fourth Annual Workshop on Computational Learning Theory (COLT '91) , pp. 5-10
    • Brooks, R.1
  • 10
    • 35248894899 scopus 로고    scopus 로고
    • Modularity and specialized learning: Reexamining behavior-based artificial intelligence
    • M. Butz, P. Gérard, & O. Sigaud (Eds.), Berlin: Springer
    • Bryson, J. (2002). Modularity and specialized learning: Reexamining behavior-based artificial intelligence. In M. Butz, P. Gérard, & O. Sigaud (Eds.), Proceedings of the Workshop on Adaptive Behavior in Anticipatory Learning Systems. Berlin: Springer.
    • (2002) Proceedings of the Workshop on Adaptive Behavior in Anticipatory Learning Systems
    • Bryson, J.1
  • 11
    • 0035301619 scopus 로고    scopus 로고
    • Topological simultaneous localization and mapping (SLAM): Towards exact localization without explicit localization
    • Choset, H., & Nagatani, K. (2001). Topological simultaneous localization and mapping (SLAM): Towards exact localization without explicit localization. IEEE Transactions on Robotics and Automation, 17(2), 125-137.
    • (2001) IEEE Transactions on Robotics and Automation , vol.17 , Issue.2 , pp. 125-137
    • Choset, H.1    Nagatani, K.2
  • 13
    • 0004782095 scopus 로고    scopus 로고
    • Learning hierarchical control structures for multiple tasks and changing environments
    • R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), Cambridge, MA: MIT Press
    • Digney, B. (1998). Learning hierarchical control structures for multiple tasks and changing environments. In R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior (pp. 321-330). Cambridge, MA: MIT Press.
    • (1998) From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior , pp. 321-330
    • Digney, B.1
  • 16
    • 0011714199 scopus 로고
    • D. Phil. thesis, School of Cognitive and Computing Sciences, University of Sussex
    • Harvey, I. (1995). The artificial evolution of adaptive behaviour. D. Phil. thesis, School of Cognitive and Computing Sciences, University of Sussex.
    • (1995) The Artificial Evolution of Adaptive Behaviour
    • Harvey, I.1
  • 19
    • 26444582589 scopus 로고    scopus 로고
    • Lausanne, Switzerland
    • K-Team SA (1999b). Khepera user manual. Lausanne, Switzerland.
    • (1999) Khepera User Manual
  • 21
    • 26444470752 scopus 로고    scopus 로고
    • Master's thesis, School of Informatics, University of Edinburgh
    • Konidaris, G. (2003). Behaviour-based reinforcement learning. Master's thesis, School of Informatics, University of Edinburgh.
    • (2003) Behaviour-based Reinforcement Learning
    • Konidaris, G.1
  • 23
    • 0026880130 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • Mahadevan, S., & Connell, J. (1992). Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55(2-3), 311-365.
    • (1992) Artificial Intelligence , vol.55 , Issue.2-3 , pp. 311-365
    • Mahadevan, S.1    Connell, J.2
  • 24
    • 0036789790 scopus 로고    scopus 로고
    • A self-organising network that grows when required
    • Marsland, S., Shapiro, J., & Nehmzow, U. (2002). A self-organising network that grows when required. Neural Networks, 15(8-9), 1041-1058.
    • (2002) Neural Networks , vol.15 , Issue.8-9 , pp. 1041-1058
    • Marsland, S.1    Shapiro, J.2    Nehmzow, U.3
  • 25
    • 84957895797 scopus 로고
    • Reward functions for accelerated learning
    • W. W. Cohen and H. Hirsh (Eds.), San Francisco, CA: Morgan Kaufmann
    • Matarić, M. (1994). Reward functions for accelerated learning. In W. W. Cohen and H. Hirsh (Eds.), Proceedings of the Eleventh International Conference on Machine Learning (pp. 181-189). San Francisco, CA: Morgan Kaufmann.
    • (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 181-189
    • Matarić, M.1
  • 26
    • 0030647149 scopus 로고    scopus 로고
    • Reinforcement learning in the multi-robot domain
    • Matarić, M. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4(1), 73-83.
    • (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 73-83
    • Matarić, M.1
  • 27
    • 26444496413 scopus 로고
    • Learning a distributed map representation based on navigation behaviors
    • R. Brooks (Ed.), Cambridge, Massachusetts: The MIT Press
    • Matarić, M., & Brooks, R. (1990). Learning a distributed map representation based on navigation behaviors. In R. Brooks (Ed.), Cambrian intelligence : The early history of the new AI. Cambridge, Massachusetts: The MIT Press.
    • (1990) Cambrian Intelligence: The Early History of the New AI
    • Matarić, M.1    Brooks, R.2
  • 28
    • 0004255908 scopus 로고    scopus 로고
    • London, UK: McGraw-Hill
    • Mitchell, T. (1997). Machine learning. London, UK: McGraw-Hill. 42
    • (1997) Machine Learning , vol.42
    • Mitchell, T.1
  • 30
    • 84898304094 scopus 로고    scopus 로고
    • Polarization compass for robot navigation
    • D. Polani, J. Kim, & T. Martinetz (Eds.), Berlin: Akademische Verlagsgesellschaft Aka
    • Schmolke, A., & Mallot, H. (2002). Polarization compass for robot navigation. In D. Polani, J. Kim, & T. Martinetz (Eds.), The Fifth German Workshop on Artificial Life (pp. 163-167). Berlin: Akademische Verlagsgesellschaft Aka.
    • (2002) The Fifth German Workshop on Artificial Life , pp. 163-167
    • Schmolke, A.1    Mallot, H.2
  • 32
    • 0036790898 scopus 로고    scopus 로고
    • Applications of the self-organising map to reinforcement learning
    • Smith, A. J. (2002). Applications of the self-organising map to reinforcement learning. Neural Networks, 15, 1107-1124.
    • (2002) Neural Networks , vol.15 , pp. 1107-1124
    • Smith, A.J.1
  • 35
    • 85152618928 scopus 로고
    • Planning by incremental dynamic programming
    • L. Birnbaum and G. Collins (Eds.), San Francisco, CA: Morgan Kaufmann
    • Sutton, R. (1991). Planning by incremental dynamic programming. In L. Birnbaum and G. Collins (Eds.), Proceedings of the Ninth Conference on Machine Learning (pp. 353-357). San Francisco, CA: Morgan Kaufmann.
    • (1991) Proceedings of the Ninth Conference on Machine Learning , pp. 353-357
    • Sutton, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.