메뉴 건너뛰기




Volumn 5, Issue 3-4, 1997, Pages 365-390

Measuring the effectiveness of reinforcement learning for behavior-based robots

Author keywords

Behavior based architectures; Reinforcement learning; Robot learning

Indexed keywords


EID: 0031287710     PISSN: 10597123     EISSN: None     Source Type: Journal    
DOI: 10.1177/105971239700500307     Document Type: Article
Times cited : (5)

References (19)
  • 1
    • 0022688781 scopus 로고
    • A robust layered control system for a mobile robot
    • Brooks, R. (1986). A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, RA-2(1), 14-23.
    • (1986) IEEE Journal of Robotics and Automation , vol.RA-2 , Issue.1 , pp. 14-23
    • Brooks, R.1
  • 2
    • 0018480749 scopus 로고
    • The ubiquitous B tree
    • Comer, D. (1979). The ubiquitous B tree. ACM Computing Surveys, 11(2), 121-137.
    • (1979) ACM Computing Surveys , vol.11 , Issue.2 , pp. 121-137
    • Comer, D.1
  • 3
    • 0001041553 scopus 로고
    • Rapid task learning for real robots
    • J. H. Connell & S. Mahadevan (Eds.), Norwell, MA: Kluwer Academic
    • Connell, J., & Mahadevan, S. (1993). Rapid task learning for real robots. In J. H. Connell & S. Mahadevan (Eds.), Robot learning. Norwell, MA: Kluwer Academic.
    • (1993) Robot Learning
    • Connell, J.1    Mahadevan, S.2
  • 4
    • 0029326107 scopus 로고
    • Alecsys and the autonomouse: Learning to control a real robot by distributed classifier systems
    • Dorigo, M. (1995). Alecsys and the autonomouse: Learning to control a real robot by distributed classifier systems. Machine Learning, 19(3), 209-240.
    • (1995) Machine Learning , vol.19 , Issue.3 , pp. 209-240
    • Dorigo, M.1
  • 5
    • 0028739953 scopus 로고
    • Robot shaping: Developing autonomous agents through learning
    • Dorigo, M., & Colombetti, M. (1994). Robot shaping: Developing autonomous agents through learning. Artificial Intelligence, 71(2), 321-370.
    • (1994) Artificial Intelligence , vol.71 , Issue.2 , pp. 321-370
    • Dorigo, M.1    Colombetti, M.2
  • 10
    • 0010853273 scopus 로고
    • To discount or not to discount in reinforcement learning: A case study comparing R-learning and Q-learning
    • New Brunswick, NJ. San Mateo, CA: Morgan Kaufmann
    • Mahadevan, S. (1994). To discount or not to discount in reinforcement learning: A case study comparing R-learning and Q-learning. In Proceedings of the Eleventh International Conference on Machine Learning, New Brunswick, NJ. San Mateo, CA: Morgan Kaufmann.
    • (1994) Proceedings of the Eleventh International Conference on Machine Learning
    • Mahadevan, S.1
  • 11
    • 0029752592 scopus 로고    scopus 로고
    • Average reward reinforcement learning: Foundations, algorithms, and empirical results
    • Mahadevan, S. (1996). Average reward reinforcement learning: Foundations, algorithms, and empirical results. Machine Learning, 22, 159-196.
    • (1996) Machine Learning , vol.22 , pp. 159-196
    • Mahadevan, S.1
  • 12
    • 0026880130 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • Mahadevan, S., & Connell, J. (1992). Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55, 311-365.
    • (1992) Artificial Intelligence , vol.55 , pp. 311-365
    • Mahadevan, S.1    Connell, J.2
  • 15
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential tasks
    • Singh, S. P. (1992). Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8(3-4), 323-339.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 323-339
    • Singh, S.P.1
  • 19
    • 0003326518 scopus 로고
    • Learning multiple goal behavior via task decomposition and dynamic policy merging
    • J. H. Connell and S. Mahadevan (Eds.), Norwell, MA: Kluwer Academic
    • Whitehead, S. D., Karlsson, J., & Tenenberg, J. (1993). Learning multiple goal behavior via task decomposition and dynamic policy merging. In J. H. Connell and S. Mahadevan (Eds.), Robot learning. Norwell, MA: Kluwer Academic.
    • (1993) Robot Learning
    • Whitehead, S.D.1    Karlsson, J.2    Tenenberg, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.