SCOPUS 정보 검색 플랫폼

Volumn 5, Issue 3-4, 1997, Pages 365-390

Measuring the effectiveness of reinforcement learning for behavior-based robots

Author keywords

Behavior based architectures; Reinforcement learning; Robot learning

Indexed keywords

EID: 0031287710 PISSN: 10597123 EISSN: None Source Type: Journal
DOI: 10.1177/105971239700500307 Document Type: Article

Times cited : (5)

References (19)

1
- 0022688781
- A robust layered control system for a mobile robot
- Brooks, R. (1986). A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, RA-2(1), 14-23.
- (1986) IEEE Journal of Robotics and Automation , vol.RA-2 , Issue.1 , pp. 14-23
- Brooks, R.¹

2
- 0018480749
- The ubiquitous B tree
- Comer, D. (1979). The ubiquitous B tree. ACM Computing Surveys, 11(2), 121-137.
- (1979) ACM Computing Surveys , vol.11 , Issue.2 , pp. 121-137
- Comer, D.¹

4
- 0029326107
- Alecsys and the autonomouse: Learning to control a real robot by distributed classifier systems
- Dorigo, M. (1995). Alecsys and the autonomouse: Learning to control a real robot by distributed classifier systems. Machine Learning, 19(3), 209-240.
- (1995) Machine Learning , vol.19 , Issue.3 , pp. 209-240
- Dorigo, M.¹

5
- 0028739953
- Robot shaping: Developing autonomous agents through learning
- Dorigo, M., & Colombetti, M. (1994). Robot shaping: Developing autonomous agents through learning. Artificial Intelligence, 71(2), 321-370.
- (1994) Artificial Intelligence , vol.71 , Issue.2 , pp. 321-370
- Dorigo, M.¹ Colombetti, M.²

6
- 0003753118
- Englewood Cliffs, NJ: Prentice-Hall
- Hilgard, E. R., & Bower, G. H. (1975). Theories of learning (4th ed.). Englewood Cliffs, NJ: Prentice-Hall.
- (1975) Theories of Learning (4th Ed.)
- Hilgard, E.R.¹ Bower, G.H.²

8
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

11
- 0029752592
- Average reward reinforcement learning: Foundations, algorithms, and empirical results
- Mahadevan, S. (1996). Average reward reinforcement learning: Foundations, algorithms, and empirical results. Machine Learning, 22, 159-196.
- (1996) Machine Learning , vol.22 , pp. 159-196
- Mahadevan, S.¹

12
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- Mahadevan, S., & Connell, J. (1992). Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55, 311-365.
- (1992) Artificial Intelligence , vol.55 , pp. 311-365
- Mahadevan, S.¹ Connell, J.²

15
- 0001027894
- Transfer of learning by composing solutions of elemental sequential tasks
- Singh, S. P. (1992). Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8(3-4), 323-339.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 323-339
- Singh, S.P.¹

16
- 0003522149
- Unpublished doctoral thesis, Cambridge University, Cambridge, UK
- Watkins, C. J. (1989). Models of delayed reinforcement learning. Unpublished doctoral thesis, Cambridge University, Cambridge, UK.
- (1989) Models of Delayed Reinforcement Learning
- Watkins, C.J.¹

17
- 34249833101
- Q-learning
- Watkins, C. J., & Dayan, P. (1992). Q-learning. Machine Learning, 8(3), 279-292.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.J.¹ Dayan, P.²

18
- 0003619736
- Unpublished doctoral thesis, University of Rochester, Rochester, NY
- Whitehead, S. D. (1992). Reinforcement learning for the adaptive control of perception and action. Unpublished doctoral thesis, University of Rochester, Rochester, NY.
- (1992) Reinforcement Learning for the Adaptive Control of Perception and Action
- Whitehead, S.D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.