SCOPUS 정보 검색 플랫폼

ISA Transactions

Volumn 43, Issue 2, 2004, Pages 217-230

Reinforcement learning algorithms for robotic navigation in dynamic environments

(2) Yen, Gary G a Hickey, Travis W a

a Oklahoma State University (United States)

Author keywords

Dynamic environment; Navigation; Obstacle avoidance; Reinforcement learning

Indexed keywords

COLLISION AVOIDANCE; COMPUTATION THEORY; COMPUTER SIMULATION; FUZZY CONTROL; HIERARCHICAL SYSTEMS; LEARNING ALGORITHMS; NAVIGATION;

DYNAMIC ENVIRONMENT; HIERARCHICAL STRUCTURE; REINFORCEMENT LEARNING;

ROBOTICS;

ALGORITHM; ARTICLE; ARTIFICIAL INTELLIGENCE; AUTOMATED PATTERN RECOGNITION; COMPUTER SIMULATION; ENVIRONMENT; EVALUATION; LEARNING; LOCOMOTION; METHODOLOGY; ORIENTATION; PHYSIOLOGY; REINFORCEMENT; ROBOTICS; THEORETICAL MODEL;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; ENVIRONMENT; LEARNING; LOCOMOTION; MODELS, THEORETICAL; ORIENTATION; PATTERN RECOGNITION, AUTOMATED; REINFORCEMENT (PSYCHOLOGY); ROBOTICS;

EID: 2142647859 PISSN: 00190578 EISSN: None Source Type: Journal
DOI: 10.1016/s0019-0578(07)60032-9 Document Type: Article

Times cited : (32)

References (22)

1
- 0004049893
- Ph.D dissertation, Cambridge University, Cambridge, England
- Watkins, C. J. C. H., Learning from Delayed Rewards. Ph.D dissertation, Cambridge University, Cambridge, England, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

2
- 0004102479
- MIT Press, Cambridge, MA
- Sutton, R. S. and Barto, A. G., Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

3
- 0025529853
- Advances in reinforcement learning and their implications for intelligent control
- Whitehead, S. D., Sutton, R. S., and Ballard, D. H., Advances in reinforcement learning and their implications for intelligent control. Proceedings of IEEE International Symposium on Intelligent Control, 1990, pp. 1289-1297.
- (1990) Proceedings of IEEE International Symposium on Intelligent Control , pp. 1289-1297
- Whitehead, S.D.¹ Sutton, R.S.² Ballard, D.H.³

4
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro, G. J., Temporal difference learning and TD-Gammon. Commun. ACM 38, 58-68 (1995).
- (1995) Commun. ACM , vol.38 , pp. 58-68
- Tesauro, G.J.¹

5
- 0033347508
- A dynamic channel assignment policy through Q-learning
- Nie, J. and Haykin, S., A dynamic channel assignment policy through Q-learning. IEEE Trans. Neural Netw. 10, 1443-1455 (1999).
- (1999) IEEE Trans. Neural Netw. , vol.10 , pp. 1443-1455
- Nie, J.¹ Haykin, S.²

6
- 0029277469
- A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning
- Beom, H. R. and Cho, H. S., A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning. IEEE Trans. Syst. Man Cybern. 25, 464-477 (1995).
- (1995) IEEE Trans. Syst. Man Cybern. , vol.25 , pp. 464-477
- Beom, H.R.¹ Cho, H.S.²

7
- 0032289291
- Dynamical categories and control policy selection
- Coelho, J. A., Araujo, E. G., Huber, M., and Grupen, R. A., Dynamical categories and control policy selection. Proceedings of IEEE International Symposium on Intelligent Control, 1998, pp. 459-464.
- (1998) Proceedings of IEEE International Symposium on Intelligent Control , pp. 459-464
- Coelho, J.A.¹ Araujo, E.G.² Huber, M.³ Grupen, R.A.⁴

8
- 0016873783
- The apparent conflict between estimation and control - A survey of the two-armed problem
- Wirten, I. H., The apparent conflict between estimation and control - A survey of the two-armed problem. J. Franklin Inst. 301, 161-189 (1976).
- (1976) J. Franklin Inst. , vol.301 , pp. 161-189
- Wirten, I.H.¹

9
- 0034874034
- A framework for the adaptive transfer of robot skill knowledge using reinforcement learning agents
- Malak, R. J. and Khosla, P. K., A framework for the adaptive transfer of robot skill knowledge using reinforcement learning agents. Proceedings of IEEE International Conference on Robotics and Automation, 2001, pp. 1994-2001.
- (2001) Proceedings of IEEE International Conference on Robotics and Automation , pp. 1994-2001
- Malak, R.J.¹ Khosla, P.K.²

10
- 0003487482
- Athena Scientific, Belmont, MA
- Bertsekas, D. P. and Tsitsiklis, J. N., Neural Dynamic Programming. Athena Scientific, Belmont, MA, 1996.
- (1996) Neural Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

11
- 0004280606
- MIT Press, Cambridge, MA
- Kaelbling, L. P., Learning in Embedded Systems. MIT Press, Cambridge, MA, 1993.
- (1993) Learning in Embedded Systems
- Kaelbling, L.P.¹

12
- 2142764562
- Sutton, R. S., editor, A Special Issue of Machine Learning on Reinforcement Learning, Volume 8. Machine Learning, 1992, Also published as Reinforcement Learning, Kluwer Academic Press, Boston, MA, 1992.
- (1992) A Special Issue of Machine Learning on Reinforcement Learning, Volume 8. Machine Learning , vol.8
- Sutton, R.S.¹

13
- 0004007508
- Kluwer Academic Press, Boston, MA
- Sutton, R. S., editor, A Special Issue of Machine Learning on Reinforcement Learning, Volume 8. Machine Learning, 1992, Also published as Reinforcement Learning, Kluwer Academic Press, Boston, MA, 1992.
- (1992) Reinforcement Learning

14
- 2142656526
- Kaelbling, L. P., editor A Special Issue of Machine Learning on Reinforcement Learning, Vol. 22, 1996.
- (1996) A Special Issue of Machine Learning on Reinforcement Learning , vol.22
- Kaelbling, L.P.¹

15
- 0033307299
- Reactive navigation of a mobile robot using a hierarchical set of learning agents
- Davesne, F. and Barret, C., Reactive navigation of a mobile robot using a hierarchical set of learning agents. Proceedings of Intelligent Robots and Systems Conference, 1999, pp. 482-487.
- (1999) Proceedings of Intelligent Robots and Systems Conference , pp. 482-487
- Davesne, F.¹ Barret, C.²

16
- 0034449143
- Fuzzy landmark-based localization for a legged robot
- Buschka, P., Saffiotti, A., and Wasik, Z., Fuzzy landmark-based localization for a legged robot. Proceedings of Intelligent Robots and Systems Conference, 2000, pp. 1205-1210.
- (2000) Proceedings of Intelligent Robots and Systems Conference , pp. 1205-1210
- Buschka, P.¹ Saffiotti, A.² Wasik, Z.³

17
- 0033279889
- Reactive navigation in dynamic environment using a multisensor predictor
- Song, K. T. and Chang, C. C., Reactive navigation in dynamic environment using a multisensor predictor. IEEE Trans. Syst. Man Cybern. 29, 870-880 (1999).
- (1999) IEEE Trans. Syst. Man Cybern. , vol.29 , pp. 870-880
- Song, K.T.¹ Chang, C.C.²

18
- 0032287655
- A neuro-fuzzy controller for mobile robot navigation and multirobot convoying
- Ng, K. C. and Trivedi, M. M., A neuro-fuzzy controller for mobile robot navigation and multirobot convoying. IEEE Trans. Syst. Man Cybern. 28, 829-840 (1998).
- (1998) IEEE Trans. Syst. Man Cybern. , vol.28 , pp. 829-840
- Ng, K.C.¹ Trivedi, M.M.²

19
- 0003584577
- Prentice Hall, Upper Saddle River, NJ
- Russell, S. and Norvig, P., Artificial Intelligence: A Modern Approach. Prentice Hall, Upper Saddle River, NJ, 1995.
- (1995) Artificial Intelligence: A Modern Approach
- Russell, S.¹ Norvig, P.²

20
- 0002351106
- An empirical investigation of optimization in dynamic environments using the cellular genetic algorithm
- Kirley, M. and Green, D. G., An empirical investigation of optimization in dynamic environments using the cellular genetic algorithm. Proceedings of the Genetic and Evolutionary Computation Conference, 2000, pp. 11-18.
- (2000) Proceedings of the Genetic and Evolutionary Computation Conference , pp. 11-18
- Kirley, M.¹ Green, D.G.²

21
- 0033692820
- Active multimodel control for dynamic maneuver optimization in unmanned air vehicles
- Godbole, D., Samad, T., and Gopal, V., Active multimodel control for dynamic maneuver optimization in unmanned air vehicles. Proceedings of IEEE International Conference on Robotics and Automation, 2000, pp. 1257-1262.
- (2000) Proceedings of IEEE International Conference on Robotics and Automation , pp. 1257-1262
- Godbole, D.¹ Samad, T.² Gopal, V.³

22
- 85012688561
- Princeton University Press, Princeton, NJ
- Bellman, R. E., Dynamic Programming. Princeton University Press, Princeton, NJ, 1957.
- (1957) Dynamic Programming
- Bellman, R.E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.