SCOPUS 정보 검색 플랫폼

IEEE International Conference on Intelligent Robots and Systems

Volumn , Issue , 2006, Pages 2656-2662

Q-RAN: A constructive reinforcement learning approach for robot behavior learning

(4) Li, Jun a Lilienthal, Achim a Martínez Marín, Tomás b Duckett, Tom c

a ÖREBRO UNIVERSITY (Sweden)

b UNIVERSITY OF ALICANTE (Spain)

c UNIVERSITY OF LINCOLN (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

CONTROL THEORY; DOCKING; MOBILE ROBOTS; RESOURCE ALLOCATION;

FUNCTION APPROXIMATOR; LAYERED ARCHITECTURE; RESOURCE ALLOCATING NETWORK (RAN);

REINFORCEMENT LEARNING;

EID: 34250630005 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IROS.2006.281986 Document Type: Conference Paper

Times cited : (11)

References (22)

1
- 0031074521
- Locally weighted learning
- C. Atkeson, A. Moore, and S. Schaal. Locally weighted learning. Artificial Intelligence Review, 11(4):76-113, 1997.
- (1997) Artificial Intelligence Review , vol.11 , Issue.4 , pp. 76-113
- Atkeson, C.¹ Moore, A.² Schaal, S.³

2
- 0031272154
- An integrated architecture for learning of reactive behaviors based on dynamic cell structures
- J. Bruske, I. Ahrns, and G. Sommer. An integrated architecture for learning of reactive behaviors based on dynamic cell structures. Robotics and Autonomous Systems, 22(2):87-101, 1998.
- (1998) Robotics and Autonomous Systems , vol.22 , Issue.2 , pp. 87-101
- Bruske, J.¹ Ahrns, I.² Sommer, G.³

3
- 34250688497
- K. R. Dixon, R. J. Malak, and P. K. Khosla. Incorporating prior knowledge and previously learned information into reinforcement learning agents. Technical report, Carnegie Mellon University, 2000.
- K. R. Dixon, R. J. Malak, and P. K. Khosla. Incorporating prior knowledge and previously learned information into reinforcement learning agents. Technical report, Carnegie Mellon University, 2000.

4
- 0346242001
- PhD thesis, The Australian National University
- C. Gaskett. Q-Learning for Robot Control. PhD thesis, The Australian National University, 2002.
- (2002) Q-Learning for Robot Control
- Gaskett, C.¹

5
- 10044221078
- An efficient sequential learning algorithm for growing and pruning RBF (GAP-RBF) networks
- Dec
- G. B. Huang, P. Saratchandran, and N. Sundararajan. An efficient sequential learning algorithm for growing and pruning RBF (GAP-RBF) networks. IEEE Trans. System, Man, And Cybernetics-Part B: Cybernetics., 34(6):2284-2292, Dec. 2004.
- (2004) IEEE Trans. System, Man, And Cybernetics-Part B: Cybernetics , vol.34 , Issue.6 , pp. 2284-2292
- Huang, G.B.¹ Saratchandran, P.² Sundararajan, N.³

6
- 78751539213
- Q-learning with a growing RBF network for behavior learning in mobile robotics
- Cambridge, USA, Nov
- J. Li and T. Duckett. Q-learning with a growing RBF network for behavior learning in mobile robotics. In Proceedings of the IASTED International Conference on Robotics and Applications (RA 2005), Cambridge, USA, Nov. 2005.
- (2005) Proceedings of the IASTED International Conference on Robotics and Applications (RA 2005)
- Li, J.¹ Duckett, T.²

7
- 79960712675
- Vision-based docking for biomimetic wheeled robots
- Prague, Czech Republic, July
- I. R. Manchester and A. V. Savkin. Vision-based docking for biomimetic wheeled robots. In 16th IFAC world congress, Prague, Czech Republic, July 2005.
- (2005) 16th IFAC world congress
- Manchester, I.R.¹ Savkin, A.V.²

8
- 33846128114
- Fast reinforcement learning for vision-guided mobile robots
- Barcelona, Spain
- T. Martínez-Marín and T. Duckett. Fast reinforcement learning for vision-guided mobile robots. In Proc. IEEE International Conference on Robotics and Automation (ICRA 2005), Barcelona, Spain, 2005.
- (2005) Proc. IEEE International Conference on Robotics and Automation (ICRA 2005)
- Martínez-Marín, T.¹ Duckett, T.²

9
- 33745885802
- Using prior knowledge to improve reinforcement learning in mobile robotics
- UK
- D. L. Moreno, C. V. Regueiro, R. Iglesias, and S. Barro. Using prior knowledge to improve reinforcement learning in mobile robotics. In Towards Autonomous Robotics Systems (TAROS04), UK, 2004.
- (2004) Towards Autonomous Robotics Systems (TAROS04)
- Moreno, D.L.¹ Regueiro, C.V.² Iglesias, R.³ Barro, S.⁴

10
- 0001071040
- A resource allocating network for function interpolation
- J. Platt. A resource allocating network for function interpolation. Neural Computa., 3:213-225, 1991.
- (1991) Neural Computa , vol.3 , pp. 213-225
- Platt, J.¹

11
- 22944448066
- Sparse distributed memories for on-line value-based reinforcement learning
- B. Ratitch and D. Precup. Sparse distributed memories for on-line value-based reinforcement learning. In ECML-2004, pages 347-358, 2004.
- (2004) ECML-2004 , pp. 347-358
- Ratitch, B.¹ Precup, D.²

12
- 1942516829
- Combining TD-learning with cascade-correlation networks
- Washington DC
- F. Rivest and D. Precup. Combining TD-learning with cascade-correlation networks. In Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003). Washington DC, 2003.
- (2003) Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003)
- Rivest, F.¹ Precup, D.²

13
- 0032865893
- Exploration tuned reinforcement function
- J. M. Santos and C. Touzet. Exploration tuned reinforcement function. Neurocomputing, 28(1-3):93-105, 1999.
- (1999) Neurocomputing , vol.28 , Issue.1-3 , pp. 93-105
- Santos, J.M.¹ Touzet, C.²

14
- 0032041134
- Learning from innate behaviors: A quantitative evaluation of neural network controllers
- N. E. Sharkey. Learning from innate behaviors: a quantitative evaluation of neural network controllers. Machine Learning, 31:115-139, 1998.
- (1998) Machine Learning , vol.31 , pp. 115-139
- Sharkey, N.E.¹

15
- 17644403231
- The MIT Press, Cambridge, Massachusetts
- R. Siegwart and I. R. Nourbakhsh. Introduction to Autonomous Mobile Robots. The MIT Press, Cambridge, Massachusetts, 2004.
- (2004) Introduction to Autonomous Mobile Robots
- Siegwart, R.¹ Nourbakhsh, I.R.²

16
- 0036058423
- Effective reinforcement learning for mobile robots
- May 11-15
- W. D. Smart and L. P. Kaelbling. Effective reinforcement learning for mobile robots. In International Conference on Robotics and Automation, May 11-15 2002.
- (2002) International Conference on Robotics and Automation
- Smart, W.D.¹ Kaelbling, L.P.²

17
- 0004102479
- MIT Press
- R. S. Sutton and A. Barto. Reinforcement Learning, an introduction. MIT Press, 1998.
- (1998) Reinforcement Learning, an introduction
- Sutton, R.S.¹ Barto, A.²

18
- 0003205434
- Extending visual servoing techniques to nonholonomic mobile robots
- G. Hager, D. Kriegman. and S. Morse, editors, Springer-Verlag
- D. P. Tsakiris, P. Rives, and C. Samson. Extending visual servoing techniques to nonholonomic mobile robots. In G. Hager, D. Kriegman. and S. Morse, editors, The Conference of Vision and Control, Lecture Notes in Control and Information Systems. Springer-Verlag, 1998.
- (1998) The Conference of Vision and Control, Lecture Notes in Control and Information Systems
- Tsakiris, D.P.¹ Rives, P.² Samson, C.³

19
- 0004049893
- PhD thesis, University of Cambridge
- C. J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, University of Cambridge, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

20
- 3242674212
- Robot docking with neural vision and reinforcement
- C. Weber, S. Wermter, and A. Zochios. Robot docking with neural vision and reinforcement. Knowledge-Based Systems, 17:165-172, 2004.
- (2004) Knowledge-Based Systems , vol.17 , pp. 165-172
- Weber, C.¹ Wermter, S.² Zochios, A.³

21
- 33644576426
- Developmental robotics: Theory and experiments
- J. Weng. Developmental robotics: Theory and experiments. International Journal of Humanoid Robotics. 1(2):199-236, 2004.
- (2004) International Journal of Humanoid Robotics , vol.1 , Issue.2 , pp. 199-236
- Weng, J.¹

22
- 0000337576
- Simple statistical gradient-following algorithm for connectionist reinforcement learning
- R. J. Williams. Simple statistical gradient-following algorithm for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.