메뉴 건너뛰기




Volumn , Issue , 2006, Pages 2656-2662

Q-RAN: A constructive reinforcement learning approach for robot behavior learning

Author keywords

[No Author keywords available]

Indexed keywords

CONTROL THEORY; DOCKING; MOBILE ROBOTS; RESOURCE ALLOCATION;

EID: 34250630005     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IROS.2006.281986     Document Type: Conference Paper
Times cited : (11)

References (22)
  • 2
    • 0031272154 scopus 로고    scopus 로고
    • An integrated architecture for learning of reactive behaviors based on dynamic cell structures
    • J. Bruske, I. Ahrns, and G. Sommer. An integrated architecture for learning of reactive behaviors based on dynamic cell structures. Robotics and Autonomous Systems, 22(2):87-101, 1998.
    • (1998) Robotics and Autonomous Systems , vol.22 , Issue.2 , pp. 87-101
    • Bruske, J.1    Ahrns, I.2    Sommer, G.3
  • 3
    • 34250688497 scopus 로고    scopus 로고
    • K. R. Dixon, R. J. Malak, and P. K. Khosla. Incorporating prior knowledge and previously learned information into reinforcement learning agents. Technical report, Carnegie Mellon University, 2000.
    • K. R. Dixon, R. J. Malak, and P. K. Khosla. Incorporating prior knowledge and previously learned information into reinforcement learning agents. Technical report, Carnegie Mellon University, 2000.
  • 4
    • 0346242001 scopus 로고    scopus 로고
    • PhD thesis, The Australian National University
    • C. Gaskett. Q-Learning for Robot Control. PhD thesis, The Australian National University, 2002.
    • (2002) Q-Learning for Robot Control
    • Gaskett, C.1
  • 7
    • 79960712675 scopus 로고    scopus 로고
    • Vision-based docking for biomimetic wheeled robots
    • Prague, Czech Republic, July
    • I. R. Manchester and A. V. Savkin. Vision-based docking for biomimetic wheeled robots. In 16th IFAC world congress, Prague, Czech Republic, July 2005.
    • (2005) 16th IFAC world congress
    • Manchester, I.R.1    Savkin, A.V.2
  • 10
    • 0001071040 scopus 로고
    • A resource allocating network for function interpolation
    • J. Platt. A resource allocating network for function interpolation. Neural Computa., 3:213-225, 1991.
    • (1991) Neural Computa , vol.3 , pp. 213-225
    • Platt, J.1
  • 11
    • 22944448066 scopus 로고    scopus 로고
    • Sparse distributed memories for on-line value-based reinforcement learning
    • B. Ratitch and D. Precup. Sparse distributed memories for on-line value-based reinforcement learning. In ECML-2004, pages 347-358, 2004.
    • (2004) ECML-2004 , pp. 347-358
    • Ratitch, B.1    Precup, D.2
  • 13
    • 0032865893 scopus 로고    scopus 로고
    • Exploration tuned reinforcement function
    • J. M. Santos and C. Touzet. Exploration tuned reinforcement function. Neurocomputing, 28(1-3):93-105, 1999.
    • (1999) Neurocomputing , vol.28 , Issue.1-3 , pp. 93-105
    • Santos, J.M.1    Touzet, C.2
  • 14
    • 0032041134 scopus 로고    scopus 로고
    • Learning from innate behaviors: A quantitative evaluation of neural network controllers
    • N. E. Sharkey. Learning from innate behaviors: a quantitative evaluation of neural network controllers. Machine Learning, 31:115-139, 1998.
    • (1998) Machine Learning , vol.31 , pp. 115-139
    • Sharkey, N.E.1
  • 20
    • 3242674212 scopus 로고    scopus 로고
    • Robot docking with neural vision and reinforcement
    • C. Weber, S. Wermter, and A. Zochios. Robot docking with neural vision and reinforcement. Knowledge-Based Systems, 17:165-172, 2004.
    • (2004) Knowledge-Based Systems , vol.17 , pp. 165-172
    • Weber, C.1    Wermter, S.2    Zochios, A.3
  • 21
    • 33644576426 scopus 로고    scopus 로고
    • Developmental robotics: Theory and experiments
    • J. Weng. Developmental robotics: Theory and experiments. International Journal of Humanoid Robotics. 1(2):199-236, 2004.
    • (2004) International Journal of Humanoid Robotics , vol.1 , Issue.2 , pp. 199-236
    • Weng, J.1
  • 22
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithm for connectionist reinforcement learning
    • R. J. Williams. Simple statistical gradient-following algorithm for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.