메뉴 건너뛰기




Volumn 24, Issue 1-2, 1998, Pages 17-32

Design, analysis and comparison of robot learners

Author keywords

Experimental methodology; Q learning; Reinforcement learning; Robot learning

Indexed keywords

COMPUTATIONAL COMPLEXITY; DECISION THEORY; LEARNING ALGORITHMS;

EID: 0032131047     PISSN: 09218890     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0921-8890(98)00019-0     Document Type: Article
Times cited : (4)

References (20)
  • 2
    • 0002355083 scopus 로고
    • Connectionist learning for control
    • W.T. Miller, R.S. Sutton, P.J. Werbos (Eds.), MIT Press, Cambridge, MA
    • A.G. Barto, Connectionist learning for control, in: W.T. Miller, R.S. Sutton, P.J. Werbos (Eds.), Neural Networks for Control, MIT Press, Cambridge, MA, 1990, pp. 5-58.
    • (1990) Neural Networks for Control , pp. 5-58
    • Barto, A.G.1
  • 7
    • 84885587394 scopus 로고    scopus 로고
    • Benchmarks for mobile robotics?
    • Manchester University, School of Computer Science, Available as Technical Report UMCS-97-9-1
    • J. Hallam, G. Hayes, Benchmarks for mobile robotics? in: Towards Intelligent Mobile Robots: Scientific methods in mobile robotics, Manchester University, School of Computer Science, Available as Technical Report UMCS-97-9-1, 1997.
    • (1997) Towards Intelligent Mobile Robots: Scientific Methods in Mobile Robotics
    • Hallam, J.1    Hayes, G.2
  • 8
    • 0346389056 scopus 로고    scopus 로고
    • Robolearn 97: An international workshop on evaluating robot learning
    • Department of Computer Science, State University of New York at Buffalo, April
    • H. Hexmoor, Robolearn 97: An international workshop on evaluating robot learning, Technical Report TR 97-03, Department of Computer Science, State University of New York at Buffalo, April 1997.
    • (1997) Technical Report TR 97-03
    • Hexmoor, H.1
  • 9
    • 0346389059 scopus 로고    scopus 로고
    • Unpublished Masters Thesis, University of Edinburgh, Department of Artificial Intelligence, September
    • J. Hoar, Reinforcement learning applied to a real robot task, Unpublished Masters Thesis, University of Edinburgh, Department of Artificial Intelligence, September 1996.
    • (1996) Reinforcement Learning Applied to a Real Robot Task
    • Hoar, J.1
  • 10
    • 0004280606 scopus 로고
    • Ph.D. Thesis, Department of Computer Science, Stanford
    • L.P. Kaelbling, Learning in embedded systems, Ph.D. Thesis, Department of Computer Science, Stanford, 1990.
    • (1990) Learning in Embedded Systems
    • Kaelbling, L.P.1
  • 12
    • 84957798150 scopus 로고
    • Evaluation of learning performance of situated embodied agents
    • Morgan Kaufmann, Los Altos, CA
    • M. Mataric, Evaluation of learning performance of situated embodied agents, in: Proceedings of the Third European Conference on Artificial Life, Morgan Kaufmann, Los Altos, CA, 1995, pp. 579-589.
    • (1995) Proceedings of the Third European Conference on Artificial Life , pp. 579-589
    • Mataric, M.1
  • 14
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • S. Singh, R. Sutton, Reinforcement learning with replacing eligibility traces, Machine Learning 22 (1996) 123-158.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.1    Sutton, R.2
  • 15
    • 0003617454 scopus 로고
    • Ph.D. Thesis, University of Massachusetts, School of Computer and Information Sciences
    • R.S. Sutton, Temporal credit assignment in reinforcement learning, Ph.D. Thesis, University of Massachusetts, School of Computer and Information Sciences, 1984.
    • (1984) Temporal Credit Assignment in Reinforcement Learning
    • Sutton, R.S.1
  • 17
    • 0004049893 scopus 로고
    • Thesis, University of Cambridge, King's College, Cambridge, UK, May
    • C.J.C.H. Watkins, Learning from delayed rewards, Thesis, University of Cambridge, King's College, Cambridge, UK, May 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.J.C.H.1
  • 18
    • 0346389058 scopus 로고
    • Learning to perceive and act
    • Department of Computer Science, University of Rochester, June
    • S.D. Whitehead, D. Ballard, Learning to perceive and act, Technical Report TR-331 (revised), Department of Computer Science, University of Rochester, June 1990.
    • (1990) Technical Report TR-331 (Revised)
    • Whitehead, S.D.1    Ballard, D.2
  • 19
    • 0029250080 scopus 로고
    • Reinforcement learning in non-Markov decision processes
    • S. Whitehead, L.-J. Lin, Reinforcement learning in non-Markov decision processes, Artificial Intelligence 73 (1995) 271-306.
    • (1995) Artificial Intelligence , vol.73 , pp. 271-306
    • Whitehead, S.1    Lin, L.-J.2
  • 20
    • 0347019040 scopus 로고    scopus 로고
    • Investigating the behaviour of Q(λ)
    • Department of Artificial Intelligence, Edinburgh University, January Presented at the IEE Colloquia on Self Learning Robots, February 12, 1996, London
    • J. Wyatt, G. Hayes, J. Hallam, Investigating the behaviour of Q(λ), Technical Report 783, Department of Artificial Intelligence, Edinburgh University, January 1996, Presented at the IEE Colloquia on Self Learning Robots, February 12, 1996, London.
    • (1996) Technical Report , vol.783
    • Wyatt, J.1    Hayes, G.2    Hallam, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.