SCOPUS 정보 검색 플랫폼

IEEE International Conference on Intelligent Robots and Systems

Volumn 1, Issue , 2003, Pages 430-435

A Robot that Reinforcement-Learns to Identify and Memorize Important Previous Observations

(4) Bakker, Bram a,b Zhumatiy, Viktor a Gruener, Gabriel c Schmidhuber, Jürgen a

a DALLE MOLLE INSTITUTE FOR ARTIFICIAL INTELLIGENCE IDSIA (Switzerland)

b UNIVERSITY OF AMSTERDAM (Netherlands)

c CSEM (Switzerland)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER NETWORKS; EXTRAPOLATION; INFORMATION RETRIEVAL; LEARNING ALGORITHMS; MARKOV PROCESSES; NEURAL NETWORKS; ROBOT LEARNING; SENSOR DATA FUSION; TRAJECTORIES; VECTORS;

ADVANTAGE LEARNING; REINFORCEMENT LEARNING (RL) ALGORITHMS;

ROBOTICS;

EID: 0346149797 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (49)

References (12)

1
- 84899015857
- Reinforcement learning with Long Short-Term Memory
- B. Bakker. Reinforcement learning with Long Short-Term Memory. In NIPS 14. 2002.
- (2002) NIPS , vol.14
- Bakker, B.¹

2
- 0346242070
- Reinforcement learning in partially observable mobile robot domains using unsupervised event extraction
- B. Bakker, F. Linåker, and J. Schmidhuber. Reinforcement learning in partially observable mobile robot domains using unsupervised event extraction. In Proc. IROS'02, 2002.
- (2002) Proc. IROS'02
- Bakker, B.¹ Linåker, F.² Schmidhuber, J.³

3
- 0003996286
- TR, Wright-Patterson Air Force Base
- M. E. Harmon and L. C. Baird. Multi-player residual advantage learning with general function approximation. TR, Wright-Patterson Air Force Base, 1996.
- (1996) Multi-player Residual Advantage Learning with General Function Approximation
- Harmon, M.E.¹ Baird, L.C.²

4
- 0031573117
- Long Short-Term Memory
- S. Hochreiter and J. Schmidhuber. Long Short-Term Memory. Neural Computation, 9 (8):1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

5
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

6
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning, and teaching
- L.-J. Lin. Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning, 8:293-321, 1992.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

7
- 0000162290
- Reinforcement learning with hidden states
- MIT Press
- L.-J. Lin and T. Mitchell. Reinforcement learning with hidden states. In Proc. of the 2nd Int. Conf. on Simulation of Adaptive Behavior. MIT Press, 1993.
- (1993) Proc. of the 2nd Int. Conf. on Simulation of Adaptive Behavior
- Lin, L.-J.¹ Mitchell, T.²

8
- 84880884411
- Mobile robot learning of delayed response tasks through event extraction: A solution to the road sign problem and beyond
- F. Linåker and H. Jacobsson. Mobile robot learning of delayed response tasks through event extraction: A solution to the road sign problem and beyond. In Proc. IJCAI'2001, 2001.
- (2001) Proc. IJCAI'2001
- Linåker, F.¹ Jacobsson, H.²

9
- 0348132947
- An optimization-based categorization of reinforcement learning environments
- MIT Press
- M. L. Littman. An optimization-based categorization of reinforcement learning environments. In Proc. of the 2nd Int. Conf. on Simulation of Adaptive Behavior. MIT Press, 1993.
- (1993) Proc. of the 2nd Int. Conf. on Simulation of Adaptive Behavior
- Littman, M.L.¹

10
- 0002242826
- Learning to use selective attention and short-term memory in sequential tasks
- R. A. McCallum. Learning to use selective attention and short-term memory in sequential tasks. In Proc. 4th Int. Conf. on Simulation of Adaptive Behavior, 1996.
- (1996) Proc. 4th Int. Conf. on Simulation of Adaptive Behavior
- McCallum, R.A.¹

11
- 0034293945
- Embedding connectionist autonomous agents in time: The road sign problem
- R. Rylatt and C. Czamecki. Embedding connectionist autonomous agents in time: The road sign problem. Neural Processing Letters, 12:145-158, 2000.
- (2000) Neural Processing Letters , vol.12 , pp. 145-158
- Rylatt, R.¹ Czamecki, C.²

12
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Richard S. Sutton. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proc. ICML 7, 1990.
- (1990) Proc. ICML , vol.7
- Sutton, R.S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.