SCOPUS 정보 검색 플랫폼

Volumn 21, Issue 10, 2007, Pages 1215-1229

Reinforcement learning of a continuous motor sequence with hidden states

Author keywords

Actor critic method; Pendulum swing up; Perceptual aliasing problem; Recurrent neural network; Reinforcement learning

Indexed keywords

ALGORITHMS; BEHAVIORAL RESEARCH; NEUROLOGY; PARAMETER ESTIMATION;

ACTOR-CRITIC METHOD; PENDULUM SWING-UP; PERCEPTUAL ALIASING PROBLEM; RECURRENT NEURAL NETWORK;

REINFORCEMENT LEARNING;

EID: 34547543601 PISSN: 01691864 EISSN: 15685535 Source Type: Journal
DOI: 10.1163/156855307781389365 Document Type: Article

Times cited : (13)

References (18)

1
- 23144448134
- Novelty and reinforcement learning in the value system of developmental robots
- X. Huang and J. Weng, Novelty and reinforcement learning in the value system of developmental robots, in: Proc. 2nd Int. Workshop on Epigenetic Robotics(2002).
- (2002) Proc. 2nd Int. Workshop on Epigenetic Robotics
- Huang, X.¹ Weng, J.²

2
- 0033148990
- Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development
- M. Asada, E. Uchibe and K. Hosoda, Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development, Artif. Intell.110, 275-292 (1999).
- (1999) Artif. Intell , vol.110 , pp. 275-292
- Asada, M.¹ Uchibe, E.² Hosoda, K.³

3
- 0000123778
- Self-improving reactive agents based on reinforcement learning
- L.-J. Lin, Self-improving reactive agents based on reinforcement learning, Machine Learn.8, 293-321 (1992).
- (1992) Machine Learn , vol.8 , pp. 293-321
- Lin, L.-J.¹

4
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- A. G. Barto, R. S. Sutton and C. W. Anderson, Neuronlike adaptive elements that can solve difficult learning control problems, IEEE Trans. Syst. Man Cybernet.13, 834-846 (1983).
- (1983) IEEE Trans. Syst. Man Cybernet , vol.13 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

5
- 3242752134
- Evolving the neural controller for a robotic arm able to grasp objects on the basis of tactile sensors
- R. Bianco and S. Nolfi, Evolving the neural controller for a robotic arm able to grasp objects on the basis of tactile sensors, Adapt. Behav.12, 37-45 (2004).
- (2004) Adapt. Behav , vol.12 , pp. 37-45
- Bianco, R.¹ Nolfi, S.²

6
- 0025600638
- A stochastic reinforcement learning algorithm for learning real-valued functions
- V. Gullapalli, A stochastic reinforcement learning algorithm for learning real-valued functions, Neural Networks3, 671-692 (1990).
- (1990) Neural Networks , vol.3 , pp. 671-692
- Gullapalli, V.¹

7
- 0033629916
- Reinforcement learning in continuous time and space
- K. Doya, Reinforcement learning in continuous time and space. Neural Comput.12, 219-245 (2000).
- (2000) Neural Comput , vol.12 , pp. 219-245
- Doya, K.¹

8
- 0000162290
- Reinforcement learning with hidden state
- L.-J. Lin and T. M. Mitchell, Reinforcement learning with hidden state, in: Proc. 2nd Int. Conf. on Simulation of Adaptive Behavior(1993).
- (1993) Proc. 2nd Int. Conf. on Simulation of Adaptive Behavior
- Lin, L.-J.¹ Mitchell, T.M.²

9
- 0030164858
- Model-based learning for mobile robot navigation from the dynamical system perspective
- J. Tani, Model-based learning for mobile robot navigation from the dynamical system perspective, IEEE Trans. Syst. Man Cybernet. B26, 421-436 (1996).
- (1996) IEEE Trans. Syst. Man Cybernet. B , vol.26 , pp. 421-436
- Tani, J.¹

10
- 0003932121
- PhD Thesis, Univertsity of Rochester
- A. Kachites McCallum, Reinforcement learning with selective perception and hidden state, PhD Thesis, Univertsity of Rochester (1995).
- (1995) Reinforcement learning with selective perception and hidden state
- Kachites McCallum, A.¹

11
- 33847202724
- Learning to predict by the methods of temporal difference
- R. S. Sutton, Learning to predict by the methods of temporal difference, Machine Learn.3, 9-44 (1988).
- (1988) Machine Learn , vol.3 , pp. 9-44
- Sutton, R.S.¹

13
- 0001202594
- A learning algorithm, for continually running fully recurrent neural networks
- R. J. Williams and D. Zipser, A learning algorithm, for continually running fully recurrent neural networks, Neural Comput.1, 270-280 (1989).
- (1989) Neural Comput , vol.1 , pp. 270-280
- Williams, R.J.¹ Zipser, D.²

14
- 44049116478
- Forward models: Supervised learning with a distal teacher
- M. I. Jordan and D. E. Rumelhart, Forward models: supervised learning with a distal teacher, Cognitive Sci.16, 307-354 (1992).
- (1992) Cognitive Sci , vol.16 , pp. 307-354
- Jordan, M.I.¹ Rumelhart, D.E.²

16
- 0032220772
- An interpretation of the "self" from the dynamical system perspective: A constructivist approach
- J. Tani, An interpretation of the "self" from the dynamical system perspective: a constructivist approach, Consciousness Studies5 (1998).
- (1998) Consciousness Studies , vol.5
- Tani, J.¹

17
- 0344154963
- Strategy learning with multilayer connectionist representations
- C. W. Anderson, Strategy learning with multilayer connectionist representations, in: Proc. 4th Int. Workshop on Machine Learning, pp. 103-114 (1987).
- (1987) Proc. 4th Int. Workshop on Machine Learning , pp. 103-114
- Anderson, C.W.¹

18
- 0028392483
- Learning long-term dependencies with gradient descent is difficult
- Y Bengio, P. Simard and P. Frasconi, Learning long-term dependencies with gradient descent is difficult, IEEE Trans. Neural Networks5, 157-166 (1994).
- (1994) IEEE Trans. Neural Networks , vol.5 , pp. 157-166
- Bengio, Y.¹ Simard, P.² Frasconi, P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.