SCOPUS 정보 검색 플랫폼

Volumn 61, Issue 7, 2013, Pages 694-703

Transfer learning with Partially Constrained Models: Application to reinforcement learning of linked multicomponent robot system control

(3) Fernandez Gauna, Borja a Lopez Guede, Jose Manuel a Graña, Manuel a

a UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

Author keywords

Hose transportation; Linked multicomponent robotic systems; Reinforcement learning; Transfer learning

Indexed keywords

CONSTRAINED SYSTEMS; HIERARCHICAL APPROACH; MARKOV DECISION PROCESSES; PHYSICAL CONSTRAINTS; ROBOTIC SYSTEMS; STATE-VALUE FUNCTIONS; TRANSFER LEARNING; UNDER-CONSTRAINED SYSTEMS;

HOSE; LEARNING ALGORITHMS; MARKOV PROCESSES;

REINFORCEMENT LEARNING;

EID: 84878315635 PISSN: 09218890 EISSN: None Source Type: Journal
DOI: 10.1016/j.robot.2012.07.020 Document Type: Article

Times cited : (18)

References (37)

1
- 0004102479
- MIT Press
- R.S. Sutton, and A.G. Barto Reinforcement Learning: An Introduction 1998 MIT Press
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 0006221144
- Vision-based behavior acquisition for a shooting robot by using a reinforcement learning
- M. Asada, S. Noda, S. Tawaratsumida, K. Hosoda, Vision-based behavior acquisition for a shooting robot by using a reinforcement learning, in: Proceedings of IAPR/IEEE Workshop on Visual Behaviors-1994, pp. 112-118, 1994.
- (1994) Proceedings of IAPR/IEEE Workshop on Visual Behaviors-1994 , pp. 112-118
- Asada, M.¹ Noda, S.² Tawaratsumida, S.³ Hosoda, K.⁴

3
- 0030647149
- Reinforcement learning in the multi-robot domain
- M.J. Mataric Reinforcement learning in the multi-robot domain Autonomous Robots 4 1997 73 83
- (1997) Autonomous Robots , vol.4 , pp. 73-83
- Mataric, M.J.¹

4
- 84878276986
- Modular learning systems for soccer robot
- Y. Takahashi, M. Asada, Modular learning systems for soccer robot, in: Proceedings of the Fourth International Symposium on Human and Artificial Intelligence Systems, pp. 370-375, 2004.
- (2004) Proceedings of the Fourth International Symposium on Human and Artificial Intelligence Systems , pp. 370-375
- Takahashi, Y.¹ Asada, M.²

5
- 84878320217
- Modular reinforcement learning: An application to a real robot task
- A. Birk, J. Demiris, LNCS Springer Berlin Heidelberg
- Z. Kalmar, C. Szepesvari, and A. Lorincz Modular reinforcement learning: an application to a real robot task A. Birk, J. Demiris, Learning Robots LNCS vol. 1545 1998 Springer Berlin Heidelberg 29 45
- (1998) Learning Robots , vol.1545 , pp. 29-45
- Kalmar, Z.¹ Szepesvari, C.² Lorincz, A.³

6
- 77954595557
- On the potential contributions of hybrid intelligent approaches to multicomponent robotic system development
- R.J. Duro, M. Graña, and J. de Lope On the potential contributions of hybrid intelligent approaches to multicomponent robotic system development Information Sciences 180 14 2010 2635 2648
- (2010) Information Sciences , vol.180 , Issue.14 , pp. 2635-2648
- Duro, R.J.¹ Graña, M.² De Lope, J.³

7
- 77954575291
- Linked multicomponent robotic systems: Basic assessment of linking element dynamical effect
- E. Corchado, M. Graña, A. Savio, Springer Verlag
- B. Fernandez-Gauna, J.M. Lopez-Guede, and E. Zulueta Linked multicomponent robotic systems: basic assessment of linking element dynamical effect E. Corchado, M. Graña, A. Savio, Hybrid Artificial Intelligence Systems, Part I, Vol. 6076 2010 Springer Verlag 73 79
- (2010) Hybrid Artificial Intelligence Systems, Part I, Vol. 6076 , pp. 73-79
- Fernandez-Gauna, B.¹ Lopez-Guede, J.M.² Zulueta, E.³

8
- 79952160640
- Learning hose transport control with Q-learning
- B. Fernandez-Gauna, J.M. Lopez-Guede, E. Zulueta, and M. Graña Learning hose transport control with Q-learning Neural Network World 20 7 2010 913 923
- (2010) Neural Network World , vol.20 , Issue.7 , pp. 913-923
- Fernandez-Gauna, B.¹ Lopez-Guede, J.M.² Zulueta, E.³ Graña, M.⁴

9
- 84878294079
- Modular q-learning with state-action vetoes for linked multi-component robotic systems
- (in press)
- B. Fernandez-Gauna, J.M. Lopez-Guede, M. Graña, Modular q-learning with state-action vetoes for linked multi-component robotic systems, International Journal of Applied Mathematics and Computer Science (2012) (in press).
- (2012) International Journal of Applied Mathematics and Computer Science
- Fernandez-Gauna, B.¹ Lopez-Guede, J.M.² Graña, M.³

10
- 68949157375
- Transfer learning for reinforcement learning domains: A survey
- M.E. Taylor, and P. Stone Transfer learning for reinforcement learning domains: a survey Journal of Machine Learning Research 10 1 2009 1633 1685
- (2009) Journal of Machine Learning Research , vol.10 , Issue.1 , pp. 1633-1685
- Taylor, M.E.¹ Stone, P.²

11
- 77954599219
- Linked multi-component mobile robots: Modeling, simulation and control
- Z. Echegoyen, I. Villaverde, R. Moreno, M. Graña, and A. d'Anjou Linked multi-component mobile robots: modeling, simulation and control Robotics and Autonomous Systems 58 12 2010 1292 1305
- (2010) Robotics and Autonomous Systems , vol.58 , Issue.12 , pp. 1292-1305
- Echegoyen, Z.¹ Villaverde, I.² Moreno, R.³ Graña, M.⁴ D'Anjou, A.⁵

12
- 79951649734
- Los Alamitos, CA. USA
- H. Qin, D. Terzopoulos, D-nurbs: a physics-based framework for geometric design, technical report, Los Alamitos, CA. USA, 1996.
- (1996) D-nurbs: A Physics-based Framework for Geometric Design, Technical Report
- Qin, H.¹ Terzopoulos, D.²

13
- 38649101898
- Geometrically exact dynamic splines
- A. Theetten, L. Grisoni, C. Andriot, and B. Barsky Geometrically exact dynamic splines Computer-Aided Design 40 1 2008 35 48
- (2008) Computer-Aided Design , vol.40 , Issue.1 , pp. 35-48
- Theetten, A.¹ Grisoni, L.² Andriot, C.³ Barsky, B.⁴

14
- 0003953834
- Springer-Verlag
- S.S. Antman Nonlinear Problems of Elasticity 1995 Springer-Verlag
- (1995) Nonlinear Problems of Elasticity
- Antman, S.S.¹

15
- 0003439836
- Kluwer
- M.B. Rubin Cosserat Theories: Shells, Rods and Points 2000 Kluwer
- (2000) Cosserat Theories: Shells, Rods and Points
- Rubin, M.B.¹

16
- 34249833101
- C. Watkins, P. Dayan, Technical note: Q-learning, in: Machine Learning, vol. 8, pp. 279-292, 1992.
- (1992) Technical Note: Q-learning, In: Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

17
- 84942867726
- An overview of maxq hierarchical reinforcement learning
- Berthe Choueiry, Toby Walsh, Lecture Notes in Computer Science Springer Berlin Heidelberg
- T. Dietterich An overview of maxq hierarchical reinforcement learning Berthe Choueiry, Toby Walsh, Abstraction, Reformulation, and Approximation Lecture Notes in Computer Science vol. 1864 2000 Springer Berlin Heidelberg 26 44
- (2000) Abstraction, Reformulation, and Approximation , vol.1864 , pp. 26-44
- Dietterich, T.¹

18
- 22944471767
- Model approximation for hexq hierarchical reinforcement learning
- B. Hengst, Model approximation for hexq hierarchical reinforcement learning, in: ECML 2004, pp. 144-155, 2004.
- (2004) ECML 2004 , pp. 144-155
- Hengst, B.¹

19
- 38149025031
- Multi-robot cooperation based on hierarchical reinforcement learning
- X. Cheng, J. Shen, H. Liu, and G. Gu Multi-robot cooperation based on hierarchical reinforcement learning Lecture Notes in Computer Science 4489 2007 90 97
- (2007) Lecture Notes in Computer Science , vol.4489 , pp. 90-97
- Cheng, X.¹ Shen, J.² Liu, H.³ Gu, G.⁴

20
- 31144477417
- Risk-sensitive reinforcement learning applied to control under constraints
- P. Geibel, and F. Wysotzki Risk-sensitive reinforcement learning applied to control under constraints Journal of Artificial Intelligence Research 24 2005 81 108
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 81-108
- Geibel, P.¹ Wysotzki, F.²

21
- 33750372439
- Reinforcement learning for MDPs with constraints
- Johannes Fürnkranz, Tobias Scheffer, Myra Spiliopoulou, Lecture Notes in Computer Science Springer
- P. Geibel Reinforcement learning for MDPs with constraints Johannes Fürnkranz, Tobias Scheffer, Myra Spiliopoulou, ECML Lecture Notes in Computer Science vol. 4212 2006 Springer 646 653
- (2006) ECML , vol.4212 , pp. 646-653
- Geibel, P.¹

22
- 13444290317
- Reinforcement learning with bounded risk
- Morgan Kaufmann
- P. Geibel Reinforcement learning with bounded risk Proceedings of the Eighteenth International Conference on Machine Learning 2001 Morgan Kaufmann 162 169
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 162-169
- Geibel, P.¹

23
- 85120861483
- Consideration of risk in reinforcement learning
- M. Heger, Consideration of risk in reinforcement learning, in: XI. International Machine Learning Conference, 1994.
- (1994) XI. International Machine Learning Conference
- Heger, M.¹

24
- 31844444663
- Exploration and apprenticeship learning in reinforcement learning
- P. Abbeel, A.Y. Ng, Exploration and apprenticeship learning in reinforcement learning, in: Proceedings of 21st International Conference on Machine Learning, ICML, pp. 1-8, 2005.
- (2005) Proceedings of 21st International Conference on Machine Learning, ICML , pp. 1-8
- Abbeel, P.¹ Ng, A.Y.²

25
- 79956136559
- Safe exploration for reinforcement learning
- A. Hans, D. Schneegaß, A.M. Schäfer, S. Udluft, Safe exploration for reinforcement learning, in: ESANN, pp. 143-148, 2008.
- (2008) ESANN , pp. 143-148
- Hans, A.¹ Schneegaß, D.² Schäfer, A.M.³ Udluft, S.⁴

26
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R. Sutton, D. Precup, and S. Singh Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning Artificial Intelligence 112 1999 181 211
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.³

27
- 0344752303
- Training and tracking in robotics
- Morgan Kaufmann Publishers Inc. San Francisco, CA, USA
- O.G. Selfridge, R.S. Sutton, and A.G. Barto Training and tracking in robotics Proceedings of the 9th International Joint Conference on Artificial Intelligence - Volume 1 1985 Morgan Kaufmann Publishers Inc. San Francisco, CA, USA 670 672
- (1985) Proceedings of the 9th International Joint Conference on Artificial Intelligence - Volume 1 , pp. 670-672
- Selfridge, O.G.¹ Sutton, R.S.² Barto, A.G.³

28
- 56049125072
- Transfer of samples in batch reinforcement learning
- A. Lazaric, M. Restelli, A. Bonarini A., Transfer of samples in batch reinforcement learning, in: Proceedings of the 25th Annual ICML, pp. 544-551, 2008.
- (2008) Proceedings of the 25th Annual ICML , pp. 544-551
- Lazaric, A.¹ Restelli, M.² Bonarini A, A.³

29
- 58349096666
- Proto-transfer learning in Markov decision processes using spectral methods
- K. Ferguson, S. Mahadevan, Proto-transfer learning in Markov decision processes using spectral methods, in: ICML Workshop on Transfer Learning, 2006.
- (2006) ICML Workshop on Transfer Learning
- Ferguson, K.¹ Mahadevan, S.²

30
- 0036927201
- State abstraction for programmable reinforcement learning agents
- AAAI Press
- D. Andre, and S.J. Russell State abstraction for programmable reinforcement learning agents Proceedings of the Eighteenth National Conference on Artificial Intelligence 2002 AAAI Press 119 125
- (2002) Proceedings of the Eighteenth National Conference on Artificial Intelligence , pp. 119-125
- Andre, D.¹ Russell, S.J.²

31
- 51849132434
- Representation transfer for reinforcement learning
- M.E. Taylor, P. Stone, Representation transfer for reinforcement learning, in: AAAI 2007 Fall Symposium on Computational Approaches to Representation Change during Learning and Development, 2007.
- (2007) AAAI 2007 Fall Symposium on Computational Approaches to Representation Change during Learning and Development
- Taylor, M.E.¹ Stone, P.²

32
- 34547997175
- Cross-domain transfer for reinforcement learning
- M.E. Taylor, P. Stone, Cross-domain transfer for reinforcement learning, in: Proceedings of the Twenty-Fourth International Conference on Machine Learning, pp. 879-886, 2007.
- (2007) Proceedings of the Twenty-Fourth International Conference on Machine Learning , pp. 879-886
- Taylor, M.E.¹ Stone, P.²

33
- 84880803349
- Generalizing plans to new environments in relational MDPs
- C. Guestrin, D. Koller, C. Gearhart, N. Kanodia, Generalizing plans to new environments in relational MDPs, in: International Joint Conference on Artificial Intelligence, IJCAI-03, pp. 1003-1010, 2003.
- (2003) International Joint Conference on Artificial Intelligence, IJCAI-03 , pp. 1003-1010
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

34
- 77953487641
- Transfer learning in reinforcement learning problems through partial policy recycling
- Springer-Verlag
- J. Ramon, K. Driessens, and T. Croonenborghs Transfer learning in reinforcement learning problems through partial policy recycling Proceedings of The 18th European Conf. on Machine Learning 2007 Springer-Verlag
- (2007) Proceedings of the 18th European Conf. on Machine Learning
- Ramon, J.¹ Driessens, K.² Croonenborghs, T.³

35
- 66149098681
- Learning relational options for inductive transfer in relational reinforcement learning
- Tom Croonenborghs, Kurt Driessens, Maurice Bruynooghe, Learning relational options for inductive transfer in relational reinforcement learning, in: Proceedings of the Seventeenth Conference on Inductive Logic Programming, 2007.
- (2007) Proceedings of the Seventeenth Conference on Inductive Logic Programming
- Croonenborghs, T.¹ Driessens, K.² Bruynooghe, M.³

36
- 84861670983
- State abstraction discovery from irrelevant state variables
- N.K. Jong, State abstraction discovery from irrelevant state variables, in: Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, pp. 752-757, 2005.
- (2005) Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence , pp. 752-757
- Jong, N.K.¹

37
- 0021412814
- Hierarchically structured systems
- M. Graña, and F.J. Torrealdea Hierarchically structured systems European Journal of Operational Research 25 1986 20 26
- (1986) European Journal of Operational Research , vol.25 , pp. 20-26
- Graña, M.¹ Torrealdea, F.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.