SCOPUS 정보 검색 플랫폼

Adaptive Behavior

Volumn 16, Issue 6, 2008, Pages 400-412

Co-evolution of shaping rewards and meta-parameters in reinforcement learning

(4) Elfwing, Stefan a,b Uchibe, Eiji b Doya, Kenji b Christensen, Henrik I a

a ROYAL INSTITUTE OF TECHNOLOGY (Sweden)

b OKINAWA INSTITUTE OF SCIENCE AND TECHNOLOGY GRADUATE UNIVERSITY (Japan)

Author keywords

Genetic algorithms; Meta parameters; Reinforcement learning; Shaping rewards

Indexed keywords

EID: 55949119833 PISSN: 10597123 EISSN: 17412633 Source Type: Journal
DOI: 10.1177/1059712308092835 Document Type: Article

Times cited : (24)

References (26)

1
- 0000500817
- Interactions between learning and evolution
- C. G. Langton, C. Taylor, C. D. Farmer, & S. Rasmussen (Eds.), Redwood City, CA: Addison-Wesley.
- Ackley, D.H., & Littman, M.L. (1991). Interactions between learning and evolution. In C. G. Langton, C. Taylor, C. D. Farmer, & S. Rasmussen (Eds.), Artificial Life II: Santa Fe Institute Studies in the Sciences of Complexity (Vol. 10, pp. 487-509). Redwood City, CA: Addison-Wesley.
- (1991) Artificial Life II: Santa Fe Institute Studies in the Sciences of Complexity , pp. 487-509
- Ackley, D.H.¹ Littman, M.L.²

2
- 0003886788
- 5th ed.). Englewood Cliffs, NJ: Prentice-Hall.
- Bower, G.H., & Hilgard, E.R. (1981). Theories of learning (5 th ed.). Englewood Cliffs, NJ: Prentice-Hall.
- (1981) Theories of Learning
- Bower, G.H.¹ Hilgard, E.R.²

3
- 21344434798
- The cyber rodent project: Exploration of adaptive mechanisms for self-preservation and self-reproduction
- Doya, K., & Uchibe, E. (2005). The cyber rodent project: Exploration of adaptive mechanisms for self-preservation and self-reproduction. Adaptive Behavior, 13 (2). 149-160.
- (2005) Adaptive Behavior , vol.13 , Issue.2 , pp. 149-160
- Doya, K.¹ Uchibe, E.²

4
- 27144533954
- Biologically inspired embodied evolution of survival
- Piscataway, NJ: IEEE.
- Elfwing, S., Uchibe, U., Doya, K., & Christensen, H.I. (2005). Biologically inspired embodied evolution of survival. In Proceedings of the IEEE Congress on Evolutionary Computation, CEC2005 (Vol. 3, pp. 2210-2216). Piscataway, NJ: IEEE.
- (2005) Proceedings of the IEEE Congress on Evolutionary Computation, CEC2005 , pp. 2210-2216
- Elfwing, S.¹ Uchibe, U.² Doya, K.³ Christensen, H.I.⁴

5
- 55949130483
- Elfwing, S., Uchibe, U., Doya, K., & Christensen, H.I. (2007 a). Darwinian embodied evolution of the learning ability for survival. Manuscript submitted for publication.
- (2007) Darwinian Embodied Evolution of the Learning Ability for Survival
- Elfwing, S.¹ Uchibe, U.² Doya, K.³ Christensen, H.I.⁴

6
- 34047255425
- Evolutionary development of hierarchical learning structures
- Elfwing, S., Uchibe, U., Doya, K., & Christensen, H.I. (2007 b). Evolutionary development of hierarchical learning structures. IEEE Transactions on Evolutionary Computation, 11 (2). 249-264.
- (2007) IEEE Transactions on Evolutionary Computation , vol.11 , Issue.2 , pp. 249-264
- Elfwing, S.¹ Uchibe, U.² Doya, K.³ Christensen, H.I.⁴

7
- 0346149798
- Evolution of meta-parameters in reinforcement learning algorithm
- Piscataway, NJ: IEEE.
- Eriksson, A., Capi, G., & Doya, K. (2003). Evolution of meta-parameters in reinforcement learning algorithm. In Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems, IROS2003 (pp. 412-417). Piscataway, NJ: IEEE.
- (2003) Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems, IROS2003 , pp. 412-417
- Eriksson, A.¹ Capi, G.² Doya, K.³

8
- 33749243349
- Autonomous shaping: Knowledge transfer in reinforcement learning
- New York: ACM.
- Konidaris, G., & Barto, A. (2006). Autonomous shaping: Knowledge transfer in reinforcement learning. In Proceedings of the International Conference on Machine Learning, ICML2006 (pp. 489-496). New York: ACM.
- (2006) Proceedings of the International Conference on Machine Learning, ICML2006 , pp. 489-496
- Konidaris, G.¹ Barto, A.²

9
- 1942482706
- Reinforcement learning and shaping: Encouraging intended behaviors
- San Francisco, CA: Morgan Kaufmann.
- Laud, A., & DeJong, G. (2002). Reinforcement learning and shaping: Encouraging intended behaviors. In Proceedings of the International Conference on Machine Learning, ICML2002 (pp. 355-362). San Francisco, CA: Morgan Kaufmann.
- (2002) Proceedings of the International Conference on Machine Learning, ICML2002 , pp. 355-362
- Laud, A.¹ Dejong, G.²

10
- 1942484890
- The influence of reward on the speed of reinforcement learning: An analysis of shaping
- San Francisco, CA: Morgan Kaufmann.
- Laud, A., & DeJong, G. (2003). The influence of reward on the speed of reinforcement learning: An analysis of shaping. In Proceedings of the International Conference on Machine Learning, ICML2003 (pp. 440-447). San Francisco, CA: Morgan Kaufmann.
- (2003) Proceedings of the International Conference on Machine Learning, ICML2003 , pp. 440-447
- Laud, A.¹ Dejong, G.²

11
- 34547964974
- Automatic shaping and decomposition of reward functions
- New York: ACM.
- Marthi, B. (2007). Automatic shaping and decomposition of reward functions. In Proceedings of the International Conference on Machine Learning, ICML2007 (pp. 601-608). New York: ACM.
- (2007) Proceedings of the International Conference on Machine Learning, ICML2007 , pp. 601-608
- Marthi, B.¹

12
- 84957895797
- Reward functions for accelerated learning
- New York: ACM.
- Mataric, M. (1994). Reward functions for accelerated learning. In Proceedings of the International Conference on Machine Learning, ICML1994 (pp. 181-189). New York: ACM.
- (1994) Proceedings of the International Conference on Machine Learning, ICML1994 , pp. 181-189
- Mataric, M.¹

13
- 0030647149
- Reinforcement learning in the multirobot domain
- Mataric, M. (1997). Reinforcement learning in the multirobot domain. Autonomous Robots, 4 (1). 73-83.
- (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 73-83
- Mataric, M.¹

14
- 0004156494
- Evolutionary algorithms for reinforcement learning
- Moriarty, D.E., Schultz, A.C., & Grefenstette, J.J. (1999). Evolutionary algorithms for reinforcement learning. Journal of Artificial Intelligence Research, 11, 241-276.
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 241-276
- Moriarty, D.E.¹ Schultz, A.C.² Grefenstette, J.J.³

15
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- San Francisco, CA: Morgan Kaufmann.
- Ng, A.Y., Harada, D., & Russell, S.J. (1999). Policy invariance under reward transformations: Theory and application to reward shaping. In Proceedings of the International Conference on Machine Learning, ICML1999 (pp. 278-287). San Francisco, CA: Morgan Kaufmann.
- (1999) Proceedings of the International Conference on Machine Learning, ICML1999 , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.J.³

16
- 0003655029
- Cambridge, MA: MIT Press.
- Nolfi, S., & Floreano, D. (2000). Evolutionary robotics. The biology, intelligence, and technology of self-organizing machines. Cambridge, MA: MIT Press.
- (2000) Evolutionary Robotics. the Biology, Intelligence, and Technology of Self-organizing Machines
- Nolfi, S.¹ Floreano, D.²

17
- 1642401055
- Learning to drive a bicycle using reinforcement learning and shaping
- San Francisco, CA: Morgan Kaufmann.
- Randløv, J., & Alstrøm, P. (1998). Learning to drive a bicycle using reinforcement learning and shaping. In Proceedings of the International Conference on Machine Learning, ICML1998. San Francisco, CA: Morgan Kaufmann.
- (1998) Proceedings of the International Conference on Machine Learning, ICML1998
- Randløv, J.¹ Alstrøm, P.²

18
- 0003636089
- Cambridge University Engineering Department.
- Rummery, G.A., & Niranjan, M. (1994). On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG/ TR 166, Cambridge University Engineering Department.
- (1994) On-line Q-learning Using Connectionist Systems. Technical Report CUED/F-INFENG/ TR 166
- Rummery, G.A.¹ Niranjan, M.²

19
- 0029753630
- Reinforcement learning with replacing eligibility traces
- Singh, S.P., & Sutton, R.S. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 22 (1-3). 123-158.
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

20
- 0003880401
- Englewood Cliffs, NJ: Prentice-Hall.
- Skinner, B.F. (1938). The behavior of organisms: An experimental analysis. Englewood Cliffs, NJ: Prentice-Hall.
- (1938) The Behavior of Organisms: An Experimental Analysis
- Skinner, B.F.¹

21
- 0042125798
- Effcient reinforcement learning through evolving neural network topologies
- San Francisco, CA: Morgan Kaufmann.
- Stanley, K.O., & Miikkulainen, R. (2002). Effcient reinforcement learning through evolving neural network topologies. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO2002 (pp. 569-577). San Francisco, CA: Morgan Kaufmann.
- (2002) Proceedings of the Genetic and Evolutionary Computation Conference, GECCO2002 , pp. 569-577
- Stanley, K.O.¹ Miikkulainen, R.²

22
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Cambridge, MA: MIT Press.
- Sutton, R.S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Advances in Neural Information Processing Systems 8 (pp. 1038-1044). Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems 8 , pp. 1038-1044
- Sutton, R.S.¹

23
- 0004102479
- Cambridge, MA: MIT Press.
- Sutton, R.S., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.²

24
- 0037197641
- Embodied evolution: Distributing an evolutionary algorithm in a population of robots
- Watson, R., Ficici, S., & Pollack, J. (2002). Embodied evolution: Distributing an evolutionary algorithm in a population of robots. Robotics and Autonomous Systems, 39 (1). 1-18.
- (2002) Robotics and Autonomous Systems , vol.39 , Issue.1 , pp. 1-18
- Watson, R.¹ Ficici, S.² Pollack, J.³

25
- 33646714634
- Evolutionary function approximation for reinforcement learning
- Whiteson, S., & Stone, P. (2006). Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research, 7, 877-917.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 877-917
- Whiteson, S.¹ Stone, P.²

26
- 27344453198
- Potential-based shaping and Q-value initialization are equivalent
- Wiewiora, E. (2003). Potential-based shaping and Q-value initialization are equivalent. Journal of Artificial Intelligence Research, 19, 205-208.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 205-208
- Wiewiora, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.