메뉴 건너뛰기




Volumn 16, Issue 6, 2008, Pages 400-412

Co-evolution of shaping rewards and meta-parameters in reinforcement learning

Author keywords

Genetic algorithms; Meta parameters; Reinforcement learning; Shaping rewards

Indexed keywords


EID: 55949119833     PISSN: 10597123     EISSN: 17412633     Source Type: Journal    
DOI: 10.1177/1059712308092835     Document Type: Article
Times cited : (24)

References (26)
  • 1
    • 0000500817 scopus 로고
    • Interactions between learning and evolution
    • C. G. Langton, C. Taylor, C. D. Farmer, & S. Rasmussen (Eds.), Redwood City, CA: Addison-Wesley.
    • Ackley, D.H., & Littman, M.L. (1991). Interactions between learning and evolution. In C. G. Langton, C. Taylor, C. D. Farmer, & S. Rasmussen (Eds.), Artificial Life II: Santa Fe Institute Studies in the Sciences of Complexity (Vol. 10, pp. 487-509). Redwood City, CA: Addison-Wesley.
    • (1991) Artificial Life II: Santa Fe Institute Studies in the Sciences of Complexity , pp. 487-509
    • Ackley, D.H.1    Littman, M.L.2
  • 3
    • 21344434798 scopus 로고    scopus 로고
    • The cyber rodent project: Exploration of adaptive mechanisms for self-preservation and self-reproduction
    • Doya, K., & Uchibe, E. (2005). The cyber rodent project: Exploration of adaptive mechanisms for self-preservation and self-reproduction. Adaptive Behavior, 13 (2). 149-160.
    • (2005) Adaptive Behavior , vol.13 , Issue.2 , pp. 149-160
    • Doya, K.1    Uchibe, E.2
  • 10
    • 1942484890 scopus 로고    scopus 로고
    • The influence of reward on the speed of reinforcement learning: An analysis of shaping
    • San Francisco, CA: Morgan Kaufmann.
    • Laud, A., & DeJong, G. (2003). The influence of reward on the speed of reinforcement learning: An analysis of shaping. In Proceedings of the International Conference on Machine Learning, ICML2003 (pp. 440-447). San Francisco, CA: Morgan Kaufmann.
    • (2003) Proceedings of the International Conference on Machine Learning, ICML2003 , pp. 440-447
    • Laud, A.1    Dejong, G.2
  • 13
    • 0030647149 scopus 로고    scopus 로고
    • Reinforcement learning in the multirobot domain
    • Mataric, M. (1997). Reinforcement learning in the multirobot domain. Autonomous Robots, 4 (1). 73-83.
    • (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 73-83
    • Mataric, M.1
  • 15
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: Theory and application to reward shaping
    • San Francisco, CA: Morgan Kaufmann.
    • Ng, A.Y., Harada, D., & Russell, S.J. (1999). Policy invariance under reward transformations: Theory and application to reward shaping. In Proceedings of the International Conference on Machine Learning, ICML1999 (pp. 278-287). San Francisco, CA: Morgan Kaufmann.
    • (1999) Proceedings of the International Conference on Machine Learning, ICML1999 , pp. 278-287
    • Ng, A.Y.1    Harada, D.2    Russell, S.J.3
  • 19
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • Singh, S.P., & Sutton, R.S. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 22 (1-3). 123-158.
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 22
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Cambridge, MA: MIT Press.
    • Sutton, R.S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Advances in Neural Information Processing Systems 8 (pp. 1038-1044). Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems 8 , pp. 1038-1044
    • Sutton, R.S.1
  • 24
    • 0037197641 scopus 로고    scopus 로고
    • Embodied evolution: Distributing an evolutionary algorithm in a population of robots
    • Watson, R., Ficici, S., & Pollack, J. (2002). Embodied evolution: Distributing an evolutionary algorithm in a population of robots. Robotics and Autonomous Systems, 39 (1). 1-18.
    • (2002) Robotics and Autonomous Systems , vol.39 , Issue.1 , pp. 1-18
    • Watson, R.1    Ficici, S.2    Pollack, J.3
  • 25
    • 33646714634 scopus 로고    scopus 로고
    • Evolutionary function approximation for reinforcement learning
    • Whiteson, S., & Stone, P. (2006). Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research, 7, 877-917.
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 877-917
    • Whiteson, S.1    Stone, P.2
  • 26
    • 27344453198 scopus 로고    scopus 로고
    • Potential-based shaping and Q-value initialization are equivalent
    • Wiewiora, E. (2003). Potential-based shaping and Q-value initialization are equivalent. Journal of Artificial Intelligence Research, 19, 205-208.
    • (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 205-208
    • Wiewiora, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.