메뉴 건너뛰기




Volumn 3809 LNAI, Issue , 2005, Pages 113-122

Global versus local constructive function approximation for on-line reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; ARTIFICIAL INTELLIGENCE; COMPUTER SCIENCE; FUNCTIONS; LEARNING ALGORITHMS; LEARNING SYSTEMS; RESOURCE ALLOCATION;

EID: 33745625397     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11589990_14     Document Type: Conference Paper
Times cited : (9)

References (23)
  • 1
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R.S. (1988). Learning to predict by the methods of temporal differences, Machine Learning, Vol. 3, pp 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 3
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro, G. J. (1995), Temporal difference learning and TD-Gammon, Communications of the ACM. 38(3), pp.58-68.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.J.1
  • 4
    • 85156221438 scopus 로고    scopus 로고
    • Generalisation in reinforcement learning: Successful examples using sparse coarse coding
    • Touretzky D.S., Mozer M.C., & Hasselmo M.E. (Eds.). Cambridge, MA: The MIT Press
    • Sutton R.S. (1996). Generalisation in reinforcement learning: Successful examples using sparse coarse coding. In Touretzky D.S., Mozer M.C., & Hasselmo M.E. (Eds.). Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference (1038-1044). Cambridge, MA: The MIT Press.
    • (1996) Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference , pp. 1038-1044
    • Sutton, R.S.1
  • 13
    • 33745316384 scopus 로고    scopus 로고
    • Reinforcement learning using cascade-correlation neural networks
    • McGill University
    • Bellemare, M.G., Precup, D. and Rivest, F. (2004), Reinforcement Learning Using Cascade-Correlation Neural Networks, Technical Report RL-3.04, McGill University.
    • (2004) Technical Report , vol.RL-3.04
    • Bellemare, M.G.1    Precup, D.2    Rivest, F.3
  • 14
    • 0001071040 scopus 로고
    • A resource-allocating network for function interpolation
    • Platt, J. (1991). "A Resource-Allocating Network for Function Interpolation." Neural Computation 3: 213-225.
    • (1991) Neural Computation , vol.3 , pp. 213-225
    • Platt, J.1
  • 17
    • 0030108039 scopus 로고    scopus 로고
    • The Cascade-correlation learning: A projection pursuit learning perspective
    • Hwang, J.-H., S.-S. You, et al. (1996). "The Cascade-Correlation Learning: A Projection Pursuit Learning Perspective." IEEE Transactions on Neural Networks 7(2): 278-288.
    • (1996) IEEE Transactions on Neural Networks , vol.7 , Issue.2 , pp. 278-288
    • Hwang, J.-H.1    You, S.-S.2
  • 18
    • 0031193357 scopus 로고    scopus 로고
    • Investigation of the CasCor family of learning algorithms
    • Prechelt, L. (1997). Investigation of the CasCor Family of Learning Algorithms, in Neural Networks, 10 (5) : 885-896.
    • (1997) Neural Networks , vol.10 , Issue.5 , pp. 885-896
    • Prechelt, L.1
  • 19
    • 0036825537 scopus 로고    scopus 로고
    • Evaluation of constructive neural networks with cascaded architectures
    • Lahnajarvi, J. J.T., Lehtokangas, M.I., Saarinen, J.P.P., (2002). Evaluation of constructive neural networks with cascaded architectures, in Neurocomputing 48: 573-607.
    • (2002) Neurocomputing , vol.48 , pp. 573-607
    • Lahnajarvi, J.J.T.1    Lehtokangas, M.I.2    Saarinen, J.P.P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.