메뉴 건너뛰기




Volumn 33, Issue 1, 1998, Pages 105-115

Fast Online Q(λ)

Author keywords

Lazy learning; Online Q( ); Q learning; Reinforcement learning; TD( )

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; ERROR ANALYSIS; MARKOV PROCESSES; MATHEMATICAL OPERATORS; ONLINE SYSTEMS; PROBABILITY DISTRIBUTIONS; COMPUTATIONAL COMPLEXITY; LEARNING ALGORITHMS; TABLE LOOKUP;

EID: 0032182997     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1007562800292     Document Type: Article
Times cited : (64)

References (21)
  • 1
    • 0016556021 scopus 로고
    • A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
    • Albus, J.S. (1975). A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Dynamic Systems, Measurement and Control, 97, 220-227.
    • (1975) Dynamic Systems, Measurement and Control , vol.97 , pp. 220-227
    • Albus, J.S.1
  • 5
    • 0010878888 scopus 로고
    • (Technical Report IRIDIA-94-14). Université Libre de Bruxelles
    • Caironi, P.V.C., & Dorigo, M. (1994). Training Q-agents (Technical Report IRIDIA-94-14). Université Libre de Bruxelles.
    • (1994) Training Q-agents
    • Caironi, P.V.C.1    Dorigo, M.2
  • 6
    • 0007512578 scopus 로고
    • Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning
    • Cichosz, P. (1995). Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning. Journal of Artificial Intelligence Research, 2, 287-318.
    • (1995) Journal of Artificial Intelligence Research , vol.2 , pp. 287-318
    • Cichosz, P.1
  • 7
    • 0347763086 scopus 로고
    • Supervised learning with growing cell structures
    • J. Cowan, G. Tesauro, & J. Alspector (Eds.), San Mateo, CA: Morgan Kaufmann
    • Fritzke, B. (1994). Supervised learning with growing cell structures. In J. Cowan, G. Tesauro, & J. Alspector (Eds.), Advances in neural information processing systems (Vol.6, pp. 255-262). San Mateo, CA: Morgan Kaufmann.
    • (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 255-262
    • Fritzke, B.1
  • 8
    • 0029751419 scopus 로고    scopus 로고
    • The effect of representation and knowledge on goal-directed exploration with reinforcement learning algorithms
    • Koenig, S., & Simmons, R.G. (1996). The effect of representation and knowledge on goal-directed exploration with reinforcement learning algorithms. Machine Learning, 22, 228-250.
    • (1996) Machine Learning , vol.22 , pp. 228-250
    • Koenig, S.1    Simmons, R.G.2
  • 11
    • 0000955979 scopus 로고    scopus 로고
    • Incremental multi-step Q-learning
    • Peng, J., & Williams, R. (1996). Incremental multi-step Q-learning. Machine Learning, 22, 283-290.
    • (1996) Machine Learning , vol.22 , pp. 283-290
    • Peng, J.1    Williams, R.2
  • 13
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • Singh, S., & Sutton, R. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 22, 123-158.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.1    Sutton, R.2
  • 14
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R.S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 15
    • 0000723997 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Cambridge, MA: MIT Press
    • Sutton, R.S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Advances in neural information processing systems, (Vol. 8, pp. 1033-1045). Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1033-1045
    • Sutton, R.S.1
  • 16
    • 2542485629 scopus 로고
    • Practical issues in temporal difference learning
    • D.S., Lippman, J.E. Moody, & D.S Touretzky (Eds.), San Mateo, CA: Morgan Kaufmann
    • Tesauro, G. (1992). Practical issues in temporal difference learning. In D.S., Lippman, J.E. Moody, & D.S Touretzky (Eds.), Advances in neural information processing systems (Vol. 4, pp. 259-266). San Mateo, CA: Morgan Kaufmann.
    • (1992) Advances in Neural Information Processing Systems , vol.4 , pp. 259-266
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.