메뉴 건너뛰기




Volumn , Issue , 2011, Pages 128-134

Higher order Q-Learning

Author keywords

Artificial intelligence; Bayesian methods; Higher Order Learning; Intelligent agent; Machine learning; Q learning; Reinforcement learning; Statistical relational learning

Indexed keywords

BAYESIAN METHODS; HIGHER ORDER; MACHINE-LEARNING; Q-LEARNING; STATISTICAL RELATIONAL LEARNING;

EID: 80052206811     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2011.5967385     Document Type: Conference Paper
Times cited : (5)

References (24)
  • 3
    • 16244388049 scopus 로고    scopus 로고
    • Local bandit approximation for optimal learning problems
    • The MIT Press
    • Duff, M, and Barto, Andrew (1997). Local bandit approximation for optimal learning problems. Advances in Neural Information Processing Systems, volume 9, pp. 1019-1025. The MIT Press.
    • (1997) Advances in Neural Information Processing Systems , vol.9 , pp. 1019-1025
    • Duff, M.1    Andrew, B.2
  • 5
    • 0004007508 scopus 로고    scopus 로고
    • Reinforcement learning
    • The MIT Encyclopedia of the Cognitive Sciences MIT Press
    • Sutton, R.S. (1999). Reinforcement Learning. In Rob Wilson and Frank Keil (Eds.) The MIT Encyclopedia of the Cognitive Sciences, MIT Press.
    • (1999) Rob Wilson and Frank Keil (Eds.)
    • Sutton, R.S.1
  • 8
    • 0032090684 scopus 로고    scopus 로고
    • Enhanced hypertext categorization using hyperlinks
    • Chakrabarti, S., Dom, B., & Indyk, P. (1998). Enhanced Hypertext Classification Using Hyper-Links, In Proceedings of ACM SIGMOD Conference, pp. 307-318. (Pubitemid 128655978)
    • (1998) SIGMOD Record , vol.27 , Issue.2 , pp. 307-318
    • Chakrabarti, S.1    Dom, B.2    Indyk, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.