메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 836-844

Regularized off-policy TD-learning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMIC FRAMEWORK; COMPUTATIONAL COSTS; CONVEX REGULARIZATIONS; LOW COMPUTATIONAL COMPLEXITY; NON-SMOOTH CONVEX OPTIMIZATIONS; SADDLE-POINT FORMULATIONS; SPARSE REPRESENTATION; THEORETICAL AND EXPERIMENTAL;

EID: 84877748309     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (59)

References (21)
  • 1
    • 17444361978 scopus 로고    scopus 로고
    • Non-Euclidean restricted memory level method for large-scale convex optimization
    • A. Ben-Tal and A. Nemirovski. Non-Euclidean restricted memory level method for large-scale convex optimization. Mathematical Programming, 102(3):407-456, 2005.
    • (2005) Mathematical Programming , vol.102 , Issue.3 , pp. 407-456
    • Ben-Tal, A.1    Nemirovski, A.2
  • 18
    • 0035273403 scopus 로고    scopus 로고
    • Online learning control by association and reinforcement
    • J. Si and Y. Wang. Online learning control by association and reinforcement. IEEE Transactions on Neural Networks, 12:264-276, 2001.
    • (2001) IEEE Transactions on Neural Networks , vol.12 , pp. 264-276
    • Si, J.1    Wang, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.