SCOPUS 정보 검색 플랫폼

Volumn 3607 LNAI, Issue , 2005, Pages 321-331

Feature-discovering approximate value iteration methods

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; CLASSIFICATION (OF INFORMATION); MARKOV PROCESSES; PROBLEM SOLVING; STATE SPACE METHODS;

BELLMAN ERROR; MARKOV DECISION PROCESSES; VALUE ITERATION;

ITERATIVE METHODS;

EID: 26944495251 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11527862_25 Document Type: Conference Paper

Times cited : (5)

References (11)

1
- 84968468700
- Polynomial approximation - A new computational technique in dynamic programming
- R. Bellman, R. Kalaba, and B. Kotkin. Polynomial approximation - a new computational technique in dynamic programming. Math. Comp., 17(8):155-161, 1963.
- (1963) Math. Comp. , vol.17 , Issue.8 , pp. 155-161
- Bellman, R.¹ Kalaba, R.² Kotkin, B.³

2
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 0004255908
- McGraw-Hill
- T. M. Mitchell. Machine Learning. McGraw-Hill, 1997.
- (1997) Machine Learning
- Mitchell, T.M.¹

4
- 26944448728
- Greedy linear value-approximation for factored markov decision processes
- R. Patrascu, P. Poupart, D. Schuurmans, C. Boutilier, and C. Guestrin. Greedy linear value-approximation for factored markov decision processes. In AAAI, 2002.
- (2002) AAAI
- Patrascu, R.¹ Poupart, P.² Schuurmans, D.³ Boutilier, C.⁴ Guestrin, C.⁵

5
- 0003500248
- Morgan Kaufmann
- J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993.
- (1993) C4.5: Programs for Machine Learning
- Quinlan, J.R.¹

6
- 33847202724
- Learning to predict by the methods of temporal differences
- R. S. Sutton. Learning to predict by the methods of temporal differences. MLJ, 3:9-44, 1988.
- (1988) MLJ , vol.3 , pp. 9-44
- Sutton, R.S.¹

7
- 0004007508
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning. MIT Press, 1998.
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

8
- 0029276036
- Temporal difference learning and td-gammon
- G. Tesauro. Temporal difference learning and td-gammon. Comm. ACM, 38(3):58-68, 1995.
- (1995) Comm. ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

10
- 0002278965
- Adaptive switching circuits
- B. Widrow and M. E. Hoff Jr. Adaptive switching circuits. IRE WESCON Convention Record, pages 96-104, 1960.
- (1960) IRE WESCON Convention Record , pp. 96-104
- Widrow, B.¹ Hoff Jr., M.E.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.