SCOPUS 정보 검색 플랫폼

ICML 2006 - Proceedings of the 23rd International Conference on Machine Learning

Volumn 2006, Issue , 2006, Pages 49-56

Relational temporal difference learning

(3) Asgharbeygi, Nima a Stracuzzi, David a Langley, Pat a

a STANFORD UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DECISION THEORY; FUNCTION EVALUATION; GAME THEORY; HIERARCHICAL SYSTEMS; MARKOV PROCESSES; MULTI AGENT SYSTEMS;

GENERAL GAME PLAYING REPOSITORY; MULTI AGENT MARKOV DECISION PROBLEMS; NONRELATIONAL METHODS; RELATIONAL TEMPORAL DIFFERENCE LEARNING;

LEARNING SYSTEMS;

EID: 33749265162 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (4)

References (18)

1
- 10044266681
- State space reduction for hierarchical reinforcement learning
- Miami Beach, FL
- Asadi, M., & Huber, M. (2004). State space reduction for hierarchical reinforcement learning. Proceedings of the Seventeenth International FLAIRS Conference (pp. 509-514). Miami Beach, FL.
- (2004) Proceedings of the Seventeenth International FLAIRS Conference , pp. 509-514
- Asadi, M.¹ Huber, M.²

2
- 26944452430
- Guiding inference through relational reinforcement learning
- Bonn
- Asgharbeygi, N., & Nejati, N. (2005). Guiding inference through relational reinforcement learning. Proceedings of the Fifteenth International Conference on Inductive Logic Programming (pp. 20-37). Bonn.
- (2005) Proceedings of the Fifteenth International Conference on Inductive Logic Programming , pp. 20-37
- Asgharbeygi, N.¹ Nejati, N.²

3
- 0002882372
- Knightcap: A chess program that learns by combining TD(A) with game-tree search
- Madison, WI: Morgan Kaufmann
- Baxter, J., Trigdell, A., & Weaver, L. (1998). Knightcap: A chess program that learns by combining TD(A) with game-tree search. Proceedings of the Fifteenth International Conference on Machine Learning (pp. 28-36). Madison, WI: Morgan Kaufmann.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 28-36
- Baxter, J.¹ Trigdell, A.² Weaver, L.³

4
- 84880891360
- Symbolic dynamic programming for first order MDPs
- Seattle, Washington
- Boutilier, C., Reiter, R., & Price, B. (2001). Symbolic dynamic programming for first order MDPs. Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (pp. 690-697). Seattle, Washington.
- (2001) Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence , pp. 690-697
- Boutilier, C.¹ Reiter, R.² Price, B.³

5
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

6
- 0002034653
- Efficient mining of emerging patterns: Discovering trends and differences
- San Diego, CA
- Dong, G., & Li, J. (1999). Efficient mining of emerging patterns: Discovering trends and differences. Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining (pp. 43-52). San Diego, CA.
- (1999) Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining , pp. 43-52
- Dong, G.¹ Li, J.²

7
- 84948172455
- Speeding up relational reinforcement learning through the use of an incremental first order decision tree learning
- Freiburg, Germany
- Driessens, K., Ramon, J., & Blockeel, H. (2001). Speeding up relational reinforcement learning through the use of an incremental first order decision tree learning. Proceedings of the Twelfth European Conference on Machine Learning (pp. 97-108). Freiburg, Germany.
- (2001) Proceedings of the Twelfth European Conference on Machine Learning , pp. 97-108
- Driessens, K.¹ Ramon, J.² Blockeel, H.³

8
- 0035312760
- Relational reinforcement learning
- Dzeroski, S., Raedt, L. D., & Driessens, K. (2001). Relational reinforcement learning. Machine Learning, 43, 7-52.
- (2001) Machine Learning , vol.43 , pp. 7-52
- Dzeroski, S.¹ Raedt, L.D.² Driessens, K.³

9
- 65849284789
- Automatic feature generation for problem solving systems
- Aberdeen, Scotland
- Fawcett, T., & Utgoff, P. E. (1992). Automatic feature generation for problem solving systems. Proceedings of the Ninth International Workshop on Machine Learning (pp. 144-153). Aberdeen, Scotland.
- (1992) Proceedings of the Ninth International Workshop on Machine Learning , pp. 144-153
- Fawcett, T.¹ Utgoff, P.E.²

10
- 13444258086
- Learning domain-specific control knowledge from random walks
- Whistler, British Columbia
- Fern, A., Yoon, S. W., & Givan, R. (2004). Learning domain-specific control knowledge from random walks. Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (pp. 191-199). Whistler, British Columbia.
- (2004) Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling , pp. 191-199
- Fern, A.¹ Yoon, S.W.² Givan, R.³

11
- 84880803349
- Generalizing plans to new environments in relational mdps
- Acapulco, Mexico
- Guestrin, C., Koller, D., Gearhart, C., & Kanodia, N. (2003). Generalizing plans to new environments in relational mdps. Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (pp. 1003-1010). Acapulco, Mexico.
- (2003) Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence , pp. 1003-1010
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

12
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

13
- 84898646291
- Chess neighborhoods, function combination, and reinforcement learning
- Hamamatsu, Japan
- Levinson, R., & Weber, R. (2000). Chess neighborhoods, function combination, and reinforcement learning. Proceedings of the Second International Conference on Computers and Games (pp. 133-150). Hamamatsu, Japan.
- (2000) Proceedings of the Second International Conference on Computers and Games , pp. 133-150
- Levinson, R.¹ Weber, R.²

14
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- New Brunswick, NJ: Morgan Kaufmann
- Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. Proceedings of the Eleventh International Conference on Machine Learning (pp. 157-163). New Brunswick, NJ: Morgan Kaufmann.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

15
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

16
- 0033570798
- A unified analysis of value-function-based reinforcement learning algorithms
- Szepesvari, C., & Littman, M. (1999). A unified analysis of value-function-based reinforcement learning algorithms. Neural Computation, 11, 2017-2060.
- (1999) Neural Computation , vol.11 , pp. 2017-2060
- Szepesvari, C.¹ Littman, M.²

17
- 26944455336
- Relational reinforcement learning: An overview
- Banff, Alberta
- Tadepalli, P., Givan, R., & Driessens, K. (2004). Relational reinforcement learning: An overview. Proceedings of the ICML'04 Workshop on Relational Reinforcement Learning (pp. 1-9). Banff, Alberta.
- (2004) Proceedings of the ICML'04 Workshop on Relational Reinforcement Learning , pp. 1-9
- Tadepalli, P.¹ Givan, R.² Driessens, K.³

18
- 0000985504
- TD-Gammon, a self-teaching backgammon program
- Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program. Neural Computation, 6, 215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.