메뉴 건너뛰기




Volumn 4630 LNCS, Issue , 2007, Pages 122-134

Feature construction for reinforcement learning in hearts

Author keywords

[No Author keywords available]

Indexed keywords

GAME THEORY; LEARNING SYSTEMS; LINEAR REGRESSION; RANDOM PROCESSES;

EID: 38049011913     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-75538-8_11     Document Type: Conference Paper
Times cited : (43)

References (16)
  • 1
    • 0002882372 scopus 로고    scopus 로고
    • Knightcap: A Chess Program that Learns by Combining TD(λ) with Game-Tree Search
    • Morgan Kaufmann, San Francisco, CA
    • Baxter, J., Trigdell, A., Weaver, L.: Knightcap: a Chess Program that Learns by Combining TD(λ) with Game-Tree Search. In: Proc. 15th International Conf. on Machine Learning, pp. 28-36. Morgan Kaufmann, San Francisco, CA (1998)
    • (1998) Proc. 15th International Conf. on Machine Learning , pp. 28-36
    • Baxter, J.1    Trigdell, A.2    Weaver, L.3
  • 2
    • 84956863737 scopus 로고    scopus 로고
    • Buro, M.: From Simple Features to Sophisticated Evaluation Functions. In: van den Herik, J., Iida, H. (eds.) CG 1998. LNCS, 1558, pp. 126-145. Springer, Heidelberg (1999)
    • Buro, M.: From Simple Features to Sophisticated Evaluation Functions. In: van den Herik, J., Iida, H. (eds.) CG 1998. LNCS, vol. 1558, pp. 126-145. Springer, Heidelberg (1999)
  • 4
    • 38049061595 scopus 로고    scopus 로고
    • Fürnkranz, J., Pfahringer, B., Kaindl, H., Kramer, S.: Learning to Use Operational Advice. In: Proc. of the 14th European Conference on A.I. (2000)
    • Fürnkranz, J., Pfahringer, B., Kaindl, H., Kramer, S.: Learning to Use Operational Advice. In: Proc. of the 14th European Conference on A.I. (2000)
  • 7
    • 38049000794 scopus 로고    scopus 로고
    • Luckhardt, C., Irani, K.: An Algorithmic Solution of N-Person Games. In: AAAI-86, 1, pp. 158-162 (1986)
    • Luckhardt, C., Irani, K.: An Algorithmic Solution of N-Person Games. In: AAAI-86, vol. 1, pp. 158-162 (1986)
  • 9
    • 38049009946 scopus 로고    scopus 로고
    • Two Search Techniques for Imperfect Information Games and Application to Hearts
    • University of Massachusetts Technical Report, pp
    • Perkins, T.: Two Search Techniques for Imperfect Information Games and Application to Hearts. University of Massachusetts Technical Report, pp. 98-71 (1998)
    • (1998) , pp. 98-71
    • Perkins, T.1
  • 11
    • 0013528313 scopus 로고    scopus 로고
    • Scaling Reinforcement Learning toward RoboCup Soccer
    • Morgan Kaufmann, San Francisco,CA
    • Stone, P., Sutton, R.S.: Scaling Reinforcement Learning toward RoboCup Soccer. In: Proc. 18th ICML, pp. 537-544. Morgan Kaufmann, San Francisco,CA (2001)
    • (2001) Proc. 18th ICML , pp. 537-544
    • Stone, P.1    Sutton, R.S.2
  • 13
    • 34247179642 scopus 로고    scopus 로고
    • Sturtevant, N.R., Bowling, M.H.: Robust Game Play against Unknown Opponents. In: AAMAS-2006, pp. 713-719 (2006)
    • Sturtevant, N.R., Bowling, M.H.: Robust Game Play against Unknown Opponents. In: AAMAS-2006, pp. 713-719 (2006)
  • 14
    • 38049011837 scopus 로고    scopus 로고
    • Sturtevant, N.R., Korf, R.E.: On Pruning Techniques for Multi-Player Games. In: AAAI-2000 (2000)
    • Sturtevant, N.R., Korf, R.E.: On Pruning Techniques for Multi-Player Games. In: AAAI-2000 (2000)
  • 16
    • 0029276036 scopus 로고
    • Temporal Difference Learning and TD-Gammon
    • Tesauro, G.: Temporal Difference Learning and TD-Gammon. Communications of the ACM 38(3), 58-68 (1995)
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.