메뉴 건너뛰기




Volumn , Issue , 2011, Pages 120-127

Protecting against evaluation overfitting in empirical reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

EMPIRICAL EVALUATIONS; OVERFITTING; PROOF OF CONCEPT; TEST EVALUATION; TILE CODING;

EID: 79958854955     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2011.5967363     Document Type: Conference Paper
Times cited : (117)

References (34)
  • 1
    • 1642452775 scopus 로고
    • Machine learning as an experimental science
    • P. Langley, "Machine learning as an experimental science," Machine Learning, vol. 3, no. 1, pp. 5-8, 1988.
    • (1988) Machine Learning , vol.3 , Issue.1 , pp. 5-8
    • Langley, P.1
  • 2
    • 25844525642 scopus 로고
    • How evaluation guides AI research
    • P. Cohen and A. Howe, "How evaluation guides AI research," AI Magazine, vol. 9, no. 4, pp. 35-43, 1988.
    • (1988) AI Magazine , vol.9 , Issue.4 , pp. 35-43
    • Cohen, P.1    Howe, A.2
  • 3
    • 0029350748 scopus 로고
    • Artificial intelligence: An empirical science
    • H. Simon, "Artificial intelligence: an empirical science," Artificial Intelligence, vol. 77, no. 1, pp. 95-127, 1995.
    • (1995) Artificial Intelligence , vol.77 , Issue.1 , pp. 95-127
    • Simon, H.1
  • 5
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • R. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," in Advances in Neural Information Processing Systems 8, 1996, pp. 1038-1044.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.1
  • 7
    • 0032163123 scopus 로고    scopus 로고
    • On Method Overfitting
    • E. Falkenauer, "On method overfitting," Journal of Heuristics, vol. 4, pp. 281-287, 1998. (Pubitemid 128513805)
    • (1998) Journal of Heuristics , vol.4 , Issue.3 , pp. 281-287
    • Falkenauer, E.1
  • 10
    • 0035283313 scopus 로고    scopus 로고
    • Robust classification for imprecise environments
    • DOI 10.1023/A:1007601015854
    • F. Provost and T. Fawcett, "Robust classification for imprecise environments," Machine Learning, vol. 42, no. 3, pp. 203-231, 2001. (Pubitemid 32188799)
    • (2001) Machine Learning , vol.42 , Issue.3 , pp. 203-231
    • Provost, F.1    Fawcett, T.2
  • 11
    • 0000459353 scopus 로고    scopus 로고
    • The Lack of a Priori Distinctions between Learning Algorithms
    • D. H. Wolpert, "The lack of a priori distinctions between learning algorithms," Neural Computation, vol. 8, no. 7, pp. 1341-1390, 1996. (Pubitemid 126449973)
    • (1996) Neural Computation , vol.8 , Issue.7 , pp. 1341-1390
    • Wolpert, D.H.1
  • 14
    • 79951878534 scopus 로고    scopus 로고
    • The reinforcement learning competitions
    • S. Whiteson, B. Tanner, and A. White, "The reinforcement learning competitions," AI Magazine, vol. 31, no. 2, pp. 81-94, 2010.
    • (2010) AI Magazine , vol.31 , Issue.2 , pp. 81-94
    • Whiteson, S.1    Tanner, B.2    White, A.3
  • 18
    • 14344277592 scopus 로고    scopus 로고
    • A model of inductive bias learning
    • J. Baxter, "A model of inductive bias learning," J. of AI Research, vol. 12, pp. 149-198, 2000.
    • (2000) J. of AI Research , vol.12 , pp. 149-198
    • Baxter, J.1
  • 19
    • 0004704216 scopus 로고
    • Limitations on conclusions using scales of measurement
    • ch. 18
    • F. S. Roberts, "Limitations on conclusions using scales of measurement," in Operations Research and The Public Sector, 1994, vol. 6, ch. 18, pp. 621-671.
    • (1994) Operations Research and the Public Sector , vol.6 , pp. 621-671
    • Roberts, F.S.1
  • 20
    • 0034247206 scopus 로고    scopus 로고
    • Multiboosting: A technique for combining boosting and wagging
    • G. Webb, "Multiboosting: A technique for combining boosting and wagging," Machine learning, vol. 40, no. 2, pp. 159-196, 2000.
    • (2000) Machine Learning , vol.40 , Issue.2 , pp. 159-196
    • Webb, G.1
  • 21
    • 29644438050 scopus 로고    scopus 로고
    • Statistical comparisons of classifiers over multiple data sets
    • J. Demšar, "Statistical comparisons of classifiers over multiple data sets," The Journal of Machine Learning Research, vol. 7, p. 30, 2006.
    • (2006) The Journal of Machine Learning Research , vol.7 , pp. 30
    • Demšar, J.1
  • 22
    • 27144463192 scopus 로고    scopus 로고
    • On comparing classifiers: Pitfalls to avoid and a recommended approach
    • S. L. Salzberg, "On comparing classifiers: Pitfalls to avoid and a recommended approach," Data Mining and Knowledge Discovery, vol. 1, pp. 317-327, 1997.
    • (1997) Data Mining and Knowledge Discovery , vol.1 , pp. 317-327
    • Salzberg, S.L.1
  • 24
    • 0030306228 scopus 로고    scopus 로고
    • The Copeland method
    • D. Saari and V. Merlin, "The Copeland method," Economic Theory, vol. 8, no. 1, pp. 51-76, 1996.
    • (1996) Economic Theory , vol.8 , Issue.1 , pp. 51-76
    • Saari, D.1    Merlin, V.2
  • 26
    • 0029362452 scopus 로고
    • Testing heuristics: We have it all wrong
    • J. N. Hooker, "Testing heuristics: We have it all wrong," Journal of Heuristics, vol. 1, pp. 33-42, 1995.
    • (1995) Journal of Heuristics , vol.1 , pp. 33-42
    • Hooker, J.N.1
  • 28
    • 0002340903 scopus 로고    scopus 로고
    • The replacement of general-purpose learning models with adaptively specialized learning modules
    • Cambridge, MA: MIT Press
    • C. R. Gallistel, "The replacement of general-purpose learning models with adaptively specialized learning modules," in The Cognitive Neurosciences. Cambridge, MA: MIT Press, 2000, pp. 1179-1191.
    • (2000) The Cognitive Neurosciences , pp. 1179-1191
    • Gallistel, C.R.1
  • 31
    • 68949157375 scopus 로고    scopus 로고
    • Transfer learning for reinforcement learning domains: A survey
    • M. E. Taylor and P. Stone, "Transfer learning for reinforcement learning domains: A survey," JMLR, vol. 10, no. 1, pp. 1633-1685, 2009.
    • (2009) JMLR , vol.10 , Issue.1 , pp. 1633-1685
    • Taylor, M.E.1    Stone, P.2
  • 33
    • 70449370276 scopus 로고    scopus 로고
    • RL-Glue : LLanguage-independent software for reinforcement-learning experiments
    • September
    • B. Tanner and A. White, "RL-Glue : Language-independent software for reinforcement-learning experiments," Journal of Machine Learning Research, vol. 10, pp. 2133-2136, September 2009.
    • (2009) Journal of Machine Learning Research , vol.10 , pp. 2133-2136
    • Tanner, B.1    White, A.2
  • 34
    • 0028424239 scopus 로고
    • Improving generalization with active learning
    • D. Cohn, L. Atlas, and R. Ladner, "Improving generalization with active learning," Machine Learning, vol. 15, no. 2, pp. 201-221, 1994.
    • (1994) Machine Learning , vol.15 , Issue.2 , pp. 201-221
    • Cohn, D.1    Atlas, L.2    Ladner, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.