SCOPUS 정보 검색 플랫폼

IEEE SSCI 2011: Symposium Series on Computational Intelligence - ADPRL 2011: 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

Volumn , Issue , 2011, Pages 120-127

Protecting against evaluation overfitting in empirical reinforcement learning

(4) Whiteson, Shimon a Tanner, Brian b Taylor, Matthew E c Stone, Peter d

a UNIVERSITY OF AMSTERDAM (Netherlands)

b UNIVERSITY OF ALBERTA (Canada)

c LAFAYETTE COLLEGE (United States)

d University of Texas at Austin (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EMPIRICAL EVALUATIONS; OVERFITTING; PROOF OF CONCEPT; TEST EVALUATION; TILE CODING;

ARTIFICIAL INTELLIGENCE; DYNAMIC PROGRAMMING;

REINFORCEMENT LEARNING;

EID: 79958854955 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2011.5967363 Document Type: Conference Paper

Times cited : (117)

References (34)

1
- 1642452775
- Machine learning as an experimental science
- P. Langley, "Machine learning as an experimental science," Machine Learning, vol. 3, no. 1, pp. 5-8, 1988.
- (1988) Machine Learning , vol.3 , Issue.1 , pp. 5-8
- Langley, P.¹

2
- 25844525642
- How evaluation guides AI research
- P. Cohen and A. Howe, "How evaluation guides AI research," AI Magazine, vol. 9, no. 4, pp. 35-43, 1988.
- (1988) AI Magazine , vol.9 , Issue.4 , pp. 35-43
- Cohen, P.¹ Howe, A.²

3
- 0029350748
- Artificial intelligence: An empirical science
- H. Simon, "Artificial intelligence: an empirical science," Artificial Intelligence, vol. 77, no. 1, pp. 95-127, 1995.
- (1995) Artificial Intelligence , vol.77 , Issue.1 , pp. 95-127
- Simon, H.¹

4
- 36948999941
- [Online]
- A. Asuncion and D. Newman, "UCI machine learning repository," 2007. [Online]. Available: http://www.ics.uci.edu/~mlearn/MLRepository.html
- (2007) UCI Machine Learning Repository
- Asuncion, A.¹ Newman, D.²

5
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- R. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," in Advances in Neural Information Processing Systems 8, 1996, pp. 1038-1044.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
- Sutton, R.¹

6
- 80052257141
- Warning: Statistical benchmarking is addictive. Kicking the habit in machine learning
- C. Drummond and N. Japkowicz, "Warning: Statistical benchmarking is addictive. Kicking the habit in machine learning," Journal of Experimental and Theoretical Artificial Intelligence, 2009.
- (2009) Journal of Experimental and Theoretical Artificial Intelligence
- Drummond, C.¹ Japkowicz, N.²

7
- 0032163123
- On Method Overfitting
- E. Falkenauer, "On method overfitting," Journal of Heuristics, vol. 4, pp. 281-287, 1998. (Pubitemid 128513805)
- (1998) Journal of Heuristics , vol.4 , Issue.3 , pp. 281-287
- Falkenauer, E.¹

8
- 51949112889
- Springer
- J. Ponce, T. Berg, M. Everingham, D. Forsyth, M. Herbert, S. Lazebnik, M. Marszalek, C. Schmid, B. Russel, A. Torralba, C. Williams, J. Zhang, and A. Zisserman, Dataset Issues in Object Recognition. Springer, 2007.
- (2007) Dataset Issues in Object Recognition
- Ponce, J.¹ Berg, T.² Everingham, M.³ Forsyth, D.⁴ Herbert, M.⁵ Lazebnik, S.⁶ Marszalek, M.⁷ Schmid, C.⁸ Russel, B.⁹ Torralba, A.¹⁰ Williams, C.¹¹ Zhang, J.¹² Zisserman, A.¹³

9
- 80052252611
- [Online]
- J. Langford, "Clever methods of overfitting," 2005. [Online]. Available: http://hunch.net/?p=22
- (2005) Clever Methods of Overfitting
- Langford, J.¹

10
- 0035283313
- Robust classification for imprecise environments
- DOI 10.1023/A:1007601015854
- F. Provost and T. Fawcett, "Robust classification for imprecise environments," Machine Learning, vol. 42, no. 3, pp. 203-231, 2001. (Pubitemid 32188799)
- (2001) Machine Learning , vol.42 , Issue.3 , pp. 203-231
- Provost, F.¹ Fawcett, T.²

11
- 0000459353
- The Lack of a Priori Distinctions between Learning Algorithms
- D. H. Wolpert, "The lack of a priori distinctions between learning algorithms," Neural Computation, vol. 8, no. 7, pp. 1341-1390, 1996. (Pubitemid 126449973)
- (1996) Neural Computation , vol.8 , Issue.7 , pp. 1341-1390
- Wolpert, D.H.¹

12
- 0001700171
- A Markov decision process
- R. E. Bellman, "A Markov decision process," Journal of Mathematical Mechanics, vol. 6, pp. 679-684, 1957.
- (1957) Journal of Mathematical Mechanics , vol.6 , pp. 679-684
- Bellman, R.E.¹

13
- 72749107057
- Neuroevolutionary reinforcement learning for generalized helicopter control
- R. Koppejan and S. Whiteson, "Neuroevolutionary reinforcement learning for generalized helicopter control," in GECCO 2009: Proceedings of the Genetic and Evolutionary Computation Conference, 2009, pp. 145-152.
- (2009) GECCO 2009: Proceedings of the Genetic and Evolutionary Computation Conference , pp. 145-152
- Koppejan, R.¹ Whiteson, S.²

14
- 79951878534
- The reinforcement learning competitions
- S. Whiteson, B. Tanner, and A. White, "The reinforcement learning competitions," AI Magazine, vol. 31, no. 2, pp. 81-94, 2010.
- (2010) AI Magazine , vol.31 , Issue.2 , pp. 81-94
- Whiteson, S.¹ Tanner, B.² White, A.³

15
- 34547994508
- Multi-task reinforcement learning: A hierarchical Bayesian approach
- A. Wilson, A. Fern, S. Ray, and P. Tadepalli, "Multi-task reinforcement learning: a hierarchical Bayesian approach," in Proceedings of the 24th International Conference on Machine Learning, 2007, pp. 1015-1022.
- (2007) Proceedings of the 24th International Conference on Machine Learning , pp. 1015-1022
- Wilson, A.¹ Fern, A.² Ray, S.³ Tadepalli, P.⁴

16
- 33749251297
- An analytic solution to discrete Bayesian reinforcement learning
- P. Poupart, N. Vlassis, J. Hoey, and K. Regan, "An analytic solution to discrete Bayesian reinforcement learning," in Proceedings of the Twenty- Third International Conference on Machine Learning, 2006.
- (2006) Proceedings of the Twenty- Third International Conference on Machine Learning
- Poupart, P.¹ Vlassis, N.² Hoey, J.³ Regan, K.⁴

17
- 0003901612
- Norwell, MA, USA: Kluwer
- S. Thrun and L. Pratt, Eds., Learning to learn. Norwell, MA, USA: Kluwer, 1998.
- (1998) Learning to Learn
- Thrun, S.¹ Pratt, L.²

18
- 14344277592
- A model of inductive bias learning
- J. Baxter, "A model of inductive bias learning," J. of AI Research, vol. 12, pp. 149-198, 2000.
- (2000) J. of AI Research , vol.12 , pp. 149-198
- Baxter, J.¹

19
- 0004704216
- Limitations on conclusions using scales of measurement
- ch. 18
- F. S. Roberts, "Limitations on conclusions using scales of measurement," in Operations Research and The Public Sector, 1994, vol. 6, ch. 18, pp. 621-671.
- (1994) Operations Research and the Public Sector , vol.6 , pp. 621-671
- Roberts, F.S.¹

20
- 0034247206
- Multiboosting: A technique for combining boosting and wagging
- G. Webb, "Multiboosting: A technique for combining boosting and wagging," Machine learning, vol. 40, no. 2, pp. 159-196, 2000.
- (2000) Machine Learning , vol.40 , Issue.2 , pp. 159-196
- Webb, G.¹

21
- 29644438050
- Statistical comparisons of classifiers over multiple data sets
- J. Demšar, "Statistical comparisons of classifiers over multiple data sets," The Journal of Machine Learning Research, vol. 7, p. 30, 2006.
- (2006) The Journal of Machine Learning Research , vol.7 , pp. 30
- Demšar, J.¹

22
- 27144463192
- On comparing classifiers: Pitfalls to avoid and a recommended approach
- S. L. Salzberg, "On comparing classifiers: Pitfalls to avoid and a recommended approach," Data Mining and Knowledge Discovery, vol. 1, pp. 317-327, 1997.
- (1997) Data Mining and Knowledge Discovery , vol.1 , pp. 317-327
- Salzberg, S.L.¹

23
- 0001178140
- The statistical sign test
- W. J. Dixon and A. M. Mood, "The statistical sign test," Journal of the American Statistical Association, vol. 41, no. 236, pp. 557-566, 1946.
- (1946) Journal of the American Statistical Association , vol.41 , Issue.236 , pp. 557-566
- Dixon, W.J.¹ Mood, A.M.²

24
- 0030306228
- The Copeland method
- D. Saari and V. Merlin, "The Copeland method," Economic Theory, vol. 8, no. 1, pp. 51-76, 1996.
- (1996) Economic Theory , vol.8 , Issue.1 , pp. 51-76
- Saari, D.¹ Merlin, V.²

25
- 72049115729
- August, [Online]
- Y. Koren, "The BellKor solution to the Netflix grand prize," August 2009. [Online]. Available: http://www.netflixprize.com/assets/ GrandPrize2009-BPC-BellKor.pdf
- (2009) The BellKor Solution to the Netflix Grand Prize
- Koren, Y.¹

26
- 0029362452
- Testing heuristics: We have it all wrong
- J. N. Hooker, "Testing heuristics: We have it all wrong," Journal of Heuristics, vol. 1, pp. 33-42, 1995.
- (1995) Journal of Heuristics , vol.1 , pp. 33-42
- Hooker, J.N.¹

27
- 80052232335
- Why (PO)MDPs lose for spatial tasks and what to do about it
- T. Lane and W. Smart, "Why (PO)MDPs lose for spatial tasks and what to do about it," in Proceedings of the ICML 2005 Workshop on Rich Representations for Reinforcement Learning, 2005.
- (2005) Proceedings of the ICML 2005 Workshop on Rich Representations for Reinforcement Learning
- Lane, T.¹ Smart, W.²

28
- 0002340903
- The replacement of general-purpose learning models with adaptively specialized learning modules
- Cambridge, MA: MIT Press
- C. R. Gallistel, "The replacement of general-purpose learning models with adaptively specialized learning modules," in The Cognitive Neurosciences. Cambridge, MA: MIT Press, 2000, pp. 1179-1191.
- (2000) The Cognitive Neurosciences , pp. 1179-1191
- Gallistel, C.R.¹

29
- 33749243349
- Autonomous shaping: Knowledge transfer in reinforcement learning
- ACM
- G. Konidaris and A. Barto, "Autonomous shaping: Knowledge transfer in reinforcement learning," in Proceedings of the 23rd International Conference on Machine learning. ACM, 2006, pp. 489-496.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 489-496
- Konidaris, G.¹ Barto, A.²

30
- 77955914197
- Multi-task evolutionary shaping without prespecified representations
- July
- M. Snel and S. Whiteson, "Multi-task evolutionary shaping without prespecified representations," in GECCO 2010: Proceedings of the Genetic and Evolutionary Computation Conference, July 2010, pp. 1031-1038.
- (2010) GECCO 2010: Proceedings of the Genetic and Evolutionary Computation Conference , pp. 1031-1038
- Snel, M.¹ Whiteson, S.²

31
- 68949157375
- Transfer learning for reinforcement learning domains: A survey
- M. E. Taylor and P. Stone, "Transfer learning for reinforcement learning domains: A survey," JMLR, vol. 10, no. 1, pp. 1633-1685, 2009.
- (2009) JMLR , vol.10 , Issue.1 , pp. 1633-1685
- Taylor, M.E.¹ Stone, P.²

32
- 80052250874
- A novel benchmark methodology and data repository for real-life reinforcement learning
- poster at the
- A. Nouri, M. L. Littman, L. Li, R. Parr, C. Painter-Wakefield, and G. Taylor, "A novel benchmark methodology and data repository for real-life reinforcement learning," 2009, poster at the Multidisciplinary Symposium on Reinforcement Learning.
- (2009) Multidisciplinary Symposium on Reinforcement Learning
- Nouri, A.¹ Littman, M.L.² Li, L.³ Parr, R.⁴ Painter-Wakefield, C.⁵ Taylor, G.⁶

33
- 70449370276
- RL-Glue : LLanguage-independent software for reinforcement-learning experiments
- September
- B. Tanner and A. White, "RL-Glue : Language-independent software for reinforcement-learning experiments," Journal of Machine Learning Research, vol. 10, pp. 2133-2136, September 2009.
- (2009) Journal of Machine Learning Research , vol.10 , pp. 2133-2136
- Tanner, B.¹ White, A.²

34
- 0028424239
- Improving generalization with active learning
- D. Cohn, L. Atlas, and R. Ladner, "Improving generalization with active learning," Machine Learning, vol. 15, no. 2, pp. 201-221, 1994.
- (1994) Machine Learning , vol.15 , Issue.2 , pp. 201-221
- Cohn, D.¹ Atlas, L.² Ladner, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.