SCOPUS 정보 검색 플랫폼

Journal of Experimental and Theoretical Artificial Intelligence

Volumn 22, Issue 1, 2010, Pages 67-80

Warning: Statistical benchmarking is addictive. Kicking the habit in machine learning

(2) Drummond, Chris a Japkowicz, Nathalie b

a NATIONAL RESEARCH COUNCIL CANADA (Canada)

b UNIVERSITY OF OTTAWA (Canada)

Author keywords

Algorithm evaluation; Benchmarking; Machine learning; Null hypothesis tests

Indexed keywords

ALGORITHM EVALUATION; ALGORITHM PERFORMANCE; DATA SETS; MACHINE LEARNING COMMUNITIES; MACHINE-LEARNING; NULL HYPOTHESIS; SET OF RULES;

BENCHMARKING; LEARNING SYSTEMS; STATISTICAL TESTS;

LEARNING ALGORITHMS;

EID: 77951200774 PISSN: 0952813X EISSN: 13623079 Source Type: Journal
DOI: 10.1080/09528130903010295 Document Type: Article

Times cited : (29)

References (48)

1
- 33750939148
- The UCI KDD archive of large data sets for data mining research and experimentation
- Bay, S.D., Kibler, D., Pazzani, M.J., and Smyth, P. (2000), 'The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation', SIGKDD Explorations, 2(2), 81-85.
- (2000) SIGKDD Explorations , vol.2 , Issue.2 , pp. 81-85
- Bay, S.D.¹ Kibler, D.² Pazzani, M.J.³ Smyth, P.⁴

2
- 0003408496
- Irvine, CA, USA: University of California
- Blake, C.L., and Merz, C.J. (1998), UCI Repository of Machine Learning Databases, Irvine, CA, USA: University of California. www.ics.uci.edu/-mlearn/ MLRepository.html
- (1998) UCI Repository of Machine Learning Databases
- Blake, C.L.¹ Merz, C.J.²

3
- 0031191630
- The use of the area under the ROC curve in the evaluation of machine learning algorithms
- Bradley, A.P. (1997), 'The Use of the Area Under the ROC Curve in the Evaluation of Machine Learning Algorithms', Pattern Recognition, 30(7), 1145-1159.
- (1997) Pattern Recognition , vol.30 , Issue.7 , pp. 1145-1159
- Bradley, A.P.¹

4
- 77951168132
- Physicist found guilty of misconduct
- DOI:10.1038/news020923-9
- Brumfiel, G. (2002). Physicist Found Guilty of Misconduct. Published online in Nature DOI:10.1038/news020923-9.
- (2002) Published Online in Nature
- Brumfiel, G.¹

5
- 24944532359
- Inference for data visualization
- Buja, A., and Cook, D. (1999). Inference for Data Visualization. Talk given at the Joint Statistical Meetings.
- (1999) Talk Given at the Joint Statistical Meetings
- Buja, A.¹ Cook, D.²

6
- 12244279570
- Data mining in metric space: An empirical analysis of supervised learning performance criteria
- New York
- Caruana, R., and Niculescu-Mizil, A. (2004), 'Data Mining in Metric Space: An Empirical Analysis of Supervised Learning Performance Criteria', in Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, pp. 69-78.
- (2004) Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pp. 69-78
- Caruana, R.¹ Niculescu-Mizil, A.²

7
- 33749254096
- An empirical comparison of supervised learning algorithms
- Banff, Alberta, Canada
- Caruana, R., and Niculescu-Mizil, A. (2006), 'An Empirical Comparison of Supervised Learning Algorithms', in Proceedings of the Twenty-Third International Conference on Machine Learning, Banff, Alberta, Canada, pp. 161-168.
- (2006) Proceedings of the Twenty-Third International Conference on Machine Learning , pp. 161-168
- Caruana, R.¹ Niculescu-Mizil, A.²

8
- 0031829312
- Precise of statistical significance: Rationale, validity, and utility
- Chow, S.L. (1998), 'Precise of Statistical Significance: Rationale, Validity, and Utility', Behavioral and Brain Sciences, 21, 169-239.
- (1998) Behavioral and Brain Sciences , vol.21 , pp. 169-239
- Chow, S.L.¹

9
- 0039802908
- The earth is round (p4.05)
- Cohen, J. (1994), 'The Earth is Round (p4.05)', American Psychologist, 49, 997-1003.
- (1994) American Psychologist , vol.49 , pp. 997-1003
- Cohen, J.¹

10
- 25844525642
- How evaluation guides ai research
- Cohen, P., and Howe, A. (1988), 'How Evaluation Guides AI Research', AI Magazine, 9(4), 35-43.
- (1988) AI Magazine , vol.9 , Issue.4 , pp. 35-43
- Cohen, P.¹ Howe, A.²

11
- 33846460845
- Breakthrough of the year: Breakdown of the year: Scientific fraud
- Couzin, J. (2006), 'Breakthrough of the Year: Breakdown of the Year: Scientific Fraud', Science, 314(5807), p. 1853.
- (2006) Science , vol.314 , Issue.5807 , pp. 1853
- Couzin, J.¹

12
- 33749249600
- The relationship between precision-recall and ROC curves
- Davis, J., and Goadrich, M. (2006), 'The Relationship Between Precision-recall and ROC Curves', in Proceedings of the Twenty Third International Conference on Machine Learning, pp. 233-240.
- (2006) Proceedings of the Twenty Third International Conference on Machine Learning , pp. 233-240
- Davis, J.¹ Goadrich, M.²

13
- 77951187255
- Drummond, C., Elazmeh, W., and Japkowicz, N. (eds.) American Association for Artificial Intelligence Technical Report WS-06-06, Menlo Park, CA, USA Drummond, C., Elazmeh, W., Japkowicz, N., and Macskassy, S.A. (eds.) American Association for Artificial Intelligence. Technical Report WS-07-105 Menlo Park, CA, USA
- Drummond, C., Elazmeh, W., and Japkowicz, N. (eds.) (2006), Proceedings of the Twenty-first National Conference on Artificial Intelligence: Workshop on Evaluation Methods for Machine Learning. American Association for Artificial Intelligence Technical Report WS-06-06, Menlo Park, CA, USA.
- (2006) Proceedings of the Twenty-first National Conference on Artificial Intelligence: Workshop on Evaluation Methods for Machine Learning

14
- 77951170115
- Drummond, C., Elazmeh, W., Japkowicz, N., and Macskassy, S.A. (eds.)
- Drummond, C., Elazmeh, W., Japkowicz, N., and Macskassy, S.A. (eds.) (2008), Proceedings of the Twenty-Fifth International Conference on Machine Learning: Evaluation Methods for Machine Learning III. http://www.site.uottawa. ca/ICML08WS
- (2008) Proceedings of the Twenty-Fifth International Conference on Machine Learning: Evaluation Methods for Machine Learning III

15
- 33845986998
- Learning to live with false alarms
- Drummond, C., and Holte, R.C. (2005), 'Learning to Live with False Alarms', in Proceedings of the Eleventh International Conference on Knowledge Discovery and Data Mining: Workshop on Data Mining Methods for Anomaly Detection, pp. 21-24.
- (2005) Proceedings of the Eleventh International Conference on Knowledge Discovery and Data Mining: Workshop on Data Mining Methods for Anomaly Detection , pp. 21-24
- Drummond, C.¹ Holte, R.C.²

16
- 33748991193
- Cost curves: An improved method for visualizing classifier performance
- DOI 10.1007/s10994-006-8199-5
- Drummond, C., and Holte, R.C. (2006), 'Cost Curves: An Improved Method for Visualizing Classifier Performance', Machine Learning, 65(1), 95-130. (Pubitemid 44451195)
- (2006) Machine Learning , vol.65 , Issue.1 , pp. 95-130
- Drummond, C.¹ Holte, R.C.²

17
- 2342622786
- Leave-one-out error, stability, and generalisation of voting combination of classifiers
- Evgeniou, T., Pontil, M., and Elisseeff, A. (2004), 'Leave-one-out Error, Stability, and Generalisation of Voting Combination of Classifiers', Machine Learning, 55(1), 71-97.
- (2004) Machine Learning , vol.55 , Issue.1 , pp. 71-97
- Evgeniou, T.¹ Pontil, M.² Elisseeff, A.³

18
- 77951199174
- Ferri, C., Flach, P., Herná ndez-Orallo, J., and Lachiche, N. (eds.)
- Ferri, C., Flach, P., Herná ndez-Orallo, J., and Lachiche, N. (eds.) (2004), Proceedings of the Sixteenth European Conference on Artificial Intelligence: Workshop on ROC Analysis in AI. http://www.dsic.upv.es/-flip/ ROCAI2004/accepted.html
- (2004) Proceedings of the Sixteenth European Conference on Artificial Intelligence: Workshop on ROC Analysis in AI

19
- 77951181351
- Scientists behaving badly
- DOI:10.1038/news040301-9
- Giles, J. (2004). Scientists Behaving Badly. Published online in Nature DOI:10.1038/news040301-9.
- (2004) Published Online in Nature
- Giles, J.¹

20
- 33846018349
- A review of machine learning at AAAI-87
- Greiner, R., Silver, B., Becker, S., and Gruninger, M. (1988), 'A Review of Machine Learning at AAAI-87', Machine Learning, 3(1), 79-92.
- (1988) Machine Learning , vol.3 , Issue.1 , pp. 79-92
- Greiner, R.¹ Silver, B.² Becker, S.³ Gruninger, M.⁴

21
- 0002812166
- In praise of the null hypothesis statistical test
- Hagen, R.L. (1997), 'In Praise of the Null Hypothesis Statistical Test', American Psychologist, 52, 15-24.
- (1997) American Psychologist , vol.52 , pp. 15-24
- Hagen, R.L.¹

22
- 84993697599
- The notion of the hypothetical universe
- (Chap. 4), eds. Denton E. Morrison and Ramon E. Henkel, Chicago: Aldine
- Hagood, M.J. (1941), 'The Notion of the Hypothetical Universe', in The Significance Test Controversy: A Reader (Chap. 4), eds. Denton E. Morrison and Ramon E. Henkel, Chicago: Aldine, pp. 65-78.
- (1941) The Significance Test Controversy: A Reader , pp. 65-78
- Hagood, M.J.¹

23
- 33745886270
- Classifier technology and the illusion of progress
- Hand, D.J. (2006), 'Classifier Technology and the Illusion of Progress', Statistical Science, 21(1), 1-15.
- (2006) Statistical Science , vol.21 , Issue.1 , pp. 1-15
- Hand, D.J.¹

24
- 0003704318
- Irvine, CA, USA: Department of Information and Computer Science, University of California
- Hettich, S., and Bay, S.D. (1999), The UCI KDD Archive. Irvine, CA, USA: Department of Information and Computer Science, University of California. http://kdd.ics.uci.edu
- (1999) The UCI KDD Archive
- Hettich, S.¹ Bay, S.D.²

25
- 0027580356
- Very simple classification rules perform well on most commonly used datasets
- Holte, R.C. (1993), 'Very Simple Classification Rules Perform Well on Most Commonly Used Datasets', Machine Learning, 11(1), 63-91.
- (1993) Machine Learning , vol.11 , Issue.1 , pp. 63-91
- Holte, R.C.¹

26
- 65649085468
- Constructing new and better evaluation measures for machine learning
- Huang, J., and Ling, C.X. (2007), 'Constructing New and Better Evaluation Measures for Machine Learning', in Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, pp. 859-864.
- (2007) Proceedings of the Twentieth International Joint Conference on Artificial Intelligence , pp. 859-864
- Huang, J.¹ Ling, C.X.²

27
- 33845990606
- Why question machine learning evaluation methods? (An illustrative review of the shortcomings of current methods)
- Evaluation Methods for Machine Learning - Papers from the AAAI Workshop, Technical Report
- Japkowicz, N. (2006), 'Why Question Machine Learning Evaluation Methods? (An illustrative review of the shortcomings of current methods)', in Proceedings of the Twenty-First National Conference on Artificial Intelligence: Workshop on Evaluation Methods for Machine Learning, AAAI Technical Report WS-06-06, pp. 6-11. (Pubitemid 46053316)
- (2006) AAAI Workshop - Technical Report , vol.WS-06-06 , pp. 6-11
- Japkowicz, N.¹

28
- 0003771282
- Machine learning as an experimental science
- Kibler, D., and Langley, P. (1998), 'Machine Learning as an Experimental Science', in Proceedings of the Third European Working Session on Learning, pp. 81-92.
- (1998) Proceedings of the Third European Working Session on Learning , pp. 81-92
- Kibler, D.¹ Langley, P.²

29
- 84974776937
- Guest editor's introduction: The comprehensibility manifesto
- Kodratoff, Y. (1994), 'Guest Editor's Introduction: The Comprehensibility Manifesto', AI Communications, 7(2), 83-85.
- (1994) AI Communications , vol.7 , Issue.2 , pp. 83-85
- Kodratoff, Y.¹

30
- 0003945869
- Chicago: University of Chicago Press
- Kuhn, T. (1962), The Structure of Scientific Revolutions, Chicago: University of Chicago Press.
- (1962) The Structure of Scientific Revolutions
- Kuhn, T.¹

31
- 1642452775
- Machine learning as an experimental science
- Langley, P. (1988), 'Machine Learning as an Experimental Science', Machine Learning, 3, 5-8.
- (1988) Machine Learning , vol.3 , pp. 5-8
- Langley, P.¹

32
- 10344252975
- Pittsburgh, PA, USA: Department of Statistics, Carnegie Mellon University.
- Meyer, M., and Vlachos, P. (1989), Statlib. Pittsburgh, PA, USA: Department of Statistics, Carnegie Mellon University. http://lib.stat.cmu.edu/
- (1989) Statlib
- Meyer, M.¹ Vlachos, P.²

33
- 0002543260
- Discovering classification rules using variable-valued logic system Vl1
- Michalski, R.S. (1973), 'Discovering Classification Rules using Variable-valued Logic System Vl1', in Proceedings of the Third International Joint Conference on Artificial Intelligence, pp. 162-172.
- (1973) Proceedings of the Third International Joint Conference on Artificial Intelligence , pp. 162-172
- Michalski, R.S.¹

34
- 0000942050
- A theory and methodology of inductive learning
- Michalski, R.S. (1983), 'A Theory and Methodology of Inductive Learning', Artificial Intelligence, 20, 111-161.
- (1983) Artificial Intelligence , vol.20 , pp. 111-161
- Michalski, R.S.¹

35
- 0002056465
- Version spaces: A candidate elimination approach to rule learning
- Mitchell, T. (1977), 'Version Spaces: A Candidate Elimination Approach to Rule Learning', in Proceedings of the Fifth International Joint Conference on Artificial Intelligence, pp. 305-310.
- (1977) Proceedings of the Fifth International Joint Conference on Artificial Intelligence , pp. 305-310
- Mitchell, T.¹

36
- 38349193795
- A framework for generating data to simulate changing environments
- Narasimhamurthy, A., and Kuncheva, L. (2007), 'A Framework for Generating Data to Simulate Changing Environments', in Proceedings of the Twenty-Fifth IASTED International Multi-Conference on Artificial Intelligence and Applications, pp. 384-389.
- (2007) Proceedings of the Twenty-Fifth IASTED International Multi-Conference on Artificial Intelligence and Applications , pp. 384-389
- Narasimhamurthy, A.¹ Kuncheva, L.²

37
- 77951151630
- Oblinger, D., Lau, T., Gil, Y., and Bauer, M. (eds.) American Association for Artificial Intelligence. Technical Report WS-05-104 Menlo Park, CA, USA
- Oblinger, D., Lau, T., Gil, Y., and Bauer, M. (eds.) (2005), Proceedings of the Twentieth National Conference on Artificial Intelligence: Workshop on Human Comprehensible Machine Learning. American Association for Artificial Intelligence. Technical Report WS-05-104 Menlo Park, CA, USA.
- (2005) Proceedings of the Twentieth National Conference on Artificial Intelligence: Workshop on Human Comprehensible Machine Learning

38
- 85041528332
- Reducing misclassification costs
- Pazzani, M., Merz, C., Murphy, P., Ali, K., Hume, T., and Brunk, C. (1994), 'Reducing Misclassification Costs', in Proceedings of the Eleventh International Conference on Machine Learning, pp. 217-225.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 217-225
- Pazzani, M.¹ Merz, C.² Murphy, P.³ Ali, K.⁴ Hume, T.⁵ Brunk, C.⁶

39
- 6144253216
- Strong inference
- Platt, J.R. (1964), 'Strong Inference', Science, 146(3642), 347-353.
- (1964) Science , vol.146 , Issue.3642 , pp. 347-353
- Platt, J.R.¹

40
- 0035283313
- Robust classification for imprecise environments
- DOI 10.1023/A:1007601015854
- Provost, F., and Fawcett, T. (2001), 'Robust Classification for Imprecise Environments', Machine Learning, 42, 203-231. (Pubitemid 32188799)
- (2001) Machine Learning , vol.42 , Issue.3 , pp. 203-231
- Provost, F.¹ Fawcett, T.²

41
- 0002900357
- The case against accuracy estimation for comparing induction algorithms
- Provost, F., Fawcett, T., and Kohavi, R. (1998), 'The Case Against Accuracy Estimation for Comparing Induction Algorithms', in Proceedings of the Fifteenth International Conference on Machine Learning, pp. 43-48.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 43-48
- Provost, F.¹ Fawcett, T.² Kohavi, R.³

42
- 0032001170
- Learning in the 'Real World
- Saitta, L., and Neri, F. (1998), 'Learning in the 'Real World', Machine Learning, 30(2-3), 133-163.
- (1998) Machine Learning , vol.30 , Issue.2-3 , pp. 133-163
- Saitta, L.¹ Neri, F.²

43
- 27144463192
- On comparing classifiers: Pitfalls to avoid and a recommended approach
- Salzberg, S.L. (1997), 'On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach', Data Mining and Knowledge Discovery, 1, 317-327.
- (1997) Data Mining and Knowledge Discovery , vol.1 , pp. 317-327
- Salzberg, S.L.¹

44
- 0027708867
- Artificial intelligence as an experimental science
- Simon, H.A. (1993), 'Artificial Intelligence as an Experimental Science', in Proceedings of the Eleventh National Conference on Artificial Intelligence, p. 853.
- (1993) Proceedings of the Eleventh National Conference on Artificial Intelligence , pp. 853
- Simon, H.A.¹

45
- 0029350748
- Artificial intelligence: An empirical science
- Simon, H.A. (1995), 'Artificial Intelligence: An Empirical Science', Artificial Intelligence, 77, 95-127.
- (1995) Artificial Intelligence , vol.77 , pp. 95-127
- Simon, H.A.¹

46
- 77951152241
- Thearling, K. and Stein, R. (eds.)
- Thearling, K. and Stein, R. (eds.) (1998), Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining: Workshop on Keys to the Commercial Success of Data Mining. http://www.thearling.com/ workshop/workshop.htm
- (1998) Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining: Workshop on Keys to the Commercial Success of Data Mining

47
- 77951162108
- Irvine, CA, USA: University of California
- Tsang, D. (2000). Social Science Data Archive Libraries, Irvine, CA, USA: University of California. http://data.lib.uci.edu/
- (2000) Social Science Data Archive Libraries
- Tsang, D.¹

48
- 0003854151
- Reading, MA, USA: Addison-Wesley
- Tukey, J.W. (1977), Exploratory Data Analysis. Reading, MA, USA: Addison-Wesley.
- (1977) Exploratory Data Analysis
- Tukey, J.W.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.