메뉴 건너뛰기




Volumn 22, Issue 1, 2010, Pages 67-80

Warning: Statistical benchmarking is addictive. Kicking the habit in machine learning

Author keywords

Algorithm evaluation; Benchmarking; Machine learning; Null hypothesis tests

Indexed keywords

ALGORITHM EVALUATION; ALGORITHM PERFORMANCE; DATA SETS; MACHINE LEARNING COMMUNITIES; MACHINE-LEARNING; NULL HYPOTHESIS; SET OF RULES;

EID: 77951200774     PISSN: 0952813X     EISSN: 13623079     Source Type: Journal    
DOI: 10.1080/09528130903010295     Document Type: Article
Times cited : (29)

References (48)
  • 1
    • 33750939148 scopus 로고    scopus 로고
    • The UCI KDD archive of large data sets for data mining research and experimentation
    • Bay, S.D., Kibler, D., Pazzani, M.J., and Smyth, P. (2000), 'The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation', SIGKDD Explorations, 2(2), 81-85.
    • (2000) SIGKDD Explorations , vol.2 , Issue.2 , pp. 81-85
    • Bay, S.D.1    Kibler, D.2    Pazzani, M.J.3    Smyth, P.4
  • 3
    • 0031191630 scopus 로고    scopus 로고
    • The use of the area under the ROC curve in the evaluation of machine learning algorithms
    • Bradley, A.P. (1997), 'The Use of the Area Under the ROC Curve in the Evaluation of Machine Learning Algorithms', Pattern Recognition, 30(7), 1145-1159.
    • (1997) Pattern Recognition , vol.30 , Issue.7 , pp. 1145-1159
    • Bradley, A.P.1
  • 4
    • 77951168132 scopus 로고    scopus 로고
    • Physicist found guilty of misconduct
    • DOI:10.1038/news020923-9
    • Brumfiel, G. (2002). Physicist Found Guilty of Misconduct. Published online in Nature DOI:10.1038/news020923-9.
    • (2002) Published Online in Nature
    • Brumfiel, G.1
  • 8
    • 0031829312 scopus 로고    scopus 로고
    • Precise of statistical significance: Rationale, validity, and utility
    • Chow, S.L. (1998), 'Precise of Statistical Significance: Rationale, Validity, and Utility', Behavioral and Brain Sciences, 21, 169-239.
    • (1998) Behavioral and Brain Sciences , vol.21 , pp. 169-239
    • Chow, S.L.1
  • 9
    • 0039802908 scopus 로고
    • The earth is round (p4.05)
    • Cohen, J. (1994), 'The Earth is Round (p4.05)', American Psychologist, 49, 997-1003.
    • (1994) American Psychologist , vol.49 , pp. 997-1003
    • Cohen, J.1
  • 10
    • 25844525642 scopus 로고
    • How evaluation guides ai research
    • Cohen, P., and Howe, A. (1988), 'How Evaluation Guides AI Research', AI Magazine, 9(4), 35-43.
    • (1988) AI Magazine , vol.9 , Issue.4 , pp. 35-43
    • Cohen, P.1    Howe, A.2
  • 11
    • 33846460845 scopus 로고    scopus 로고
    • Breakthrough of the year: Breakdown of the year: Scientific fraud
    • Couzin, J. (2006), 'Breakthrough of the Year: Breakdown of the Year: Scientific Fraud', Science, 314(5807), p. 1853.
    • (2006) Science , vol.314 , Issue.5807 , pp. 1853
    • Couzin, J.1
  • 13
    • 77951187255 scopus 로고    scopus 로고
    • Drummond, C., Elazmeh, W., and Japkowicz, N. (eds.) American Association for Artificial Intelligence Technical Report WS-06-06, Menlo Park, CA, USA Drummond, C., Elazmeh, W., Japkowicz, N., and Macskassy, S.A. (eds.) American Association for Artificial Intelligence. Technical Report WS-07-105 Menlo Park, CA, USA
    • Drummond, C., Elazmeh, W., and Japkowicz, N. (eds.) (2006), Proceedings of the Twenty-first National Conference on Artificial Intelligence: Workshop on Evaluation Methods for Machine Learning. American Association for Artificial Intelligence Technical Report WS-06-06, Menlo Park, CA, USA.
    • (2006) Proceedings of the Twenty-first National Conference on Artificial Intelligence: Workshop on Evaluation Methods for Machine Learning
  • 16
    • 33748991193 scopus 로고    scopus 로고
    • Cost curves: An improved method for visualizing classifier performance
    • DOI 10.1007/s10994-006-8199-5
    • Drummond, C., and Holte, R.C. (2006), 'Cost Curves: An Improved Method for Visualizing Classifier Performance', Machine Learning, 65(1), 95-130. (Pubitemid 44451195)
    • (2006) Machine Learning , vol.65 , Issue.1 , pp. 95-130
    • Drummond, C.1    Holte, R.C.2
  • 17
    • 2342622786 scopus 로고    scopus 로고
    • Leave-one-out error, stability, and generalisation of voting combination of classifiers
    • Evgeniou, T., Pontil, M., and Elisseeff, A. (2004), 'Leave-one-out Error, Stability, and Generalisation of Voting Combination of Classifiers', Machine Learning, 55(1), 71-97.
    • (2004) Machine Learning , vol.55 , Issue.1 , pp. 71-97
    • Evgeniou, T.1    Pontil, M.2    Elisseeff, A.3
  • 19
    • 77951181351 scopus 로고    scopus 로고
    • Scientists behaving badly
    • DOI:10.1038/news040301-9
    • Giles, J. (2004). Scientists Behaving Badly. Published online in Nature DOI:10.1038/news040301-9.
    • (2004) Published Online in Nature
    • Giles, J.1
  • 21
    • 0002812166 scopus 로고    scopus 로고
    • In praise of the null hypothesis statistical test
    • Hagen, R.L. (1997), 'In Praise of the Null Hypothesis Statistical Test', American Psychologist, 52, 15-24.
    • (1997) American Psychologist , vol.52 , pp. 15-24
    • Hagen, R.L.1
  • 22
    • 84993697599 scopus 로고
    • The notion of the hypothetical universe
    • (Chap. 4), eds. Denton E. Morrison and Ramon E. Henkel, Chicago: Aldine
    • Hagood, M.J. (1941), 'The Notion of the Hypothetical Universe', in The Significance Test Controversy: A Reader (Chap. 4), eds. Denton E. Morrison and Ramon E. Henkel, Chicago: Aldine, pp. 65-78.
    • (1941) The Significance Test Controversy: A Reader , pp. 65-78
    • Hagood, M.J.1
  • 23
    • 33745886270 scopus 로고    scopus 로고
    • Classifier technology and the illusion of progress
    • Hand, D.J. (2006), 'Classifier Technology and the Illusion of Progress', Statistical Science, 21(1), 1-15.
    • (2006) Statistical Science , vol.21 , Issue.1 , pp. 1-15
    • Hand, D.J.1
  • 24
    • 0003704318 scopus 로고    scopus 로고
    • Irvine, CA, USA: Department of Information and Computer Science, University of California
    • Hettich, S., and Bay, S.D. (1999), The UCI KDD Archive. Irvine, CA, USA: Department of Information and Computer Science, University of California. http://kdd.ics.uci.edu
    • (1999) The UCI KDD Archive
    • Hettich, S.1    Bay, S.D.2
  • 25
    • 0027580356 scopus 로고
    • Very simple classification rules perform well on most commonly used datasets
    • Holte, R.C. (1993), 'Very Simple Classification Rules Perform Well on Most Commonly Used Datasets', Machine Learning, 11(1), 63-91.
    • (1993) Machine Learning , vol.11 , Issue.1 , pp. 63-91
    • Holte, R.C.1
  • 27
    • 33845990606 scopus 로고    scopus 로고
    • Why question machine learning evaluation methods? (An illustrative review of the shortcomings of current methods)
    • Evaluation Methods for Machine Learning - Papers from the AAAI Workshop, Technical Report
    • Japkowicz, N. (2006), 'Why Question Machine Learning Evaluation Methods? (An illustrative review of the shortcomings of current methods)', in Proceedings of the Twenty-First National Conference on Artificial Intelligence: Workshop on Evaluation Methods for Machine Learning, AAAI Technical Report WS-06-06, pp. 6-11. (Pubitemid 46053316)
    • (2006) AAAI Workshop - Technical Report , vol.WS-06-06 , pp. 6-11
    • Japkowicz, N.1
  • 29
    • 84974776937 scopus 로고
    • Guest editor's introduction: The comprehensibility manifesto
    • Kodratoff, Y. (1994), 'Guest Editor's Introduction: The Comprehensibility Manifesto', AI Communications, 7(2), 83-85.
    • (1994) AI Communications , vol.7 , Issue.2 , pp. 83-85
    • Kodratoff, Y.1
  • 31
    • 1642452775 scopus 로고
    • Machine learning as an experimental science
    • Langley, P. (1988), 'Machine Learning as an Experimental Science', Machine Learning, 3, 5-8.
    • (1988) Machine Learning , vol.3 , pp. 5-8
    • Langley, P.1
  • 32
    • 10344252975 scopus 로고
    • Pittsburgh, PA, USA: Department of Statistics, Carnegie Mellon University.
    • Meyer, M., and Vlachos, P. (1989), Statlib. Pittsburgh, PA, USA: Department of Statistics, Carnegie Mellon University. http://lib.stat.cmu.edu/
    • (1989) Statlib
    • Meyer, M.1    Vlachos, P.2
  • 34
    • 0000942050 scopus 로고
    • A theory and methodology of inductive learning
    • Michalski, R.S. (1983), 'A Theory and Methodology of Inductive Learning', Artificial Intelligence, 20, 111-161.
    • (1983) Artificial Intelligence , vol.20 , pp. 111-161
    • Michalski, R.S.1
  • 39
    • 6144253216 scopus 로고
    • Strong inference
    • Platt, J.R. (1964), 'Strong Inference', Science, 146(3642), 347-353.
    • (1964) Science , vol.146 , Issue.3642 , pp. 347-353
    • Platt, J.R.1
  • 40
    • 0035283313 scopus 로고    scopus 로고
    • Robust classification for imprecise environments
    • DOI 10.1023/A:1007601015854
    • Provost, F., and Fawcett, T. (2001), 'Robust Classification for Imprecise Environments', Machine Learning, 42, 203-231. (Pubitemid 32188799)
    • (2001) Machine Learning , vol.42 , Issue.3 , pp. 203-231
    • Provost, F.1    Fawcett, T.2
  • 42
    • 0032001170 scopus 로고    scopus 로고
    • Learning in the 'Real World
    • Saitta, L., and Neri, F. (1998), 'Learning in the 'Real World', Machine Learning, 30(2-3), 133-163.
    • (1998) Machine Learning , vol.30 , Issue.2-3 , pp. 133-163
    • Saitta, L.1    Neri, F.2
  • 43
    • 27144463192 scopus 로고    scopus 로고
    • On comparing classifiers: Pitfalls to avoid and a recommended approach
    • Salzberg, S.L. (1997), 'On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach', Data Mining and Knowledge Discovery, 1, 317-327.
    • (1997) Data Mining and Knowledge Discovery , vol.1 , pp. 317-327
    • Salzberg, S.L.1
  • 45
    • 0029350748 scopus 로고
    • Artificial intelligence: An empirical science
    • Simon, H.A. (1995), 'Artificial Intelligence: An Empirical Science', Artificial Intelligence, 77, 95-127.
    • (1995) Artificial Intelligence , vol.77 , pp. 95-127
    • Simon, H.A.1
  • 47


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.