메뉴 건너뛰기




Volumn 116, Issue , 2016, Pages 133-145

Incorrect results in software engineering experiments: How to improve research practices

Author keywords

Controlled experiments; Empirical software engineering; Statistical hypothesis testing

Indexed keywords

PUBLISHING; SOFTWARE ENGINEERING; SURVEYS; TESTING;

EID: 84927142981     PISSN: 01641212     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jss.2015.03.065     Document Type: Article
Times cited : (47)

References (56)
  • 3
    • 47649116254 scopus 로고    scopus 로고
    • False-positive results in cancer epidemiology: A plea for epistemological modesty
    • Bofetta P. et al. False-positive results in cancer epidemiology: a plea for epistemological modesty J. Nat. Canc. Inst. 100 14 2008 988 995
    • (2008) J. Nat. Canc. Inst. , vol.100 , Issue.14 , pp. 988-995
    • Bofetta, P.1
  • 4
    • 84876665206 scopus 로고    scopus 로고
    • Power failure: Why small sample size undermines the reliability of neuroscience
    • Button K.S. et al. Power failure: why small sample size undermines the reliability of neuroscience Nat. Rev. Neurosci. 14 5 2013 365 376
    • (2013) Nat. Rev. Neurosci. , vol.14 , Issue.5 , pp. 365-376
    • Button, K.S.1
  • 6
    • 0000626699 scopus 로고
    • Things I have learned (so far)
    • Cohen J. Things I have learned (so far) Am. Psychol. 45 12 1990 1304
    • (1990) Am. Psychol. , vol.45 , Issue.12 , pp. 1304
    • Cohen, J.1
  • 7
    • 11944272254 scopus 로고
    • A power primer
    • Cohen J. A power primer Psychol. Bull. 112 1 1992 155
    • (1992) Psychol. Bull. , vol.112 , Issue.1 , pp. 155
    • Cohen, J.1
  • 8
    • 84891940182 scopus 로고    scopus 로고
    • The new statistics why and how
    • Cumming G. The new statistics why and how Psychol. Sci. 25 1 2014 7 29
    • (2014) Psychol. Sci. , vol.25 , Issue.1 , pp. 7-29
    • Cumming, G.1
  • 9
    • 41049113064 scopus 로고    scopus 로고
    • Data splitting as a countermeasure against hypothesis fishing: With a case study of predictors for low back pain
    • Dahl F.A. et al. Data splitting as a countermeasure against hypothesis fishing: With a case study of predictors for low back pain Europ. J. Epidemiol. 23 4 2008 237 242
    • (2008) Europ. J. Epidemiol. , vol.23 , Issue.4 , pp. 237-242
    • Dahl, F.A.1
  • 10
    • 0027193829 scopus 로고
    • Misconduct in medical research
    • Dingell J.D. Misconduct in medical research N. Eng. J. Med. 328 22 1993 1610 1615
    • (1993) N. Eng. J. Med. , vol.328 , Issue.22 , pp. 1610-1615
    • Dingell, J.D.1
  • 11
    • 33747126100 scopus 로고    scopus 로고
    • A systematic review of statistical power in software engineering experiments
    • Dybå T., Kampenes V.B., Sjøberg D.I. A systematic review of statistical power in software engineering experiments Inf. Softw. Technol. 48 8 2006 745 755
    • (2006) Inf. Softw. Technol. , vol.48 , Issue.8 , pp. 745-755
    • Dybå, T.1    Kampenes, V.B.2    Sjøberg, D.I.3
  • 12
    • 19144370101 scopus 로고    scopus 로고
    • Evidence-based software engineering for practitioners
    • Dybå T., Kitchenham B., Jørgensen M. Evidence-based software engineering for practitioners IEEE Softw. 22 1 2005 58 65
    • (2005) IEEE Softw. , vol.22 , Issue.1 , pp. 58-65
    • Dybå, T.1    Kitchenham, B.2    Jørgensen, M.3
  • 13
    • 66849084202 scopus 로고    scopus 로고
    • How many scientists fabricate and falsify research? a systematic review and meta-analysis of survey data
    • Fanelli D. How many scientists fabricate and falsify research? a systematic review and meta-analysis of survey data PloS One 4 5 2009 e5738
    • (2009) PloS One , vol.4 , Issue.5 , pp. e5738
    • Fanelli, D.1
  • 14
    • 77956329449 scopus 로고    scopus 로고
    • "Positive" results increase down the hierarchy of the sciences
    • Fanelli D. "Positive" results increase down the hierarchy of the sciences PloS One 5 4 2010 e10068
    • (2010) PloS One , vol.5 , Issue.4 , pp. e10068
    • Fanelli, D.1
  • 15
    • 77956334138 scopus 로고    scopus 로고
    • Do pressures to publish increase scientists' bias? An empirical support from US States Data
    • Fanelli D. Do pressures to publish increase scientists' bias? an empirical support from US States Data PloS One 5 4 2010 e10271
    • (2010) PloS One , vol.5 , Issue.4 , pp. e10271
    • Fanelli, D.1
  • 16
    • 84856526329 scopus 로고    scopus 로고
    • Negative results are disappearing from most disciplines and countries
    • Fanelli D. Negative results are disappearing from most disciplines and countries Scientometrics 90 3 2012 891 904
    • (2012) Scientometrics , vol.90 , Issue.3 , pp. 891-904
    • Fanelli, D.1
  • 17
    • 84897614571 scopus 로고    scopus 로고
    • Research misconduct: A grand global challenge for the 21st century
    • Farthing M.J.G. Research misconduct: A grand global challenge for the 21st century J. Gastroen. Hepatol. 29 3 2014 422 427
    • (2014) J. Gastroen. Hepatol. , vol.29 , Issue.3 , pp. 422-427
    • Farthing, M.J.G.1
  • 18
    • 84858432861 scopus 로고    scopus 로고
    • Too good to be true: Publication bias in two prominent studies from experimental psychology
    • Francis G. Too good to be true: publication bias in two prominent studies from experimental psychology Psychon. Bull. Rev. 19 2 2012 151 156
    • (2012) Psychon. Bull. Rev. , vol.19 , Issue.2 , pp. 151-156
    • Francis, G.1
  • 19
    • 34247527774 scopus 로고    scopus 로고
    • Why most published research findings are false: Problems in the analysis
    • Goodman S., Greenland S. Why most published research findings are false: problems in the analysis PLoS Med. 4 4 2007 773
    • (2007) PLoS Med. , vol.4 , Issue.4 , pp. 773
    • Goodman, S.1    Greenland, S.2
  • 20
    • 76749129677 scopus 로고    scopus 로고
    • Effects of personality on pair programming
    • Hannay J.E. et al. Effects of personality on pair programming IEEE Trans. Softw. Eng. 36 1 2010 61 80
    • (2010) IEEE Trans. Softw. Eng. , vol.36 , Issue.1 , pp. 61-80
    • Hannay, J.E.1
  • 21
    • 0001815484 scopus 로고
    • Estimation of effect size under nonrandom sampling: The effects of censoring studies yielding statistically insignificant mean differences
    • Hedges L.V. Estimation of effect size under nonrandom sampling: the effects of censoring studies yielding statistically insignificant mean differences J. Educ. Behav. Stat. 9 1 1984 61 85
    • (1984) J. Educ. Behav. Stat. , vol.9 , Issue.1 , pp. 61-85
    • Hedges, L.V.1
  • 22
    • 84880811845 scopus 로고    scopus 로고
    • Why small low-powered studies are worse than large high-powered studies and how to protect against trivial findings in research: Comment on Friston (2012)
    • Ingre M. Why small low-powered studies are worse than large high-powered studies and how to protect against trivial findings in research: comment on Friston (2012) Neuroimage 81 2013 496 498
    • (2013) Neuroimage , vol.81 , pp. 496-498
    • Ingre, M.1
  • 23
    • 84879836789 scopus 로고    scopus 로고
    • Optimal type I and type II error pairs when the available sample size is fixed
    • Ioannidis J., Hozo I., Djulbegovic B. Optimal type I and type II error pairs when the available sample size is fixed J. Clin. Epidemiol. 66 8 2013 903 910
    • (2013) J. Clin. Epidemiol. , vol.66 , Issue.8 , pp. 903-910
    • Ioannidis, J.1    Hozo, I.2    Djulbegovic, B.3
  • 24
    • 33846563409 scopus 로고    scopus 로고
    • Why most published research findings are false
    • Ioannidis J.P.A. Why most published research findings are false PLoS Med. 2 8 2005 e124
    • (2005) PLoS Med. , vol.2 , Issue.8 , pp. e124
    • Ioannidis, J.P.A.1
  • 25
    • 50549104256 scopus 로고    scopus 로고
    • Why most discovered true associations are inflated
    • Ioannidis J.P.A. Why most discovered true associations are inflated Epidemiology 19 5 2008 640 648
    • (2008) Epidemiology , vol.19 , Issue.5 , pp. 640-648
    • Ioannidis, J.P.A.1
  • 26
    • 34548642946 scopus 로고    scopus 로고
    • An exploratory test for an excess of significant findings
    • Ioannidis J.P.A., Trikalinos T.A. An exploratory test for an excess of significant findings Clin. Trials 4 3 2007 245 253
    • (2007) Clin. Trials , vol.4 , Issue.3 , pp. 245-253
    • Ioannidis, J.P.A.1    Trikalinos, T.A.2
  • 27
    • 84861748584 scopus 로고    scopus 로고
    • Measuring the prevalence of questionable research practices with incentives for truth telling
    • John L.K., Loewenstein G., Prelec D. Measuring the prevalence of questionable research practices with incentives for truth telling Psychol. Sci. 23 5 2012 524 532
    • (2012) Psychol. Sci. , vol.23 , Issue.5 , pp. 524-532
    • John, L.K.1    Loewenstein, G.2    Prelec, D.3
  • 29
    • 80052484644 scopus 로고    scopus 로고
    • The role of non-exact replications in software engineering experiments
    • Juristo N., Vegas S. The role of non-exact replications in software engineering experiments Empir. Softw. Eng. 16 3 2011 295 324
    • (2011) Empir. Softw. Eng. , vol.16 , Issue.3 , pp. 295-324
    • Juristo, N.1    Vegas, S.2
  • 30
    • 34648846182 scopus 로고    scopus 로고
    • A systematic review of effect size in software engineering experiments
    • Kampenes V.B. et al. A systematic review of effect size in software engineering experiments Inf. Softw. Technol. 49 11 2007 1073 1086
    • (2007) Inf. Softw. Technol. , vol.49 , Issue.11 , pp. 1073-1086
    • Kampenes, V.B.1
  • 33
    • 84904902280 scopus 로고    scopus 로고
    • How trustworthy is the scientific literature in industrial and organizational psychology?
    • Kepes S., McDaniel M.A. How trustworthy is the scientific literature in industrial and organizational psychology? Ind. Organ. Psychol. 6 3 2013 252 268
    • (2013) Ind. Organ. Psychol. , vol.6 , Issue.3 , pp. 252-268
    • Kepes, S.1    McDaniel, M.A.2
  • 34
    • 0004029218 scopus 로고
    • Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel
    • Kincaid J.P. et al. Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel DTIC Document. 1975
    • (1975) DTIC Document.
    • Kincaid, J.P.1
  • 35
    • 0030517592 scopus 로고    scopus 로고
    • Practical significance: A concept whose time has come
    • Kirk R.E. Practical significance: a concept whose time has come Educ. Psychol. Meas. 56 5 1996 746 759
    • (1996) Educ. Psychol. Meas. , vol.56 , Issue.5 , pp. 746-759
    • Kirk, R.E.1
  • 36
    • 0036704729 scopus 로고    scopus 로고
    • Preliminary guidelines for empirical research in software engineering
    • Kitchenham B.A. et al. Preliminary guidelines for empirical research in software engineering IEEE Trans. Softw. Eng. 28 8 2002 721 734
    • (2002) IEEE Trans. Softw. Eng. , vol.28 , Issue.8 , pp. 721-734
    • Kitchenham, B.A.1
  • 37
    • 85004878786 scopus 로고
    • Estimating effect size: Bias resulting from the significance criterion in editorial decisions
    • Lane D.M., Dunlap W.P. Estimating effect size: bias resulting from the significance criterion in editorial decisions Br. J. Math. Stat. Psychol. 31 2 1978 107 112
    • (1978) Br. J. Math. Stat. Psychol. , vol.31 , Issue.2 , pp. 107-112
    • Lane, D.M.1    Dunlap, W.P.2
  • 40
    • 84867871106 scopus 로고    scopus 로고
    • A peculiar prevalence of p values just below.05
    • Masicampo E., Lalande D.R. A peculiar prevalence of p values just below.05 Q. J. Exp. Psychol. 65 11 2012 2271 2279
    • (2012) Q. J. Exp. Psychol. , vol.65 , Issue.11 , pp. 2271-2279
    • Masicampo, E.1    Lalande, D.R.2
  • 41
    • 0031119290 scopus 로고    scopus 로고
    • Statistical power and its subcomponents-missing and misunderstood concepts in empirical software engineering research
    • Miller J. et al. Statistical power and its subcomponents-missing and misunderstood concepts in empirical software engineering research Inf. Softw. Technol. 39 4 1997 285 295
    • (1997) Inf. Softw. Technol. , vol.39 , Issue.4 , pp. 285-295
    • Miller, J.1
  • 42
    • 84898688933 scopus 로고    scopus 로고
    • Research practices that can prevent an inflation of false-positive rates
    • Murayama K., Pekrun R., Fiedler K. Research practices that can prevent an inflation of false-positive rates Pers. Soc. Psychol. Rev. 18 2014 107 118 10.1177/1088868313496330 ISSN 1532-7957
    • (2014) Pers. Soc. Psychol. Rev. , vol.18 , pp. 107-118
    • Murayama, K.1    Pekrun, R.2    Fiedler, K.3
  • 44
    • 80055088241 scopus 로고    scopus 로고
    • Believe it or not: How much can we rely on published data on potential drug targets?
    • Prinz F., Schlange T., Asadullah K. Believe it or not: how much can we rely on published data on potential drug targets? Nat. Rev. Drug Discov. 10 9 2011 712
    • (2011) Nat. Rev. Drug Discov. , vol.10 , Issue.9 , pp. 712
    • Prinz, F.1    Schlange, T.2    Asadullah, K.3
  • 45
    • 84874429676 scopus 로고    scopus 로고
    • The ironic effect of significant results on the credibility of multiple-study articles
    • Schimmack U. The ironic effect of significant results on the credibility of multiple-study articles Psychol. Methods 17 4 2012 551 566
    • (2012) Psychol. Methods , vol.17 , Issue.4 , pp. 551-566
    • Schimmack, U.1
  • 46
    • 84903176990 scopus 로고    scopus 로고
    • Researcher bias: The use of machine learning in software defect prediction
    • Shepperd M., Bowes D., Hall T. Researcher bias: the use of machine learning in software defect prediction IEEE Trans. Softw. Eng. 40 6 2014 603 616
    • (2014) IEEE Trans. Softw. Eng. , vol.40 , Issue.6 , pp. 603-616
    • Shepperd, M.1    Bowes, D.2    Hall, T.3
  • 47
    • 80555145867 scopus 로고    scopus 로고
    • False-positive psychology undisclosed flexibility in data collection and analysis allows presenting anything as significant
    • Simmons J.P., Nelson L.D., Simonsohn U. False-positive psychology undisclosed flexibility in data collection and analysis allows presenting anything as significant Psychol. Sci. 22 11 2011 1359 1366
    • (2011) Psychol. Sci. , vol.22 , Issue.11 , pp. 1359-1366
    • Simmons, J.P.1    Nelson, L.D.2    Simonsohn, U.3
  • 48
    • 27644501818 scopus 로고    scopus 로고
    • A survey of controlled experiments in software engineering
    • Sjøberg D. et al. A survey of controlled experiments in software engineering IEEE Trans. Softw. Eng. 31 9 2005 733 753
    • (2005) IEEE Trans. Softw. Eng. , vol.31 , Issue.9 , pp. 733-753
    • Sjøberg, D.1
  • 50
    • 0030024848 scopus 로고    scopus 로고
    • False-positive results in clinical trials: Multiple significance tests and the problem of unreported comparisons
    • Tannock I.F. False-positive results in clinical trials: multiple significance tests and the problem of unreported comparisons J. Natl. Cancer Inst. 88 3-4 1996 206 207
    • (1996) J. Natl. Cancer Inst. , vol.88 , Issue.3-4 , pp. 206-207
    • Tannock, I.F.1
  • 51
    • 33745847757 scopus 로고
    • Belief in the law of small numbers
    • Tversky A., Kahneman D. Belief in the law of small numbers Psychol. Bull. 76 2 1971 105
    • (1971) Psychol. Bull. , vol.76 , Issue.2 , pp. 105
    • Tversky, A.1    Kahneman, D.2
  • 52
    • 84898460382 scopus 로고    scopus 로고
    • Why publishing everything is more effective than selective publishing of statistically significant results
    • van Assen M.A. et al. Why publishing everything is more effective than selective publishing of statistically significant results PloS One 9 1 2014 e84896
    • (2014) PloS One , vol.9 , Issue.1 , pp. e84896
    • Van Assen, M.A.1
  • 53
    • 1642295096 scopus 로고    scopus 로고
    • Assessing the probability that a positive report is false: An approach for molecular epidemiology studies
    • Wacholder S. et al. Assessing the probability that a positive report is false: an approach for molecular epidemiology studies J. Natl. Cancer Inst. 96 6 2004 434 442
    • (2004) J. Natl. Cancer Inst. , vol.96 , Issue.6 , pp. 434-442
    • Wacholder, S.1
  • 54
    • 80051813363 scopus 로고    scopus 로고
    • Statistical evidence in experimental psychology an empirical comparison using 855 t tests
    • Wetzels R. et al. Statistical evidence in experimental psychology an empirical comparison using 855 t tests Perspect. Psychol. Sci. 6 3 2011 291 298
    • (2011) Perspect. Psychol. Sci. , vol.6 , Issue.3 , pp. 291-298
    • Wetzels, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.