메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1-10

A practical guide for using statistical tests to assess randomized algorithms in software engineering

Author keywords

bonferroni adjustment; confidence interval; effect size; non parametric test; parametric test; statistical difference; survey; systematic review

Indexed keywords

BONFERRONI ADJUSTMENT; CONFIDENCE INTERVAL; EFFECT SIZE; NON-PARAMETRIC TEST; PARAMETRIC TEST; STATISTICAL DIFFERENCES; SYSTEMATIC REVIEW;

EID: 79959871222     PISSN: 02705257     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1985793.1985795     Document Type: Conference Paper
Times cited : (886)

References (60)
  • 2
    • 0035892566 scopus 로고    scopus 로고
    • An evolutionary approach to estimating software development projects
    • DOI 10.1016/S0950-5849(01)00193-8, PII S0950584901001938
    • J. Aguilar-Ruiz, I. Ramos, J. C. Riquelme, and M. Toro. An evolutionary approach to estimating software development projects. Information and Software Technology, 43:875-882, 2001. (Pubitemid 33050967)
    • (2001) Information and Software Technology , vol.43 , Issue.14 , pp. 875-882
    • Aguilar-Ruiz, J.S.1    Ramos, I.2    Riquelme, J.C.3    Toro, M.4
  • 4
    • 70349271023 scopus 로고    scopus 로고
    • Full theoretical runtime analysis of alternating variable method on the triangle classification problem
    • A. Arcuri. Full theoretical runtime analysis of alternating variable method on the triangle classification problem. In International Symposium on Search Based Software Engineering (SSBSE), pages 113-121, 2009.
    • (2009) International Symposium on Search Based Software Engineering (SSBSE) , pp. 113-121
    • Arcuri, A.1
  • 7
    • 55749099654 scopus 로고    scopus 로고
    • A novel co-evolutionary approach to automatic software bug fixing
    • A. Arcuri and X. Yao. A novel co-evolutionary approach to automatic software bug fixing. In IEEE Congress on Evolutionary Computation (CEC), pages 162-168, 2008.
    • (2008) IEEE Congress on Evolutionary Computation (CEC) , pp. 162-168
    • Arcuri, A.1    Yao, X.2
  • 12
    • 0000020125 scopus 로고
    • On the origins of the.05 level of statistical significance
    • M. Cowles and C. Davis. On the origins of the .05 level of statistical significance. American Psychologist, 37(5):553-558, 1982.
    • (1982) American Psychologist , vol.37 , Issue.5 , pp. 553-558
    • Cowles, M.1    Davis, C.2
  • 15
    • 33747126100 scopus 로고    scopus 로고
    • A systematic review of statistical power in software engineering experiments
    • T. Dybå, V. Kampenes, and D. Sjøberg. A systematic review of statistical power in software engineering experiments. Information and Software Technology (IST), 48(8):745-755, 2006.
    • (2006) Information and Software Technology (IST) , vol.48 , Issue.8 , pp. 745-755
    • Dybå, T.1    Kampenes, V.2    Sjøberg, D.3
  • 16
    • 77956815759 scopus 로고    scopus 로고
    • Wilcoxon-Mann-Whitney or t-test? On assumptions for hypothesis tests and multiple interpretations of decision rules
    • M. Fay and M. Proschan. Wilcoxon-Mann-Whitney or t-test? On assumptions for hypothesis tests and multiple interpretations of decision rules. Statistics Surveys, 4:1-39, 2010.
    • (2010) Statistics Surveys , vol.4 , pp. 1-39
    • Fay, M.1    Proschan, M.2
  • 19
    • 3042527872 scopus 로고    scopus 로고
    • Escaping the Bonferroni iron claw in ecological studies
    • DOI 10.1111/j.0030-1299.2004.13046.x
    • L. García. Escaping the Bonferroni iron claw in ecological studies. Oikos, 105(3):657-663, 2004. (Pubitemid 38826088)
    • (2004) Oikos , vol.105 , Issue.3 , pp. 657-663
    • Garcia, L.V.1
  • 23
    • 0033564491 scopus 로고    scopus 로고
    • Toward evidence-based medical statistics. 1: The P value fallacy
    • S. Goodman. Toward evidence-based medical statistics. 1: The P value fallacy. Annals of Internal Medicine, 130(12):995-1004, 1999.
    • (1999) Annals of Internal Medicine , vol.130 , Issue.12 , pp. 995-1004
    • Goodman, S.1
  • 26
    • 77950626862 scopus 로고    scopus 로고
    • A theoretical and empirical study of search based testing: Local, global and hybrid search
    • M. Harman and P. McMinn. A theoretical and empirical study of search based testing: Local, global and hybrid search. IEEE Transactions on Software Engineering (TSE), 36(2):226-247, 2010.
    • (2010) IEEE Transactions on Software Engineering (TSE) , vol.36 , Issue.2 , pp. 226-247
    • Harman, M.1    McMinn, P.2
  • 28
    • 34648846182 scopus 로고    scopus 로고
    • A systematic review of effect size in software engineering experiments
    • DOI 10.1016/j.infsof.2007.02.015, PII S0950584907000195
    • V. Kampenes, T. Dybå, J. Hannay, and D. Sjøberg. A systematic review of effect size in software engineering experiments. Information and Software Technology (IST), 49(11-12):1073-1086, 2007. (Pubitemid 47464910)
    • (2007) Information and Software Technology , vol.49 , Issue.11-12 , pp. 1073-1086
    • Kampenes, V.B.1    Dyba, T.2    Hannay, J.E.3    Sjoberg, D.I.K.4
  • 32
    • 11844289599 scopus 로고    scopus 로고
    • A multiobjective module-order model for software quality enhancement
    • DOI 10.1109/TEVC.2004.837108
    • T. Khoshgoftaar, L. Yi, and N. Seliya. A multiobjective module-order model for software quality enhancement. IEEE Transactions on Evolutionary Computation (TEC), 8(6):593-608, 2004. (Pubitemid 40085397)
    • (2004) IEEE Transactions on Evolutionary Computation , vol.8 , Issue.6 , pp. 593-608
    • Khoshgoftaar, T.M.1    Liu, Y.2    Seliya, N.3
  • 41
    • 3142725712 scopus 로고    scopus 로고
    • Search-based software test data generation: A survey
    • P. McMinn. Search-based software test data generation: A survey. Software Testing, Verification and Reliability, 14(2):105-156, 2004.
    • (2004) Software Testing, Verification and Reliability , vol.14 , Issue.2 , pp. 105-156
    • McMinn, P.1
  • 43
    • 33645833890 scopus 로고    scopus 로고
    • On the automatic modularization of software systems using the bunch tool
    • B. S. Mitchell and S. Mancoridis. On the automatic modularization of software systems using the bunch tool. IEEE Transactions on Software Engineering (TSE), 32(3):193-208, 2006.
    • (2006) IEEE Transactions on Software Engineering (TSE) , vol.32 , Issue.3 , pp. 193-208
    • Mitchell, B.S.1    Mancoridis, S.2
  • 45
    • 8144231087 scopus 로고    scopus 로고
    • A farewell to Bonferroni: The problems of low statistical power and publication bias
    • DOI 10.1093/beheco/arh107
    • S. Nakagawa. A farewell to Bonferroni: the problems of low statistical power and publication bias. Behavioral Ecology, 15(6):1044-1045, 2004. (Pubitemid 39471217)
    • (2004) Behavioral Ecology , vol.15 , Issue.6 , pp. 1044-1045
    • Nakagawa, S.1
  • 46
    • 34547900079 scopus 로고    scopus 로고
    • Effect size, confidence interval and statistical significance: A practical guide for biologists
    • DOI 10.1111/j.1469-185X.2007.00027.x
    • S. Nakagawa and I. Cuthill. Effect size, confidence interval and statistical significance: a practical guide for biologists. Biological Reviews, 82(4):591-605, 2007. (Pubitemid 47609922)
    • (2007) Biological Reviews , vol.82 , Issue.4 , pp. 591-605
    • Nakagawa, S.1    Cuthill, I.C.2
  • 47
    • 60449109741 scopus 로고    scopus 로고
    • Optimized Resource Allocation for Software Release Planning
    • A. Ngo-The and G. Ruhe. Optimized Resource Allocation for Software Release Planning. IEEE Transactions on Software Engineering (TSE), 35(1):109-123, 2009.
    • (2009) IEEE Transactions on Software Engineering (TSE) , vol.35 , Issue.1 , pp. 109-123
    • Ngo-The, A.1    Ruhe, G.2
  • 48
    • 0037297422 scopus 로고    scopus 로고
    • An analysis of the behavior of simplified evolutionary algorithms on trap functions
    • S. Nijssen and T. Back. An analysis of the behavior of simplified evolutionary algorithms on trap functions. IEEE Transactions on Evolutionary Computation (TEC), 7(1):11-22, 2003.
    • (2003) IEEE Transactions on Evolutionary Computation (TEC) , vol.7 , Issue.1 , pp. 11-22
    • Nijssen, S.1    Back, T.2
  • 49
    • 0032542897 scopus 로고    scopus 로고
    • What's wrong with Bonferroni adjustments
    • T. Perneger. What's wrong with Bonferroni adjustments. British Medical Journal, 316:1236-1238, 1998. (Pubitemid 28171344)
    • (1998) British Medical Journal , vol.316 , Issue.7139 , pp. 1236-1238
    • Perneger, T.V.1
  • 51
    • 5344244656 scopus 로고    scopus 로고
    • R Development Core Team R Foundation for Statistical Computing, Vienna, Austria, ISBN 3-900051-07-0
    • R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2008. ISBN 3-900051-07-0.
    • (2008) R: A Language and Environment for Statistical Computing
  • 53
    • 0028203106 scopus 로고
    • Convergence analysis of canonical genetic algorithms
    • G. Rudolph. Convergence analysis of canonical genetic algorithms. IEEE transactions on Neural Networks, 5(1):96-101, 1994.
    • (1994) IEEE Transactions on Neural Networks , vol.5 , Issue.1 , pp. 96-101
    • Rudolph, G.1
  • 54
    • 33745595610 scopus 로고    scopus 로고
    • The unequal variance t-test is an underused alternative to Student's t-test and the Mann-Whitney U test
    • DOI 10.1093/beheco/ark016
    • G. Ruxton. The unequal variance t-test is an underused alternative to Student's t-test and the Mann-Whitney U test. Behavioral Ecology, 17(4):688-690, 2006. (Pubitemid 43992440)
    • (2006) Behavioral Ecology , vol.17 , Issue.4 , pp. 688-690
    • Ruxton, G.D.1
  • 55
    • 11944273917 scopus 로고
    • A more realistic look at the robustness and type II error properties of the t test to departures from population normality
    • S. Sawilowsky and R. Blair. A more realistic look at the robustness and type II error properties of the t test to departures from population normality. Psychological Bulletin, 111(2):352-360, 1992.
    • (1992) Psychological Bulletin , vol.111 , Issue.2 , pp. 352-360
    • Sawilowsky, S.1    Blair, R.2
  • 58
    • 0034411339 scopus 로고    scopus 로고
    • A critique and improvement of the CL common language effect size statistics of McGraw and Wong
    • A. Vargha and H. D. Delaney. A critique and improvement of the CL common language effect size statistics of McGraw and Wong. Journal of Educational and Behavioral Statistics, 25(2):101-132, 2000.
    • (2000) Journal of Educational and Behavioral Statistics , vol.25 , Issue.2 , pp. 101-132
    • Vargha, A.1    Delaney, H.D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.