메뉴 건너뛰기




Volumn 9, Issue 2, 1999, Pages 165-181

If Statistical Significance Tests are Broken/Misused, What Practices Should Supplement or Replace Them?

Author keywords

effect size; hypothesis tests; null hypotheses; significance tests; statistical significance

Indexed keywords


EID: 0033447390     PISSN: 09593543     EISSN: None     Source Type: Journal    
DOI: 10.1177/095935439992006     Document Type: Article
Times cited : (82)

References (82)
  • 2
    • 84925916132 scopus 로고
    • The case against statistical significance testing
    • Carver, R.(1978). The case against statistical significance testing. Harvard Educational Review, 48, 378–399.
    • (1978) Harvard Educational Review , vol.48 , pp. 378-399
    • Carver, R.1
  • 3
    • 0039625885 scopus 로고
    • The case against statistical significance testing, revisited
    • Carver, R.(1993). The case against statistical significance testing, revisited. Journal of Experimental Education, 61, 287–292.
    • (1993) Journal of Experimental Education , vol.61 , pp. 287-292
    • Carver, R.1
  • 4
    • 0000948488 scopus 로고
    • Significance test or effect size?
    • Chow, S.L.(1988). Significance test or effect size? Psychological Bulletin, 103, 105–110.
    • (1988) Psychological Bulletin , vol.103 , pp. 105-110
    • Chow, S.L.1
  • 6
    • 0000626699 scopus 로고
    • Things I have learned (so far)
    • Cohen, J.(1990). Things I have learned (so far). American Psychologist, 45, 1304–1312.
    • (1990) American Psychologist , vol.45 , pp. 1304-1312
    • Cohen, J.1
  • 7
    • 0039802908 scopus 로고
    • The earth is round (p,.05)
    • Cohen, J.(1994). The earth is round (p,.05). American Psychologist, 49, 997–1003.
    • (1994) American Psychologist , vol.49 , pp. 997-1003
    • Cohen, J.1
  • 8
    • 0000358366 scopus 로고    scopus 로고
    • Logic and purpose of significance testing
    • Cortina, J.M., & Dunlap, W.P. (1997). Logic and purpose of significance testing. Psychological Methods, 2, 161–172.
    • (1997) Psychological Methods , vol.2 , pp. 161-172
    • Cortina, J.M.1    Dunlap, W.P.2
  • 9
    • 0001578394 scopus 로고
    • Beyond the two disciplines of psychology
    • Cronbach, L.J.(1975). Beyond the two disciplines of psychology. American Psychologist, 30, 116–127.
    • (1975) American Psychologist , vol.30 , pp. 116-127
    • Cronbach, L.J.1
  • 10
    • 0002029180 scopus 로고
    • Another look at Meehl, Lakatos, and the scientific practices of psychologists
    • Dar, R.(1987). Another look at Meehl, Lakatos, and the scientific practices of psychologists. American Psychologist, 42, 145–151.
    • (1987) American Psychologist , vol.42 , pp. 145-151
    • Dar, R.1
  • 12
    • 0020748239 scopus 로고
    • Computer-intensive methods in statistics
    • Diaconis, P., & Efron, B. (1983). Computer-intensive methods in statistics. Scientific American, 248(5), 116–130.
    • (1983) Scientific American , vol.248 , Issue.5 , pp. 116-130
    • Diaconis, P.1    Efron, B.2
  • 13
    • 21844512087 scopus 로고
    • Significance tests die hard: The amazing persistence of a probabilistic misconception
    • Falk, R., & Greenbaum, C.W. (1995). Significance tests die hard: The amazing persistence of a probabilistic misconception. Theory & Psychology, 5, 75–98.
    • (1995) Theory & Psychology , vol.5 , pp. 75-98
    • Falk, R.1    Greenbaum, C.W.2
  • 14
    • 0001314489 scopus 로고    scopus 로고
    • The appropriate use of null hypothesis testing
    • Frick, R.W.(1996). The appropriate use of null hypothesis testing. Psychological Methods, 1, 379–390.
    • (1996) Psychological Methods , vol.1 , pp. 379-390
    • Frick, R.W.1
  • 15
    • 0001658151 scopus 로고
    • Magnitude of experimental effect and a table for its rapid estimation
    • Friedman, H.(1968). Magnitude of experimental effect and a table for its rapid estimation. Psychological Bulletin, 70, 245–251.
    • (1968) Psychological Bulletin , vol.70 , pp. 245-251
    • Friedman, H.1
  • 17
    • 84973809808 scopus 로고
    • Policy for the unpredictable (uncertainty research and policy)
    • Glass, G.V.(1979). Policy for the unpredictable (uncertainty research and policy). Educational Researcher, 8(9), 12–14.
    • (1979) Educational Researcher , vol.8 , Issue.9 , pp. 12-14
    • Glass, G.V.1
  • 18
    • 0016437295 scopus 로고
    • Consequences of prejudice against the null hypothesis
    • Greenwald, A.G.(1975). Consequences of prejudice against the null hypothesis. Psychological Bulletin, 82, 1–20.
    • (1975) Psychological Bulletin , vol.82 , pp. 1-20
    • Greenwald, A.G.1
  • 19
    • 0029912508 scopus 로고    scopus 로고
    • Effect size and p-values: What should be reported and what should be replicated?
    • Greenwald, A.G., Gonzalez, R., Harris, R.J., & Guthrie, D. (1996). Effect size and p-values: What should be reported and what should be replicated? Psychophysiology, 33, 175–183.
    • (1996) Psychophysiology , vol.33 , pp. 175-183
    • Greenwald, A.G.1    Gonzalez, R.2    Harris, R.J.3    Guthrie, D.4
  • 20
    • 0002812166 scopus 로고    scopus 로고
    • In praise of the null hypothesis statistical test
    • Hagen, R.L.(1997). In praise of the null hypothesis statistical test. American Psychologist, 52, 15–24.
    • (1997) American Psychologist , vol.52 , pp. 15-24
    • Hagen, R.L.1
  • 22
    • 84970501261 scopus 로고
    • Significance tests are not enough: The role of effect size estimation in theory corroboration
    • Harris, M.J.(1991). Significance tests are not enough: The role of effect size estimation in theory corroboration. Theory & Psychology, 1, 375–382.
    • (1991) Theory & Psychology , vol.1 , pp. 375-382
    • Harris, M.J.1
  • 23
    • 84965451891 scopus 로고
    • 3rd ed. New York Holt, Rinehart & Winston
    • Hays, W.L.(1981). Statistics (3rd ed.). New York: Holt, Rinehart & Winston.
    • (1981) Statistics
    • Hays, W.L.1
  • 25
    • 0040233434 scopus 로고
    • Probability, uncertainty and the practice of statistics
    • Chichester Wiley
    • Howson, C., & Urbach, P. (1994). Probability, uncertainty and the practice of statistics. In G. Wright & P. Ayton (Eds.), Subjective probability (pp. 39–51). Chichester: Wiley.
    • (1994) Subjective probability , pp. 39-51
    • Howson, C.1    Urbach, P.2    Wright, G.3    Ayton, P.4
  • 28
    • 0011661151 scopus 로고
    • Pseudo-orthogonal and other analysis of variance designs involving individual-differences variables
    • Humphreys, L.G., & Fleishman, A. (1974). Pseudo-orthogonal and other analysis of variance designs involving individual-differences variables. Journal of Educational Psychology, 66, 464–472.
    • (1974) Journal of Educational Psychology , vol.66 , pp. 464-472
    • Humphreys, L.G.1    Fleishman, A.2
  • 29
    • 0009179149 scopus 로고    scopus 로고
    • Needed: A ban on the significance test
    • Hunter, J.E.(1997). Needed: A ban on the significance test. Psychological Science, 8, 3–7.
    • (1997) Psychological Science , vol.8 , pp. 3-7
    • Hunter, J.E.1
  • 30
    • 0030517592 scopus 로고    scopus 로고
    • Practical significance: A concept whose time has come
    • Kirk, R.(1996). Practical significance: A concept whose time has come. Educational and Psychological Measurement, 56, 746–759.
    • (1996) Educational and Psychological Measurement , vol.56 , pp. 746-759
    • Kirk, R.1
  • 31
    • 0001223190 scopus 로고
    • Canonical correlation analysis: A general parametric significance testing system
    • Knapp, T.R.(1978). Canonical correlation analysis: A general parametric significance testing system. Psychological Bulletin, 85, 410–416.
    • (1978) Psychological Bulletin , vol.85 , pp. 410-416
    • Knapp, T.R.1
  • 32
    • 0027081733 scopus 로고
    • Reporting the size of effects in research studies to facilitate assessment of practical or clinical significance
    • Kraemer, H.C.(1992). Reporting the size of effects in research studies to facilitate assessment of practical or clinical significance. Psychoendocrinology, 17, 527–536.
    • (1992) Psychoendocrinology , vol.17 , pp. 527-536
    • Kraemer, H.C.1
  • 33
    • 0001687428 scopus 로고
    • Improving what is published: A model in search of an editor
    • Kupfersmid, J.(1988). Improving what is published: A model in search of an editor. American Psychologist, 43, 635–642.
    • (1988) American Psychologist , vol.43 , pp. 635-642
    • Kupfersmid, J.1
  • 34
  • 36
    • 0017816556 scopus 로고
    • Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology
    • Meehl, P.E.(1978). Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology. Journal of Consulting and Clinical Psychology, 46, 806–834.
    • (1978) Journal of Consulting and Clinical Psychology , vol.46 , pp. 806-834
    • Meehl, P.E.1
  • 37
    • 0002212028 scopus 로고    scopus 로고
    • The problem is epistemology, not statistics: Replace significance tests by confidence intervals and quantify accuracy of risky numerical predictions
    • Mahwah, NJ: Erlbaum
    • Meehl, P.E.(1997). The problem is epistemology, not statistics: Replace significance tests by confidence intervals and quantify accuracy of risky numerical predictions. In L.L. Harlow, S.A. Mulaik, & J.H. Steiger (Eds.), What if there were no significance tests? (pp. 391–423). Mahwah, NJ: Erlbaum.
    • (1997) What if there were no significance tests? , pp. 391-423
    • Meehl, P.E.1    Harlow, L.L.2    Mulaik, S.A.3    Steiger, J.H.4
  • 40
    • 0000426690 scopus 로고
    • Interpretation of significance levels and effect sizes by psychological researchers
    • Nelson, N., Rosenthal, R., & Rosnow, R.L. (1986). Interpretation of significance levels and effect sizes by psychological researchers. American Psychologist, 41, 1299–1301.
    • (1986) American Psychologist , vol.41 , pp. 1299-1301
    • Nelson, N.1    Rosenthal, R.2    Rosnow, R.L.3
  • 43
    • 0011852992 scopus 로고
    • Planning educational research: Determining the necessary sample size
    • Olejnik, S.F.(1984). Planning educational research: Determining the necessary sample size. Journal of Experimental Education, 53, 40–48.
    • (1984) Journal of Experimental Education , vol.53 , pp. 40-48
    • Olejnik, S.F.1
  • 44
    • 0004603755 scopus 로고
    • Measures of relationship strength in occupational therapy research
    • Ottenbacher, K.(1984). Measures of relationship strength in occupational therapy research. The Occupational Therapy Journal of Research, 4, 271–285.
    • (1984) The Occupational Therapy Journal of Research , vol.4 , pp. 271-285
    • Ottenbacher, K.1
  • 48
    • 0002039024 scopus 로고    scopus 로고
    • Reflections on statistical and substantive significance, with a slice of replication
    • Robinson, D., & Levin, J. (1997). Reflections on statistical and substantive significance, with a slice of replication. Educational Researcher, 26(5), 21–26.
    • (1997) Educational Researcher , vol.26 , Issue.5 , pp. 21-26
    • Robinson, D.1    Levin, J.2
  • 49
    • 33746341926 scopus 로고
    • The ‘file drawer problem’ and tolerance for null results
    • Rosenthal, R.(1979). The ‘file drawer problem’ and tolerance for null results. Psychological Bulletin, 86, 638–641.
    • (1979) Psychological Bulletin , vol.86 , pp. 638-641
    • Rosenthal, R.1
  • 50
    • 0002650263 scopus 로고
    • Effect sizes: Pearson’s correlation, its display via the BESD, and alternative indices
    • Rosenthal, R.(1991). Effect sizes: Pearson’s correlation, its display via the BESD, and alternative indices. American Psychologist, 46, 1086–1087.
    • (1991) American Psychologist , vol.46 , pp. 1086-1087
    • Rosenthal, R.1
  • 51
    • 0002718625 scopus 로고
    • The interpretation of levels of significance by psychological researchers
    • Rosenthal, R., & Gaito, J. (1963). The interpretation of levels of significance by psychological researchers. Journal of Psychology, 55, 33–38.
    • (1963) Journal of Psychology , vol.55 , pp. 33-38
    • Rosenthal, R.1    Gaito, J.2
  • 52
    • 0000547980 scopus 로고
    • Statistical procedures and the justification of knowledge in psychological science
    • Rosnow, R.L., & Rosenthal, R. (1989). Statistical procedures and the justification of knowledge in psychological science. American Psychologist, 44, 1276–1284.
    • (1989) American Psychologist , vol.44 , pp. 1276-1284
    • Rosnow, R.L.1    Rosenthal, R.2
  • 53
    • 0000130677 scopus 로고
    • The fallacy of the null hypothesis significance test
    • Rozeboom, W.W.(1960). The fallacy of the null hypothesis significance test. Psychological Bulletin, 57, 416–428.
    • (1960) Psychological Bulletin , vol.57 , pp. 416-428
    • Rozeboom, W.W.1
  • 54
    • 0023694521 scopus 로고
    • Evaluating the clinical significance of treatment effects: Norms and normality
    • Saunders, S.M., Howard, K.I., & Newman, F.L. (1988). Evaluating the clinical significance of treatment effects: Norms and normality. Behavioral Assessment, 10, 207–218.
    • (1988) Behavioral Assessment , vol.10 , pp. 207-218
    • Saunders, S.M.1    Howard, K.I.2    Newman, F.L.3
  • 55
    • 0000683206 scopus 로고
    • The significance of statistical significance tests in marketing research
    • Sawyer, A.G., & Peter, J.P. (1983). The significance of statistical significance tests in marketing research. Journal of Marketing Research, 20, 122–123.
    • (1983) Journal of Marketing Research , vol.20 , pp. 122-123
    • Sawyer, A.G.1    Peter, J.P.2
  • 56
    • 0000724985 scopus 로고    scopus 로고
    • Statistical significance testing and cumulative knowledge in psychology: Implications for the training of researchers
    • Schmidt, F.L.(1996). Statistical significance testing and cumulative knowledge in psychology: Implications for the training of researchers. Psychological Methods, 1, 115–129.
    • (1996) Psychological Methods , vol.1 , pp. 115-129
    • Schmidt, F.L.1
  • 57
    • 0001994477 scopus 로고    scopus 로고
    • Eight common but false objections to the discontinuation of significance testing in the analysis of research data
    • Mahwah, NJ: Erlbaum
    • Schmidt, F.L., & Hunter, J.E. (1997). Eight common but false objections to the discontinuation of significance testing in the analysis of research data. In L.L. Harlow, S.A. Mulaik, & J.H. Steiger (Eds.), What if there were no significance tests? (pp. 37–64). Mahwah, NJ: Erlbaum.
    • (1997) What if there were no significance tests? , pp. 37-64
    • Schmidt, F.L.1    Hunter, J.E.2    Harlow, L.L.3    Mulaik, S.A.4    Steiger, J.H.5
  • 58
    • 21344480569 scopus 로고
    • Confidence intervals and the scientific method: A case for Holm on the range
    • Serlin, R.C.(1993). Confidence intervals and the scientific method: A case for Holm on the range. Journal of Experimental Education, 61, 350–360.
    • (1993) Journal of Experimental Education , vol.61 , pp. 350-360
    • Serlin, R.C.1
  • 59
    • 0002086025 scopus 로고
    • Chance and nonsense
    • Shaver, J.(1985). Chance and nonsense. Phi Delta Kappan, 67, 57–60.
    • (1985) Phi Delta Kappan , vol.67 , pp. 57-60
    • Shaver, J.1
  • 60
    • 0007307635 scopus 로고    scopus 로고
    • Psychologists debate accuracy of ‘significance test’
    • Shea, C.(1996). Psychologists debate accuracy of ‘significance test’. Chronicle of Higher Education, 42, A12-A12, A16-A16.
    • (1996) Chronicle of Higher Education , vol.42 , pp. A12-A12
    • Shea, C.1
  • 61
    • 84997941849 scopus 로고    scopus 로고
    • Should significance tests be banned? Introduction to a special section exploring the pros and cons
    • Shrout, P.E.(1997). Should significance tests be banned? Introduction to a special section exploring the pros and cons. Psychological Science, 8, 1–2.
    • (1997) Psychological Science , vol.8 , pp. 1-2
    • Shrout, P.E.1
  • 62
    • 21344494087 scopus 로고
    • Evaluating results using corrected and uncorrected effect size estimates
    • Snyder, P.A., & Lawson, S. (1993). Evaluating results using corrected and uncorrected effect size estimates. Journal of Experimental Education, 61, 334–349.
    • (1993) Journal of Experimental Education , vol.61 , pp. 334-349
    • Snyder, P.A.1    Lawson, S.2
  • 63
    • 18244378268 scopus 로고    scopus 로고
    • Use of tests of statistical significance and other analytic choices in a school psychology journal: Review of practices and suggested alternatives
    • Snyder, P.A., & Thompson, B. (1998). Use of tests of statistical significance and other analytic choices in a school psychology journal: Review of practices and suggested alternatives. School Psychology Quarterly, 13, 335–348.
    • (1998) School Psychology Quarterly , vol.13 , pp. 335-348
    • Snyder, P.A.1    Thompson, B.2
  • 64
    • 0007116857 scopus 로고
    • An epistemology of practical research
    • Strike, K.A.(1979). An epistemology of practical research. Educational Researcher, 8(1), 10–16.
    • (1979) Educational Researcher , vol.8 , Issue.1 , pp. 10-16
    • Strike, K.A.1
  • 65
    • 21144479456 scopus 로고
    • DISCSTRA: A computer program that computes bootstrap resampling estimates of descriptive discriminant analysis function and structure coefficients and group centroids
    • Thompson, B.(1992 a). DISCSTRA: A computer program that computes bootstrap resampling estimates of descriptive discriminant analysis function and structure coefficients and group centroids. Educational and Psychological Measurement, 52, 905–911.
    • (1992) Educational and Psychological Measurement , vol.52 , pp. 905-911
    • Thompson, B.1
  • 66
    • 34247491887 scopus 로고
    • Two and one-half decades of leadership in measurement and evaluation
    • Thompson, B.(1992 b). Two and one-half decades of leadership in measurement and evaluation. Journal of Counseling and Development, 70, 434–438.
    • (1992) Journal of Counseling and Development , vol.70 , pp. 434-438
    • Thompson, B.1
  • 67
    • 21344491489 scopus 로고
    • The use of statistical significance tests in research: Bootstrap and other alternatives
    • Thompson, B.(1993). The use of statistical significance tests in research: Bootstrap and other alternatives. Journal of Experimental Education, 61, 361–377.
    • (1993) Journal of Experimental Education , vol.61 , pp. 361-377
    • Thompson, B.1
  • 68
    • 0037693159 scopus 로고
    • The concept of statistical significance testing (An ERIC/AE Clearinghouse Digest #EDO-TM-94-1)
    • (ERIC Document Reproduction Service No. ED: 366 654)
    • Thompson, B.(1994 a). The concept of statistical significance testing (An ERIC/AE Clearinghouse Digest #EDO-TM-94-1). Measurement Update, 4, 5–6. (ERIC Document Reproduction Service No. ED 366 654)
    • (1994) Measurement Update , vol.4 , pp. 5-6
    • Thompson, B.1
  • 70
    • 84989612692 scopus 로고
    • The pivotal role of replication in psychological research: Empirically evaluating the replicability of sample results
    • Thompson, B.(1994 c). The pivotal role of replication in psychological research: Empirically evaluating the replicability of sample results. Journal of Personality, 62, 157–176.
    • (1994) Journal of Personality , vol.62 , pp. 157-176
    • Thompson, B.1
  • 71
    • 84973805638 scopus 로고
    • Exploring the replicability of a study’s results: Bootstrap statistics for the multivariate case
    • Thompson, B.(1995). Exploring the replicability of a study’s results: Bootstrap statistics for the multivariate case. Educational and Psychological Measurement, 55, 84–94.
    • (1995) Educational and Psychological Measurement , vol.55 , pp. 84-94
    • Thompson, B.1
  • 72
    • 79957902492 scopus 로고    scopus 로고
    • AERA editorial policies regarding statistical significance testing: Three suggested reforms
    • Thompson, B.(1996). AERA editorial policies regarding statistical significance testing: Three suggested reforms. Educational Researcher, 25(2), 26–30.
    • (1996) Educational Researcher , vol.25 , Issue.2 , pp. 26-30
    • Thompson, B.1
  • 73
    • 0003241872 scopus 로고    scopus 로고
    • Editorial policies regarding statistical significance tests: Further comments
    • Thompson, B.(1997). Editorial policies regarding statistical significance tests: Further comments. Educational Researcher, 26(5), 29–32.
    • (1997) Educational Researcher , vol.26 , Issue.5 , pp. 29-32
    • Thompson, B.1
  • 75
    • 84993729912 scopus 로고    scopus 로고
    • Why ‘encouraging’ effect size reporting isn’t working: The etiology of researcher resistance to changing practices
    • January Paper presented at the annual meeting of the Southwest Educational Research Association Houston, TX: (ERIC Document Reproduction Service No. E.D. 416 214)
    • Thompson, B.(1998b, January). Why ‘encouraging’ effect size reporting isn’t working: The etiology of researcher resistance to changing practices. Paper presented at the annual meeting of the Southwest Educational Research Association, Houston, TX. (ERIC Document Reproduction Service No. E.D. 416 214)
    • (1998)
    • Thompson, B.1
  • 78
    • 0032352330 scopus 로고    scopus 로고
    • Statistical significance and reliability analyses in recent JCD research articles
    • Thompson, B., & Snyder, P.A. (1998). Statistical significance and reliability analyses in recent JCD research articles. Journal of Counseling and Development, 76, 436–441.
    • (1998) Journal of Counseling and Development , vol.76 , pp. 436-441
    • Thompson, B.1    Snyder, P.A.2
  • 80
    • 0027270830 scopus 로고
    • Supplementing tests of statistical significance: Variation accounted for
    • Young, M.A.(1993). Supplementing tests of statistical significance: Variation accounted for. Journal of Speech and Hearing Research, 36, 644–656.
    • (1993) Journal of Speech and Hearing Research , vol.36 , pp. 644-656
    • Young, M.A.1
  • 81
    • 84995076601 scopus 로고
    • Contemporary issues in the analysis of data: A survey of 551 psychologists
    • Zuckerman, M., Hodgins, H.S., Zuckerman, A., & Rosenthal, R. (1993). Contemporary issues in the analysis of data: A survey of 551 psychologists. Psychological Science, 4, 49–53.
    • (1993) Psychological Science , vol.4 , pp. 49-53
    • Zuckerman, M.1    Hodgins, H.S.2    Zuckerman, A.3    Rosenthal, R.4
  • 82
    • 84993784247 scopus 로고    scopus 로고
    • Would the abolition of significance testing lead to better science?
    • March Paper presented at the annual meeting of the American Educational Research Association Chicago, IL
    • Zwick, R.(1997, March). Would the abolition of significance testing lead to better science? Paper presented at the annual meeting of the American Educational Research Association, Chicago, IL.
    • (1997)
    • Zwick, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.