메뉴 건너뛰기




Volumn 1, Issue , 2005, Pages 85-105

Agreement Statistics: Kappa Coefficients in Medical Research

Author keywords

Consensus; Kappa; Reliability; Validity

Indexed keywords

CATEGORICAL VARIABLES; CONSENSUS; INTRA CLASS; KAPPA; KAPPA COEFFICIENT; MEDICAL RESEARCH; MISLEADING INFORMATIONS; VALIDITY;

EID: 84879606023     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/0470023678.ch1c     Document Type: Chapter
Times cited : (10)

References (66)
  • 1
    • 58149409073 scopus 로고
    • Large sample variance of kappa in the case of different sets of raters
    • Fleiss JL, Nee JCM, Landis JR. Large sample variance of kappa in the case of different sets of raters. Psychological Bulletin 1979; 86:974-977.
    • (1979) Psychological Bulletin , vol.86 , pp. 974-977
    • Fleiss, J.L.1    Nee, J.C.M.2    Landis, J.R.3
  • 2
    • 33748063869 scopus 로고
    • Reliability of content analysis: the case of nominal scale coding
    • Scott WA. Reliability of content analysis: the case of nominal scale coding. Public Opinion Quarterly 1955; 321-325.
    • (1955) Public Opinion Quarterly , pp. 321-325
    • Scott, W.A.1
  • 4
    • 0021952883 scopus 로고
    • A bibliography of publications on observer variability
    • Feinstein AR. A bibliography of publications on observer variability. Journal of Chronic Diseases 1985; 38: 619-632.
    • (1985) Journal of Chronic Diseases , vol.38 , pp. 619-632
    • Feinstein, A.R1
  • 6
    • 0017152497 scopus 로고
    • Carpenter WT. On the methods and theory of reliability
    • Bartko JJ, Carpenter WT. On the methods and theory of reliability. Journal of Nervous and Mental Disease 1976; 163:307-317.
    • (1976) Journal of Nervous and Mental Disease , vol.163 , pp. 307-317
    • Bartko, J.J.1
  • 8
    • 0016750401 scopus 로고
    • Measuring agreement between two judges on the presence or absence of a trait
    • Fleiss JL. Measuring agreement between two judges on the presence or absence of a trait. Biometrics 1975; 31:651- 659.
    • (1975) Biometrics , vol.31 , pp. 651-659
    • Fleiss, J.L.1
  • 9
    • 84973728301 scopus 로고
    • A comparison of three indexes of agreement between observers: proportion of agreement, G-index, and kappa
    • Green SB. A comparison of three indexes of agreement between observers: proportion of agreement, G-index, and kappa. Educational and Psychological Measurement 1981; 41:1069 -1072.
    • (1981) Educational and Psychological Measurement , vol.41 , pp. 1069-1072
    • Green, S.B.1
  • 10
    • 0024240106 scopus 로고
    • Kappa coeficients in epidemiology: an appraisal of a reappaisal
    • Kraemer HC, Bloch DA. Kappa coeficients in epidemiology: an appraisal of a reappaisal. Journal of Clinical Epidemiology 1988; 41:959-968.
    • (1988) Journal of Clinical Epidemiology , vol.41 , pp. 959-968
    • Kraemer, H.C.1    Bloch, D.A.2
  • 11
    • 84995012914 scopus 로고
    • A review of statistical methods in the analysis of data arising from observer reliability studies (Part I)
    • Landis JR, Koch GG. A review of statistical methods in the analysis of data arising from observer reliability studies (Part I). Statistica Neerlandica 1975; 29:101-123.
    • (1975) Statistica Neerlandica , vol.29 , pp. 101-123
    • Landis, J.R.1    Koch, G.G.2
  • 12
    • 0001587595 scopus 로고
    • Measures of response agreement for qualitative data: some generalizations and alternatives
    • Light RJ. Measures of response agreement for qualitative data: some generalizations and alternatives. Psychological Bulletin 1971; 76:365-377.
    • (1971) Psychological Bulletin , vol.76 , pp. 365-377
    • Light, R.J.1
  • 14
    • 0031716722 scopus 로고    scopus 로고
    • Measurement reliability and agreement in psychiatry
    • Shrout PE. Measurement reliability and agreement in psychiatry. Statistical Methods in Medical Research 1998; 7:301-317.
    • (1998) Statistical Methods in Medical Research , vol.7 , pp. 301-317
    • Shrout, P.E.1
  • 18
    • 0034095523 scopus 로고    scopus 로고
    • Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity
    • Hoehler FK. Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. Journal of Clinical Epidemiology 2000; 53:499 -503.
    • (2000) Journal of Clinical Epidemiology , vol.53 , pp. 499-503
    • Hoehler, F.K.1
  • 19
    • 0029935469 scopus 로고    scopus 로고
    • Behavior and interpretation of the k statistic: resolution of the two paradoxes
    • Lantz CA, Nebenzahl E. Behavior and interpretation of the k statistic: resolution of the two paradoxes. Journal of Clinical Epidemiology 1996; 49:431- 434.
    • (1996) Journal of Clinical Epidemiology , vol.49 , pp. 431-434
    • Lantz, C.A.1    Nebenzahl, E.2
  • 20
    • 0028043395 scopus 로고
    • Modelling observer agreement-an alternative to kappa
    • May SM. Modelling observer agreement-an alternative to kappa. Journal of Clinical Epidemiology 1994; 47:1315 -1324.
    • (1994) Journal of Clinical Epidemiology , vol.47 , pp. 1315-1324
    • May, S.M.1
  • 21
    • 3343019470 scopus 로고
    • Measuring nominal scale agreement among many raters
    • Fleiss JL. Measuring nominal scale agreement among many raters. Psychological Bulletin 1971; 76:378-382.
    • (1971) Psychological Bulletin , vol.76 , pp. 378-382
    • Fleiss, J.L.1
  • 22
    • 0019025893 scopus 로고
    • Extensions of the kappa coeficient
    • Kraemer HC. Extensions of the kappa coeficient. Biometrics 1980; 36:207 -216.
    • (1980) Biometrics , vol.36 , pp. 207-216
    • Kraemer, H.C.1
  • 23
    • 0024466276 scopus 로고
    • On assessing interrater agreement for multiple attribute responses
    • Kupper LL. On assessing interrater agreement for multiple attribute responses. Biometrics 1989; 45:957-967.
    • (1989) Biometrics , vol.45 , pp. 957-967
    • Kupper, L.L.1
  • 24
    • 0017360990 scopus 로고
    • The measurement of observer agreement for categorical data
    • Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977; 33: 159-174.
    • (1977) Biometrics , vol.33 , pp. 159-174
    • Landis, J.R.1    Koch, G.G.2
  • 25
    • 0018486508 scopus 로고
    • Agreement measurement and the judgment process
    • Janes CL. Agreement measurement and the judgment process. Journal of Nervous and Mental Disease 1979; 167:343-347.
    • (1979) Journal of Nervous and Mental Disease , vol.167 , pp. 343-347
    • Janes, C.L.1
  • 26
    • 0021845522 scopus 로고
    • A proposed solution to the base rate problem in the kappa statistic
    • Spitznagel EL, Helzer JE. A proposed solution to the base rate problem in the kappa statistic. Archives of General Psychiatry 1985; 42:725-728.
    • (1985) Archives of General Psychiatry , vol.42 , pp. 725-728
    • Spitznagel, E.L.1    Helzer, J.E.2
  • 27
    • 0001513782 scopus 로고
    • A computer program for determining the significance of the difference betweeen pairs of independently derived values of kappa or weighted kappa
    • Cicchetti DV, Heavens RJ. A computer program for determining the significance of the difference betweeen pairs of independently derived values of kappa or weighted kappa. Educational and Psychological Measurement 1981; 41:189 -193.
    • (1981) Educational and Psychological Measurement , vol.41 , pp. 189-193
    • Cicchetti, D.V.1    Heavens, R.J.2
  • 28
    • 0023188960 scopus 로고
    • Sample size requirements for reliability studies
    • Donner A, Eliasziw M. Sample size requirements for reliability studies. Statistics in Medicine 1987; 6: 441-448.
    • (1987) Statistics in Medicine , vol.6 , pp. 441-448
    • Donner, A.1    Eliasziw, M.2
  • 29
    • 0030003468 scopus 로고    scopus 로고
    • Testing the homogeneity of kappa statistics
    • Donner A, Eliasziw M, Klar N. Testing the homogeneity of kappa statistics. Biometrics 1996; 52:176 -183.
    • (1996) Biometrics , vol.52 , pp. 176 -183
    • Donner, A.1    Eliasziw, M.2    Klar, N.3
  • 30
    • 0031897109 scopus 로고    scopus 로고
    • Sample size requirements for the comparison of two or more coeficients of inter-observer agreement
    • Donner A. Sample size requirements for the comparison of two or more coeficients of inter-observer agreement. Statistics in Medicine 1998; 17:1157-1168.
    • (1998) Statistics in Medicine , vol.17 , pp. 1157-1168
    • Donner, A.1
  • 31
    • 0001586045 scopus 로고
    • Ramifications of a population model for k as a coeficient of reliability
    • Kraemer HC. Ramifications of a population model for k as a coeficient of reliability. Psychometrika 1979; 44:461- 472.
    • (1979) Psychometrika , vol.44 , pp. 461-472
    • Kraemer, H.C.1
  • 32
    • 0025144804 scopus 로고
    • Maximum likelihood estimation of agreeement in the constant predictive probability model, and its relation to Cohen's kappa
    • Aickin M. Maximum likelihood estimation of agreeement in the constant predictive probability model, and its relation to Cohen's kappa. Biometrics 1990; 46:293-302.
    • (1990) Biometrics , vol.46 , pp. 293-302
    • Aickin, M.1
  • 33
    • 0017612723 scopus 로고
    • Coeficients of agreement between observers and their interpretations
    • Maxwell AE. Coeficients of agreement between observers and their interpretations. British Journal of Psychiatry 1977; 130:79-83.
    • (1977) British Journal of Psychiatry , vol.130 , pp. 79-83
    • Maxwell, A.E.1
  • 34
    • 0001234460 scopus 로고
    • Estimating false alarms and missed events from interobserver agreement: comment on Kaye
    • Kraemer HC. Estimating false alarms and missed events from interobserver agreement: comment on Kaye. Psychological Bulletin 1982; 92:749-754.
    • (1982) Psychological Bulletin , vol.92 , pp. 749-754
    • Kraemer, H.C.1
  • 35
  • 36
    • 0003459542 scopus 로고
    • Statistical Theories of Mental Test Scores
    • Addison-Wesley: Reading, MA
    • Lord FM, Novick MR. Statistical Theories of Mental Test Scores. Addison-Wesley: Reading, MA, 1968.
    • (1968)
    • Lord, F.M.1    Novick, M.R.2
  • 37
    • 0026957836 scopus 로고
    • Measurement of reliability for categorical data in medical research
    • Kraemer HC. Measurement of reliability for categorical data in medical research. Statistical Methods in Medical Research 1992; 1:183 -199.
    • (1992) Statistical Methods in Medical Research , vol.1 , pp. 183-199
    • Kraemer, H.C.1
  • 39
    • 0009747154 scopus 로고
    • Reliability of multiple classifications
    • Huynh H. Reliability of multiple classifications. Psychometrika 1978; 43:317-325.
    • (1978) Psychometrika , vol.43 , pp. 317-325
    • Huynh, H.1
  • 40
    • 58149412516 scopus 로고
    • Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit
    • Cohen J. Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin 1968; 70:213-229.
    • (1968) Psychological Bulletin , vol.70 , pp. 213-229
    • Cohen, J.1
  • 42
    • 0030971320 scopus 로고    scopus 로고
    • A hierarchical approach to inferences concerning interobserver agreement for multinomial data
    • Donner A, Eliasziw MA. A hierarchical approach to inferences concerning interobserver agreement for multinomial data. Statistics in Medicine 1997; 16:1097 -1106.
    • (1997) Statistics in Medicine , vol.16 , pp. 1097-1106
    • Donner, A.1    Eliasziw, M.A.2
  • 43
    • 0003877646 scopus 로고
    • Statistical Methods For Rates and Proportions
    • Wiley: New York
    • Fleiss JL. Statistical Methods For Rates and Proportions. Wiley: New York, 1981.
    • (1981)
    • Fleiss, J.L.1
  • 44
    • 84965886444 scopus 로고
    • The equivalence of weighted kappa and the intraclass correlation coeficient as measures of reliability
    • Fleiss JL, Cohen J. The equivalence of weighted kappa and the intraclass correlation coeficient as measures of reliability. Educational and Psychological Measurement 1973; 33:613-619.
    • (1973) Educational and Psychological Measurement , vol.33 , pp. 613-619
    • Fleiss, J.L.1    Cohen, J.2
  • 45
    • 0019978807 scopus 로고
    • Jackknifing functions of multinomial frequencies, with an application to a measure of concordance
    • Fleiss JL, Davies M. Jackknifing functions of multinomial frequencies, with an application to a measure of concordance. American Journal of Epidemiology 1982; 115:841- 845.
    • (1982) American Journal of Epidemiology , vol.115 , pp. 841-845
    • Fleiss, J.L.1    Davies, M.2
  • 46
    • 0026794310 scopus 로고
    • A goodness-of-fit approach to inference procedures for the kappa statistics: confidence interval construction, significance-testing and sample size estimation
    • Donner A, Eliasziw M. A goodness-of-fit approach to inference procedures for the kappa statistics: confidence interval construction, significance-testing and sample size estimation. Statistics in Medicine 1992; 11:1511-1519.
    • (1992) Statistics in Medicine , vol.11 , pp. 1511-1519
    • Donner, A.1    Eliasziw, M.2
  • 47
    • 0024513436 scopus 로고
    • 2x2 kappa coeficients: measures of agreement or association
    • Bloch DA, Kraemer HC. 2x2 kappa coeficients: measures of agreement or association. Biometrics 1989; 45:269 -287.
    • (1989) Biometrics , vol.45 , pp. 269-287
    • Bloch, D.A.1    Kraemer, H.C.2
  • 48
    • 0017697784 scopus 로고
    • A one-way components of variance model for categorical data
    • Landis JR, Koch GG. A one-way components of variance model for categorical data. Biometrics 1977; 33: 671-679.
    • (1977) Biometrics , vol.33 , pp. 671-679
    • Landis, J.R.1    Koch, G.G.2
  • 49
    • 0000165343 scopus 로고
    • Standard error of the kappa statistic
    • Hanley JA. Standard error of the kappa statistic. Psychological Bulletin 1987; 102:315-321.
    • (1987) Psychological Bulletin , vol.102 , pp. 315-321
    • Hanley, J.A.1
  • 50
    • 0003493777 scopus 로고
    • Statistical Methods for Research Workers
    • 2nd edn. Oliver & Boyd: London
    • Fisher RA. Statistical Methods for Research Workers, 2nd edn. Oliver & Boyd: London, 1928.
    • (1928)
    • Fisher, R.A.1
  • 51
    • 0002344794 scopus 로고
    • Bootstrap methods: another look at the jackknife
    • Efron B. Bootstrap methods: another look at the jackknife. Annals of Statistics 1979; 7:1-26.
    • (1979) Annals of Statistics , vol.7 , pp. 1-26
    • Efron, B.1
  • 52
    • 0009680641 scopus 로고
    • Confidence intervals for the interrater agreement measure kappa
    • Flack VF. Confidence intervals for the interrater agreement measure kappa. Communications in Statistics- Theory and Methods 1987; 16:953-968.
    • (1987) Communications in Statistics- Theory and Methods , vol.16 , pp. 953-968
    • Flack, V.F.1
  • 54
    • 0034654267 scopus 로고    scopus 로고
    • Interval estimation for Cohen's kappa as a measure of agreement
    • Blackman NJ-N, Koval JJ. Interval estimation for Cohen's kappa as a measure of agreement. Statistics in Medicine 2000; 19:723-741.
    • (2000) Statistics in Medicine , vol.19 , pp. 723-741
    • Blackman, N.J.-N.1    Koval, J.J.2
  • 55
    • 0031027246 scopus 로고    scopus 로고
    • Hypothesis testing and effect size estimation in clinical trials
    • Borenstein M. Hypothesis testing and effect size estimation in clinical trials. Annals of Allergy, Asthma, and Immunology 1997; 78:5 -16.
    • (1997) Annals of Allergy, Asthma, and Immunology , vol.78 , pp. 5-16
    • Borenstein, M.1
  • 56
    • 0003934402 scopus 로고
    • Evaluating Medical Tests: Objective and Quantitative Guidelines
    • Sage Publications: Newbury Park, CA
    • Kraemer HC. Evaluating Medical Tests: Objective and Quantitative Guidelines. Sage Publications: Newbury Park, CA, 1992.
    • (1992)
    • Kraemer, H.C.1
  • 57
    • 84973847627 scopus 로고
    • Kappa, measures of marginal symmetry and intraclass correlations
    • Collis GM. Kappa, measures of marginal symmetry and intraclass correlations. Educational and Psychological Measurement 1985; 45:55-62.
    • (1985) Educational and Psychological Measurement , vol.45 , pp. 55-62
    • Collis, G.M.1
  • 59
    • 84965920423 scopus 로고
    • Testing patterned hypotheses in multi-way contingency tables using weighted kappa and weighted chi square
    • Ross DC. Testing patterned hypotheses in multi-way contingency tables using weighted kappa and weighted chi square. Educational and Psychological Measurement 1977; 37:291- 307.
    • (1977) Educational and Psychological Measurement , vol.37 , pp. 291-307
    • Ross, D.C.1
  • 61
    • 0003802343 scopus 로고
    • Classification and Regression Trees
    • Wadsworth & Brooks/Cole Advanced Books & Software: Monterey, CA
    • Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and Regression Trees. Wadsworth & Brooks/Cole Advanced Books & Software: Monterey, CA, 1984.
    • (1984)
    • Breiman, L.1    Friedman, J.H.2    Olshen, R.A.3    Stone, C.J.4
  • 63
    • 0026538818 scopus 로고
    • How many raters? Toward the most reliable diagnostic consensus
    • Kraemer HC. How many raters? Toward the most reliable diagnostic consensus. Statistics in Medicine 1992; 11:317-331.
    • (1992) Statistics in Medicine , vol.11 , pp. 317-331
    • Kraemer, H.C.1
  • 64
    • 0031436477 scopus 로고    scopus 로고
    • What is the 'right' statistical measure of twin concordance (or diagnostic reliability and validity)?
    • Kraemer HC. What is the 'right' statistical measure of twin concordance (or diagnostic reliability and validity)? Archives of General Psychiatry 1997; 54:1121-1124.
    • (1997) Archives of General Psychiatry , vol.54 , pp. 1121-1124
    • Kraemer, H.C.1
  • 65
    • 0029065656 scopus 로고
    • Statistical issues in assessing comorbidity
    • Kraemer HC. Statistical issues in assessing comorbidity. Statistics in Medicine 1995; 14:721-733.
    • (1995) Statistics in Medicine , vol.14 , pp. 721-733
    • Kraemer, H.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.