메뉴 건너뛰기




Volumn 11, Issue 1, 2015, Pages 13-22

Mistakes and how to avoid mistakes in using intercoder reliability indices

Author keywords

Content analysis; Intercoder reliability; Misuse

Indexed keywords


EID: 84924625276     PISSN: 16141881     EISSN: 16142241     Source Type: Journal    
DOI: 10.1027/1614-2241/a000086     Document Type: Article
Times cited : (93)

References (55)
  • 1
    • 57349126313 scopus 로고    scopus 로고
    • Inter-coder agreement for computational linguistics
    • Artstein, R., & Poesio, M. (2008). Inter-coder agreement for computational linguistics. Computational Linguistics, 34, 555-596. doi: 10.1162/coli.07-034-R2v
    • (2008) Computational Linguistics , vol.34 , pp. 555-596
    • Artstein, R.1    Poesio, M.2
  • 2
    • 0040822661 scopus 로고
    • Communications through limited-response questioning
    • Bennett, E. M., Alpert, R., & Goldstein, A. C. (1954). Communications through limited-response questioning. Public Opinion Quarterly, 18, 303-308. doi: 10.1086/266520
    • (1954) Public Opinion Quarterly , vol.18 , pp. 303-308
    • Bennett, E.M.1    Alpert, R.2    Goldstein, A.C.3
  • 3
    • 0032884749 scopus 로고    scopus 로고
    • Measuring agreement in method comparison studies
    • Bland, J. M., & Altman, D. G. (1999). Measuring agreement in method comparison studies. Statistical Methods in Medical Research, 8, 135-160. doi: 10.1177/096228029900800204
    • (1999) Statistical Methods in Medical Research , vol.8 , pp. 135-160
    • Bland, J.M.1    Altman, D.G.2
  • 4
    • 0001057572 scopus 로고
    • Coefficient kappa: Some uses, misuses, and alternatives
    • Brennan, R., & Prediger, D. (1981). Coefficient kappa: Some uses, misuses, and alternatives. Educational and Psychological Measurement, 41, 687-699. doi: 10.1177/001316448104100307
    • (1981) Educational and Psychological Measurement , vol.41 , pp. 687-699
    • Brennan, R.1    Prediger, D.2
  • 6
    • 33751224822 scopus 로고
    • Convergent and discriminant validation by the multitrait-multimethod matrix
    • Campbell, D., & Fiske, D. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56, 81-105. doi: 10.1037/h0046016
    • (1959) Psychological Bulletin , vol.56 , pp. 81-105
    • Campbell, D.1    Fiske, D.2
  • 7
    • 84965150247 scopus 로고    scopus 로고
    • March. (1.3-4th ed.). Vienna, Austria: R Foundation for Statistical Computing
    • Canty, A., & Ripley, B. (2012, March). Boot: Bootstrap functions [Computer software manual] (1.3-4th ed.). Vienna, Austria: R Foundation for Statistical Computing. Retrieved from http://cran.r-project.org/web/packages/boot/index.html
    • (2012) Boot: Bootstrap Functions [Computer Software Manual]
    • Canty, A.1    Ripley, B.2
  • 8
    • 0025281244 scopus 로고
    • High agreement but low kappa: II. Resolving the paradoxes
    • Cicchetti, D., & Feinstein, A. (1990). High agreement but low kappa: II. Resolving the paradoxes. Journal of Clinical Epidemiology, 43, 551-558. doi: 10.1016/0895-4356(90)90159-M
    • (1990) Journal of Clinical Epidemiology , vol.43 , pp. 551-558
    • Cicchetti, D.1    Feinstein, A.2
  • 9
    • 84973587732 scopus 로고
    • A coefficient of agreement for nominal scales
    • Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37-46. doi: 10.1177/001316446002000104
    • (1960) Educational and Psychological Measurement , vol.20 , pp. 37-46
    • Cohen, J.1
  • 10
    • 58149412516 scopus 로고
    • Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit
    • Cohen, J. (1968). Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological Bulletin, 70, 213-220. doi: 10.1037/h0026256
    • (1968) Psychological Bulletin , vol.70 , pp. 213-220
    • Cohen, J.1
  • 11
    • 0001059534 scopus 로고
    • Integration and generalization of kappas for multiple raters
    • Conger, A. (1980). Integration and generalization of kappas for multiple raters. Psychological Bulletin, 88, 322-328. doi: 10.1037/0033-2909.88.2.322
    • (1980) Psychological Bulletin , vol.88 , pp. 322-328
    • Conger, A.1
  • 12
    • 84977690570 scopus 로고
    • Theory of generalizability: A liberalization of reliability theory
    • Cronbach, L., Rajaratnam, N., & Gleser, G. (1963). Theory of generalizability: A liberalization of reliability theory. British Journal of Statistical Psychology, 16(2), 137-163. doi: 10.1111/j.2044-8317.1963.tb00206.x
    • (1963) British Journal of Statistical Psychology , vol.16 , Issue.2 , pp. 137-163
    • Cronbach, L.1    Rajaratnam, N.2    Gleser, G.3
  • 13
    • 0025286295 scopus 로고
    • High agreement but low kappa: I. The problems of two paradoxes
    • Feinstein, A. R., & Cicchetti, D. V. (1990). High agreement but low kappa: I. The problems of two paradoxes. Journal of Clinical Epidemiology, 43, 543-549. doi: 10.1016/0895-4356(90)90158-L
    • (1990) Journal of Clinical Epidemiology , vol.43 , pp. 543-549
    • Feinstein, A.R.1    Cicchetti, D.V.2
  • 14
    • 84879319237 scopus 로고    scopus 로고
    • Factors affecting intercoder reliability: A Monte Carlo experiment
    • Feng, G. C. (2013a). Factors affecting intercoder reliability: A Monte Carlo experiment. Quality and Quantity, 47, 2959-2982. doi: 10.1007/s11135-012-9745-9
    • (2013) Quality and Quantity , vol.47 , pp. 2959-2982
    • Feng, G.C.1
  • 15
    • 84879314629 scopus 로고    scopus 로고
    • Underlying determinants driving agreement among coders
    • Feng, G. C. (2013b). Underlying determinants driving agreement among coders. Quality and Quantity, 47, 2983-2997. doi: 10.1007/s11135-012-9807-z
    • (2013) Quality and Quantity , vol.47 , pp. 2983-2997
    • Feng, G.C.1
  • 16
    • 84902366470 scopus 로고    scopus 로고
    • Estimating intercoder reliability: A structural equation modeling approach
    • Feng, G. C. (2014). Estimating intercoder reliability: A structural equation modeling approach. Quality & Quantity, 48, 2355-2369. doi: 10.1007/s11135-014-0034-7
    • (2014) Quality & Quantity , vol.48 , pp. 2355-2369
    • Feng, G.C.1
  • 17
    • 0002217416 scopus 로고
    • A note on estimating the reliability of categorical data
    • Finn, R. (1970). A note on estimating the reliability of categorical data. Educational and Psychological Measurement, 30, 71-76. doi: 10.1177/001316447003000106
    • (1970) Educational and Psychological Measurement , vol.30 , pp. 71-76
    • Finn, R.1
  • 18
    • 3343019470 scopus 로고
    • Measuring nominal scale agreement among many raters
    • Fleiss, J. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76, 378-382. doi: 10.1037/h0031619
    • (1971) Psychological Bulletin , vol.76 , pp. 378-382
    • Fleiss, J.1
  • 19
    • 33645066726 scopus 로고
    • Large sample standard errors of kappa and weighted kappa
    • Fleiss, J. L., Cohen, J., & Everitt, B. S. (1969). Large sample standard errors of kappa and weighted kappa. Psychological Bulletin, 72, 323-327. doi: 10.1037/h0028106
    • (1969) Psychological Bulletin , vol.72 , pp. 323-327
    • Fleiss, J.L.1    Cohen, J.2    Everitt, B.S.3
  • 20
    • 0001104909 scopus 로고    scopus 로고
    • The measurement of interrater agreement
    • J. L. Fleiss, B. Levin, & M. C. Paik (Eds.), 3rd ed., Hoboken, NJ: Wiley
    • Fleiss, J. L., Levin, B., & Paik, M. C. (2004). The measurement of interrater agreement. In J. L. Fleiss, B. Levin, & M. C. Paik (Eds.), Statistical methods for rates and proportions (3rd ed., pp. 598-626). Hoboken, NJ: Wiley.
    • (2004) Statistical Methods for Rates and Proportions , pp. 598-626
    • Fleiss, J.L.1    Levin, B.2    Paik, M.C.3
  • 22
    • 0242363530 scopus 로고    scopus 로고
    • Inter-rater reliability: Dependency on trait prevalence and marginal homogeneity
    • Gwet, K. (2002). Inter-rater reliability: Dependency on trait prevalence and marginal homogeneity. Statistical Methods for Inter-Rater Reliability Assessment Series, 2, 1-9. Retrieved from http://advancedanalyticsllc.com/irrhbk/research-papers/inter-rater-reliability-dependency.pdf
    • (2002) Statistical Methods for Inter-Rater Reliability Assessment Series , vol.2 , pp. 1-9
    • Gwet, K.1
  • 23
    • 44349101004 scopus 로고    scopus 로고
    • Computing inter-rater reliability and its variance in the presence of high agreement
    • Gwet, K. (2008). Computing inter-rater reliability and its variance in the presence of high agreement. The British Journal of Mathematical and Statistical Psychology, 61, 29-48. doi: 10.1348/000711006X126600
    • (2008) The British Journal of Mathematical and Statistical Psychology , vol.61 , pp. 29-48
    • Gwet, K.1
  • 25
    • 34250756132 scopus 로고    scopus 로고
    • Answering the call for a standard reliability measure for coding data
    • Hayes, A., & Krippendorff, K. (2007). Answering the call for a standard reliability measure for coding data. Communication Methods and Measures, 1, 77-89. doi: 10.1080/19312450709336664
    • (2007) Communication Methods and Measures , vol.1 , pp. 77-89
    • Hayes, A.1    Krippendorff, K.2
  • 28
    • 0000567061 scopus 로고
    • Intercoder reliability estimation approaches in marketing: A generalizability theory framework for quantitative data
    • Hughes, M. A., & Garrett, D. E. (1990). Intercoder reliability estimation approaches in marketing: A generalizability theory framework for quantitative data. Journal of Marketing Research, 27, 185-195. Retrieved from http://www.jstor.org/stable/3172845
    • (1990) Journal of Marketing Research , vol.27 , pp. 185-195
    • Hughes, M.A.1    Garrett, D.E.2
  • 29
  • 30
    • 0040822657 scopus 로고
    • On generalizations of the g index and the phi coefficient to nominal scales
    • Janson, S., & Vegelius, J. (1979). On generalizations of the g index and the phi coefficient to nominal scales. Multivariate Behavioral Research, 14, 255-269. doi: 10.1207/s15327906mbr14029
    • (1979) Multivariate Behavioral Research , vol.14 , pp. 255-269
    • Janson, S.1    Vegelius, J.2
  • 31
    • 84867053474 scopus 로고    scopus 로고
    • Mutual information as a measure of intercoder agreement
    • Klemens, B. (2012). Mutual information as a measure of intercoder agreement. Journal of Official Statistics, 28, 395-412. Retrieved from http://www.jos.nu/Articles/abstract.asp?article=283395
    • (2012) Journal of Official Statistics , vol.28 , pp. 395-412
    • Klemens, B.1
  • 32
    • 0000881830 scopus 로고
    • Content-analysis research: An examination of applications with directives for improving research reliability and objectivity
    • Kolbe, R. H., & Burnett, M. S. (1991). Content-analysis research: An examination of applications with directives for improving research reliability and objectivity. Journal of Consumer Research, 18, 243-250. Retrieved from http://www.jstor.org/stable/2489559
    • (1991) Journal of Consumer Research , vol.18 , pp. 243-250
    • Kolbe, R.H.1    Burnett, M.S.2
  • 33
    • 0000855257 scopus 로고
    • A disagreement about within-group agreement: Disentangling issues of consistency versus consensus
    • Kozlowski, S., & Hattrup, K. (1992). A disagreement about within-group agreement: Disentangling issues of consistency versus consensus. Journal of Applied Psychology, 77, 161-167. doi: 10.1037/0021-9010.77.2.161
    • (1992) Journal of Applied Psychology , vol.77 , pp. 161-167
    • Kozlowski, S.1    Hattrup, K.2
  • 34
    • 0002168731 scopus 로고
    • Bivariate agreement coefficients for reliability of data
    • Krippendorff, K. (1970). Bivariate agreement coefficients for reliability of data. Sociological Methodology, 2, 139-150. Retrieved from http://www.jstor.org/stable/270787
    • (1970) Sociological Methodology , vol.2 , pp. 139-150
    • Krippendorff, K.1
  • 36
    • 4043124032 scopus 로고    scopus 로고
    • Reliability in content analysis. Some common misconceptions and recommendations
    • Krippendorff, K. (2004b). Reliability in content analysis. Some common misconceptions and recommendations. Human Communication Research, 30, 411-433. doi: 10.1111/j.1468-2958.2004.tb00738.x
    • (2004) Human Communication Research , vol.30 , pp. 411-433
    • Krippendorff, K.1
  • 38
    • 79959281731 scopus 로고    scopus 로고
    • Agreement and information in the reliability of coding
    • Krippendorff, K. (2011). Agreement and information in the reliability of coding. Communication Methods and Measures, 5, 93-112. doi: 10.1080/19312458.2011.568376
    • (2011) Communication Methods and Measures , vol.5 , pp. 93-112
    • Krippendorff, K.1
  • 39
    • 85178017660 scopus 로고    scopus 로고
    • A dissenting view on so-called paradoxes of reliability coefficients
    • C. T. Salmon (Ed.), New York, NY: Routledge
    • Krippendorff, K. (2012). A dissenting view on so-called paradoxes of reliability coefficients. In C. T. Salmon (Ed.), Communication Yearbook (Vol. 36, pp. 481-500). New York, NY: Routledge.
    • (2012) Communication Yearbook , vol.36 , pp. 481-500
    • Krippendorff, K.1
  • 40
    • 0001587595 scopus 로고
    • Measures of response agreement for qualitative data: Some generalizations and alternatives
    • Light, R. J. (1971). Measures of response agreement for qualitative data: Some generalizations and alternatives. Psychological Bulletin, 76, 365-377. doi: 10.1037/h0031643
    • (1971) Psychological Bulletin , vol.76 , pp. 365-377
    • Light, R.J.1
  • 41
    • 0024521543 scopus 로고
    • A concordance correlation coefficient to evaluate reproducibility
    • Lin, L. (1989). A concordance correlation coefficient to evaluate reproducibility. Biometrics, 45, 255-268. doi: 10.2307/2532051
    • (1989) Biometrics , vol.45 , pp. 255-268
    • Lin, L.1
  • 42
    • 34347396549 scopus 로고    scopus 로고
    • A unified approach for assessing agreement for continuous and categorical data
    • Lin, L., Hedayat, A. S., & Wenting, W. (2007). A unified approach for assessing agreement for continuous and categorical data. Journal of Biopharmaceutical Statistics, 17, 629-652. doi: 10.1080/10543400701376498
    • (2007) Journal of Biopharmaceutical Statistics , vol.17 , pp. 629-652
    • Lin, L.1    Hedayat, A.S.2    Wenting, W.3
  • 43
    • 0036409443 scopus 로고    scopus 로고
    • Content analysis in mass communication: Assessment and reporting of intercoder reliability
    • Lombard, M., Snyder Duch, J., & Bracken, C. (2002). Content analysis in mass communication: Assessment and reporting of intercoder reliability. Human Communication Research, 28, 587-604. doi: 10.1093/hcr/28.4.587
    • (2002) Human Communication Research , vol.28 , pp. 587-604
    • Lombard, M.1    Snyder Duch, J.2    Bracken, C.3
  • 44
    • 0017612723 scopus 로고
    • Coefficients of agreement between observers and their interpretation
    • Maxwell, A. E. (1977). Coefficients of agreement between observers and their interpretation. The British Journal of Psychiatry, 130, 79-83. doi: 10.1192/bjp.130.1.79
    • (1977) The British Journal of Psychiatry , vol.130 , pp. 79-83
    • Maxwell, A.E.1
  • 45
    • 0000135659 scopus 로고    scopus 로고
    • Forming inferences about some intraclass correlation coefficients
    • McGraw, K., & Wong, S. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30-46. doi: 10.1037/1082-989X.1.1.30
    • (1996) Psychological Methods , vol.1 , pp. 30-46
    • McGraw, K.1    Wong, S.2
  • 46
    • 0003231688 scopus 로고
    • The representational model and relevant research methods
    • I. de Sola Pool (Ed.), Urbana, IL: University of Illinois Press
    • Osgood, C. (1959). The representational model and relevant research methods. In I. de Sola Pool (Ed.), Trends in content analysis (pp. 33-88). Urbana, IL: University of Illinois Press.
    • (1959) Trends in Content Analysis , pp. 33-88
    • Osgood, C.1
  • 47
    • 0001220166 scopus 로고
    • Reliability of nominal data based on qualitative judgments
    • Perreault, J., William, D., & Leigh, L. E. (1989). Reliability of nominal data based on qualitative judgments. Journal of Marketing Research, 26(2), 135-148. Retrieved from http://www.jstor.org/stable/3172601
    • (1989) Journal of Marketing Research , vol.26 , Issue.2 , pp. 135-148
    • Perreault, J.1    William, D.2    Leigh, L.E.3
  • 49
    • 33748063869 scopus 로고
    • Reliability of content analysis: The case of nominal scale coding
    • Scott, W. (1955). Reliability of content analysis: The case of nominal scale coding. Public Opinion Quarterly, 19, 321-325. doi: 10.1086/266577
    • (1955) Public Opinion Quarterly , vol.19 , pp. 321-325
    • Scott, W.1
  • 50
    • 48249153186 scopus 로고
    • Intraclass correlations: Uses in assessing rater reliability
    • Shrout, P., & Fleiss, J. (1979). Intraclass correlations: Uses in assessing rater reliability. Psychological Bulletin, 86, 420-428. doi: 10.1037/0033-2909.86.2.420
    • (1979) Psychological Bulletin , vol.86 , pp. 420-428
    • Shrout, P.1    Fleiss, J.2
  • 51
    • 14744284163 scopus 로고    scopus 로고
    • The kappa statistic in reliability studies: Use, interpretation, and sample size requirements
    • Sim, J., & Wright, C. C. (2005). The kappa statistic in reliability studies: Use, interpretation, and sample size requirements. Physical Therapy, 85, 257-268. Retrieved from http://ptjournal.apta.org/content/85/3/257.abstract
    • (2005) Physical Therapy , vol.85 , pp. 257-268
    • Sim, J.1    Wright, C.C.2
  • 52
    • 85055898408 scopus 로고
    • The reliability of agreement in content analysis
    • Spiegelman, M., Terwilliger, C., & Fearing, F. (1953). The reliability of agreement in content analysis. The Journal of Social Psychology, 37, 175-187. Retrieved from http://doi.apa.org/?uid=1954-02550-001
    • (1953) The Journal of Social Psychology , vol.37 , pp. 175-187
    • Spiegelman, M.1    Terwilliger, C.2    Fearing, F.3
  • 53
    • 0001407154 scopus 로고
    • Interrater reliability and agreement of subjective judgments
    • Tinsley, H. E., & Weiss, D. J. (1975). Interrater reliability and agreement of subjective judgments. Journal of Counseling Psychology, 22, 358-376. doi: 10.1037/h0076640
    • (1975) Journal of Counseling Psychology , vol.22 , pp. 358-376
    • Tinsley, H.E.1    Weiss, D.J.2
  • 54
    • 85178051193 scopus 로고    scopus 로고
    • Assumptions behind inter-coder reliability indices
    • C. T. Salmon (Ed.), New York, NY: Routledge
    • Zhao, X., Liu, J. S., & Deng, K. (2012). Assumptions behind inter-coder reliability indices. In C. T. Salmon (Ed.), Communication Yearbook (Vol. 36, pp. 419-480). New York, NY: Routledge.
    • (2012) Communication Yearbook , vol.36 , pp. 419-480
    • Zhao, X.1    Liu, J.S.2    Deng, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.