SCOPUS 정보 검색 플랫폼

Journal of Consulting and Clinical Psychology

Volumn 82, Issue 6, 2014, Pages 1219-1227

Interrater agreement statistics with skewed data: Evaluation of alternatives to Cohen's kappa

(2) Xu, Shu a Lorber, Michael F a

a NEW YORK UNIVERSITY (United States)

Author keywords

Behavior observation; Diagnosis; Interrater agreement; Low base rate; Skew

Indexed keywords

ARTICLE; BEHAVIORAL SCIENCE; CLINICAL RESEARCH; COHEN KAPPA; HUMAN; INTERRATER AGREEMENT STATISTICS; INTERRATER RELIABILITY; KAPPA STATISTICS; MONTE CARLO METHOD; SIMULATION; OBSERVER VARIATION; REPRODUCIBILITY; STATISTICS;

HUMANS; OBSERVER VARIATION; REPRODUCIBILITY OF RESULTS; STATISTICS AS TOPIC;

EID: 84925639509 PISSN: 0022006X EISSN: 19392117 Source Type: Journal
DOI: 10.1037/a0037489 Document Type: Article

Times cited : (73)

References (31)

1
- 0003680124
- Cambridge, United Kingdom: Cambridge University Press
- Bakeman, R., & Gottman, J. M. (1997). Observing interaction: An introduction to sequential analysis. Cambridge, United Kingdom: Cambridge University Press. doi:10.1017/CBO97805115
- (1997) Observing Interaction: An Introduction to Sequential Analysis
- Bakeman, R.¹ Gottman, J.M.²

2
- 0013565775
- Detecting sequential patterns and determining their reliability with fallible observers
- Bakeman, R., McArthur, D., Quera, V., & Robinson, B. F. (1997). Detecting sequential patterns and determining their reliability with fallible observers. Psychological Methods, 2, 357-370. doi:10.1037/1082-989X .2.4.357
- (1997) Psychological Methods , vol.2 , pp. 357-370
- Bakeman, R.¹ McArthur, D.² Quera, V.³ Robinson, B.F.⁴

3
- 33845891920
- The design of simulation studies in medical statistics
- Burton, A., Altman, D. G., Royston, P., & Holder, R. L. (2006). The design of simulation studies in medical statistics. Statistics in Medicine, 25, 4279-4292. doi:10.1002/sim.2673
- (2006) Statistics in Medicine , vol.25 , pp. 4279-4292
- Burton, A.¹ Altman, D.G.² Royston, P.³ Holder, R.L.⁴

4
- 0028600665
- Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology
- Cicchetti, D. V. (1994). Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychological Assessment, 6, 284-290. doi:10.1037/1040-3590.6.4.284
- (1994) Psychological Assessment , vol.6 , pp. 284-290
- Cicchetti, D.V.¹

5
- 0025281244
- High agreement but low kappa: II
- Cicchetti, D. V., & Feinstein, A. R. (1990). High agreement but low kappa: II. Resolving the paradoxes. Journal of Clinical Epidemiology, 43, 551-558. doi:10.1016/0895-4356(90)90159-M
- (1990) Resolving the Paradoxes. Journal of Clinical Epidemiology , vol.43 , pp. 551-558
- Cicchetti, D.V.¹ Feinstein, A.R.²

6
- 84973587732
- A coefficient of agreement for nominal scales
- Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37-46. doi:10.1177/ 00131644600
- (1960) Educational and Psychological Measurement , vol.20 , pp. 37-46
- Cohen, J.¹

7
- 58149412516
- Weighed kappa: Nominal scale agreement with provision for scaled disagreement or partial credit
- Cohen, J. (1968). Weighed kappa: Nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 70, 213-220. doi:10.1037/h0026256
- (1968) Psychological Bulletin , vol.70 , pp. 213-220
- Cohen, J.¹

8
- 84879319237
- Factors affecting intercoder reliability: A Monte Carlo experiment
- Feng, G. C. (2013). Factors affecting intercoder reliability: A Monte Carlo experiment. Quality & Quantity, 47, 2959-2982. doi:10.1007/s11135-012-9745-9
- (2013) Quality & Quantity , vol.47 , pp. 2959-2982
- Feng, G.C.¹

9
- 85006639031
- Few, L. R., Miller, J. D., Rothbaum, A. O., Meller, S., Maples, J., Terry, D. P., MacKillop, J. (2013). Examination of the Section III DSM-5
- (2013) Examination of the Section III DSM-5
- Few, L.R.¹ Miller, J.D.² Rothbaum, A.O.³ Meller, S.⁴ Maples, J.⁵ Terry, D.P.⁶ Mackillop, J.⁷

10
- 84890425821
- Diagnostic system for personality disorders in an outpatient clinical sample
- diagnostic system for personality disorders in an outpatient clinical sample. Journal of Abnormal Psychology, 122, 1057-1069. doi:10.1037/ a0034878
- Journal of Abnormal Psychology , vol.122 , pp. 1057-1069

11
- 84877819479
- Interrater agreement and interrater reliability: Key concepts, approaches, and applications
- Gisev, N., Bell, J. S., & Chen, T. F. (2013). Interrater agreement and interrater reliability: Key concepts, approaches, and applications. Research in Social and Administrative Pharmacy, 9, 330-338. doi: 10.1016/j.sapharm.2012.04.004
- (2013) Research in Social and Administrative Pharmacy , vol.9 , pp. 330-338
- Gisev, N.¹ Bell, J.S.² Chen, T.F.³

12
- 85006639034
- Kappa statistic is not satisfactory for assessing the extent of agreement between raters
- Gwet, K. (2002). Kappa statistic is not satisfactory for assessing the extent of agreement between raters. Retrieved from http://www.agreestat.com/ research-papers/kappa-statistic-is-not-satisfactory.pdf
- (2002) Retrieved from
- Gwet, K.¹

13
- 44349101004
- Computing inter-rater reliability and its variance in the presence of high agreement
- Gwet, K. L. (2008). Computing inter-rater reliability and its variance in the presence of high agreement. British Journal of Mathematical and Statistical Psychology, 61, 29-48. doi:10.1348/000711006X1
- (2008) British Journal of Mathematical and Statistical Psychology , vol.61 , pp. 29-48
- Gwet, K.L.¹

14
- 84975190542
- (3rd ed ). Gaithersburg, MD: Advanced Analytics Press
- Gwet, K. L. (2012). Handbook of inter-rater reliability: The definitive guide to measuring the extent of agreement among multiple raters (3rd ed.). Gaithersburg, MD: Advanced Analytics Press
- (2012) Handbook of Inter-rater Reliability: The Definitive Guide to Measuring the Extent of Agreement among Multiple Raters
- Gwet, K.L.¹

15
- 84976954076
- A note on the G index of agreement
- Holley, J. W., & Guilford, J. P. (1964). A note on the G index of agreement. Educational and Psychological Measurement, 24, 749-753. doi:10.1177/00131644640
- (1964) Educational and Psychological Measurement , vol.24 , pp. 749-753
- Holley, J.W.¹ Guilford, J.P.²

16
- 84890691291
- The effect of the raters' marginal distributions on their matched agreement: A rescaling framework for interpreting kappa
- Karelitz, T. M., & Budescu, D. V. (2013). The effect of the raters' marginal distributions on their matched agreement: A rescaling framework for interpreting kappa. Multivariate Behavioral Research, 48, 923-952. doi:10.1080/00273171.2013.830064
- (2013) Multivariate Behavioral Research , vol.48 , pp. 923-952
- Karelitz, T.M.¹ Budescu, D.V.²

17
- 0003731484
- (4th ed ). Boston, MA: Allyn & Bacon
- Kazdin, A. E. (2003). Research design in clinical psychology (4th ed.). Boston, MA: Allyn & Bacon
- (2003) Research Design in Clinical Psychology
- Kazdin, A.E.¹

18
- 84889573987
- Retrieved from
- Krippendorff, K. (2011). Agreement and information in the reliability of coding. Retrieved from http://repository.upenn.edu/asc-papers/278
- (2011) Agreement and Information in the Reliability of Coding
- Krippendorff, K.¹

19
- 84995012914
- A review of statistical methods in the analysis of data arising from observer reliability studies (Part I)
- Landis, J. R., & Koch, G. G. (1975a). A review of statistical methods in the analysis of data arising from observer reliability studies (Part I). Statistica Neerlandica, 29, 101-123. doi:10.1111/j.1467-9574.1975.tb00254.x
- (1975) Statistica Neerlandica , vol.29 , pp. 101-123
- Landis, J.R.¹ Koch, G.G.²

20
- 84995114573
- A review of statistical methods in the analysis of data arising from observer reliability studies (Part II)
- Landis, J. R., & Koch, G. G. (1975b). A review of statistical methods in the analysis of data arising from observer reliability studies (Part II). Statistica Neerlandica, 29, 151-161. doi:10.1111/j.1467-9574.1975.tb00259.x
- (1975) Statistica Neerlandica , vol.29 , pp. 151-161
- Landis, J.R.¹ Koch, G.G.²

21
- 0017381382
- An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers
- Landis, J. R., & Koch, G. G. (1977). An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics, 33, 363-374. doi:10.2307/2529786
- (1977) Biometrics , vol.33 , pp. 363-374
- Landis, J.R.¹ Koch, G.G.²

22
- 33745784824
- Can minimally trained observers provide valid global ratings
- Lorber, M. F. (2006). Can minimally trained observers provide valid global ratings? Journal of Family Psychology, 20, 335-338. doi:10.1037/0893-3200.20.2.335
- (2006) Journal of Family Psychology , vol.20 , pp. 335-338
- Lorber, M.F.¹

23
- 85043870731
- On the practice of dichotomization of quantitative variables
- MacCallum, R. C., Zhang, S., Preacher, K. J., & Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. Psychological Methods, 7, 19-40. doi:10.1037/1082-989X.7.1.19
- (2002) Psychological Methods , vol.7 , pp. 19-40
- Maccallum, R.C.¹ Zhang, S.² Preacher, K.J.³ Rucker, D.D.⁴

24
- 0000135659
- Forming inferences about some intraclass correlation coefficients
- McGraw, K. O., & Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30-46. doi:10.1037/1082-989X.1.1.30
- (1996) Psychological Methods , vol.1 , pp. 30-46
- McGraw, K.O.¹ Wong, S.P.²

25
- 0000241635
- Interobserver agreement, reliability, and generalizability of data collected in observational studies
- Mitchell, S. (1979). Interobserver agreement, reliability, and generalizability of data collected in observational studies. Psychological Bulletin, 86, 376-390. doi:10.1037/0033-2909.86.2.376
- (1979) Psychological Bulletin , vol.86 , pp. 376-390
- Mitchell, S.¹

26
- 33748063869
- Reliability of content analysis: The case of nominal scale coding
- Scott, W. A. (1955). Reliability of content analysis: The case of nominal scale coding. Public Opinion Quarterly, 19, 321-325. doi:10.1086/ 266577
- (1955) Public Opinion Quarterly , vol.19 , pp. 321-325
- Scott, W.A.¹

27
- 33748088163
- Including omission mistakes in the calculation of Cohen's kappa and an analysis of the coefficient's paradox features
- Simon, P. (2006). Including omission mistakes in the calculation of Cohen's kappa and an analysis of the coefficient's paradox features. Educational and Psychological Measurement, 66, 765-777. doi:10.1177/ 00131644052
- (2006) Educational and Psychological Measurement , vol.66 , pp. 765-777
- Simon, P.¹

28
- 33745618426
- Creating and field-testing child maltreatment definitions: Improving the reliability of substantiation determinations
- Slep, A. M. S., & Heyman, R. E. (2006). Creating and field-testing child maltreatment definitions: Improving the reliability of substantiation determinations. Child Maltreatment, 11, 217-236. doi:10.1177/ 10775595062
- (2006) Child Maltreatment , vol.11 , pp. 217-236
- Slep, A.M.S.¹ Heyman, R.E.²

29
- 0021845522
- A proposed solution to the base rate problem in the kappa statistic
- Spitznagel, E. L., & Helzer, J. E. (1985). A proposed solution to the base rate problem in the kappa statistic. Archives of General Psychiatry, 42, 725-728. doi:10.1001/archpsyc.1985.01790300093
- (1985) Archives of General Psychiatry , vol.42 , pp. 725-728
- Spitznagel, E.L.¹ Helzer, J.E.²

30
- 0001319997
- Diversity of decision-making models and the measurement of interrater agreement
- Uebersax, J. S. (1987). Diversity of decision-making models and the measurement of interrater agreement. Psychological Bulletin, 101, 140-146. doi:10.1037/0033-2909.101.1.140
- (1987) Psychological Bulletin , vol.101 , pp. 140-146
- Uebersax, J.S.¹

31
- 20444420190
- The dependence of Cohen's kappa on the prevalence does not matter
- Vach, W. (2005). The dependence of Cohen's kappa on the prevalence does not matter. Journal of Clinical Epidemiology, 58, 655-661. doi: 10.1016/j.jclinepi.2004.02.021
- (2005) Journal of Clinical Epidemiology , vol.58 , pp. 655-661
- Vach, W.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.