메뉴 건너뛰기




Volumn 48, Issue 4, 2011, Pages 399-418

Rater effects on essay scoring: A multilevel analysis of severity drift, central tendency, and rater experience

Author keywords

[No Author keywords available]

Indexed keywords


EID: 84055198279     PISSN: 00220655     EISSN: 17453984     Source Type: Journal    
DOI: 10.1111/j.1745-3984.2011.00152.x     Document Type: Article
Times cited : (78)

References (35)
  • 1
    • 84965402107 scopus 로고
    • The effect of rater variables in the development of an occupation-specific language performance test
    • Brown, A. (1995). The effect of rater variables in the development of an occupation-specific language performance test. Language Testing, 12, 1-15.
    • (1995) Language Testing , vol.12 , pp. 1-15
    • Brown, A.1
  • 2
    • 0034195156 scopus 로고    scopus 로고
    • The stability of rater severity in large-scale assessment programs
    • Congdon, P. J., & McQueen, J. (2000). The stability of rater severity in large-scale assessment programs. Journal of Educational Measurement, 37, 163-178.
    • (2000) Journal of Educational Measurement , vol.37 , pp. 163-178
    • Congdon, P.J.1    McQueen, J.2
  • 3
    • 84930559584 scopus 로고
    • Expertise in evaluating second language compositions
    • Cumming, A. (1990). Expertise in evaluating second language compositions. Language Testing, 7, 31-51.
    • (1990) Language Testing , vol.7 , pp. 31-51
    • Cumming, A.1
  • 4
    • 33745756490 scopus 로고    scopus 로고
    • Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis
    • Eckes, T. (2005). Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis. Language Assessment Quarterly, 2, 197-221.
    • (2005) Language Assessment Quarterly , vol.2 , pp. 197-221
    • Eckes, T.1
  • 5
    • 55249090887 scopus 로고    scopus 로고
    • Rater types in writing performance assessments: A classification approach to rater variability
    • Eckes, T. (2008). Rater types in writing performance assessments: A classification approach to rater variability. Language Testing, 25, 155-185.
    • (2008) Language Testing , vol.25 , pp. 155-185
    • Eckes, T.1
  • 6
    • 84988122960 scopus 로고
    • Examining rater errors in the assessment of written composition with a many-faceted Rasch model
    • Engelhard, G. (1994). Examining rater errors in the assessment of written composition with a many-faceted Rasch model. Journal of Educational Measurement, 31, 93-112.
    • (1994) Journal of Educational Measurement , vol.31 , pp. 93-112
    • Engelhard, G.1
  • 10
    • 0035536108 scopus 로고    scopus 로고
    • Real-time feedback on rater drift in constructed-response items: An example from the Golden State examination
    • Hoskens, M., & Wilson, M. (2001). Real-time feedback on rater drift in constructed-response items: An example from the Golden State examination. Journal of Educational Measurement, 38, 121-145.
    • (2001) Journal of Educational Measurement , vol.38 , pp. 121-145
    • Hoskens, M.1    Wilson, M.2
  • 11
    • 0347672323 scopus 로고
    • Analyzing ratings and training raters
    • Kingsbury, F. A. (1922). Analyzing ratings and training raters. Journal of Personnel Research, 1, 377-383.
    • (1922) Journal of Personnel Research , vol.1 , pp. 377-383
    • Kingsbury, F.A.1
  • 12
    • 34447098449 scopus 로고    scopus 로고
    • Re-training writing raters online: How does it compare with face-to-face training?
    • Knoch, U., Read, J., & von Randow, J. (2007). Re-training writing raters online: How does it compare with face-to-face training? Assessing Writing, 12, 26-43.
    • (2007) Assessing Writing , vol.12 , pp. 26-43
    • Knoch, U.1    Read, J.2    von Randow, J.3
  • 13
    • 33646351041 scopus 로고    scopus 로고
    • The stability of marker characteristics across tests of the same subject and across subjects
    • Lamprianou, I. (2006). The stability of marker characteristics across tests of the same subject and across subjects. Journal of Applied Measurement, 7, 192-205.
    • (2006) Journal of Applied Measurement , vol.7 , pp. 192-205
    • Lamprianou, I.1
  • 14
    • 84055201201 scopus 로고    scopus 로고
    • runmlwin: Stata module for fitting multilevel models in the MLwiN software package [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol
    • Leckie, G., & Charlton, C. (2011). runmlwin: Stata module for fitting multilevel models in the MLwiN software package [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol.
    • (2011)
    • Leckie, G.1    Charlton, C.2
  • 16
    • 80051652744 scopus 로고    scopus 로고
    • Understanding uncertainty in school league tables
    • Leckie, G., & Goldstein, H. (2011). Understanding uncertainty in school league tables. Fiscal Studies, 32, 207-224.
    • (2011) Fiscal Studies , vol.32 , pp. 207-224
    • Leckie, G.1    Goldstein, H.2
  • 18
    • 0025203168 scopus 로고
    • Judge consistency and severity across grading periods
    • Lunz, M. E., & Stahl, J. A. (1990). Judge consistency and severity across grading periods. Evaluation & the Health Professions, 13, 425-444.
    • (1990) Evaluation & the Health Professions , vol.13 , pp. 425-444
    • Lunz, M.E.1    Stahl, J.A.2
  • 19
    • 11944265594 scopus 로고
    • Validity of psychological assessment: Validation of inferences from persons' responses and performances as scientific inquiry into score meaning
    • Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons' responses and performances as scientific inquiry into score meaning. American Psychologist, 50, 741-749.
    • (1995) American Psychologist , vol.50 , pp. 741-749
    • Messick, S.1
  • 20
    • 71549124344 scopus 로고    scopus 로고
    • Monitoring rater performance over time: A framework for detecting differential accuracy and differential scale category use
    • Myford, C. M., & Wolfe, E. W. (2009). Monitoring rater performance over time: A framework for detecting differential accuracy and differential scale category use. Journal of Educational Measurement, 46, 371-389.
    • (2009) Journal of Educational Measurement , vol.46 , pp. 371-389
    • Myford, C.M.1    Wolfe, E.W.2
  • 22
    • 84055181717 scopus 로고    scopus 로고
    • Qualifying essay readers for an online scoring network (OSN) (Research Report). Princeton, NJ: Educational Testing Service
    • Powers, D., & Kubota, M. (1998). Qualifying essay readers for an online scoring network (OSN) (Research Report). Princeton, NJ: Educational Testing Service.
    • (1998)
    • Powers, D.1    Kubota, M.2
  • 23
    • 84055201200 scopus 로고    scopus 로고
    • MLwiN Version 2.1 [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol
    • Rasbash, J., Charlton, C., Browne, W. J., Healy, M., & Cameron, B. (2009). MLwiN Version 2.1 [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol.
    • (2009)
    • Rasbash, J.1    Charlton, C.2    Browne, W.J.3    Healy, M.4    Cameron, B.5
  • 24
    • 0001605592 scopus 로고
    • Efficient analysis of mixed hierarchical and cross-classified random structures using a multilevel model
    • Rasbash, J., & Goldstein, H. (1994). Efficient analysis of mixed hierarchical and cross-classified random structures using a multilevel model. Journal of Educational and Behavioral Statistics, 19, 337-350.
    • (1994) Journal of Educational and Behavioral Statistics , vol.19 , pp. 337-350
    • Rasbash, J.1    Goldstein, H.2
  • 25
    • 0001585474 scopus 로고
    • A crossed random effects model for unbalanced data with applications in cross-sectional and longitudinal research
    • Raudenbush, S. W. (1993). A crossed random effects model for unbalanced data with applications in cross-sectional and longitudinal research. Journal of Educational and Behavioral Statistics, 18, 321-349.
    • (1993) Journal of Educational and Behavioral Statistics , vol.18 , pp. 321-349
    • Raudenbush, S.W.1
  • 27
    • 66749159684 scopus 로고    scopus 로고
    • Is teaching experience necessary for reliable scoring of extended English questions?
    • Royal-Dawson, L., & Baird, J. (2009). Is teaching experience necessary for reliable scoring of extended English questions? Educational Measurement: Issues and Practice, 28(2), 2-8.
    • (2009) Educational Measurement: Issues and Practice , vol.28 , Issue.2 , pp. 2-8
    • Royal-Dawson, L.1    Baird, J.2
  • 28
    • 0001155553 scopus 로고
    • Rating the ratings: Assessing the psychometric quality of rating data
    • Saal, F. E., Downey, R. G., & Lahey, M. A. (1980). Rating the ratings: Assessing the psychometric quality of rating data. Psychological Bulletin, 88, 413-428.
    • (1980) Psychological Bulletin , vol.88 , pp. 413-428
    • Saal, F.E.1    Downey, R.G.2    Lahey, M.A.3
  • 29
    • 0002475119 scopus 로고
    • The effect of raters' background and training on the reliability of direct writing tests
    • Shohamy, E., Gordon, C. M., & Kraemer, R. (1992). The effect of raters' background and training on the reliability of direct writing tests. The Modern Language Journal, 76, 27-33.
    • (1992) The Modern Language Journal , vol.76 , pp. 27-33
    • Shohamy, E.1    Gordon, C.M.2    Kraemer, R.3
  • 31
    • 0001893233 scopus 로고
    • A constant error in psychological ratings
    • Thorndike, E. L. (1920). A constant error in psychological ratings. Journal of Applied Psychology, 4, 25-29.
    • (1920) Journal of Applied Psychology , vol.4 , pp. 25-29
    • Thorndike, E.L.1
  • 32
    • 0043206862 scopus 로고    scopus 로고
    • Investigating rater/prompt interactions writing assessment: Quantitative and qualitative approaches
    • Weigle, S. C. (1999). Investigating rater/prompt interactions writing assessment: Quantitative and qualitative approaches. Assessing Writing, 6(2), 145-178.
    • (1999) Assessing Writing , vol.6 , Issue.2 , pp. 145-178
    • Weigle, S.C.1
  • 33
    • 33646364669 scopus 로고    scopus 로고
    • Identifying rater effects using latent trait models
    • Wolfe, E. W. (2004). Identifying rater effects using latent trait models. Psychology Science, 46, 35-51.
    • (2004) Psychology Science , vol.46 , pp. 35-51
    • Wolfe, E.W.1
  • 34
    • 33749847596 scopus 로고    scopus 로고
    • Cognitive differences in proficient and nonproficient essay scorers
    • Wolfe, E. W., Kao, C. W., & Ranney, M. (1998). Cognitive differences in proficient and nonproficient essay scorers. Written Communication, 15, 465-492.
    • (1998) Written Communication , vol.15 , pp. 465-492
    • Wolfe, E.W.1    Kao, C.W.2    Ranney, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.