-
1
-
-
84965402107
-
The effect of rater variables in the development of an occupation-specific language performance test
-
Brown, A. (1995). The effect of rater variables in the development of an occupation-specific language performance test. Language Testing, 12, 1-15.
-
(1995)
Language Testing
, vol.12
, pp. 1-15
-
-
Brown, A.1
-
2
-
-
0034195156
-
The stability of rater severity in large-scale assessment programs
-
Congdon, P. J., & McQueen, J. (2000). The stability of rater severity in large-scale assessment programs. Journal of Educational Measurement, 37, 163-178.
-
(2000)
Journal of Educational Measurement
, vol.37
, pp. 163-178
-
-
Congdon, P.J.1
McQueen, J.2
-
3
-
-
84930559584
-
Expertise in evaluating second language compositions
-
Cumming, A. (1990). Expertise in evaluating second language compositions. Language Testing, 7, 31-51.
-
(1990)
Language Testing
, vol.7
, pp. 31-51
-
-
Cumming, A.1
-
4
-
-
33745756490
-
Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis
-
Eckes, T. (2005). Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis. Language Assessment Quarterly, 2, 197-221.
-
(2005)
Language Assessment Quarterly
, vol.2
, pp. 197-221
-
-
Eckes, T.1
-
5
-
-
55249090887
-
Rater types in writing performance assessments: A classification approach to rater variability
-
Eckes, T. (2008). Rater types in writing performance assessments: A classification approach to rater variability. Language Testing, 25, 155-185.
-
(2008)
Language Testing
, vol.25
, pp. 155-185
-
-
Eckes, T.1
-
6
-
-
84988122960
-
Examining rater errors in the assessment of written composition with a many-faceted Rasch model
-
Engelhard, G. (1994). Examining rater errors in the assessment of written composition with a many-faceted Rasch model. Journal of Educational Measurement, 31, 93-112.
-
(1994)
Journal of Educational Measurement
, vol.31
, pp. 93-112
-
-
Engelhard, G.1
-
8
-
-
61349170677
-
An examination of rater drift within a generalizability theory framework
-
Harik, P., Clauser, B. E., Grabovsky, I., Nungester, R. J., Swanson, D., & Nandakumar, R. (2009). An examination of rater drift within a generalizability theory framework. Journal of Educational Measurement, 46, 43-58.
-
(2009)
Journal of Educational Measurement
, vol.46
, pp. 43-58
-
-
Harik, P.1
Clauser, B.E.2
Grabovsky, I.3
Nungester, R.J.4
Swanson, D.5
Nandakumar, R.6
-
10
-
-
0035536108
-
Real-time feedback on rater drift in constructed-response items: An example from the Golden State examination
-
Hoskens, M., & Wilson, M. (2001). Real-time feedback on rater drift in constructed-response items: An example from the Golden State examination. Journal of Educational Measurement, 38, 121-145.
-
(2001)
Journal of Educational Measurement
, vol.38
, pp. 121-145
-
-
Hoskens, M.1
Wilson, M.2
-
11
-
-
0347672323
-
Analyzing ratings and training raters
-
Kingsbury, F. A. (1922). Analyzing ratings and training raters. Journal of Personnel Research, 1, 377-383.
-
(1922)
Journal of Personnel Research
, vol.1
, pp. 377-383
-
-
Kingsbury, F.A.1
-
12
-
-
34447098449
-
Re-training writing raters online: How does it compare with face-to-face training?
-
Knoch, U., Read, J., & von Randow, J. (2007). Re-training writing raters online: How does it compare with face-to-face training? Assessing Writing, 12, 26-43.
-
(2007)
Assessing Writing
, vol.12
, pp. 26-43
-
-
Knoch, U.1
Read, J.2
von Randow, J.3
-
13
-
-
33646351041
-
The stability of marker characteristics across tests of the same subject and across subjects
-
Lamprianou, I. (2006). The stability of marker characteristics across tests of the same subject and across subjects. Journal of Applied Measurement, 7, 192-205.
-
(2006)
Journal of Applied Measurement
, vol.7
, pp. 192-205
-
-
Lamprianou, I.1
-
14
-
-
84055201201
-
-
runmlwin: Stata module for fitting multilevel models in the MLwiN software package [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol
-
Leckie, G., & Charlton, C. (2011). runmlwin: Stata module for fitting multilevel models in the MLwiN software package [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol.
-
(2011)
-
-
Leckie, G.1
Charlton, C.2
-
16
-
-
80051652744
-
Understanding uncertainty in school league tables
-
Leckie, G., & Goldstein, H. (2011). Understanding uncertainty in school league tables. Fiscal Studies, 32, 207-224.
-
(2011)
Fiscal Studies
, vol.32
, pp. 207-224
-
-
Leckie, G.1
Goldstein, H.2
-
17
-
-
0002616035
-
A new approach to standard-setting in language assessment
-
Lumley, T., Lynch, B. K., & McNamara, T. F. (1994). A new approach to standard-setting in language assessment. Melbourne Papers in Language Testing, 3, 19-40.
-
(1994)
Melbourne Papers in Language Testing
, vol.3
, pp. 19-40
-
-
Lumley, T.1
Lynch, B.K.2
McNamara, T.F.3
-
18
-
-
0025203168
-
Judge consistency and severity across grading periods
-
Lunz, M. E., & Stahl, J. A. (1990). Judge consistency and severity across grading periods. Evaluation & the Health Professions, 13, 425-444.
-
(1990)
Evaluation & the Health Professions
, vol.13
, pp. 425-444
-
-
Lunz, M.E.1
Stahl, J.A.2
-
19
-
-
11944265594
-
Validity of psychological assessment: Validation of inferences from persons' responses and performances as scientific inquiry into score meaning
-
Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons' responses and performances as scientific inquiry into score meaning. American Psychologist, 50, 741-749.
-
(1995)
American Psychologist
, vol.50
, pp. 741-749
-
-
Messick, S.1
-
20
-
-
71549124344
-
Monitoring rater performance over time: A framework for detecting differential accuracy and differential scale category use
-
Myford, C. M., & Wolfe, E. W. (2009). Monitoring rater performance over time: A framework for detecting differential accuracy and differential scale category use. Journal of Educational Measurement, 46, 371-389.
-
(2009)
Journal of Educational Measurement
, vol.46
, pp. 371-389
-
-
Myford, C.M.1
Wolfe, E.W.2
-
21
-
-
79551558634
-
Marking consistency over time
-
Pinot de Moira, A., Massey, C., Baird, J., & Morrissey, M. (2002). Marking consistency over time. Research in Education, 67, 79-87.
-
(2002)
Research in Education
, vol.67
, pp. 79-87
-
-
Pinot de Moira, A.1
Massey, C.2
Baird, J.3
Morrissey, M.4
-
22
-
-
84055181717
-
-
Qualifying essay readers for an online scoring network (OSN) (Research Report). Princeton, NJ: Educational Testing Service
-
Powers, D., & Kubota, M. (1998). Qualifying essay readers for an online scoring network (OSN) (Research Report). Princeton, NJ: Educational Testing Service.
-
(1998)
-
-
Powers, D.1
Kubota, M.2
-
23
-
-
84055201200
-
-
MLwiN Version 2.1 [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol
-
Rasbash, J., Charlton, C., Browne, W. J., Healy, M., & Cameron, B. (2009). MLwiN Version 2.1 [Computer software]. Bristol, England: Centre for Multilevel Modeling, University of Bristol.
-
(2009)
-
-
Rasbash, J.1
Charlton, C.2
Browne, W.J.3
Healy, M.4
Cameron, B.5
-
24
-
-
0001605592
-
Efficient analysis of mixed hierarchical and cross-classified random structures using a multilevel model
-
Rasbash, J., & Goldstein, H. (1994). Efficient analysis of mixed hierarchical and cross-classified random structures using a multilevel model. Journal of Educational and Behavioral Statistics, 19, 337-350.
-
(1994)
Journal of Educational and Behavioral Statistics
, vol.19
, pp. 337-350
-
-
Rasbash, J.1
Goldstein, H.2
-
25
-
-
0001585474
-
A crossed random effects model for unbalanced data with applications in cross-sectional and longitudinal research
-
Raudenbush, S. W. (1993). A crossed random effects model for unbalanced data with applications in cross-sectional and longitudinal research. Journal of Educational and Behavioral Statistics, 18, 321-349.
-
(1993)
Journal of Educational and Behavioral Statistics
, vol.18
, pp. 321-349
-
-
Raudenbush, S.W.1
-
27
-
-
66749159684
-
Is teaching experience necessary for reliable scoring of extended English questions?
-
Royal-Dawson, L., & Baird, J. (2009). Is teaching experience necessary for reliable scoring of extended English questions? Educational Measurement: Issues and Practice, 28(2), 2-8.
-
(2009)
Educational Measurement: Issues and Practice
, vol.28
, Issue.2
, pp. 2-8
-
-
Royal-Dawson, L.1
Baird, J.2
-
28
-
-
0001155553
-
Rating the ratings: Assessing the psychometric quality of rating data
-
Saal, F. E., Downey, R. G., & Lahey, M. A. (1980). Rating the ratings: Assessing the psychometric quality of rating data. Psychological Bulletin, 88, 413-428.
-
(1980)
Psychological Bulletin
, vol.88
, pp. 413-428
-
-
Saal, F.E.1
Downey, R.G.2
Lahey, M.A.3
-
29
-
-
0002475119
-
The effect of raters' background and training on the reliability of direct writing tests
-
Shohamy, E., Gordon, C. M., & Kraemer, R. (1992). The effect of raters' background and training on the reliability of direct writing tests. The Modern Language Journal, 76, 27-33.
-
(1992)
The Modern Language Journal
, vol.76
, pp. 27-33
-
-
Shohamy, E.1
Gordon, C.M.2
Kraemer, R.3
-
31
-
-
0001893233
-
A constant error in psychological ratings
-
Thorndike, E. L. (1920). A constant error in psychological ratings. Journal of Applied Psychology, 4, 25-29.
-
(1920)
Journal of Applied Psychology
, vol.4
, pp. 25-29
-
-
Thorndike, E.L.1
-
32
-
-
0043206862
-
Investigating rater/prompt interactions writing assessment: Quantitative and qualitative approaches
-
Weigle, S. C. (1999). Investigating rater/prompt interactions writing assessment: Quantitative and qualitative approaches. Assessing Writing, 6(2), 145-178.
-
(1999)
Assessing Writing
, vol.6
, Issue.2
, pp. 145-178
-
-
Weigle, S.C.1
-
33
-
-
33646364669
-
Identifying rater effects using latent trait models
-
Wolfe, E. W. (2004). Identifying rater effects using latent trait models. Psychology Science, 46, 35-51.
-
(2004)
Psychology Science
, vol.46
, pp. 35-51
-
-
Wolfe, E.W.1
-
34
-
-
33749847596
-
Cognitive differences in proficient and nonproficient essay scorers
-
Wolfe, E. W., Kao, C. W., & Ranney, M. (1998). Cognitive differences in proficient and nonproficient essay scorers. Written Communication, 15, 465-492.
-
(1998)
Written Communication
, vol.15
, pp. 465-492
-
-
Wolfe, E.W.1
Kao, C.W.2
Ranney, M.3
|