-
1
-
-
0001506437
-
Open-ended exercises in large-scale educational assessment
-
In. L. B. Resnick. & J. G. Wirt (. Eds. pp.). San Francisco: Jossey-Bass.
-
Bock, R. D. (1995). Open-ended exercises in large-scale educational assessment. In L. B. Resnick & J. G. Wirt (Eds Linking school and work: Roles for standards and assessment (pp. 305 338). San Francisco : Jossey-Bass.
-
(1995)
Linking School and Work: Roles for Standards and Assessment
, pp. 305-338
-
-
Bock, R.D.1
-
2
-
-
84988099050
-
Item pool maintenance in the presence of item parameter drift
-
&
-
Bock, R. D., Muraki, E., & Pfeiffenberger, W. (1988). Item pool maintenance in the presence of item parameter drift. Journal of Educational Measurement, 25, 275 285.
-
(1988)
Journal of Educational Measurement
, vol.25
, pp. 275-285
-
-
Bock, R.D.1
Muraki, E.2
Pfeiffenberger, W.3
-
3
-
-
3342912106
-
Assessing the written communication skills of medical school graduates
-
&
-
Boulet, J. R., Rebbecchi, T., Denton, E. C., McKinley, D. W., & Whalen, G. P. (2007). Assessing the written communication skills of medical school graduates. Advances in Health Science Education, 9, 47 60.
-
(2007)
Advances in Health Science Education
, vol.9
, pp. 47-60
-
-
Boulet, J.R.1
Rebbecchi, T.2
Denton, E.C.3
McKinley, D.W.4
Whalen, G.P.5
-
4
-
-
0001841268
-
Understanding scoring reliability: Experiments in calibrating essay readers
-
Braun, H. I. (1988). Understanding scoring reliability: Experiments in calibrating essay readers. Journal of Educational Statistics, 13, 1 18.
-
(1988)
Journal of Educational Statistics
, vol.13
, pp. 1-18
-
-
Braun, H.I.1
-
5
-
-
0038561568
-
Making essay test scores fairer with statistics
-
& In. J. Tanur, F. Mosteller, W. H. Kruskal, E. L. Lehmann, R. F. Link, R. S. Pieters. & G. S. Rising (. Eds. (3rd ed., pp.). Pacific Grove, CA: Wadsworth.
-
Braun, H. I., & Wainer, H. (1989). Making essay test scores fairer with statistics. In J. Tanur, F. Mosteller, W. H. Kruskal, E. L. Lehmann, R. F. Link, R. S. Pieters & G. S. Rising (Eds Statistics: A guide to the unknown (3rd ed., pp. 178 188). Pacific Grove, CA : Wadsworth.
-
(1989)
Statistics: A Guide to the Unknown
, pp. 178-188
-
-
Braun, H.I.1
Wainer, H.2
-
7
-
-
33747277764
-
A multivariate generalizability analysis of data from a performance assessment of physicians' clinical skills
-
&
-
Clauser, B. E., Harik, P., & Margolis, M. J. (2006). A multivariate generalizability analysis of data from a performance assessment of physicians' clinical skills. Journal of Educational Measurement, 43, 173 191.
-
(2006)
Journal of Educational Measurement
, vol.43
, pp. 173-191
-
-
Clauser, B.E.1
Harik, P.2
Margolis, M.J.3
-
8
-
-
0034195156
-
The stability of rater severity in large-scale assessment programs
-
&
-
Congdon, P. J., & McQueen, J. (2000). The stability of rater severity in large-scale assessment programs. Journal of Educational Measurement, 37, 163 178.
-
(2000)
Journal of Educational Measurement
, vol.37
, pp. 163-178
-
-
Congdon, P.J.1
McQueen, J.2
-
10
-
-
84988122960
-
Examining rater errors in the assessment of written composition with a many-faceted Rasch model
-
Engelhard, G., Jr. 1994). Examining rater errors in the assessment of written composition with a many-faceted Rasch model. Journal of Educational Measurement, 31, 93 112.
-
(1994)
Journal of Educational Measurement
, vol.31
, pp. 93-112
-
-
Engelhard Jr., G.1
-
11
-
-
0030306428
-
Evaluating rater accuracy in performance assessments
-
Engelhard, G., Jr. 1996). Evaluating rater accuracy in performance assessments. Journal of Educational Measurement, 33, 56 70.
-
(1996)
Journal of Educational Measurement
, vol.33
, pp. 56-70
-
-
Engelhard Jr., G.1
-
12
-
-
84970178351
-
Balanced incomplete block designs for inter-rater reliability studies
-
Fleiss, J. L. (1981). Balanced incomplete block designs for inter-rater reliability studies. Applied Psychological Measurement, 5, 105 112.
-
(1981)
Applied Psychological Measurement
, vol.5
, pp. 105-112
-
-
Fleiss, J.L.1
-
16
-
-
0036053914
-
Understanding Rasch measurement: Construction of measures from many-facet data
-
&
-
Linacre, J. M., & Wright, B. D. (2002). Understanding Rasch measurement: Construction of measures from many-facet data. Journal of Applied Measurement, 3, 486 512.
-
(2002)
Journal of Applied Measurement
, vol.3
, pp. 486-512
-
-
Linacre, J.M.1
Wright, B.D.2
-
18
-
-
0141993684
-
Analysis of the relationship between score components on a standardized patient clinical skills examination
-
&
-
Margolis, M. J., Clauser, B. E., Swanson, D. B., & Boulet, J. R. (2003). Analysis of the relationship between score components on a standardized patient clinical skills examination. Academic Medicine, 78, S68 S71.
-
(2003)
Academic Medicine
, vol.78
-
-
Margolis, M.J.1
Clauser, B.E.2
Swanson, D.B.3
Boulet, J.R.4
-
19
-
-
3342964203
-
Detecting score drift in a high-stakes performance-based assessment
-
&
-
McKinley, D., & Boulet, J. R. (2004). Detecting score drift in a high-stakes performance-based assessment. Advances in Health Sciences Education, 9, 29 38.
-
(2004)
Advances in Health Sciences Education
, vol.9
, pp. 29-38
-
-
McKinley, D.1
Boulet, J.R.2
-
20
-
-
84988101449
-
Least squares models to correct for rater effects in performance assessment
-
&
-
Raymond, M. R., & Viswesvaran, C. (1993). Least squares models to correct for rater effects in performance assessment. Journal of Educational Measurement, 30, 253 268.
-
(1993)
Journal of Educational Measurement
, vol.30
, pp. 253-268
-
-
Raymond, M.R.1
Viswesvaran, C.2
-
21
-
-
61349160977
-
Analysis-of-variance principles applied to the grading of essay tests
-
Stanley, J. C. (1962). Analysis-of-variance principles applied to the grading of essay tests. Journal of Experimental Education, 30, 279 283.
-
(1962)
Journal of Experimental Education
, vol.30
, pp. 279-283
-
-
Stanley, J.C.1
-
23
-
-
0035755690
-
Detecting differential rater functioning over time (DRIFT) using a Rasch multi-faceted rating scale model
-
& POLINA HARIK is a Senior Measurement Scientist, National Board of Medical Examiners, 3750 Market Street, Philadelphia, PA 19104;. Her primary research interests include test modeling and differential item functioning.
-
Wolfe, E. W., Moulder, B. C., & Myford, C. M. (2001). Detecting differential rater functioning over time (DRIFT) using a Rasch multi-faceted rating scale model. Journal of Applied Measurement, 2, 256 280.
-
(2001)
Journal of Applied Measurement
, vol.2
, pp. 256-280
-
-
Wolfe, E.W.1
Moulder, B.C.2
Myford, C.M.3
|