메뉴 건너뛰기




Volumn 2, Issue 2, 2007, Pages 130-144

The use of scoring rubrics: Reliability, validity and educational consequences

Author keywords

Alternative assessment; Performance assessment; Reliability; Scoring rubrics; Validity

Indexed keywords


EID: 36049027816     PISSN: 1747938X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.edurev.2007.05.002     Document Type: Review
Times cited : (796)

References (76)
  • 2
    • 36048934562 scopus 로고    scopus 로고
    • Baartman, L. K. J., Bastiaens, T. J., Kirschner, P. A., & Van der Vleuten, C. P. M. (submitted for publication). Assessment in competence-based education: How can assessment quality be evaluated? Educational Research Review.
  • 7
    • 0003340032 scopus 로고    scopus 로고
    • Generalizability of performance assessments
    • Phillips G. (Ed), National Center for Education Statistics, Washington, DC
    • Brennan R. Generalizability of performance assessments. In: Phillips G. (Ed). Technical issues in large-scale performance assessment (1996), National Center for Education Statistics, Washington, DC 19-58
    • (1996) Technical issues in large-scale performance assessment , pp. 19-58
    • Brennan, R.1
  • 11
    • 0039608544 scopus 로고    scopus 로고
    • Effects of ethnicity and violent content on rubric scores in writing samples
    • Davidson M., Howell K.W., and Hoekema P. Effects of ethnicity and violent content on rubric scores in writing samples. Journal of Educational Research 93 (2000) 367-373
    • (2000) Journal of Educational Research , vol.93 , pp. 367-373
    • Davidson, M.1    Howell, K.W.2    Hoekema, P.3
  • 12
    • 52949147261 scopus 로고    scopus 로고
    • Learning and the emerging new assessment culture
    • Verschaffel L., Dochy F., Boekaerts M., and Vosniadou S. (Eds), Elsevier, Oxford, Amsterdam
    • Dochy F., Gijbels D., and Segers M. Learning and the emerging new assessment culture. In: Verschaffel L., Dochy F., Boekaerts M., and Vosniadou S. (Eds). Instructional psychology: Past, present and future trends (2006), Elsevier, Oxford, Amsterdam
    • (2006) Instructional psychology: Past, present and future trends
    • Dochy, F.1    Gijbels, D.2    Segers, M.3
  • 13
    • 0346728653 scopus 로고    scopus 로고
    • The use of self-, peer and co-assessment in higher education: A review
    • Dochy F., Segers M., and Sluijsmans D. The use of self-, peer and co-assessment in higher education: A review. Studies in Higher Education 24 (1999) 331-350
    • (1999) Studies in Higher Education , vol.24 , pp. 331-350
    • Dochy, F.1    Segers, M.2    Sluijsmans, D.3
  • 14
    • 84965584733 scopus 로고
    • Student self-assessment in higher education: A meta-analysis
    • Falchikov N., and Boud D. Student self-assessment in higher education: A meta-analysis. Review of Educational Research 59 (1989) 395-430
    • (1989) Review of Educational Research , vol.59 , pp. 395-430
    • Falchikov, N.1    Boud, D.2
  • 15
    • 0034562333 scopus 로고    scopus 로고
    • Student peer assessment in higher education: A meta-analysis comparing peer and teacher marks
    • Falchikov N., and Goldfinch J. Student peer assessment in higher education: A meta-analysis comparing peer and teacher marks. Review of Educational Research 70 (2000) 287-322
    • (2000) Review of Educational Research , vol.70 , pp. 287-322
    • Falchikov, N.1    Goldfinch, J.2
  • 17
    • 15244356096 scopus 로고    scopus 로고
    • Evaluating the consequential validity of new modes of assessment: The influence of assessment on learning, including pre-, post-, and true assessment effects
    • Segers M., Dochy F., and Cascallar E. (Eds), Kluwer Academic Publishers, Dordrecht
    • Gielen S., Dochy F., and Dierick S. Evaluating the consequential validity of new modes of assessment: The influence of assessment on learning, including pre-, post-, and true assessment effects. In: Segers M., Dochy F., and Cascallar E. (Eds). Optimizing new modes of assessment: In search of qualities and standards (2003), Kluwer Academic Publishers, Dordrecht
    • (2003) Optimizing new modes of assessment: In search of qualities and standards
    • Gielen, S.1    Dochy, F.2    Dierick, S.3
  • 18
    • 0000737569 scopus 로고    scopus 로고
    • Writing to the rubric: Lingering effects of traditional standardized testing on direct writing assessment
    • Mabry L. Writing to the rubric: Lingering effects of traditional standardized testing on direct writing assessment. Phi Delta Kappan 80 (1999) 673-679
    • (1999) Phi Delta Kappan , vol.80 , pp. 673-679
    • Mabry, L.1
  • 20
    • 0001840881 scopus 로고    scopus 로고
    • Validity of performance assessments
    • Phillips G. (Ed), National Center for Education Statistics, Washington, DC
    • Messick S. Validity of performance assessments. In: Phillips G. (Ed). Technical issues in large-scale performance assessment (1996), National Center for Education Statistics, Washington, DC 1-18
    • (1996) Technical issues in large-scale performance assessment , pp. 1-18
    • Messick, S.1
  • 23
    • 79955527123 scopus 로고    scopus 로고
    • The importance of marking criteria in the use of peer assessment
    • Orsmond P., and Merry S. The importance of marking criteria in the use of peer assessment. Assessment & Evaluation in Higher Education 21 (1996) 239-250
    • (1996) Assessment & Evaluation in Higher Education , vol.21 , pp. 239-250
    • Orsmond, P.1    Merry, S.2
  • 24
    • 36048988950 scopus 로고    scopus 로고
    • Perlman, C.C. (2003). Performance assessment: Designing appropriate performance tasks and scoring rubrics. North Carolina, USA.
  • 25
    • 0002556185 scopus 로고    scopus 로고
    • On the content validity of performance assessments: Centrality of domain-specifications
    • Birenbaum M., and Dochy F. (Eds), Kluwer Academic Publishers, Boston
    • Shavelson R.J., Gao X., and Baxter G. On the content validity of performance assessments: Centrality of domain-specifications. In: Birenbaum M., and Dochy F. (Eds). Alternatives in assessment of achievements, learning processes and prior knowledge (1996), Kluwer Academic Publishers, Boston
    • (1996) Alternatives in assessment of achievements, learning processes and prior knowledge
    • Shavelson, R.J.1    Gao, X.2    Baxter, G.3
  • 26
    • 84874593076 scopus 로고    scopus 로고
    • A comparison of consensus, consistency, and measurement approaches to estimating interrater reliability
    • Stemler S.E. A comparison of consensus, consistency, and measurement approaches to estimating interrater reliability. Practical Assessment, Research & Evaluation 9 (2004)
    • (2004) Practical Assessment, Research & Evaluation , vol.9
    • Stemler, S.E.1
  • 27
    • 26244466014 scopus 로고    scopus 로고
    • Self and peer assessment in school and university: Reliability, validity and utility
    • Segers M., Dochy F., and Cascallar E. (Eds), Kluwer Academic Publishers, Dordrecht
    • Topping K. Self and peer assessment in school and university: Reliability, validity and utility. In: Segers M., Dochy F., and Cascallar E. (Eds). Optimizing new modes of assessment: In search of qualities and standards (2003), Kluwer Academic Publishers, Dordrecht
    • (2003) Optimizing new modes of assessment: In search of qualities and standards
    • Topping, K.1
  • 28
    • 33749603973 scopus 로고    scopus 로고
    • Teachers' and students' perceptions of assessments: A review and a study into the ability and accuracy of estimating the difficulty levels of assessment items
    • Van de Watering G., and van der Rijt J. Teachers' and students' perceptions of assessments: A review and a study into the ability and accuracy of estimating the difficulty levels of assessment items. Educational Research Review 1 (2006) 133-147
    • (2006) Educational Research Review , vol.1 , pp. 133-147
    • Van de Watering, G.1    van der Rijt, J.2
  • 32
    • 21344488404 scopus 로고
    • Learning-based assessments of history understanding
    • Baker E.L. Learning-based assessments of history understanding. Educational Psychologist 29 (1994) 97-106
    • (1994) Educational Psychologist , vol.29 , pp. 97-106
    • Baker, E.L.1
  • 33
    • 0013292154 scopus 로고
    • Dimensionality and generalizability of domain-independent performance assessments
    • Baker E.L., Abedi J., Linn R.L., and Niemi D. Dimensionality and generalizability of domain-independent performance assessments. Journal of Educational Research 89 (1995) 197-205
    • (1995) Journal of Educational Research , vol.89 , pp. 197-205
    • Baker, E.L.1    Abedi, J.2    Linn, R.L.3    Niemi, D.4
  • 35
    • 30544445473 scopus 로고    scopus 로고
    • A new method for assessing critical thinking in the classroom
    • Bissell A.N., and Lemons P.R. A new method for assessing critical thinking in the classroom. BioScience 56 (2006) 66-72
    • (2006) BioScience , vol.56 , pp. 66-72
    • Bissell, A.N.1    Lemons, P.R.2
  • 36
    • 4644314465 scopus 로고    scopus 로고
    • Accuracy in the scoring of writing: Studies of reliability and validity using a new zealand writing assessment system
    • Brown G.T.L., Glasswell K., and Harland D. Accuracy in the scoring of writing: Studies of reliability and validity using a new zealand writing assessment system. Assessing Writing 9 (2004) 105-121
    • (2004) Assessing Writing , vol.9 , pp. 105-121
    • Brown, G.T.L.1    Glasswell, K.2    Harland, D.3
  • 37
    • 0035753889 scopus 로고    scopus 로고
    • Comparing holistic and analytic scoring for performance assessment with many-facet rasch model
    • Chi E. Comparing holistic and analytic scoring for performance assessment with many-facet rasch model. Journal of Applied Measurement 2 (2001) 379-388
    • (2001) Journal of Applied Measurement , vol.2 , pp. 379-388
    • Chi, E.1
  • 38
    • 33751015138 scopus 로고    scopus 로고
    • Validity and reliability of scaffolded peer assessment of writing from instructor and student perspectives
    • Cho K., Schunn C.D., and Wilson R.W. Validity and reliability of scaffolded peer assessment of writing from instructor and student perspectives. Journal of Educational Psychology 98 (2006) 891-901
    • (2006) Journal of Educational Psychology , vol.98 , pp. 891-901
    • Cho, K.1    Schunn, C.D.2    Wilson, R.W.3
  • 40
    • 0043265942 scopus 로고    scopus 로고
    • Writing assessment: Raters' elaboration of the rating task
    • DeRemer M.L. Writing assessment: Raters' elaboration of the rating task. Assessing Writing 5 (1998) 7-29
    • (1998) Assessing Writing , vol.5 , pp. 7-29
    • DeRemer, M.L.1
  • 41
    • 36048969079 scopus 로고    scopus 로고
    • Duke, B. L. (2003). The influence of using cognitive strategy instruction through writing rubrics on high school students' writing self-efficacy, achievement goal orientation, perceptions of classroom goal structures, self-regulation, and writing achievement. Unpublished doctoral dissertation. USA: University of Oklahoma.
  • 43
    • 0010779177 scopus 로고
    • Toward the instructional utility of large-scale writing assessment: Validation of a new narrative rubric
    • Gearhart M., Herman J.L., Novak J.R., and Wolf S.A. Toward the instructional utility of large-scale writing assessment: Validation of a new narrative rubric. Assessing Writing 2 (1995) 207-242
    • (1995) Assessing Writing , vol.2 , pp. 207-242
    • Gearhart, M.1    Herman, J.L.2    Novak, J.R.3    Wolf, S.A.4
  • 44
    • 33845801344 scopus 로고    scopus 로고
    • Observations from the field: Sharing a literature review rubric
    • Green R., and Bowser M. Observations from the field: Sharing a literature review rubric. Journal of Library Administration 45 (2006) 185-202
    • (2006) Journal of Library Administration , vol.45 , pp. 185-202
    • Green, R.1    Bowser, M.2
  • 45
    • 0347447439 scopus 로고    scopus 로고
    • Quantitative analysis of the rubric as an assessment tool: An empirical study of student peer-group rating
    • Hafner J.C., and Hafner P.M. Quantitative analysis of the rubric as an assessment tool: An empirical study of student peer-group rating. International journal of science education 25 (2003) 1509-1528
    • (2003) International journal of science education , vol.25 , pp. 1509-1528
    • Hafner, J.C.1    Hafner, P.M.2
  • 46
    • 0034386497 scopus 로고    scopus 로고
    • The relation between score resolution methods and interrater reliability: An empirical study of an analytic scoring rubric
    • Johnson R.L., Penny J., and Gordon B. The relation between score resolution methods and interrater reliability: An empirical study of an analytic scoring rubric. Applied Measurement in Education 13 (2000) 121-138
    • (2000) Applied Measurement in Education , vol.13 , pp. 121-138
    • Johnson, R.L.1    Penny, J.2    Gordon, B.3
  • 47
    • 85001195529 scopus 로고    scopus 로고
    • Score resolution and the interrater reliabilityof holistic scores in rating essays
    • Johnson R.L., Penny J., and Gordon B. Score resolution and the interrater reliabilityof holistic scores in rating essays. Written Communication 18 (2001) 229-249
    • (2001) Written Communication , vol.18 , pp. 229-249
    • Johnson, R.L.1    Penny, J.2    Gordon, B.3
  • 50
    • 36048950967 scopus 로고    scopus 로고
    • Lunsford, B. E. (2002). Inquiry and inscription as keys to authentic science instruction and assessment for preservice secondary science teachers. Unpublished doctoral dissertation. USA: University of Tennessee.
  • 51
    • 0036039272 scopus 로고    scopus 로고
    • A comparison of selected methods of scoring classroom assessments
    • Marzano R.J. A comparison of selected methods of scoring classroom assessments. Applied Measurement in Education 15 (2002) 249-267
    • (2002) Applied Measurement in Education , vol.15 , pp. 249-267
    • Marzano, R.J.1
  • 52
    • 0034187230 scopus 로고    scopus 로고
    • Scenario assignments as assessment tools for undergraduate engineering education
    • McMartin F., McKenna A., and Youssefi K. Scenario assignments as assessment tools for undergraduate engineering education. IEEE Transactions on Education 43 (2000) 111-120
    • (2000) IEEE Transactions on Education , vol.43 , pp. 111-120
    • McMartin, F.1    McKenna, A.2    Youssefi, K.3
  • 54
    • 36049037522 scopus 로고    scopus 로고
    • Mullen, Y. K. (2003). Student improvement in middle school science. Unpublished master thesis. USA: University of Wisconsin.
  • 56
    • 36048955328 scopus 로고    scopus 로고
    • Critical thinking in preservice teachers: A rubric for evaluating argumentation and statistical reasoning
    • Osana H.P., and Seymour J.R. Critical thinking in preservice teachers: A rubric for evaluating argumentation and statistical reasoning. Educational Research and Evaluation 10 (2004) 473-498
    • (2004) Educational Research and Evaluation , vol.10 , pp. 473-498
    • Osana, H.P.1    Seymour, J.R.2
  • 57
    • 21844486359 scopus 로고
    • Assessing literacy: Establishing common standards in portfolio assessment
    • Paratore J.R. Assessing literacy: Establishing common standards in portfolio assessment. Topics in Language Disorders 16 (1995) 67-83
    • (1995) Topics in Language Disorders , vol.16 , pp. 67-83
    • Paratore, J.R.1
  • 58
    • 0141603477 scopus 로고    scopus 로고
    • The effect of rating augmentation on inter-rater reliability: An empirical study of a holistic rubric
    • Penny J., Johnson R.L., and Gordon B. The effect of rating augmentation on inter-rater reliability: An empirical study of a holistic rubric. Assessing Writing 7 (2000) 143-164
    • (2000) Assessing Writing , vol.7 , pp. 143-164
    • Penny, J.1    Johnson, R.L.2    Gordon, B.3
  • 59
    • 0034145314 scopus 로고    scopus 로고
    • Using rating augmentation to expand the scale of an analytic rubric
    • Penny J., Johnson R.L., and Gordon B. Using rating augmentation to expand the scale of an analytic rubric. Journal of Experimental Education 68 (2000) 269-287
    • (2000) Journal of Experimental Education , vol.68 , pp. 269-287
    • Penny, J.1    Johnson, R.L.2    Gordon, B.3
  • 60
    • 36049032687 scopus 로고    scopus 로고
    • Piscitello, M. E. (2001). Using rubrics for assessment and evaluation in art. Unpublished master thesis. USA: Saint Xavier University.
  • 62
    • 0037425790 scopus 로고    scopus 로고
    • Validation of the fresno test of competence in evidence based medicine
    • Ramos K.D., Schafer S., and Tracz S.M. Validation of the fresno test of competence in evidence based medicine. British Medical Journal 326 (2003) 319-321
    • (2003) British Medical Journal , vol.326 , pp. 319-321
    • Ramos, K.D.1    Schafer, S.2    Tracz, S.M.3
  • 63
    • 6344249348 scopus 로고    scopus 로고
    • Design and use of a rubric to assess and encourage interactive qualities in distance courses
    • Roblyer M.D., and Wiencke W.R. Design and use of a rubric to assess and encourage interactive qualities in distance courses. American Journal of Distance Education 17 (2003) 77-99
    • (2003) American Journal of Distance Education , vol.17 , pp. 77-99
    • Roblyer, M.D.1    Wiencke, W.R.2
  • 64
    • 33645065584 scopus 로고    scopus 로고
    • The impact of self- and peer-grading on student learning
    • Sadler P.M., and Good E. The impact of self- and peer-grading on student learning. Educational Assessment 11 (2006) 1-31
    • (2006) Educational Assessment , vol.11 , pp. 1-31
    • Sadler, P.M.1    Good, E.2
  • 65
    • 0035634464 scopus 로고    scopus 로고
    • Effects of teacher knowledge of rubrics on student achievement in four content areas
    • Schafer W.D., Swanson G., Bené N., and Newberry G. Effects of teacher knowledge of rubrics on student achievement in four content areas. Applied Measurement in Education 14 (2001) 151-170
    • (2001) Applied Measurement in Education , vol.14 , pp. 151-170
    • Schafer, W.D.1    Swanson, G.2    Bené, N.3    Newberry, G.4
  • 66
    • 36049017603 scopus 로고    scopus 로고
    • Assessing and improving the quality of group critical thinking exhibited in the final projects of collaborative learning groups
    • Schamber J.F., and Mahoney S.L. Assessing and improving the quality of group critical thinking exhibited in the final projects of collaborative learning groups. Journal of General Education 55 (2006) 103-137
    • (2006) Journal of General Education , vol.55 , pp. 103-137
    • Schamber, J.F.1    Mahoney, S.L.2
  • 67
    • 0039621495 scopus 로고    scopus 로고
    • Using a writing assessment rubric for writing development of children who are deaf
    • Schirmer B.R., Bailey J., and Fitzgerald S.M. Using a writing assessment rubric for writing development of children who are deaf. Exceptional Children 65 (1999) 383-397
    • (1999) Exceptional Children , vol.65 , pp. 383-397
    • Schirmer, B.R.1    Bailey, J.2    Fitzgerald, S.M.3
  • 68
    • 36048982394 scopus 로고    scopus 로고
    • Demystifying the evaluation process for parents: Rubrics for marking student research projects
    • Shaw J. Demystifying the evaluation process for parents: Rubrics for marking student research projects. Teacher Librarian 32 (2004) 16-19
    • (2004) Teacher Librarian , vol.32 , pp. 16-19
    • Shaw, J.1
  • 69
    • 0009917470 scopus 로고    scopus 로고
    • Using rubrics for documentation of clinical work supervision
    • Smith J., and Hanna M.A. Using rubrics for documentation of clinical work supervision. Counselor Education and Supervision 37 (1998) 269-278
    • (1998) Counselor Education and Supervision , vol.37 , pp. 269-278
    • Smith, J.1    Hanna, M.A.2
  • 72
    • 85011187077 scopus 로고    scopus 로고
    • A generalizability study of the effects of training on teachers' abilities to rate children's writing using a rubric
    • Stuhlmann J., Daniel C., Dellinger A., Denny R.K., and Powers T. A generalizability study of the effects of training on teachers' abilities to rate children's writing using a rubric. Journal of Reading Psychology 20 (1999) 107-127
    • (1999) Journal of Reading Psychology , vol.20 , pp. 107-127
    • Stuhlmann, J.1    Daniel, C.2    Dellinger, A.3    Denny, R.K.4    Powers, T.5
  • 73
    • 0036107595 scopus 로고    scopus 로고
    • Mapping to know: The effects of representational guidance and reflective assessment on scientific inquiry
    • Toth E.E., Suthers D.D., and Lesgold A.M. Mapping to know: The effects of representational guidance and reflective assessment on scientific inquiry. Science Education 86 (2002) 264-286
    • (2002) Science Education , vol.86 , pp. 264-286
    • Toth, E.E.1    Suthers, D.D.2    Lesgold, A.M.3
  • 74
    • 36049008684 scopus 로고    scopus 로고
    • Waltman, K., Kahn, A., & Koency, G. (1998). Alternative approaches to scoring: The effects of using different scoring methods on the validity of scores from a performance assessment. CSE Technical Report 488. Los Angeles.
  • 75
    • 0043206862 scopus 로고    scopus 로고
    • Investigating rater/prompt interactions in writing assessment: Quantitative and qualitative approaches
    • Weigle S.C. Investigating rater/prompt interactions in writing assessment: Quantitative and qualitative approaches. Assessing Writing 6 (1999) 145-178
    • (1999) Assessing Writing , vol.6 , pp. 145-178
    • Weigle, S.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.