메뉴 건너뛰기




Volumn 69, Issue 6, 2009, Pages 887-912

Magnitude of Task-Sampling Variability in Performance Assessment: A Meta-Analysis

Author keywords

Generalizability theory; Meta analysis; Performance assessment; Task sampling variation; Variance component

Indexed keywords


EID: 70449969432     PISSN: 00131644     EISSN: 15523888     Source Type: Journal    
DOI: 10.1177/0013164409344550     Document Type: Article
Times cited : (14)

References (67)
  • 1
    • 0030613897 scopus 로고    scopus 로고
    • Resampling tests for meta-analysis of ecological data
    • Adams, D.C., Gurevitch, J., & Rosenberg, M.S. (1997). Resampling tests for meta-analysis of ecological data. Ecology, 78, 1277-1283.
    • (1997) Ecology , vol.78 , pp. 1277-1283
    • Adams, D.C.1    Gurevitch, J.2    Rosenberg, M.S.3
  • 3
    • 0013292154 scopus 로고
    • Dimensionality and generalizability of domain-independent performance assessments
    • Baker, E.L., Abedi, J., Linn, R.L., & Niemi, D. (1995). Dimensionality and generalizability of domain-independent performance assessments. Journal of Educational Research, 89, 197-205.
    • (1995) Journal of Educational Research , vol.89 , pp. 197-205
    • Baker, E.L.1    Abedi, J.2    Linn, R.L.3    Niemi, D.4
  • 4
    • 70449947986 scopus 로고
    • (November 17-18). Paper presented at the 73rd Annual Meeting of the California Educational Research Association, San Diego, CA. (ERIC Document Reproduction Service No. 377498)
    • Barrett, T.J. (1994, November 17-18). Generalizability of writing tasks at fourth grade in the Riverside Unified School District. Paper presented at the 73rd Annual Meeting of the California Educational Research Association, San Diego, CA. (ERIC Document Reproduction Service No. 377498).
    • (1994) Generalizability of Writing Tasks at Fourth Grade in the Riverside Unified School District
    • Barrett, T.J.1
  • 6
    • 84988099631 scopus 로고
    • Generalizability, validity, and examinee perceptions of a computer-delivered Formulating Hypotheses Test
    • Bennett, R.E., & Rock, D.L. (1995). Generalizability, validity, and examinee perceptions of a computer-delivered Formulating Hypotheses Test. Journal of Educational Measurement, 32, 19-36.
    • (1995) Journal of Educational Measurement , vol.32 , pp. 19-36
    • Bennett, R.E.1    Rock, D.L.2
  • 7
    • 85032068735 scopus 로고
    • Use of the essay examination to investigate the writing skills of undergraduate education majors
    • Boodoo, G.M., & Garlinghouse, P. (1983). Use of the essay examination to investigate the writing skills of undergraduate education majors. Educational and Psychological Measurement, 43, 1005-1014.
    • (1983) Educational and Psychological Measurement , vol.43 , pp. 1005-1014
    • Boodoo, G.M.1    Garlinghouse, P.2
  • 11
    • 70449947984 scopus 로고
    • (June). Paper presented at the Conference of the Education Commission of the States/ Colorado Department of Education Assessment, Boulder, CO. (ERIC Document Reproduction Service No. 310149)
    • Bunch, M.B., & Littlefair, W. (1988, June). Total score reliability in large-scale writing assessment. Paper presented at the Conference of the Education Commission of the States/ Colorado Department of Education Assessment, Boulder, CO. (ERIC Document Reproduction Service No. 310149).
    • (1988) Total Score Reliability in Large-scale Writing Assessment
    • Bunch, M.B.1    Littlefair, W.2
  • 12
    • 18644385268 scopus 로고    scopus 로고
    • (March 24-28). Paper presented at the Annual Meeting of the American Educational Research Association, Chicago. (ERIC Document Reproduction Service No. 408326)
    • Chiu, C.W.T., & Wolfe, E.W. (1997, March 24-28). Generalizability theory: A new approach to analyze non-crossed performance assessment data. Paper presented at the Annual Meeting of the American Educational Research Association, Chicago. (ERIC Document Reproduction Service No. 408326).
    • (1997) Generalizability Theory: A New Approach to Analyze Non-crossed Performance Assessment Data
    • Chiu, C.W.T.1    Wolfe, E.W.2
  • 13
    • 0034257275 scopus 로고    scopus 로고
    • The generalizability of scores for a performance assessment scored with a computer-automated scoring system
    • Clauser, B.E., Harik, P., & Clyman, S.G. (2000). The generalizability of scores for a performance assessment scored with a computer-automated scoring system. Journal of Educational Measurement, 37, 245-261.
    • (2000) Journal of Educational Measurement , vol.37 , pp. 245-261
    • Clauser, B.E.1    Harik, P.2    Clyman, S.G.3
  • 14
    • 33747277764 scopus 로고    scopus 로고
    • A multivariate analysis of assessment of physicians' clinical skills
    • Clauser, B.E., Harik, P., & Margolis, M.J. (2006). A multivariate analysis of assessment of physicians' clinical skills. Journal of Educational Measurement, 43, 173-191.
    • (2006) Journal of Educational Measurement , vol.43 , pp. 173-191
    • Clauser, B.E.1    Harik, P.2    Margolis, M.J.3
  • 15
    • 0036795388 scopus 로고    scopus 로고
    • An examination of the contribution of computer-based case simulations to the USMLE Step 3 examination
    • Clauser, B.E., Margolis, M.J., & Swanson, D.B. (2002). An examination of the contribution of computer-based case simulations to the USMLE Step 3 examination. Academic Medicine, 77, S80-S82.
    • (2002) Academic Medicine , vol.77
    • Clauser, B.E.1    Margolis, M.J.2    Swanson, D.B.3
  • 17
    • 0001059534 scopus 로고
    • Integration and generalization of kappas for multiple raters
    • Conger, A.J. (1980). Integration and generalization of kappas for multiple raters. Psychological Bulletin, 88, 322-328.
    • (1980) Psychological Bulletin , vol.88 , pp. 322-328
    • Conger, A.J.1
  • 18
    • 83455161021 scopus 로고    scopus 로고
    • (October). Paper presented at the Annual Meeting of the Arizona Educational Research Association, Phoenix, AZ. (ERIC Document Reproduction Service No. 414336)
    • Crehan, K.D. (1997, October). A discussion of analytic scoring for writing performance assessment. Paper presented at the Annual Meeting of the Arizona Educational Research Association, Phoenix, AZ. (ERIC Document Reproduction Service No. 414336).
    • (1997) A Discussion of Analytic Scoring for Writing Performance Assessment
    • Crehan, K.D.1
  • 19
    • 0031476178 scopus 로고    scopus 로고
    • Generalizability analysis for performance assessments of student achievement or school effectiveness
    • Cronbach, L.J., Linn, R.L., Brennan, R.L., & Haertel, E.H. (1997). Generalizability analysis for performance assessments of student achievement or school effectiveness. Educational and Psychological Measurement, 57, 373-399.
    • (1997) Educational and Psychological Measurement , vol.57 , pp. 373-399
    • Cronbach, L.J.1    Linn, R.L.2    Brennan, R.L.3    Haertel, E.H.4
  • 20
    • 0035630874 scopus 로고    scopus 로고
    • Variability of estimated variance components and related statistics in a performance assessment
    • Gao, X., & Brennan, R.L. (2001). Variability of estimated variance components and related statistics in a performance assessment. Applied Measurement in Education, 14, 191-203.
    • (2001) Applied Measurement in Education , vol.14 , pp. 191-203
    • Gao, X.1    Brennan, R.L.2
  • 21
    • 21844495653 scopus 로고
    • Generalizability of large-scale performance assessments in science: Promises and problems
    • Gao, X., Shavelson, R.J., & Baxter, G.P. (1994). Generalizability of large-scale performance assessments in science: Promises and problems. Applied Measurement in Education, 7, 323-342.
    • (1994) Applied Measurement in Education , vol.7 , pp. 323-342
    • Gao, X.1    Shavelson, R.J.2    Baxter, G.P.3
  • 22
    • 0041029763 scopus 로고    scopus 로고
    • Magnitude and moderators of bias in observer ratings: A meta-analysis
    • Hoyt, W.T., & Kerns, M.-D. (1999). Magnitude and moderators of bias in observer ratings: A meta-analysis. Psychological Methods, 4, 403-424.
    • (1999) Psychological Methods , vol.4 , pp. 403-424
    • Hoyt, W.T.1    Kerns, M.-D.2
  • 24
    • 70450095242 scopus 로고    scopus 로고
    • (April 24-28). Paper presented at the Annual Meeting of the American Educational Research Association, New Orleans, LA. (ERIC Document Reproduction Service No. 449209)
    • Kim, S.C. (2000, April 24-28). Investigating the generalizability of scores from different rating systems in performance assessment. Paper presented at the Annual Meeting of the American Educational Research Association, New Orleans, LA. (ERIC Document Reproduction Service No. 449209).
    • (2000) Investigating the Generalizability of Scores from Different Rating Systems in Performance Assessment
    • Kim, S.C.1
  • 28
    • 34248704279 scopus 로고    scopus 로고
    • Using GENOVA and FACETS to set multiple standards on performance assessment for certification in medical translation from Japanese into English
    • Kozaki, Y. (2004). Using GENOVA and FACETS to set multiple standards on performance assessment for certification in medical translation from Japanese into English. Language Testing, 21, 1-27.
    • (2004) Language Testing , vol.21 , pp. 1-27
    • Kozaki, Y.1
  • 30
    • 0345757913 scopus 로고    scopus 로고
    • A prelude to modeling the expert: A generalizability study of expert ratings of performance on computerized clinical simulations
    • Kreiter, C.D., Gordon, J.A., Elloit, S.T., & Ferguson, K.J. (1999). A prelude to modeling the expert: A generalizability study of expert ratings of performance on computerized clinical simulations. Advances in Health Sciences Education, 4, 261-270.
    • (1999) Advances in Health Sciences Education , vol.4 , pp. 261-270
    • Kreiter, C.D.1    Gordon, J.A.2    Elloit, S.T.3    Ferguson, K.J.4
  • 35
    • 0000544370 scopus 로고
    • Reliability and generalizability of ratings of compositions
    • Lehmann, R.H. (1990). Reliability and generalizability of ratings of compositions. Studies in Educational Evaluation, 16, 501-512.
    • (1990) Studies in Educational Evaluation , vol.16 , pp. 501-512
    • Lehmann, R.H.1
  • 37
    • 85005349154 scopus 로고
    • Performance-based assessment: Implications of task specificity
    • Linn, R.L., & Burton, E. (1994). Performance-based assessment: Implications of task specificity. Educational Measurement: Issues and Practices, 13(1), 5-8.
    • (1994) Educational Measurement: Issues and Practices , vol.13 , Issue.1 , pp. 5-8
    • Linn, R.L.1    Burton, E.2
  • 38
    • 0031311356 scopus 로고    scopus 로고
    • Scoring and analysis of performance examinations: A comparison of methods and interpretations
    • Lunz, M.E., & Schumacker, R.E. (1997). Scoring and analysis of performance examinations: A comparison of methods and interpretations. Journal of Outcome Measurement, 1, 219-238.
    • (1997) Journal of Outcome Measurement , vol.1 , pp. 219-238
    • Lunz, M.E.1    Schumacker, R.E.2
  • 39
    • 0002533616 scopus 로고    scopus 로고
    • Using G-theory and many-facet Rasch measurement in the development of performance assessments of the ESL speaking skills of immigrants
    • Lynch, B.K., & McNamara, T.F. (1998). Using G-theory and many-facet Rasch measurement in the development of performance assessments of the ESL speaking skills of immigrants. Language Testing, 15, 158-180.
    • (1998) Language Testing , vol.15 , pp. 158-180
    • Lynch, B.K.1    McNamara, T.F.2
  • 42
    • 0032400259 scopus 로고    scopus 로고
    • The generalizability of a performance assessment measuring achievement in eighth grade mathematics
    • McBee, M.M., & Barnes, L.L.B. (1998). The generalizability of a performance assessment measuring achievement in eighth grade mathematics. Applied Measurement in Education, 11, 179-194.
    • (1998) Applied Measurement in Education , vol.11 , pp. 179-194
    • McBee, M.M.1    Barnes, L.L.B.2
  • 44
    • 0033483781 scopus 로고    scopus 로고
    • Prophesying the reliability of cognitively complex assessments
    • Nichols, P., & Kuehl, B.J. (1999). Prophesying the reliability of cognitively complex assessments. Applied Measurement in Education, 12, 73-94.
    • (1999) Applied Measurement in Education , vol.12 , pp. 73-94
    • Nichols, P.1    Kuehl, B.J.2
  • 45
    • 84965668074 scopus 로고
    • Multivariate generalizability theory in educational measurement: An empirical study
    • Nuβbaum, A. (1984). Multivariate generalizability theory in educational measurement: An empirical study. Applied Psychological Measurement, 8, 219-230.
    • (1984) Applied Psychological Measurement , vol.8 , pp. 219-230
    • Nußbaum, A.1
  • 46
    • 0002510810 scopus 로고
    • Evaluating coding decisions
    • In H. Cooper & L. V. Hedges (Eds.), New York: Russell Sage Foundation
    • Orwin, R.G. (1994). Evaluating coding decisions. In H. Cooper & L. V. Hedges (Eds.), Handbook of research synthesis (pp. 139-162). New York: Russell Sage Foundation.
    • (1994) Handbook of Research Synthesis , pp. 139-162
    • Orwin, R.G.1
  • 47
    • 10444287377 scopus 로고    scopus 로고
    • The role of transfer in the variability of performance assessment scores
    • Parkes, J. (2001). The role of transfer in the variability of performance assessment scores. Educational Assessment, 7, 143-164.
    • (2001) Educational Assessment , vol.7 , pp. 143-164
    • Parkes, J.1
  • 55
    • 0040029161 scopus 로고
    • Generalizability of job performance measurements: Marine Corps rifleman
    • Shavelson, R.J., Mayberry, P.W., Li, W., & Webb, N.M. (1990). Generalizability of job performance measurements: Marine Corps rifleman. Military Psychology, 2, 129-144.
    • (1990) Military Psychology , vol.2 , pp. 129-144
    • Shavelson, R.J.1    Mayberry, P.W.2    Li, W.3    Webb, N.M.4
  • 57
    • 3242700805 scopus 로고    scopus 로고
    • An application of generalizability theory and many-facet Rasch measurement using a complex problem-solving skills assessment
    • Smith, E.V., Jr., & Kulikowich, J.M. (2004). An application of generalizability theory and many-facet Rasch measurement using a complex problem-solving skills assessment. Educational and Psychological Measurement, 64, 617-639.
    • (2004) Educational and Psychological Measurement , vol.64 , pp. 617-639
    • Smith Jr., E.V.1    Kulikowich, J.M.2
  • 60
    • 70450118579 scopus 로고    scopus 로고
    • (April 8-12). Paper presented at the Annual Meeting of the American Educational Research Association, New York. (ERIC Document Reproduction Service No. 435626)
    • Suzuki, K., & Harnisch, D.L. (1996, April 8-12). An investigation on the generalizability of performance-based assessment in mathematics. Paper presented at the Annual Meeting of the American Educational Research Association, New York. (ERIC Document Reproduction Service No. 435626).
    • (1996) An Investigation on the Generalizability of Performance-based Assessment in Mathematics
    • Suzuki, K.1    Harnisch, D.L.2
  • 62
    • 0034414898 scopus 로고    scopus 로고
    • The dependability and interchangeability of assessment methods in science
    • Webb, M.W., Schlackman, J., & Sugrue, B. (2000). The dependability and interchangeability of assessment methods in science. Applied Measurement in Education, 13, 277-301.
    • (2000) Applied Measurement in Education , vol.13 , pp. 277-301
    • Webb, M.W.1    Schlackman, J.2    Sugrue, B.3
  • 63
    • 0012878967 scopus 로고
    • Reliability (generalizability) of job performance measurements: Navy machinist mates
    • Webb, N.M., Shavelson, R.J., Kim, K.-S., & Chen, Z. (1989). Reliability (generalizability) of job performance measurements: Navy machinist mates. Military Psychology, 1, 91-110.
    • (1989) Military Psychology , vol.1 , pp. 91-110
    • Webb, N.M.1    Shavelson, R.J.2    Kim, K.-S.3    Chen, Z.4
  • 64
    • 84979406003 scopus 로고
    • Testing pronunciation: An application of generalizability theory
    • van Weeren, J., & Theunissen, T.J.J.M. (1987). Testing pronunciation: An application of generalizability theory. Language Learning, 37, 109-122.
    • (1987) Language Learning , vol.37 , pp. 109-122
    • van Weeren, J.1    Theunissen, T.J.J.M.2
  • 66
    • 42649146120 scopus 로고    scopus 로고
    • (ERIC Document Reproduction Service No. 483407), Los Angeles: Center for the Study of Evaluation/ National Center for Research on Evaluation, Standards, and Student Testing
    • Yin, Y., & Shavelson, R.J. (2004). Application of generalizability theory to concept-map assessment research. Los Angeles: Center for the Study of Evaluation/ National Center for Research on Evaluation, Standards, and Student Testing. (ERIC Document Reproduction Service No. 483407).
    • (2004) Application of Generalizability Theory to Concept-Map Assessment Research
    • Yin, Y.1    Shavelson, R.J.2
  • 67
    • 70450005601 scopus 로고    scopus 로고
    • (ERIC Document Reproduction Service No. 450138), Los Angeles: Center for the Study of Evaluation/ National Center for Research on Evaluation, Standards, and Student Testing
    • Yoon, B., & Young, M.J. (2000). Validating standards-referenced science assessments. Los Angeles: Center for the Study of Evaluation/ National Center for Research on Evaluation, Standards, and Student Testing. (ERIC Document Reproduction Service No. 450138).
    • (2000) Validating Standards-Referenced Science Assessments
    • Yoon, B.1    Young, M.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.