메뉴 건너뛰기




Volumn 24, Issue 4, 2000, Pages 325-337

Equating and linking of performance assessments

Author keywords

Item response theory; Large scale assessment; National assessment of educational progress; Performance assessment; Test equating; Test linking

Indexed keywords


EID: 0034337148     PISSN: 01466216     EISSN: None     Source Type: Journal    
DOI: 10.1177/01466210022031787     Document Type: Article
Times cited : (45)

References (68)
  • 1
    • 0041111521 scopus 로고    scopus 로고
    • Graphical representation of multidimensional item response theory analyses
    • Ackerman, T. (1996). Graphical representation of multidimensional item response theory analyses. Applied Measurement in Education, 20, 311-329.
    • (1996) Applied Measurement in Education , vol.20 , pp. 311-329
    • Ackerman, T.1
  • 2
    • 84859638307 scopus 로고
    • Equating under the graded response model
    • Baker, F. B. (1992). Equating under the graded response model. Applied Psychological Measurement, 16, 87-96.
    • (1992) Applied Psychological Measurement , vol.16 , pp. 87-96
    • Baker, F.B.1
  • 3
    • 84976942741 scopus 로고
    • Equating tests under the nominal response model
    • Baker, F. B. (1993). Equating tests under the nominal response model. Applied Psychological Measurement, 17, 239-251.
    • (1993) Applied Psychological Measurement , vol.17 , pp. 239-251
    • Baker, F.B.1
  • 4
    • 0003009098 scopus 로고
    • Estimating item parameters and latent ability when responses are scored in two or more nominal categories
    • Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37, 29-51.
    • (1972) Psychometrika , vol.37 , pp. 29-51
    • Bock, R.D.1
  • 8
    • 0002734470 scopus 로고    scopus 로고
    • Multiple group IRT
    • W. J. van der Linden and R. K. Hambleton (Eds.), New York: Springer-Verlag
    • Bock, R. D., & Zimowski, M. F. (1997). Multiple group IRT. In W. J. van der Linden and R. K. Hambleton (Eds.), Handbook of modern item response theory (pp. 433-448). New York: Springer-Verlag.
    • (1997) Handbook of Modern Item Response Theory , pp. 433-448
    • Bock, R.D.1    Zimowski, M.F.2
  • 9
    • 0001732404 scopus 로고    scopus 로고
    • Fairness in large-scale performance assessment
    • G. Phillips (Ed.), Washington DC: National Center for Education Statistics
    • Bond, L., Moss, P., & Carr, P. (1996). Fairness in large-scale performance assessment. In G. Phillips (Ed.), Technical issues in large-scale performance assessment (pp. 117-140). Washington DC: National Center for Education Statistics.
    • (1996) Technical Issues in Large-Scale Performance Assessment , pp. 117-140
    • Bond, L.1    Moss, P.2    Carr, P.3
  • 15
    • 0001959627 scopus 로고    scopus 로고
    • Data analysis for the reading assessment
    • N. L. Allen, D. L. Kline, & C. A. Zelenak (Eds.), Report No. NCES 97-897, Washington DC: National Center for Education Statistics
    • Donoghue, J. R., Isham, S. P., & Worthington L. H. (1996). Data analysis for the reading assessment. In N. L. Allen, D. L. Kline, & C. A. Zelenak (Eds.), The NAEP 1994 technical report (Report No. NCES 97-897, pp. 267-308). Washington DC: National Center for Education Statistics.
    • (1996) The NAEP 1994 Technical Report , pp. 267-308
    • Donoghue, J.R.1    Isham, S.P.2    Worthington, L.H.3
  • 16
    • 0001965776 scopus 로고
    • Comparing IRT-based equating procedures for trend measurement in a complex test design
    • April. San Francisco
    • Donoghue, J. R., & Mazzeo, J. (1992, April). Comparing IRT-based equating procedures for trend measurement in a complex test design. Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco.
    • (1992) Annual Meeting of the National Council on Measurement in Education
    • Donoghue, J.R.1    Mazzeo, J.2
  • 17
    • 84952404505 scopus 로고
    • Quality control in the development and use of performance assessments
    • Dunbar, S. B., Koretz, D. M., & Hoover, H. D. (1991). Quality control in the development and use of performance assessments. Applied Measurement in Education, 4, 289-303.
    • (1991) Applied Measurement in Education , vol.4 , pp. 289-303
    • Dunbar, S.B.1    Koretz, D.M.2    Hoover, H.D.3
  • 19
    • 0001587566 scopus 로고
    • Measuring changes in educational attainment over time: Problems and possibilities
    • Goldstein, H. (1983). Measuring changes in educational attainment over time: Problems and possibilities. Journal of Educational Measurement, 20, 369-377.
    • (1983) Journal of Educational Measurement , vol.20 , pp. 369-377
    • Goldstein, H.1
  • 21
    • 0001712807 scopus 로고    scopus 로고
    • Developments in reading research and their implications for computer-adaptive reading assessment
    • March. Orlando FL
    • Grabe, W. (1997, March). Developments in reading research and their implications for computer-adaptive reading assessment. Paper presented at the 19th Language Testing Research Colloquium Conference, Orlando FL.
    • (1997) 19th Language Testing Research Colloquium Conference
    • Grabe, W.1
  • 22
    • 85005273035 scopus 로고
    • Comparability of scores from performance assessments
    • Green, B. F. (1995). Comparability of scores from performance assessments. Educational Measurement: Issues and Practice, 14(4), 13-15.
    • (1995) Educational Measurement: Issues and Practice , vol.14 , Issue.4 , pp. 13-15
    • Green, B.F.1
  • 23
    • 0001270536 scopus 로고
    • Equating logistic ability scales by a weighted least squares method
    • Haebara, T. (1980). Equating logistic ability scales by a weighted least squares method. Japanese Psychological Research, 22, 144-149.
    • (1980) Japanese Psychological Research , vol.22 , pp. 144-149
    • Haebara, T.1
  • 24
    • 0001802799 scopus 로고    scopus 로고
    • Comparability
    • G. W. Phillips (Ed.), Washington DC: National Center for Education Statistics
    • Haertel, E. H., & Linn, R. L. (1996). Comparability. In G. W. Phillips (Ed.), Technical issues in large-scale performance assessment (pp. 59-78). Washington DC: National Center for Education Statistics.
    • (1996) Technical Issues in Large-Scale Performance Assessment , pp. 59-78
    • Haertel, E.H.1    Linn, R.L.2
  • 25
    • 0000170789 scopus 로고
    • Principles and selected applications of item response theory
    • R. L. Linn (Ed.), New York: Macmillan
    • Hambleton, R. K. (1989). Principles and selected applications of item response theory. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 147-200). New York: Macmillan.
    • (1989) Educational Measurement 3rd Ed. , pp. 147-200
    • Hambleton, R.K.1
  • 27
    • 0001941639 scopus 로고    scopus 로고
    • Palo Alto CA: NAEP Validity Studies Panel, American Institutes for Research
    • Hedges, L. V., & Vevea, J. L. (1997). A study of equating in NAEP. Palo Alto CA: NAEP Validity Studies Panel, American Institutes for Research.
    • (1997) A Study of Equating in NAEP
    • Hedges, L.V.1    Vevea, J.L.2
  • 28
    • 0031506674 scopus 로고    scopus 로고
    • Stochastic ordering using the latent trait and the sum score in polytomous IRT models
    • Hemker, B. T., Sijtsma, K., Molenaar, I. W., & Junker, B. W. (1997). Stochastic ordering using the latent trait and the sum score in polytomous IRT models. Psychometrika, 62, 331-347.
    • (1997) Psychometrika , vol.62 , pp. 331-347
    • Hemker, B.T.1    Sijtsma, K.2    Molenaar, I.W.3    Junker, B.W.4
  • 30
    • 84988122259 scopus 로고
    • A comparison of equal percentile and partial credit equatings for performance-based assessments composed of free-response items
    • Huynh, H., & Ferrara, S. (1994). A comparison of equal percentile and partial credit equatings for performance-based assessments composed of free-response items. Journal of Educational Measurement, 31, 125-141.
    • (1994) Journal of Educational Measurement , vol.31 , pp. 125-141
    • Huynh, H.1    Ferrara, S.2
  • 31
    • 0001732408 scopus 로고
    • Comparison of traditional and item response theory methods for equating tests
    • Kolen, M. J. (1981). Comparison of traditional and item response theory methods for equating tests. Journal of Educational Measurement, 18, 1-11.
    • (1981) Journal of Educational Measurement , vol.18 , pp. 1-11
    • Kolen, M.J.1
  • 36
    • 21144460499 scopus 로고
    • Linking results of distinct assessments
    • Linn, R. L. (1993). Linking results of distinct assessments. Applied Measurement in Education, 6, 83-102.
    • (1993) Applied Measurement in Education , vol.6 , pp. 83-102
    • Linn, R.L.1
  • 37
    • 84965800682 scopus 로고
    • Comparison of IRT true-score and equipercentile observed-score "equatings."
    • Lord, F. M., & Wingersky, M. S. (1984). Comparison of IRT true-score and equipercentile observed-score "equatings." Applied Psychological Measurement, 8, 453-461.
    • (1984) Applied Psychological Measurement , vol.8 , pp. 453-461
    • Lord, F.M.1    Wingersky, M.S.2
  • 38
    • 0001942273 scopus 로고    scopus 로고
    • Achieving form-to-form comparability: Fundamental issues and proposed strategies for equating performance assessments for teachers. Edu
    • Loyd, B., Engelhard, G., & Cracker, L. (1996). Achieving form-to-form comparability: Fundamental issues and proposed strategies for equating performance assessments for teachers. Educational Assessment, 3, 99-110.
    • (1996) Cational Assessment , vol.3 , pp. 99-110
    • Loyd, B.1    Engelhard, G.2    Cracker, L.3
  • 39
    • 0001840881 scopus 로고    scopus 로고
    • Validity of performance assessment
    • G. W. Phillips (Ed.), Washington DC: National Center for Education Statistics
    • Messick, S. (1996). Validity of performance assessment. In G. W. Phillips (Ed.), Technical issues in large-scale performance assessment (pp. 1-18). Washington DC: National Center for Education Statistics.
    • (1996) Technical Issues in Large-Scale Performance Assessment , pp. 1-18
    • Messick, S.1
  • 42
    • 0004221985 scopus 로고
    • Scaling procedures
    • E. G. Johnson & N. L. Allen (Eds.), Report No. 21-TR-20, Washington DC: National Center for Education Statistics
    • Mislevy, R. J. (1992b). Scaling procedures. In E. G. Johnson & N. L. Allen (Eds.), The NAEP 1990 technical report (Report No. 21-TR-20, pp. 199-213). Washington DC: National Center for Education Statistics.
    • (1992) The NAEP 1990 Technical Report , pp. 199-213
    • Mislevy, R.J.1
  • 43
    • 84970515244 scopus 로고
    • Test theory and language learning assessment
    • Mislevy, R. J. (1995). Test theory and language learning assessment. Language Testing, 12, 341-369.
    • (1995) Language Testing , vol.12 , pp. 341-369
    • Mislevy, R.J.1
  • 44
    • 84976923244 scopus 로고
    • A generalized partial credit model: Application of an EM algorithm
    • Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement, 16, 159-176.
    • (1992) Applied Psychological Measurement , vol.16 , pp. 159-176
    • Muraki, E.1
  • 45
    • 0003322220 scopus 로고
    • Variations of polytomous item response models: Raters' effect model, IF model, and trend model
    • April. Atlanta GA
    • Muraki, E. (1993, April). Variations of polytomous item response models: Raters' effect model, IF model, and trend model. Paper presented at the annual meeting of the American Educational Research Association, Atlanta GA.
    • (1993) Annual Meeting of the American Educational Research Association
    • Muraki, E.1
  • 46
    • 0033246636 scopus 로고    scopus 로고
    • Stepwise analysis of differential item functioning based on multiple-group partial credit model
    • Muraki, E. (1999). Stepwise analysis of differential item functioning based on multiple-group partial credit model. Journal of Educational Measurement, 36, 217-232.
    • (1999) Journal of Educational Measurement , vol.36 , pp. 217-232
    • Muraki, E.1
  • 48
    • 84976969011 scopus 로고
    • Full-information factor analysis for polytomous item responses
    • Muraki, E., & Carlson, J. E. (1995). Full-information factor analysis for polytomous item responses. Applied Psychological Measurement, 19, 73-90.
    • (1995) Applied Psychological Measurement , vol.19 , pp. 73-90
    • Muraki, E.1    Carlson, J.E.2
  • 56
    • 84965400546 scopus 로고
    • The difficulty of items that measure more than one ability
    • Reckase, M. D. (1985). The difficulty of items that measure more than one ability. Applied Psychological Measurement, 9, 401-412.
    • (1985) Applied Psychological Measurement , vol.9 , pp. 401-412
    • Reckase, M.D.1
  • 57
    • 0002723397 scopus 로고
    • Estimation of latent ability using a response pattern of graded scores
    • Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement, 17.
    • (1969) Psychometrika Monograph Supplement , vol.17
    • Samejima, F.1
  • 60
  • 61
    • 0001935002 scopus 로고
    • Detection of differential item functioning using the parameters of item response models
    • P. W. Holland & H. Wainer (Eds.), Hillsdale NJ: Erlbaum
    • Thissen, D., Steinberg, L., & Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In P. W. Holland & H. Wainer (Eds.), Differential item functioning (pp. 67-113). Hillsdale NJ: Erlbaum.
    • (1993) Differential Item Functioning , pp. 67-113
    • Thissen, D.1    Steinberg, L.2    Wainer, H.3
  • 63
    • 0000917089 scopus 로고
    • Assessment: Authenticity, context, and validity
    • Wiggins, G. (1993). Assessment: Authenticity, context, and validity. Phi Delta Kappan, 75, 200-214.
    • (1993) Phi Delta Kappan , vol.75 , pp. 200-214
    • Wiggins, G.1
  • 65
    • 84988115553 scopus 로고
    • Scaling performance assessments: Strategies for managing local dependence
    • Yen, W. M. (1993). Scaling performance assessments: Strategies for managing local dependence. Journal of Educational Measurement, 30, 187-213.
    • (1993) Journal of Educational Measurement , vol.30 , pp. 187-213
    • Yen, W.M.1
  • 66
    • 0031539913 scopus 로고    scopus 로고
    • The Maryland school performance assessment program: Performance assessment with psychometric quality suitable for high-stakes usage
    • Yen, W. M., & Ferrara, S. (1997). The Maryland school performance assessment program: Performance assessment with psychometric quality suitable for high-stakes usage. Educational and Psychological Measurement, 57, 60-84.
    • (1997) Educational and Psychological Measurement , vol.57 , pp. 60-84
    • Yen, W.M.1    Ferrara, S.2
  • 68
    • 84984506091 scopus 로고
    • Effects of item order and context on estimation of NAEP Reading Proficiency
    • Zwick, R. (1991). Effects of item order and context on estimation of NAEP Reading Proficiency. Educational Measurement: Issues and Practice, 10 (3), 10-16.
    • (1991) Educational Measurement: Issues and Practice , vol.10 , Issue.3 , pp. 10-16
    • Zwick, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.