메뉴 건너뛰기




Volumn 72, Issue 3, 2004, Pages 221-261

Effects of anchor item methods on the detection of differential item functioning within the family of rasch models

Author keywords

Estimation bias; Item response theory; Partial credit model; Power; Type I error

Indexed keywords


EID: 2442624864     PISSN: 00220973     EISSN: 19400683     Source Type: Journal    
DOI: 10.3200/JEXE.72.3.221-261     Document Type: Article
Times cited : (78)

References (74)
  • 3
    • 0002686036 scopus 로고
    • The numerical solution of a set of conditional estimation equations
    • Andersen, E. B. (1972). The numerical solution of a set of conditional estimation equations. Journal of the Royal Statistical Society, Series B, 34, 42-54.
    • (1972) Journal of the Royal Statistical Society, Series B , vol.34 , pp. 42-54
    • Andersen, E.B.1
  • 4
    • 34250274848 scopus 로고
    • A rating formulation for ordered response categories
    • Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561-573.
    • (1978) Psychometrika , vol.43 , pp. 561-573
    • Andrich, D.1
  • 6
    • 0033475133 scopus 로고    scopus 로고
    • An investigation of the power of the likelihood ratio goodness-of-fit statistic in detecting differential item functioning
    • Ankenmann, R. D., Witt, E. A., & Dunbar, S. B. (1999). An investigation of the power of the likelihood ratio goodness-of-fit statistic in detecting differential item functioning. Journal of Educational Measurement, 36, 277-300.
    • (1999) Journal of Educational Measurement , vol.36 , pp. 277-300
    • Ankenmann, R.D.1    Witt, E.A.2    Dunbar, S.B.3
  • 7
    • 0001670658 scopus 로고
    • Some latent trait models and their use in inferring an examinee's ability
    • F. M. Lord & M. R. Novick, Reading, MA: Addison-Wesley
    • Bimbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In F. M. Lord & M. R. Novick, Statistical theories of mental test scores (pp. 397-479). Reading, MA: Addison-Wesley.
    • (1968) Statistical Theories of Mental Test Scores , pp. 397-479
    • Bimbaum, A.1
  • 8
    • 0000433590 scopus 로고
    • Marginal maximum likelihood estimation of item parameters: An application of the EM algorithm
    • Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: An application of the EM algorithm. Psychometrika, 46, 443-459.
    • (1981) Psychometrika , vol.46 , pp. 443-459
    • Bock, R.D.1    Aitkin, M.2
  • 9
    • 27644511026 scopus 로고
    • Fitting a response model for n dichotomously scored items
    • Bock, R. D., & Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35, 179-197.
    • (1970) Psychometrika , vol.35 , pp. 179-197
    • Bock, R.D.1    Lieberman, M.2
  • 12
    • 84977052435 scopus 로고
    • Analysis of differential item functioning in translated assessment instruments
    • Budgell, G. R., Raju, N. S., & Quartetti, D. A. (1995). Analysis of differential item functioning in translated assessment instruments. Applied Psychological Measurement, 19, 309-321.
    • (1995) Applied Psychological Measurement , vol.19 , pp. 309-321
    • Budgell, G.R.1    Raju, N.S.2    Quartetti, D.A.3
  • 13
    • 84965853871 scopus 로고
    • An iterative procedure for linking metrics and assessing item bias in item response theory
    • Candell, G. L., & Drasgow, F. (1988). An iterative procedure for linking metrics and assessing item bias in item response theory. Applied Psychological Measurement, 12, 253-260.
    • (1988) Applied Psychological Measurement , vol.12 , pp. 253-260
    • Candell, G.L.1    Drasgow, F.2
  • 15
    • 0030496756 scopus 로고    scopus 로고
    • An investigation of the likelihood ratio test for detection of differential item functioning
    • Cohen A. S., Kim, S., & Wollack, J. A. (1996). An investigation of the likelihood ratio test for detection of differential item functioning. Applied Psychological Measurement, 20, 15-26.
    • (1996) Applied Psychological Measurement , vol.20 , pp. 15-26
    • Cohen, A.S.1    Kim, S.2    Wollack, J.A.3
  • 17
    • 0002619289 scopus 로고    scopus 로고
    • Effects of amount of DIF, test length, and purification type on robustness and power of Mantel-Haenszel procedures
    • Fidalgo, A. M., Mellenbergh, G. J., & Muniz, J. (2000). Effects of amount of DIF, test length, and purification type on robustness and power of Mantel-Haenszel procedures. Methods of Psychological Research Online, 5, 43-53.
    • (2000) Methods of Psychological Research Online , vol.5 , pp. 43-53
    • Fidalgo, A.M.1    Mellenbergh, G.J.2    Muniz, J.3
  • 18
    • 0015738609 scopus 로고
    • The linear logistic model as an instrument in educational research
    • Fischer, G. H. (1973). The linear logistic model as an instrument in educational research. Acta Psy-chologica, 37, 359-374.
    • (1973) Acta Psy-Chologica , vol.37 , pp. 359-374
    • Fischer, G.H.1
  • 19
    • 0009408874 scopus 로고
    • Notes of the Mantel-Haenszel procedure and another chi-square tests for the assessment of DIF
    • Fischer, G. H. (1993). Notes of the Mantel-Haenszel procedure and another chi-square tests for the assessment of DIF. Methodika, 7, 88-100.
    • (1993) Methodika , vol.7 , pp. 88-100
    • Fischer, G.H.1
  • 21
    • 21344484925 scopus 로고
    • An extension of the partial credit model with an application to the measurement of change
    • Fischer, G. H., & Pononcy, I. (1994). An extension of the partial credit model with an application to the measurement of change. Psychometrika, 59, 177-192.
    • (1994) Psychometrika , vol.59 , pp. 177-192
    • Fischer, G.H.1    Pononcy, I.2
  • 22
    • 0032360963 scopus 로고    scopus 로고
    • Detection of differential item functioning using Lagrange multiplier tests
    • Glas, C. A. W. (1998). Detection of differential item functioning using Lagrange multiplier tests. Statistica Sinica, 8, 641-661.
    • (1998) Statistica Sinica , vol.8 , pp. 641-661
    • Glas, C.A.W.1
  • 23
    • 0002333711 scopus 로고
    • Tests of fit for polytomous Rasch models
    • G. H. Fischer & I. W. Molenaar (Eds.), New York: Springer-Verlag
    • Glas, C. A. W., & Verhelst, N. D. (1995). Tests of fit for polytomous Rasch models. In G. H. Fischer & I. W. Molenaar (Eds.), Rasch models: Foundations, recent developments, and applications (pp. 325-352). New York: Springer-Verlag.
    • (1995) Rasch Models: Foundations, Recent Developments, and Applications , pp. 325-352
    • Glas, C.A.W.1    Verhelst, N.D.2
  • 24
    • 0001270536 scopus 로고
    • Equating logistic ability scales by a weighted least squares method
    • Haebara, T. (1980). Equating logistic ability scales by a weighted least squares method. Japanese Psychological Research, 22, 144-149.
    • (1980) Japanese Psychological Research , vol.22 , pp. 144-149
    • Haebara, T.1
  • 26
    • 0036106581 scopus 로고    scopus 로고
    • Two-stage equating differential item functioning detection under the graded response model with the Raju area measures and the Lord statistic
    • Hidalgo-Montesinos, M. D., & Lopez-Pina, J. A. (2002). Two-stage equating differential item functioning detection under the graded response model with the Raju area measures and the Lord statistic. Educational and Psychological Measurement, 62, 32-44.
    • (2002) Educational and Psychological Measurement , vol.62 , pp. 32-44
    • Hidalgo-Montesinos, M.D.1    Lopez-Pina, J.A.2
  • 27
    • 84976959244 scopus 로고
    • Differential item performance and the Mantel-Haenszel procedure
    • H. Wainer & H. Braun (Eds.), Hillsdale, NJ: Erlbaum
    • Holland, P. W., & Thayer, D. T. (1988). Differential item performance and the Mantel-Haenszel procedure. In H. Wainer & H. Braun (Eds.), Test validity (pp. 129-145). Hillsdale, NJ: Erlbaum.
    • (1988) Test Validity , pp. 129-145
    • Holland, P.W.1    Thayer, D.T.2
  • 28
    • 0031502248 scopus 로고    scopus 로고
    • Identifying cultural differences in items and traits: Differential item functioning in the NEO personality inventory
    • Huang, C. D., Church, A. T., & Katigbak, M. S. (1997). Identifying cultural differences in items and traits: Differential item functioning in the NEO personality inventory. Journal of Cross Cultural Psychology, 28, 192-218.
    • (1997) Journal of Cross Cultural Psychology , vol.28 , pp. 192-218
    • Huang, C.D.1    Church, A.T.2    Katigbak, M.S.3
  • 29
    • 0040620288 scopus 로고    scopus 로고
    • Evaluating Type I error and power rates using an effect size measure with the logistic regression procedure for DIF detection
    • Jodoin, M. G., & Gierl, M. J. (2001). Evaluating Type I error and power rates using an effect size measure with the logistic regression procedure for DIF detection. Applied Measurement in Education, 14, 329-349.
    • (2001) Applied Measurement in Education , vol.14 , pp. 329-349
    • Jodoin, M.G.1    Gierl, M.J.2
  • 30
    • 84990393867 scopus 로고    scopus 로고
    • Detecting DIF across the different language groups in a speaking test
    • Kim, M. (2001). Detecting DIF across the different language groups in a speaking test. Language Testing, 18, 89-114.
    • (2001) Language Testing , vol.18 , pp. 89-114
    • Kim, M.1
  • 31
    • 21844511144 scopus 로고
    • A comparison of Lord's chi-square, Raju's area measures, and the likelihood ratio test on detection of differential item functioning
    • Kim, S.-H., & Cohen, A. S. (1995). A comparison of Lord's chi-square, Raju's area measures, and the likelihood ratio test on detection of differential item functioning. Applied Psychological Measurement, 8, 291-312.
    • (1995) Applied Psychological Measurement , vol.8 , pp. 291-312
    • Kim, S.-H.1    Cohen, A.S.2
  • 32
    • 0032348542 scopus 로고    scopus 로고
    • Detection of differential item functioning under the graded response model with the likelihood ratio test
    • Kim, S.-H, & Cohen, A. S. (1998). Detection of differential item functioning under the graded response model with the likelihood ratio test. Applied Psychological Measurement, 22, 345-355.
    • (1998) Applied Psychological Measurement , vol.22 , pp. 345-355
    • Kim, S.-H.1    Cohen, A.S.2
  • 33
    • 0036102805 scopus 로고    scopus 로고
    • A comparison of linking and concurrent calibration under the graded response model
    • Kim, S.-H., & Cohen, A. S. (2002). A comparison of linking and concurrent calibration under the graded response model. Applied Psychological Measurement, 26, 25-41.
    • (2002) Applied Psychological Measurement , vol.26 , pp. 25-41
    • Kim, S.-H.1    Cohen, A.S.2
  • 34
    • 0041618113 scopus 로고
    • A note of the value of including the studied item in the test score when analyzing test items for DIF
    • P. W. Holland & H. Wainer (Eds.), Hillsdale, NJ: Erlbaum
    • Lewis, C. (1993). A note of the value of including the studied item in the test score when analyzing test items for DIF. In P. W. Holland & H. Wainer (Eds.), Differential item functioning (pp. 317-319). Hillsdale, NJ: Erlbaum.
    • (1993) Differential Item Functioning , pp. 317-319
    • Lewis, C.1
  • 35
    • 0003686557 scopus 로고
    • Chicago: Measurement, Evaluation, Statistics, and Assessment Press
    • Linacre, J. M. (1989). Many-facet Rasch measurement. Chicago: Measurement, Evaluation, Statistics, and Assessment Press.
    • (1989) Many-Facet Rasch Measurement
    • Linacre, J.M.1
  • 38
    • 0035537309 scopus 로고    scopus 로고
    • Differential item functioning in the WISC-III: Item parameters for boys and girls in the national standardization sample
    • Mailer, S. J. (2001). Differential item functioning in the WISC-III: Item parameters for boys and girls in the national standardization sample. Educational and Psychological Measurement, 61, 793-817.
    • (2001) Educational and Psychological Measurement , vol.61 , pp. 793-817
    • Mailer, S.J.1
  • 39
    • 84959801619 scopus 로고
    • Statistical aspects of the analysis of data from retrospective studies of disease
    • Mantel, N., & Haenszel, W. (1959). Statistical aspects of the analysis of data from retrospective studies of disease. Journal of the National Cancer Institute, 22, 719-748.
    • (1959) Journal of the National Cancer Institute , vol.22 , pp. 719-748
    • Mantel, N.1    Haenszel, W.2
  • 40
    • 0000208012 scopus 로고
    • Item characteristic curve solutions to three intractable testing problems
    • Marco, G. L. (1977). Item characteristic curve solutions to three intractable testing problems. Journal of Educational Measurement, 14, 139-160.
    • (1977) Journal of Educational Measurement , vol.14 , pp. 139-160
    • Marco, G.L.1
  • 41
    • 0000261668 scopus 로고
    • A Rasch model for partial credit scoring
    • Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149-174.
    • (1982) Psychometrika , vol.47 , pp. 149-174
    • Masters, G.N.1
  • 42
    • 0001765999 scopus 로고
    • On the misuse of manifest variables in the detection of mea-surement bias
    • Meredith, W., & Millsap, R. E. (1992). On the misuse of manifest variables in the detection of mea-surement bias. Psychometrika, 57, 289-311.
    • (1992) Psychometrika , vol.57 , pp. 289-311
    • Meredith, W.1    Millsap, R.E.2
  • 43
    • 84976919215 scopus 로고
    • Effect of sample size, number of biased items and magnitude of bias on a two-stage item bias estimation method
    • Miller, M. D., & Oshima, T. C. (1992). Effect of sample size, number of biased items and magnitude of bias on a two-stage item bias estimation method. Applied Psychological Measurement, 16, 381-388.
    • (1992) Applied Psychological Measurement , vol.16 , pp. 381-388
    • Miller, M.D.1    Oshima, T.C.2
  • 44
    • 84977005381 scopus 로고
    • Inferential conditions in the statistical detection of measurement bias
    • Millsap, R. E., & Meredith, W. (1992). Inferential conditions in the statistical detection of measurement bias. Applied Psychological Measurement, 16, 389-402.
    • (1992) Applied Psychological Measurement , vol.16 , pp. 389-402
    • Millsap, R.E.1    Meredith, W.2
  • 46
    • 84976919712 scopus 로고
    • Performance of the Mantel-Haenszel and simultaneous item bias procedures for detecting differential item functioning
    • Narayanan, P., & Swaminathan, H. (1994). Performance of the Mantel-Haenszel and simultaneous item bias procedures for detecting differential item functioning. Applied Psychological Measurement, 18, 315-328.
    • (1994) Applied Psychological Measurement , vol.18 , pp. 315-328
    • Narayanan, P.1    Swaminathan, H.2
  • 48
    • 0000992663 scopus 로고
    • On the use and interpretation of certain test criteria for purposes of statistical inference
    • Neyman, J., & Pearson, E. S. (1928). On the use and interpretation of certain test criteria for purposes of statistical inference. Biometrika, 20A, 174-240, 263-294.
    • (1928) Biometrika , vol.20A , Issue.174-240 , pp. 263-294
    • Neyman, J.1    Pearson, E.S.2
  • 49
    • 0000003370 scopus 로고
    • The area between two item characteristics curves
    • Raju, N. S. (1988). The area between two item characteristics curves. Psychometrika, 53, 495-502.
    • (1988) Psychometrika , vol.53 , pp. 495-502
    • Raju, N.S.1
  • 51
    • 84976934177 scopus 로고
    • A comparison of logistic regression and Mantel-Haenszel pro-cedures for detecting differential item functioning
    • Rogers, H. J., & Swaminathan, H. (1993). A comparison of logistic regression and Mantel-Haenszel pro-cedures for detecting differential item functioning. Applied Psychological Measurement, 17,105-116.
    • (1993) Applied Psychological Measurement , vol.17 , pp. 105-116
    • Rogers, H.J.1    Swaminathan, H.2
  • 52
    • 0030170597 scopus 로고    scopus 로고
    • Simulation studies of the effects of small sample size and studied item parameters on SIBTEST and Mantel-Haenszel Type I error performance
    • Roussos, L. A., & Stout, W. (1996). Simulation studies of the effects of small sample size and studied item parameters on SIBTEST and Mantel-Haenszel Type I error performance. Journal of Educational Measurement, 33, 215-230.
    • (1996) Journal of Educational Measurement , vol.33 , pp. 215-230
    • Roussos, L.A.1    Stout, W.2
  • 53
    • 0002723397 scopus 로고
    • Estimation of latent ability using a response pattern of graded scores
    • Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometric Monograph Supplement, 17, 1-100.
    • (1969) Psychometric Monograph Supplement , vol.17 , pp. 1-100
    • Samejima, F.1
  • 54
    • 21144479363 scopus 로고
    • Model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item bias/DIF
    • Shealy, R., & Stout, W. (1993). Model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item bias/DIF. Psychometrika, 58, 159-194.
    • (1993) Psychometrika , vol.58 , pp. 159-194
    • Shealy, R.1    Stout, W.2
  • 56
  • 57
    • 84988120749 scopus 로고
    • Detecting differential item functioning using logistic regression procedures
    • Swaminathan, H., & Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27, 361-370.
    • (1990) Journal of Educational Measurement , vol.27 , pp. 361-370
    • Swaminathan, H.1    Rogers, H.J.2
  • 59
    • 85079745565 scopus 로고
    • Use of item response theory in the study of group differences in trace lines
    • H. Wainer & H. Braun (Eds.), Hillsdale, NJ: Erlbaum
    • Thissen, D., Steinberg, L., & Wainer, H. (1988). Use of item response theory in the study of group differences in trace lines. In H. Wainer & H. Braun (Eds.), Test validity (pp. 147-169). Hillsdale, NJ: Erlbaum.
    • (1988) Test Validity , pp. 147-169
    • Thissen, D.1    Steinberg, L.2    Wainer, H.3
  • 62
    • 0031519723 scopus 로고    scopus 로고
    • Differential item functioning and male-female differences on multiple-choice tests in economics
    • Walstad, W. B., & Robson, D. (1997). Differential item functioning and male-female differences on multiple-choice tests in economics. The Journal of Economic Education, 28, 155-171.
    • (1997) The Journal of Economic Education , vol.28 , pp. 155-171
    • Walstad, W.B.1    Robson, D.2
  • 63
    • 0034585654 scopus 로고    scopus 로고
    • Modeling effects of differential item functioning in polytomous items
    • Wang, W.-C. (2000a). Modeling effects of differential item functioning in polytomous items. Journal of Applied Measurement, 1, 63-82.
    • (2000) Journal of Applied Measurement , vol.1 , pp. 63-82
    • Wang, W.-C.1
  • 64
    • 2442529653 scopus 로고    scopus 로고
    • The simultaneous factorial analysis of differential item functioning
    • Wang, W.-C. (2000b). The simultaneous factorial analysis of differential item functioning. Methods of Psychological Research Online, 5, 51-76.
    • (2000) Methods of Psychological Research Online , vol.5 , pp. 51-76
    • Wang, W.-C.1
  • 65
    • 2442576822 scopus 로고    scopus 로고
    • Effects of average signed area between two item characteristic curves and test purification procedures on the DIF detection via the Mantel-Haenszel method
    • (in press)
    • Wang, W.-C., & Su, Y.-H. (in press). Effects of average signed area between two item characteristic curves and test purification procedures on the DIF detection via the Mantel-Haenszel method. Applied Measurement in Education.
    • Applied Measurement in Education
    • Wang, W.-C.1    Su, Y.-H.2
  • 66
    • 0345581718 scopus 로고    scopus 로고
    • Effects of anchor item methods on differential item functioning detection with the likelihood ratio test
    • Wang, W.-C., & Yeh, Y.-L. (2003). Effects of anchor item methods on differential item functioning detection with the likelihood ratio test. Applied Psychological Measurement, 27, 479-498.
    • (2003) Applied Psychological Measurement , vol.27 , pp. 479-498
    • Wang, W.-C.1    Yeh, Y.-L.2
  • 67
    • 84976919649 scopus 로고
    • The partial order model: An extension of the partial credit model
    • Wilson, M. R. (1992). The partial order model: An extension of the partial credit model. Applied Psy-chological Measurement, 16, 309-325.
    • (1992) Applied Psy-Chological Measurement , vol.16 , pp. 309-325
    • Wilson, M.R.1
  • 68
    • 0003633018 scopus 로고
    • Chicago: Measurement, Evaluation, Statistics, and Assessment Press
    • Wright, B. D., & Masters, G. N. (1982). Rating scale analysis. Chicago: Measurement, Evaluation, Statistics, and Assessment Press.
    • (1982) Rating Scale Analysis
    • Wright, B.D.1    Masters, G.N.2
  • 69
    • 0003760335 scopus 로고
    • Chicago: Measurement, Evaluation, Statistics, and Assessment Press
    • Wright, B. D., & Stone, M. H. (1979). Best test design. Chicago: Measurement, Evaluation, Statistics, and Assessment Press.
    • (1979) Best Test Design
    • Wright, B.D.1    Stone, M.H.2
  • 73
    • 0000172650 scopus 로고
    • When do item response function and Mantel-Haenszel definitions of differential item functioning coincide?
    • Zwick, R. (1990). When do item response function and Mantel-Haenszel definitions of differential item functioning coincide? Journal of Educational Statistics, 75, 185-197.
    • (1990) Journal of Educational Statistics , vol.75 , pp. 185-197
    • Zwick, R.1
  • 74
    • 84988106388 scopus 로고
    • Assessment of differential item functioning for per-formance tasks
    • Zwick, R., Donoghue, J. R., & Grima, A. (1993). Assessment of differential item functioning for per-formance tasks. Journal of Educational Measurement, 30, 233-251.
    • (1993) Journal of Educational Measurement , vol.30 , pp. 233-251
    • Zwick, R.1    Donoghue, J.R.2    Grima, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.