메뉴 건너뛰기




Volumn 53, Issue 2, 2016, Pages 215-233

Validation of automated scoring of science assessments

Author keywords

automated scoring; c rater ML; science assessment

Indexed keywords

EDUCATION;

EID: 84954103223     PISSN: 00224308     EISSN: 10982736     Source Type: Journal    
DOI: 10.1002/tea.21299     Document Type: Article
Times cited : (113)

References (45)
  • 2
    • 84895910194 scopus 로고    scopus 로고
    • Assessing scientific practices using machine-learning methods: How closely do they match clinical interview performance
    • Beggrow, E. P., Ha, M., Nehm, R. H., Pearl, D., &, Boone, W. J., (2014). Assessing scientific practices using machine-learning methods: How closely do they match clinical interview performance ?. Journal of Science Education and Technology, 23, 160-182.
    • (2014) Journal of Science Education and Technology , vol.23 , pp. 160-182
    • Beggrow, E.P.1    Ha, M.2    Nehm, R.H.3    Pearl, D.4    Boone, W.J.5
  • 3
    • 0030555167 scopus 로고    scopus 로고
    • The accuracy of expert-system diagnoses of mathematical problem solutions
    • Bennett, R. E., &, Sebrechts, M. M., (1996). The accuracy of expert-system diagnoses of mathematical problem solutions. Applied Measurement in Education, 9, 133-150.
    • (1996) Applied Measurement in Education , vol.9 , pp. 133-150
    • Bennett, R.E.1    Sebrechts, M.M.2
  • 4
    • 84855958640 scopus 로고    scopus 로고
    • Comparison of human and machine scoring of essays: Differences by gender, ethnicity, and country
    • Bridgeman, B., Trapani, C., &, Attali, Y., (2012). Comparison of human and machine scoring of essays: Differences by gender, ethnicity, and country. Applied Measurement in Education, 25, 27-40.
    • (2012) Applied Measurement in Education , vol.25 , pp. 27-40
    • Bridgeman, B.1    Trapani, C.2    Attali, Y.3
  • 6
    • 85142545640 scopus 로고    scopus 로고
    • Automated evaluation of discourse structure in student essays
    • M. D. Shermis & J. Burstein (Eds.) Mahwah, NJ: Lawrence Erlbaum
    • Burstein, J., &, Marcu, D., (2002). Automated evaluation of discourse structure in student essays. In M. D. Shermis & J. Burstein (Eds.), Automated essay scoring: A cross-disciplinary perspective (pp. 200-219). Mahwah, NJ: Lawrence Erlbaum.
    • (2002) Automated Essay Scoring: A Cross-disciplinary Perspective (200-219)
    • Burstein, J.1    Marcu, D.2
  • 9
    • 58149412516 scopus 로고
    • Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit
    • Cohen, J., (1968). Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 70, 213-220.
    • (1968) Psychological Bulletin , vol.70 , pp. 213-220
    • Cohen, J.1
  • 13
    • 84965886444 scopus 로고
    • The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability
    • Fleiss, J. L., &, Cohen, J., (1973). The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educational and Psychological Measurement, 33, 613-619.
    • (1973) Educational and Psychological Measurement , vol.33 , pp. 613-619
    • Fleiss, J.L.1    Cohen, J.2
  • 15
    • 33645214429 scopus 로고    scopus 로고
    • Conditions under which assessment supports students' learning
    • Gibbs, G., &, Simpson, C., (2004). Conditions under which assessment supports students' learning. Learning and Teaching in Higher Education, 1, 3-31.
    • (2004) Learning and Teaching in Higher Education , vol.1 , pp. 3-31
    • Gibbs, G.1    Simpson, C.2
  • 16
    • 82755186234 scopus 로고    scopus 로고
    • Applying computerized-scoring models of written biological explanations across courses and colleges: Prospects and limitations
    • Ha, M., Nehm, R. H., Urban-Lurain, M., &, Merrill, J. E., (2011). Applying computerized-scoring models of written biological explanations across courses and colleges: Prospects and limitations. CBE-Life Sciences Education, 10, 379-393.
    • (2011) CBE-Life Sciences Education , vol.10 , pp. 379-393
    • Ha, M.1    Nehm, R.H.2    Urban-Lurain, M.3    Merrill, J.E.4
  • 17
    • 84866000686 scopus 로고    scopus 로고
    • What are they thinking? Automated analysis of student writing about acid-base chemistry in introductory biology
    • Haudek, K. C., Prevost, L. B., Moscarella, R. A., Merrill, J., &, Urban-Lurain, M., (2012). What are they thinking? Automated analysis of student writing about acid-base chemistry in introductory biology. CBE-Life Sciences Education, 11 (3), 283-293.
    • (2012) CBE-Life Sciences Education , vol.11 , Issue.3 , pp. 283-293
    • Haudek, K.C.1    Prevost, L.B.2    Moscarella, R.A.3    Merrill, J.4    Urban-Lurain, M.5
  • 19
    • 78049531039 scopus 로고    scopus 로고
    • A three-stage approach to the automated scoring of spontaneous spoken responses
    • Higgins, D., Zechner, K., Xi, X., &, Williamson, D., (2011). A three-stage approach to the automated scoring of spontaneous spoken responses. Computer Speech and Language, 25, 282-306.
    • (2011) Computer Speech and Language , vol.25 , pp. 282-306
    • Higgins, D.1    Zechner, K.2    Xi, X.3    Williamson, D.4
  • 20
    • 84879960065 scopus 로고    scopus 로고
    • Toward an integrated model for designing assessment systems: An analysis of the current status of computer-based assessments in science
    • Kuo, C. Y., &, Wu, H. K., (2013). Toward an integrated model for designing assessment systems: An analysis of the current status of computer-based assessments in science. Computer & Education, 68, 388-403.
    • (2013) Computer & Education , vol.68 , pp. 388-403
    • Kuo, C.Y.1    Wu, H.K.2
  • 21
    • 0017360990 scopus 로고
    • The measurement of observer agreement for categorical data
    • Landis, J. R., &, Koch, G. G., (1977). The measurement of observer agreement for categorical data. Biometrics, 33, 159-174.
    • (1977) Biometrics , vol.33 , pp. 159-174
    • Landis, J.R.1    Koch, G.G.2
  • 22
    • 33646866698 scopus 로고    scopus 로고
    • C-rater: Automated scoring of short-answer questions
    • Leacock, C., &, Chodorow, M., (2003). C-rater: Automated scoring of short-answer questions. Computers and the Humanities, 37, 389-405.
    • (2003) Computers and the Humanities , vol.37 , pp. 389-405
    • Leacock, C.1    Chodorow, M.2
  • 23
    • 79953136183 scopus 로고    scopus 로고
    • Validating measurement of knowledge integration in science using multiple-choice and explanation items
    • Lee, H. S., Liu, O. L., &, Linn, M. C., (2011). Validating measurement of knowledge integration in science using multiple-choice and explanation items. Applied Measurement in Education, 24, 115-136.
    • (2011) Applied Measurement in Education , vol.24 , pp. 115-136
    • Lee, H.S.1    Liu, O.L.2    Linn, M.C.3
  • 24
    • 34147152012 scopus 로고    scopus 로고
    • Science education: Integrating views of learning and instruction
    • P. A. Alexander & P. H. Winne (Eds.) Mahwah, NJ: Lawrence Erlbaum Associates
    • Linn, M. C., &, Eylon, B- S., (2006). Science education: Integrating views of learning and instruction. In P. A. Alexander & P. H. Winne (Eds.), Handbook of educational psychology (pp. 511-544). Mahwah, NJ: Lawrence Erlbaum Associates.
    • (2006) Handbook of Educational Psychology , pp. 511-544
    • Linn, M.C.1    Eylon, B.-S.2
  • 25
    • 84898002032 scopus 로고    scopus 로고
    • Computer-guided inquiry to improve science learning
    • Linn, M. C., Gerard, L., Ryoo, K., McElhaney, K., Liu, O. L., &, Rafferty, A. N., (2014). Computer-guided inquiry to improve science learning. Science, 344 (6180), 155-156.
    • (2014) Science , vol.344 , Issue.6180 , pp. 155-156
    • Linn, M.C.1    Gerard, L.2    Ryoo, K.3    McElhaney, K.4    Liu, O.L.5    Rafferty, A.N.6
  • 27
    • 42549100860 scopus 로고    scopus 로고
    • Assessing knowledge integration in science: Construct, measures, and evidence
    • Liu, O. L., Lee, H. S., Hoftstetter, C., &, Linn, M. C., (2008). Assessing knowledge integration in science: Construct, measures, and evidence. Educational Assessment, 13, 33-55.
    • (2008) Educational Assessment , vol.13 , pp. 33-55
    • Liu, O.L.1    Lee, H.S.2    Hoftstetter, C.3    Linn, M.C.4
  • 28
    • 80054069721 scopus 로고    scopus 로고
    • Measuring knowledge integration: Validation of four-year assessments
    • Liu, O. L., Lee, H. S., &, Linn, M. C., (2011). Measuring knowledge integration: Validation of four-year assessments. Journal of Research in Science Teaching, 48, 1079-1107.
    • (2011) Journal of Research in Science Teaching , vol.48 , pp. 1079-1107
    • Liu, O.L.1    Lee, H.S.2    Linn, M.C.3
  • 32
    • 84954089639 scopus 로고    scopus 로고
    • EvoGrader: An online formative assessment tool for automatically evaluating written evolutionary explanations
    • Retrieved from
    • Moharreri, K., Ha, M., &, Nehm, R., (2014). EvoGrader: An online formative assessment tool for automatically evaluating written evolutionary explanations. Evolution: Education and Outreach, 7 (1), 1-15. Retrieved from http://www.evolution-outreach.com/content/7/1/15
    • (2014) Evolution: Education and Outreach , vol.7 , Issue.1 , pp. 1-15
    • Moharreri, K.1    Ha, M.2    Nehm, R.3
  • 34
    • 84903267109 scopus 로고    scopus 로고
    • National Council of Teachers of English Retrieved from
    • National Council of Teachers of English. (2008). Statement on class size and teacher workload: Secondary. Retrieved from http://www.ncte.org/positions/statements/classsizesecondary.
    • (2008) Statement on Class Size and Teacher Workload: Secondary
  • 35
    • 84856612191 scopus 로고    scopus 로고
    • Transforming biology assessment with machine learning: Automated scoring of written evolutionary explanations
    • Nehm, R. H., Ha, M., &, Mayfield, E., (2011). Transforming biology assessment with machine learning: Automated scoring of written evolutionary explanations. Journal of Science Education and Technology, 21 (1), 183-196. doi: 10.1007/s10956-011-9300-9
    • (2011) Journal of Science Education and Technology , vol.21 , Issue.1 , pp. 183-196
    • Nehm, R.H.1    Ha, M.2    Mayfield, E.3
  • 36
    • 57249116250 scopus 로고    scopus 로고
    • Measuring knowledge of natural selection: A comparison of the CINS, an open-response instrument, and an oral interview
    • Nehm, R. H., &, Schonfeld, I. S., (2008). Measuring knowledge of natural selection: A comparison of the CINS, an open-response instrument, and an oral interview. Journal of Research in Science Teaching, 45, 1131-1160.
    • (2008) Journal of Research in Science Teaching , vol.45 , pp. 1131-1160
    • Nehm, R.H.1    Schonfeld, I.S.2
  • 37
    • 84856591629 scopus 로고    scopus 로고
    • Human vs. Computer diagnosis of students' natural selection knowledge: Testing the efficacy of text analytic software
    • Nehm, R. H., &, Haertig, H., (2012). Human vs. computer diagnosis of students' natural selection knowledge: Testing the efficacy of text analytic software. Journal of science education and technology, 21 (1), 56-73.
    • (2012) Journal of Science Education and Technology , vol.21 , Issue.1 , pp. 56-73
    • Nehm, R.H.1    Haertig, H.2
  • 41
    • 84874593076 scopus 로고    scopus 로고
    • A comparison of consensus, consistency, and measurement approaches to estimating interrater reliability
    • Stemler, S. E., (2004). A comparison of consensus, consistency, and measurement approaches to estimating interrater reliability. Practical Assessment, Research & Evaluation, 9 (4), 66-78.
    • (2004) Practical Assessment, Research & Evaluation , vol.9 , Issue.4 , pp. 66-78
    • Stemler, S.E.1
  • 43
    • 70350518126 scopus 로고    scopus 로고
    • C-Rater: Automatic content scoring for short constructed responses
    • H. C. Lane & H. W. Guesgen (Eds.) Menlo Park, CA: Association for the Advancement of Artificial Intelligence Press
    • Sukkarieh, J. Z., &, Blackmore, J., (2009). c-Rater: Automatic content scoring for short constructed responses. In H. C. Lane & H. W. Guesgen (Eds.), Proceedings of the twenty-second international Florida artificial intelligence research society conference (pp. 290-295). Menlo Park, CA: Association for the Advancement of Artificial Intelligence Press.
    • (2009) Proceedings of the Twenty-second International Florida Artificial Intelligence Research Society Conference , pp. 290-295
    • Sukkarieh, J.Z.1    Blackmore, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.