메뉴 건너뛰기




Volumn 16, Issue 2, 2013, Pages 138-178

An analysis of human factors and label accuracy in crowdsourcing relevance judgments

Author keywords

Crowdsourcing; Relevance judgments; Study of human factors

Indexed keywords


EID: 84875650055     PISSN: 13864564     EISSN: 15737659     Source Type: Journal    
DOI: 10.1007/s10791-012-9205-0     Document Type: Article
Times cited : (113)

References (58)
  • 3
    • 65249129950 scopus 로고    scopus 로고
    • Crowdsourcing for relevance evaluation
    • Alonso, O., Rose, D. E., & Stewart, B. (2008). Crowdsourcing for relevance evaluation. SIGIR Forum, 42(2), 9-15.
    • (2008) SIGIR Forum , vol.42 , Issue.2 , pp. 9-15
    • Alonso, O.1    Rose, D.E.2    Stewart, B.3
  • 9
    • 65549085067 scopus 로고    scopus 로고
    • Power-law distributions in empirical data
    • Clauset, A., Shalizi, C. R., & Newman, M. E. J. (2009). Power-law distributions in empirical data. SIAM Review, 51(4), 661-703.
    • (2009) SIAM Review , vol.51 , Issue.4 , pp. 661-703
    • Clauset, A.1    Shalizi, C.R.2    Newman, M.E.J.3
  • 10
    • 84946536292 scopus 로고
    • The Cranfield tests on index language devices
    • Cleverdon, C. W. (1967). The Cranfield tests on index language devices. Aslib, 19, 173-192.
    • (1967) Aslib , vol.19 , pp. 173-192
    • Cleverdon, C.W.1
  • 16
    • 33845895457 scopus 로고
    • Cognitive consequences of forced compliance
    • Festinger, L., & Carlsmith, J. M. (1959). Cognitive consequences of forced compliance. Journal of Abnormal and Social Psychology, 58(2), 203-210. http://psychclassics. yorku. ca/Festinger/.
    • (1959) Journal of Abnormal and Social Psychology , vol.58 , Issue.2 , pp. 203-210
    • Festinger, L.1    Carlsmith, J.M.2
  • 17
    • 3343019470 scopus 로고
    • Measuring nominal scale agreement among many raters
    • Fleiss, J. L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76, 378-382.
    • (1971) Psychological Bulletin , vol.76 , pp. 378-382
    • Fleiss, J.L.1
  • 22
    • 84875653914 scopus 로고    scopus 로고
    • Mechanical turk: The demographics
    • Ipeirotis, P. (2008). Mechanical turk: The demographics. Blog post. http://behind-the-enemy-lines. blogspot. com/2008/03/mechanical-turk-demographics. html.
    • (2008) Blog post
    • Ipeirotis, P.1
  • 23
    • 84875657089 scopus 로고    scopus 로고
    • The new demographics of mechanical turk
    • Ipeirotis, P. (2010a). The new demographics of mechanical turk. Blog post. http://behind-the-enemy-lines. blogspot. com/2010/03/new-demographics-of-mechanical-turk. html.
    • (2010) Blog post
    • Ipeirotis, P.1
  • 24
    • 79958122721 scopus 로고    scopus 로고
    • Analyzing the amazon mechanical turk marketplace
    • Ipeirotis, P. G. (2010b). Analyzing the amazon mechanical turk marketplace. XRDS, 17, 16-21.
    • (2010) Xrds , vol.17 , pp. 16-21
    • Ipeirotis, P.G.1
  • 31
    • 70350484988 scopus 로고    scopus 로고
    • Overview of the inex 2008 book track
    • Kazai, G., Doucet, A., & Landoni, M. (2008). Overview of the inex 2008 book track. In INEX (pp. 106-123).
    • (2008) INEX , pp. 106-123
    • Kazai, G.1    Doucet, A.2    Landoni, M.3
  • 36
    • 0017360990 scopus 로고
    • The measurement of observer agreement for categorical data
    • Landis, J. R., & Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159-174.
    • (1977) Biometrics , vol.33 , Issue.1 , pp. 159-174
    • Landis, J.R.1    Koch, G.G.2
  • 37
    • 80052123756 scopus 로고    scopus 로고
    • Ensuring quality in crowdsourced search relevance evaluation
    • In V. Carvalho, M. Lease, & E. Yilmaz (Eds.) New York, NY: ACM
    • Le, J., Edmonds, A., Hester, V., & Biewald, L. (2010) Ensuring quality in crowdsourced search relevance evaluation. In V. Carvalho, M. Lease, & E. Yilmaz (Eds.), SIGIR Workshop on crowdsourcing for search evaluation (pp. 17-20). New York, NY: ACM.
    • (2010) SIGIR Workshop on crowdsourcing for search evaluation , pp. 17-20
    • Le, J.1    Edmonds, A.2    Hester, V.3    Biewald, L.4
  • 41
    • 79952688606 scopus 로고    scopus 로고
    • Conducting behavioral research on amazons mechanical turk
    • Mason, W., & Suri, S. (2011). Conducting behavioral research on amazons mechanical turk. Behavior Research Methods.
    • (2011) Behavior Research Methods
    • Mason, W.1    Suri, S.2
  • 43
    • 77952357661 scopus 로고    scopus 로고
    • How reliable are annotations via crowdsourcing: A study about inter-annotator agreement for multi-label image annotation
    • New York, NY: ACM
    • Nowak, S., & Rüger, S. (2010). How reliable are annotations via crowdsourcing: A study about inter-annotator agreement for multi-label image annotation. In MIR '10: Proceedings of the international conference on Multimedia information retrieval (pp. 557-566). New York, NY: ACM.
    • (2010) MIR '10: Proceedings of the international conference on Multimedia information retrieval , pp. 557-566
    • Nowak, S.1    Rüger, S.2
  • 45
    • 77955600502 scopus 로고    scopus 로고
    • A taxonomy of distributed human computation
    • University of Maryland
    • Quinn, A. J., & Bederson, B. B. (2009). A taxonomy of distributed human computation. Technical Report HCIL-2009-23. University of Maryland.
    • (2009) Technical Report HCIL-2009-23
    • Quinn, A.J.1    Bederson, B.B.2
  • 46
    • 79958083139 scopus 로고    scopus 로고
    • Human computation: A survey and taxonomy of a growing field
    • Quinn, A. J., & Bederson, B. B. (2011). Human computation: A survey and taxonomy of a growing field. In Proceedings of CHI 2011.
    • (2011) Proceedings of CHI 2011
    • Quinn, A.J.1    Bederson, B.B.2
  • 47
    • 67650085898 scopus 로고    scopus 로고
    • How does clickthrough data reflect retrieval quality?
    • In J. G. Shanahan, S. Amer-Yahia, I. Manolescu, Y. Zhang, D. A. Evans, A. Kolcz, K. S. Choi, & A. Chowdhury (Eds) ACM
    • Radlinski, F., Kurup, M., & Joachims, T. (2008). How does clickthrough data reflect retrieval quality?. In J. G. Shanahan, S. Amer-Yahia, I. Manolescu, Y. Zhang, D. A. Evans, A. Kolcz, K. S. Choi, & A. Chowdhury (Eds). CIKM (pp. 43-52). ACM.
    • (2008) CIKM , pp. 43-52
    • Radlinski, F.1    Kurup, M.2    Joachims, T.3
  • 49
    • 80755168394 scopus 로고    scopus 로고
    • Instrumenting the crowd: using implicit behavioral measures to predict task performance
    • New York, NY: ACM. doi: 10. 1145/2047196. 2047199. url
    • Rzeszotarski, J. M., & Kittur, A. (2011). Instrumenting the crowd: using implicit behavioral measures to predict task performance. In Proceedings of the 24th annual ACM symposium on User interface software and technology, UIST '11 (pp. 13-22). New York, NY: ACM. doi: 10. 1145/2047196. 2047199. url: http://doi. acm. org/10. 1145/2047196. 2047199.
    • (2011) Proceedings of the 24th annual ACM symposium on User interface software and technology, UIST '11 , pp. 13-22
    • Rzeszotarski, J.M.1    Kittur, A.2
  • 54
    • 0033733783 scopus 로고    scopus 로고
    • Variations in relevance judgments and the measurement of retrieval effectiveness
    • Voorhees, E. M. (2000). Variations in relevance judgments and the measurement of retrieval effectiveness. Information Processing & Management, 36(5), 697-716.
    • (2000) Information Processing Management , vol.36 , Issue.5 , pp. 697-716
    • Voorhees, E.M.1
  • 57


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.