메뉴 건너뛰기




Volumn , Issue , 2013, Pages 659-668

Users versus models: What observation tells us about effectiveness metrics

Author keywords

Evaluation; Retrieval experiment; System measurement

Indexed keywords

EFFECTIVENESS METRICS; EVALUATION; INFORMATION SEEKING; RELEVANCE JUDGMENT; RELEVANT DOCUMENTS; RETRIEVAL SYSTEMS; SYSTEM MEASUREMENT; TASK PERFORMANCE;

EID: 84889574389     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2505515.2507665     Document Type: Conference Paper
Times cited : (85)

References (27)
  • 1
    • 0016355478 scopus 로고
    • A new look at the statistical model identification
    • H. Akaike. A new look at the statistical model identification. IEEE Trans. Automatic Control, 19(6):716-723, 1974.
    • (1974) IEEE Trans. Automatic Control , vol.19 , Issue.6 , pp. 716-723
    • Akaike, H.1
  • 2
    • 36448971554 scopus 로고    scopus 로고
    • The relationship between IR effectiveness measures and user satisfaction
    • Amsterdam, The Netherlands
    • A. Al-Maskari, M. Sanderson, and P. Clough. The relationship between IR effectiveness measures and user satisfaction. In Proc. SIGIR, pages 773-774, Amsterdam, The Netherlands, 2007.
    • (2007) Proc. SIGIR , pp. 773-774
    • Al-Maskari, A.1    Sanderson, M.2    Clough, P.3
  • 3
    • 33747880883 scopus 로고    scopus 로고
    • Retrieval system evaluation
    • E. M. Voorhees and D. K. Harman, editors chapter 3 MIT Press, Cambridge, Massachusetts
    • C. Buckley and E. M. Voorhees. Retrieval system evaluation. In E. M. Voorhees and D. K. Harman, editors, TREC: Experiment and Evaluation in Information Retrieval, chapter 3, pages 53-75. MIT Press, Cambridge, Massachusetts, 2005.
    • (2005) TREC: Experiment and Evaluation in Information Retrieval , pp. 53-75
    • Buckley, C.1    Voorhees, E.M.2
  • 5
    • 80052123590 scopus 로고    scopus 로고
    • System effectiveness, user models, and user utility: A conceptual framework for investigation
    • Beijing, China
    • B. Carterette. System effectiveness, user models, and user utility: A conceptual framework for investigation. In Proc. SIGIR, pages 903-912, Beijing, China, 2011.
    • (2011) Proc. SIGIR , pp. 903-912
    • Carterette, B.1
  • 6
    • 83055191831 scopus 로고    scopus 로고
    • Simulating simple user behavior for system effectiveness evaluation
    • Glasgow, Scotland
    • B. Carterette, E. Kanoulas, and E. Yilmaz. Simulating simple user behavior for system effectiveness evaluation. In Proc. CIKM, pages 611-620, Glasgow, Scotland, 2011.
    • (2011) Proc. CIKM , pp. 611-620
    • Carterette, B.1    Kanoulas, E.2    Yilmaz, E.3
  • 7
    • 74549208546 scopus 로고    scopus 로고
    • Expected reciprocal rank for graded relevance
    • Hong Kong, China
    • O. Chapelle, D. Metzler, Y. Zhang, and P. Grinspan. Expected reciprocal rank for graded relevance. In Proc. CIKM, pages 621-630, Hong Kong, China, 2009.
    • (2009) Proc. CIKM , pp. 621-630
    • Chapelle, O.1    Metzler, D.2    Zhang, Y.3    Grinspan, P.4
  • 8
    • 84865624822 scopus 로고    scopus 로고
    • A dynamic Bayesian network click model for web search ranking
    • Madrid, Spain
    • O. Chapelle and Y. Zhang. A dynamic Bayesian network click model for web search ranking. In Proc. WWW, pages 1-10, Madrid, Spain, 2009.
    • (2009) Proc. WWW , pp. 1-10
    • Chapelle, O.1    Zhang, Y.2
  • 9
    • 42549140738 scopus 로고    scopus 로고
    • An experimental comparison of click position-bias models
    • Palo Alto, CA
    • N. Craswell, O. Zoeter, M. J. Taylor, and B. Ramsey. An experimental comparison of click position-bias models. In Proc. WSDM, pages 87-94, Palo Alto, CA, 2008.
    • (2008) Proc. WSDM , pp. 87-94
    • Craswell, N.1    Zoeter, O.2    Taylor, M.J.3    Ramsey, B.4
  • 10
    • 80053968060 scopus 로고    scopus 로고
    • Discounted cumulative gain and user decision models
    • Pisa, Italy
    • G. Dupret. Discounted cumulative gain and user decision models. In Proc. SPIRE, pages 2-13, Pisa, Italy, 2011.
    • (2011) Proc. SPIRE , pp. 2-13
    • Dupret, G.1
  • 11
    • 77956046532 scopus 로고    scopus 로고
    • A user behavior model for average precision and its generalization to graded judgments
    • Geneva, Switzerland
    • G. Dupret and B. Piwowarski. A user behavior model for average precision and its generalization to graded judgments. In Proc. SIGIR, pages 531-538, Geneva, Switzerland, 2010.
    • (2010) Proc. SIGIR , pp. 531-538
    • Dupret, G.1    Piwowarski, B.2
  • 12
    • 36448951157 scopus 로고    scopus 로고
    • How well does result relevance predict session satisfaction?
    • Amsterdam, The Netherlands
    • S. B. Huffman and M. Hochster. How well does result relevance predict session satisfaction? In Proc. SIGIR, pages 567-574, Amsterdam, The Netherlands, 2007.
    • (2007) Proc. SIGIR , pp. 567-574
    • Huffman, S.B.1    Hochster, M.2
  • 13
  • 14
    • 84885665252 scopus 로고    scopus 로고
    • Accurately interpreting clickthrough data as implicit feedback
    • Salvador, Brazil
    • T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In Proc. SIGIR, pages 154-161, Salvador, Brazil, 2005.
    • (2005) Proc. SIGIR , pp. 154-161
    • Joachims, T.1    Granka, L.2    Pan, B.3    Hembrooke, H.4    Gay, G.5
  • 15
    • 83055179261 scopus 로고    scopus 로고
    • Relative effect of spam and irrelevant documents on user interaction with search engines
    • Glasgow
    • T. Jones, D. Hawking, P. Thomas, and R. Sankaranarayana. Relative effect of spam and irrelevant documents on user interaction with search engines. In Proc. CIKM, pages 2113-2116, Glasgow, 2011.
    • (2011) Proc. CIKM , pp. 2113-2116
    • Jones, T.1    Hawking, D.2    Thomas, P.3    Sankaranarayana, R.4
  • 16
    • 84889592613 scopus 로고    scopus 로고
    • Seven numeric properties of effectiveness metrics
    • To appear
    • A. Moffat. Seven numeric properties of effectiveness metrics. In Proc. AIRS, 2013. To appear.
    • (2013) Proc. AIRS
    • Moffat, A.1
  • 18
    • 66949147248 scopus 로고    scopus 로고
    • Rank-biased precision for measurement of retrieval effectiveness
    • A. Moffat and J. Zobel. Rank-biased precision for measurement of retrieval effectiveness. ACM Trans. Information Systems, 27(1):2:1-2:27, 2008.
    • (2008) ACM Trans. Information Systems , vol.27 , Issue.1 , pp. 212-227
    • Moffat, A.1    Zobel, J.2
  • 19
    • 57349087085 scopus 로고    scopus 로고
    • A new interpretation of average precision
    • Singapore
    • S. Robertson. A new interpretation of average precision. In Proc. SIGIR, pages 689-690, Singapore, 2008.
    • (2008) Proc. SIGIR , pp. 689-690
    • Robertson, S.1
  • 20
    • 84871057318 scopus 로고    scopus 로고
    • Stochastic simulation of time-biased gain
    • Maui, Hawaii
    • M. D. Smucker and C. L. A. Clarke. Stochastic simulation of time-biased gain. In Proc. CIKM, pages 2040-2044, Maui, Hawaii, 2012.
    • (2012) Proc. CIKM , pp. 2040-2044
    • Smucker, M.D.1    Clarke, C.L.A.2
  • 21
    • 84866603223 scopus 로고    scopus 로고
    • Time-based calibration of effectiveness measures
    • Portland, Oregon
    • M. D. Smucker and C. L. A. Clarke. Time-based calibration of effectiveness measures. In Proc. SIGIR, pages 95-104, Portland, Oregon, 2012.
    • (2012) Proc. SIGIR , pp. 95-104
    • Smucker, M.D.1    Clarke, C.L.A.2
  • 22
    • 80052129492 scopus 로고    scopus 로고
    • What deliberately degrading search quality tells us about discount functions
    • Beijing
    • P. Thomas, T. Jones, and D. Hawking. What deliberately degrading search quality tells us about discount functions. In Proc. SIGIR, pages 1107-1108, Beijing, 2011.
    • (2011) Proc. SIGIR , pp. 1107-1108
    • Thomas, P.1    Jones, T.2    Hawking, D.3
  • 23
  • 24
    • 33750351305 scopus 로고    scopus 로고
    • User performance versus precision measures for simple search tasks
    • Seattle, Washington
    • A. Turpin and F. Scholer. User performance versus precision measures for simple search tasks. In Proc. SIGIR, pages 11-18, Seattle, Washington, 2006.
    • (2006) Proc. SIGIR , pp. 11-18
    • Turpin, A.1    Scholer, F.2
  • 25
    • 84867460358 scopus 로고    scopus 로고
    • Grannies, tanning beds, tattoos and NASCAR: Evaluation of search tasks with varying levels of cognitive complexity
    • Nijmegen, The Netherlands
    • W.-C. Wu, D. Kelly, A. Edwards, and J. Arguello. Grannies, tanning beds, tattoos and NASCAR: Evaluation of search tasks with varying levels of cognitive complexity. In Proc. 4th Information Interaction in Context Symp., pages 254-257, Nijmegen, The Netherlands, 2012.
    • (2012) Proc. 4th Information Interaction in Context Symp , pp. 254-257
    • Wu, W.-C.1    Kelly, D.2    Edwards, A.3    Arguello, J.4
  • 26
    • 78651268113 scopus 로고    scopus 로고
    • Expected browsing utility for web search evaluation
    • Toronto, Canada
    • E. Yilmaz, M. Shokouhi, N. Craswell, and S. Robertson. Expected browsing utility for web search evaluation. In Proc. CIKM, pages 1561-1564, Toronto, Canada, 2010.
    • (2010) Proc. CIKM , pp. 1561-1564
    • Yilmaz, E.1    Shokouhi, M.2    Craswell, N.3    Robertson, S.4
  • 27
    • 76349123386 scopus 로고    scopus 로고
    • Click-based evidence for decaying weight distributions in search effectiveness metrics
    • Y. Zhang, L. A. F. Park, and A. Moffat. Click-based evidence for decaying weight distributions in search effectiveness metrics. Information Retrieval, 13(1):46-69, 2010.
    • (2010) Information Retrieval , vol.13 , Issue.1 , pp. 46-69
    • Zhang, Y.1    Park, L.A.F.2    Moffat, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.