메뉴 건너뛰기




Volumn 8173 LNCS, Issue , 2014, Pages 116-163

Metrics, statistics, tests

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION RETRIEVAL;

EID: 84901306933     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-54798-0_6     Document Type: Conference Paper
Times cited : (50)

References (121)
  • 2
    • 34547647257 scopus 로고    scopus 로고
    • Retrieval evaluation with incomplete relevance data: A comparative study of three measures
    • Ahlgren, P., Grönqvist, L.: Retrieval evaluation with incomplete relevance data: A comparative study of three measures. In: Proceedings of ACM CIKM 2006, pp. 872-873 (2006)
    • (2006) Proceedings of ACM CIKM 2006 , pp. 872-873
    • Ahlgren, P.1    Grönqvist, L.2
  • 5
    • 85015342057 scopus 로고    scopus 로고
    • A methodology for evaluating aggregated search results
    • Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. Springer, Heidelberg
    • Arguello, J., Diaz, F., Callan, J., Carterette, B.: A methodology for evaluating aggregated search results. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 141- 152. Springer, Heidelberg (2011)
    • (2011) LNCS , vol.6611 , pp. 141-152
    • Arguello, J.1    Diaz, F.2    Callan, J.3    Carterette, B.4
  • 6
    • 77957131892 scopus 로고    scopus 로고
    • Expected reading effort in focused retrieval evaluation
    • Arvola, P., Kekäläinen, J., Junkkari, M.: Expected reading effort in focused retrieval evaluation. Information Retrieval 13(5), 460-484 (2010)
    • (2010) Information Retrieval , vol.13 , Issue.5 , pp. 460-484
    • Arvola, P.1    Kekäläinen, J.2    Junkkari, M.3
  • 7
    • 84871631227 scopus 로고    scopus 로고
    • On the informativeness of cascade and intent-aware effectiveness measures
    • Ashkan, A., Clarke, C.L.: On the informativeness of cascade and intent-aware effectiveness measures. In: Proceedings of ACM SIGIR 2013, pp. 407-416 (2011)
    • (2011) Proceedings of ACM SIGIR 2013 , pp. 407-416
    • Ashkan, A.1    Clarke, C.L.2
  • 8
    • 33746264710 scopus 로고    scopus 로고
    • The maximum entropy method for analyzing retrieval measures
    • Aslam, J.A., Yilmaz, E., Pavlu, V.: The maximum entropy method for analyzing retrieval measures. In: Proceedings of ACM SIGIR 2005, pp. 27-34 (2005)
    • (2005) Proceedings of ACM SIGIR 2005 , pp. 27-34
    • Aslam, J.A.1    Yilmaz, E.2    Pavlu, V.3
  • 9
    • 74549121417 scopus 로고    scopus 로고
    • Usage based effectiveness measures
    • Azzopardi, L.: Usage based effectiveness measures. In: Proceedings of ACM CIKM 2009, pp. 631-640 (2009)
    • (2009) Proceedings of ACM CIKM 2009 , pp. 631-640
    • Azzopardi, L.1
  • 10
    • 84866615871 scopus 로고    scopus 로고
    • Time drives interaction: Simulating sessions in diverse searching environments
    • Baskaya, F., Keskustalo, H., Järvelin, K.: Time drives interaction: Simulating sessions in diverse searching environments. In: Proceedings of ACM SIGIR 2012, pp. 105-114 (2012)
    • (2012) Proceedings of ACM SIGIR 2012 , pp. 105-114
    • Baskaya, F.1    Keskustalo, H.2    Järvelin, K.3
  • 11
    • 36448947171 scopus 로고    scopus 로고
    • Test theory for assessing IR test collections
    • Bodoff, D., Li, P.: Test theory for assessing IR test collections. In: Proceedings of ACM SIGIR 2007, pp. 367-374 (2007)
    • (2007) Proceedings of ACM SIGIR 2007 , pp. 367-374
    • Bodoff, D.1    Li, P.2
  • 12
  • 14
    • 0142030258 scopus 로고    scopus 로고
    • A taxonomy of web search
    • Broder, A.: A taxonomy of web search. SIGIR Forum 36(2) (2002)
    • (2002) SIGIR Forum , vol.36 , Issue.2
    • Broder, A.1
  • 19
    • 72449194290 scopus 로고    scopus 로고
    • On rank correlation and the distance between rankings
    • Carterette, B.: On rank correlation and the distance between rankings. In: Proceedings of ACM SIGIR 2009, pp. 436-443 (2009)
    • (2009) Proceedings of ACM SIGIR 2009 , pp. 436-443
    • Carterette, B.1
  • 20
    • 80052123590 scopus 로고    scopus 로고
    • System effectiveness, user models, and user utility: A conceptual framework for investigation
    • Carterette, B.: System effectiveness, user models, and user utility: A conceptual framework for investigation. In: Proceedings of ACM SIGIR 2011, pp. 903-912 (2011)
    • (2011) Proceedings of ACM SIGIR 2011 , pp. 903-912
    • Carterette, B.1
  • 21
    • 84859011471 scopus 로고    scopus 로고
    • Multiple testing in statistical analysis of systems-based information retrieval experiments
    • Carterette, B.: Multiple testing in statistical analysis of systems-based information retrieval experiments. ACM TOIS 30(1) (2012)
    • (2012) ACM TOIS , vol.30 , Issue.1
    • Carterette, B.1
  • 24
    • 84880779561 scopus 로고    scopus 로고
    • Analysis of various evaluation measures for diversity
    • Chandar, P., Carterette, B.: Analysis of various evaluation measures for diversity. In: Proceedings of DDR 2011, pp. 21-28 (2011)
    • (2011) Proceedings of DDR 2011 , pp. 21-28
    • Chandar, P.1    Carterette, B.2
  • 26
    • 80255123851 scopus 로고    scopus 로고
    • Intent-based diversification of web search results: Metrics and algorithms
    • Chapelle, O., Ji, S., Liao, C., Velipasaoglu, E., Lai, L., Wu, S.L.: Intent-based diversification of web search results: Metrics and algorithms. Information Retrieval 14(6), 572-592 (2011)
    • (2011) Information Retrieval , vol.14 , Issue.6 , pp. 572-592
    • Chapelle, O.1    Ji, S.2    Liao, C.3    Velipasaoglu, E.4    Lai, L.5    Wu, S.L.6
  • 28
    • 85107916112 scopus 로고
    • MUC-4 evaluation metrics
    • Chinchor, N.: MUC-4 evaluation metrics. In: Proceedings of MUC-4, pp. 22-29 (1992)
    • (1992) Proceedings of MUC-4 , pp. 22-29
    • Chinchor, N.1
  • 33
    • 70350576772 scopus 로고    scopus 로고
    • An effectiveness measure for ambiguous and underspecified queries
    • Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. Springer, Heidelberg
    • Clarke, C.L.A., Kolla, M., Vechtomova, O.: An effectiveness measure for ambiguous and underspecified queries. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 188-199. Springer, Heidelberg (2009)
    • (2009) LNCS , vol.5766 , pp. 188-199
    • Clarke, C.L.A.1    Kolla, M.2    Vechtomova, O.3
  • 35
    • 0002064245 scopus 로고
    • Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems
    • Cooper, W.S.: Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems. JASIS 19(1), 30-41 (1968)
    • (1968) JASIS , vol.19 , Issue.1 , pp. 30-41
    • Cooper, W.S.1
  • 36
    • 0015604498 scopus 로고
    • On selecting a measure of retrieval effectiveness
    • Cooper, W.S.: On selecting a measure of retrieval effectiveness. JASIS 24(2), 87-100 (1973)
    • (1973) JASIS , vol.24 , Issue.2 , pp. 87-100
    • Cooper, W.S.1
  • 37
    • 0015681463 scopus 로고
    • On selecting a measure of retrieval effectiveness: Part II. Implementation of the philosophy
    • Cooper, W.S.: On selecting a measure of retrieval effectiveness: Part II. Implementation of the philosophy. JASIS 24(6), 413-424 (1973)
    • (1973) JASIS , vol.24 , Issue.6 , pp. 413-424
    • Cooper, W.S.1
  • 39
    • 44649166130 scopus 로고    scopus 로고
    • Different structures for evaluating answers to complex questions: Pyramids won't topple, and neither will human assessors
    • Dang, H., Lin, J.: Different structures for evaluating answers to complex questions: Pyramids won't topple, and neither will human assessors. In: Proceedings of ACL 2007, pp. 768-775 (2007)
    • (2007) Proceedings of ACL 2007 , pp. 768-775
    • Dang, H.1    Lin, J.2
  • 40
    • 33750363214 scopus 로고    scopus 로고
    • Rpref: A generalization of bpref towards graded relevance judgments
    • De Beer, J., Moens, M.F.: Rpref: A generalization of bpref towards graded relevance judgments. In: Proceedings of ACM SIGIR 2006, pp. 637-638 (2006)
    • (2006) Proceedings of ACM SIGIR 2006 , pp. 637-638
    • De Beer, J.1    Moens, M.F.2
  • 41
    • 1842680293 scopus 로고    scopus 로고
    • Measuring retrieval effectiveness: A new proposal and a first experimental validation
    • Della Mea, V., Mizzaro, S.: Measuring retrieval effectiveness: A new proposal and a first experimental validation. JASIST 55(6), 503-543 (2004)
    • (2004) JASIST , vol.55 , Issue.6 , pp. 503-543
    • Della Mea, V.1    Mizzaro, S.2
  • 42
    • 0030650239 scopus 로고    scopus 로고
    • Time, relevance and interaction modelling for information retrieval
    • Dunlop,M.D.: Time, relevance and interaction modelling for information retrieval. In: Proceedings of ACM SIGIR 1997, pp. 206-213 (1997)
    • (1997) Proceedings of ACM SIGIR 1997 , pp. 206-213
    • Dunlop, M.D.1
  • 45
    • 84886070538 scopus 로고    scopus 로고
    • NTCIR9-GeoTime overview - Evaluating geographic and temporal search: Round 2
    • Gey, F., Larson, R., Machado, J., Yoshioka, M.: NTCIR9-GeoTime overview - evaluating geographic and temporal search: Round 2. In: Proceedings of NTCIR-9, pp. 9-17 (2011)
    • (2011) Proceedings of NTCIR-9 , pp. 9-17
    • Gey, F.1    Larson, R.2    Machado, J.3    Yoshioka, M.4
  • 47
    • 0027725490 scopus 로고
    • Using statistical testing in the evaluation of retrieval experiments
    • Hull, D.: Using statistical testing in the evaluation of retrieval experiments. In: Proceedings of ACM SIGIR 1993. pp. 329-338 (1993)
    • (1993) Proceedings of ACM SIGIR 1993 , pp. 329-338
    • Hull, D.1
  • 48
    • 33846563409 scopus 로고    scopus 로고
    • Why most published research findings are false
    • Ioannidis, J.P.: Why most published research findings are false. PLoS Med. 2(8) (2005)
    • (2005) PLoS Med , vol.2 , Issue.8
    • Ioannidis, J.P.1
  • 51
    • 0032808670 scopus 로고    scopus 로고
    • The insignificance of statistical significance testing
    • Johnson, D.H.: The insignificance of statistical significance testing. The Journal of Wildlife Management 63(3), 763-772 (1999)
    • (1999) The Journal of Wildlife Management , vol.63 , Issue.3 , pp. 763-772
    • Johnson, D.H.1
  • 52
    • 51849123809 scopus 로고    scopus 로고
    • INEX 2007 evaluation measures
    • Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. Springer, Heidelberg
    • Kamps, J., Pehcevski, J., Kazai, G., Lalmas, M., Robertson, S.: INEX 2007 evaluation measures. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 24-33. Springer, Heidelberg (2008)
    • (2008) LNCS , vol.4862 , pp. 24-33
    • Kamps, J.1    Pehcevski, J.2    Kazai, G.3    Lalmas, M.4    Robertson, S.5
  • 53
    • 74549195035 scopus 로고    scopus 로고
    • Empirical justification of the gain and discount function for nDCG
    • Kanoulas, E., Aslam, J.A.: Empirical justification of the gain and discount function for nDCG. In: ACM CIKM 2009, pp. 611-620 (2009)
    • (2009) ACM CIKM 2009 , pp. 611-620
    • Kanoulas, E.1    Aslam, J.A.2
  • 55
    • 84883070185 scopus 로고    scopus 로고
    • Report from the NTCIR-10 1CLICK-2 Japanese subtask: Baselines, upperbounds and evaluation robustness
    • Kato, M.P., Sakai, T., Yamamoto, T., Iwata, M.: Report from the NTCIR-10 1CLICK-2 Japanese subtask: Baselines, upperbounds and evaluation robustness. In: Proceedings of ACM SIGIR 2013 (2013)
    • (2013) Proceedings of ACM SIGIR 2013
    • Kato, M.P.1    Sakai, T.2    Yamamoto, T.3    Iwata, M.4
  • 56
    • 0036851939 scopus 로고    scopus 로고
    • Using graded relevance assessments in IR evaluation
    • Kekäläinen, J., Järvelin, K.: Using graded relevance assessments in IR evaluation. JASIST 53(13), 1120-1129 (2002)
    • (2002) JASIST , vol.53 , Issue.13 , pp. 1120-1129
    • Kekäläinen, J.1    Järvelin, K.2
  • 57
    • 33749530226 scopus 로고    scopus 로고
    • Property of average precision and its generalization: An examination of evaluation indicator for information retrieval
    • Kishida, K.: Property of average precision and its generalization: An examination of evaluation indicator for information retrieval. NII Technical Reports NII-2005-014E (2005)
    • (2005) NII Technical Reports NII-2005-014E
    • Kishida, K.1
  • 59
    • 84871050880 scopus 로고    scopus 로고
    • A comprehensive analysis of parameter settings for novelty-biased cumulative gain
    • Leenanupab, T., Zuccon, G., Jose, J.M.: A comprehensive analysis of parameter settings for novelty-biased cumulative gain. In: Proceedings of ACM CIKM 2012, pp. 1950-1954 (2012)
    • (2012) Proceedings of ACM CIKM 2012 , pp. 1950-1954
    • Leenanupab, T.1    Zuccon, G.2    Jose, J.M.3
  • 61
    • 33748772836 scopus 로고    scopus 로고
    • Methods for automatically evaluating answers to complex questions
    • Lin, J., Demner-Fushman, D.: Methods for automatically evaluating answers to complex questions. Information Retrieval 9(5), 565-587 (2006)
    • (2006) Information Retrieval , vol.9 , Issue.5 , pp. 565-587
    • Lin, J.1    Demner-Fushman, D.2
  • 62
    • 77956042082 scopus 로고    scopus 로고
    • PRES: A score metric for evaluating recall-oriented information retrieval applications
    • Magdy, W., Jones, G.J.: PRES: A score metric for evaluating recall-oriented information retrieval applications. In: Proceedings of ACM SIGIR 2010, pp. 611-618 (2010)
    • (2010) Proceedings of ACM SIGIR 2010 , pp. 611-618
    • Magdy, W.1    Jones, G.J.2
  • 63
    • 66949147248 scopus 로고    scopus 로고
    • Rank-biased precision for measurement of retrieval effectiveness
    • Moffat, A., Zobel, J.: Rank-biased precision for measurement of retrieval effectiveness. ACM TOIS 27(1) (2008)
    • (2008) ACM TOIS , vol.27 , Issue.1
    • Moffat, A.1    Zobel, J.2
  • 65
    • 34249275304 scopus 로고    scopus 로고
    • The pyramid method: Incorporating human content selection variation in summarization evaluation
    • Nenkova, A., Passonneau, R., McKeown, K.: The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Transactions on Speech and Language Processing 4(2), Article 4 (2007)
    • (2007) ACM Transactions on Speech and Language Processing , vol.4 , Issue.2 , pp. 4
    • Nenkova, A.1    Passonneau, R.2    McKeown, K.3
  • 67
    • 16244364485 scopus 로고
    • Measures for the comparison of information retrieval systems
    • Pollock, S.M.: Measures for the comparison of information retrieval systems. American Documentation 19(4), 387-397 (1968)
    • (1968) American Documentation , vol.19 , Issue.4 , pp. 387-397
    • Pollock, S.M.1
  • 69
    • 0017630891 scopus 로고
    • The probability ranking principle in IR
    • Robertson, S.E.: The probability ranking principle in IR. Journal of Documentation 33, 130-137 (1977)
    • (1977) Journal of Documentation , vol.33 , pp. 130-137
    • Robertson, S.E.1
  • 71
    • 57349087085 scopus 로고    scopus 로고
    • A new interpretation of average precision
    • Robertson, S.E.: A new interpretation of average precision. In: Proceedings of ACM SIGIR 2008, pp. 689-690 (2008)
    • (2008) Proceedings of ACM SIGIR 2008 , pp. 689-690
    • Robertson, S.E.1
  • 74
    • 84901381577 scopus 로고    scopus 로고
    • New performance metrics based on multigrade relevance: Their application to question answering
    • Sakai, T.: New performance metrics based on multigrade relevance: Their application to question answering. In: Proceedings of NTCIR-4 (Open Submission Session) (2004)
    • Proceedings of NTCIR-4 (Open Submission Session) (2004)
    • Sakai, T.1
  • 75
    • 24344471313 scopus 로고    scopus 로고
    • Ranking the NTCIR systems based on multigrade relevance
    • Myaeng, S.-H., Zhou, M., Wong, K.-F., Zhang, H.-J. (eds.) AIRS 2004. Springer, Heidelberg
    • Sakai, T.: Ranking the NTCIR systems based on multigrade relevance. In: Myaeng, S.-H., Zhou, M., Wong, K.-F., Zhang, H.-J. (eds.) AIRS 2004. LNCS, vol. 3411, pp. 251-262. Springer, Heidelberg (2005)
    • (2005) LNCS , vol.3411 , pp. 251-262
    • Sakai, T.1
  • 76
    • 33751354079 scopus 로고    scopus 로고
    • Bootstrap-based comparisons of IR metrics for finding one relevant document
    • Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. Springer, Heidelberg
    • Sakai, T.: Bootstrap-based comparisons of IR metrics for finding one relevant document. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 374-389. Springer, Heidelberg (2006)
    • (2006) LNCS , vol.4182 , pp. 374-389
    • Sakai, T.1
  • 77
    • 33750340100 scopus 로고    scopus 로고
    • Evaluating evaluation metrics based on the bootstrap
    • Sakai, T.: Evaluating evaluation metrics based on the bootstrap. In: Proceedings of ACM SIGIR 2006, pp. 525-532 (2006)
    • (2006) Proceedings of ACM SIGIR 2006 , pp. 525-532
    • Sakai, T.1
  • 78
    • 38149100610 scopus 로고    scopus 로고
    • For building better retrieval systems: Trends in information retrieval evaluation based on graded relevance
    • in Japanese
    • Sakai, T.: For building better retrieval systems: Trends in information retrieval evaluation based on graded relevance (in Japanese). IPSJ Magazine 47(2), 147-158 (2006)
    • (2006) IPSJ Magazine , vol.47 , Issue.2 , pp. 147-158
    • Sakai, T.1
  • 80
    • 57349094546 scopus 로고    scopus 로고
    • On penalising late arrival of relevant documents in information retrieval evaluation with graded relevance
    • Sakai, T.: On penalising late arrival of relevant documents in information retrieval evaluation with graded relevance. In: Proceedings of EVIA 2007, pp. 32-43 (2007)
    • (2007) Proceedings of EVIA 2007 , pp. 32-43
    • Sakai, T.1
  • 81
    • 70349242289 scopus 로고    scopus 로고
    • Comparing metrics across TREC and NTCIR: The robustness to system bias
    • Sakai, T.: Comparing metrics across TREC and NTCIR: The robustness to system bias. In: Proceedings of ACM CIKM 2008, pp. 581-590 (2008)
    • (2008) Proceedings of ACM CIKM 2008 , pp. 581-590
    • Sakai, T.1
  • 82
    • 84860858401 scopus 로고    scopus 로고
    • Evaluation with informational and navigational intents
    • Sakai, T.: Evaluation with informational and navigational intents. In: Proceedings of WWW 2012, pp. 499-508 (2012)
    • (2012) Proceedings of WWW 2012 , pp. 499-508
    • Sakai, T.1
  • 83
    • 84901374927 scopus 로고    scopus 로고
    • How intuitive are diversified search metrics? Concordance test results for the diversity U-measures
    • Sakai, T.: How intuitive are diversified search metrics? Concordance test results for the diversity U-measures. IPSJ SIG Technical Report 2013-IFAT-111 (2013)
    • (2013) IPSJ SIG Technical Report 2013-IFAT-111
    • Sakai, T.1
  • 84
    • 84883084005 scopus 로고    scopus 로고
    • The unreusability of diversified test collections
    • Sakai, T.: The unreusability of diversified test collections. In: Proceedings of EVIA 2013 (2013)
    • (2013) Proceedings of EVIA 2013
    • Sakai, T.1
  • 85
    • 84883092343 scopus 로고    scopus 로고
    • Summaries, ranked retrieval and sessions: A unified framework for information access evaluation
    • Sakai, T., Dou, Z.: Summaries, ranked retrieval and sessions: A unified framework for information access evaluation. In: Proceedings of ACM SIGIR 2013, pp. 473-482 (2013)
    • (2013) Proceedings of ACM SIGIR 2013 , pp. 473-482
    • Sakai, T.1    Dou, Z.2
  • 87
    • 84871603799 scopus 로고    scopus 로고
    • The reusability of a diversified search test collection
    • Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. Springer, Heidelberg
    • Sakai, T., Dou, Z., Song, R., Kando, N.: The reusability of a diversified search test collection. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 26-38. Springer, Heidelberg (2012)
    • (2012) LNCS , vol.7675 , pp. 26-38
    • Sakai, T.1    Dou, Z.2    Song, R.3    Kando, N.4
  • 89
    • 50849122035 scopus 로고    scopus 로고
    • On information retrieval metrics designed for evaluation with incomplete relevance assessments
    • Sakai, T., Kando, N.: On information retrieval metrics designed for evaluation with incomplete relevance assessments. Information Retrieval 11, 447-470 (2008)
    • (2008) Information Retrieval , vol.11 , pp. 447-470
    • Sakai, T.1    Kando, N.2
  • 90
    • 84871582413 scopus 로고    scopus 로고
    • One click one revisited: Enhancing evaluation based on information units
    • Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. Springer, Heidelberg
    • Sakai, T., Kato, M.P.: One click one revisited: Enhancing evaluation based on information units. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 39-51. Springer, Heidelberg (2012)
    • (2012) LNCS , vol.7675 , pp. 39-51
    • Sakai, T.1    Kato, M.P.2
  • 91
    • 83055168077 scopus 로고    scopus 로고
    • Click the search button and be happy: Evaluating direct and immediate information access
    • Sakai, T., Kato, M.P., Song, Y.I.: Click the search button and be happy: Evaluating direct and immediate information access. In: Proceedings of ACM CIKM 2011, pp. 621-630 (2011)
    • (2011) Proceedings of ACM CIKM 2011 , pp. 621-630
    • Sakai, T.1    Kato, M.P.2    Song, Y.I.3
  • 92
    • 76349094521 scopus 로고    scopus 로고
    • Modelling a user population for designing information retrieval metrics
    • Sakai, T., Robertson, S.: Modelling a user population for designing information retrieval metrics. In: Proceedings of EVIA 2008, pp. 30-41 (2008)
    • (2008) Proceedings of EVIA 2008 , pp. 30-41
    • Sakai, T.1    Robertson, S.2
  • 94
    • 80052111133 scopus 로고    scopus 로고
    • Evaluating diversified search results using per-intent graded relevance
    • Sakai, T., Song, R.: Evaluating diversified search results using per-intent graded relevance. In: Proceedings of ACM SIGIR 2011 (2011)
    • (2011) Proceedings of ACM SIGIR 2011
    • Sakai, T.1    Song, R.2
  • 95
    • 84880838418 scopus 로고    scopus 로고
    • Diversified search evaluation: Lessons from the NTCIR-9 INTENT task
    • Sakai, T., Song, R.: Diversified search evaluation: Lessons from the NTCIR-9 INTENT task. Information Retrieval (2013)
    • (2013) Information Retrieval
    • Sakai, T.1    Song, R.2
  • 96
  • 97
    • 77954220071 scopus 로고    scopus 로고
    • Test collection based evaluation of information retrieval systems
    • Sanderson, M.: Test collection based evaluation of information retrieval systems. Foundations and Trends in Information Retrieval 4, 247-375 (2010)
    • (2010) Foundations and Trends in Information Retrieval , vol.4 , pp. 247-375
    • Sanderson, M.1
  • 99
    • 84885608872 scopus 로고    scopus 로고
    • Information retrieval system evaluation: Effort, sensitivity, and reliability
    • Sanderson, M., Zobel, J.: Information retrieval system evaluation: Effort, sensitivity, and reliability. In: Proceedings of ACM SIGIR 2005, pp. 162-169 (2005)
    • (2005) Proceedings of ACM SIGIR 2005 , pp. 162-169
    • Sanderson, M.1    Zobel, J.2
  • 100
    • 0031193029 scopus 로고    scopus 로고
    • Statistical inference in retrieval effectiveness evaluation
    • Savoy, J.: Statistical inference in retrieval effectiveness evaluation. Information Processing and Management 33(4), 495-512 (1997)
    • (1997) Information Processing and Management , vol.33 , Issue.4 , pp. 495-512
    • Savoy, J.1
  • 101
    • 63449088172 scopus 로고    scopus 로고
    • A comparison of statistical significance tests for information retrieval evaluation
    • Smucker, M.D., Allan, J., Carterette, B.: A comparison of statistical significance tests for information retrieval evaluation. In: Proceedings of ACM CIKM 2007, pp. 623-632 (2007)
    • (2007) Proceedings of ACM CIKM 2007 , pp. 623-632
    • Smucker, M.D.1    Allan, J.2    Carterette, B.3
  • 106
    • 8644262918 scopus 로고    scopus 로고
    • The philosophy of information retrieval evaluation
    • Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. Springer, Heidelberg
    • Voorhees, E.M.: The philosophy of information retrieval evaluation. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 355-370. Springer, Heidelberg (2002)
    • (2002) LNCS , vol.2406 , pp. 355-370
    • Voorhees, E.M.1
  • 107
    • 0036993119 scopus 로고    scopus 로고
    • The effect of topic set size on retrieval experiment error
    • Voorhees, E.M., Buckley, C.: The effect of topic set size on retrieval experiment error. In: Proceedings of ACM SIGIR 2002, pp. 316-323 (2002)
    • (2002) Proceedings of ACM SIGIR 2002 , pp. 316-323
    • Voorhees, E.M.1    Buckley, C.2
  • 109
    • 57349160444 scopus 로고    scopus 로고
    • Score standardization for inter-collection comparison of retrieval systems
    • Webber, W., Moffat, A., Zobel, J.: Score standardization for inter-collection comparison of retrieval systems. In: Proceedings of ACM SIGIR 2008, pp. 51-58 (2008)
    • (2008) Proceedings of ACM SIGIR 2008 , pp. 51-58
    • Webber, W.1    Moffat, A.2    Zobel, J.3
  • 111
    • 80052117619 scopus 로고    scopus 로고
    • The effect of pooling and evaluation depth on metric stability
    • Webber, W., Moffat, A., Zobel, J.: The effect of pooling and evaluation depth on metric stability. In: Proceedings of EVIA 2010, pp. 7-15 (2010)
    • (2010) Proceedings of EVIA 2010 , pp. 7-15
    • Webber, W.1    Moffat, A.2    Zobel, J.3
  • 112
    • 80051482719 scopus 로고    scopus 로고
    • A similarity measure for indefinite rankings
    • Webber, W., Moffat, A., Zobel, J.: A similarity measure for indefinite rankings. ACM TOIS 28(4) (2010)
    • (2010) ACM TOIS , vol.28 , Issue.4
    • Webber, W.1    Moffat, A.2    Zobel, J.3
  • 113
    • 72449157957 scopus 로고    scopus 로고
    • Score adjustment for correction of pooling bias
    • Webber, W., Park, L.A.: Score adjustment for correction of pooling bias. In: Proceedings of ACM SIGIR 2009, pp. 444-451 (2009)
    • (2009) Proceedings of ACM SIGIR 2009 , pp. 444-451
    • Webber, W.1    Park, L.A.2
  • 114
    • 70449488015 scopus 로고    scopus 로고
    • Modeling expected utility of multi-session information distillation
    • Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. Springer, Heidelberg
    • Yang, Y., Lad, A.: Modeling expected utility of multi-session information distillation. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 164-175. Springer, Heidelberg (2009)
    • (2009) LNCS , vol.5766 , pp. 164-175
    • Yang, Y.1    Lad, A.2
  • 115
    • 57349152359 scopus 로고    scopus 로고
    • A new rank correlation coefficient for information retrieval
    • Yilmaz, E., Aslam, J., Robertson, S.: A new rank correlation coefficient for information retrieval. In: Proceedings of ACM SIGIR 2008, pp. 587-594 (2008)
    • (2008) Proceedings of ACM SIGIR 2008 , pp. 587-594
    • Yilmaz, E.1    Aslam, J.2    Robertson, S.3
  • 116
    • 34547632535 scopus 로고    scopus 로고
    • Estimating average precision with incomplete and imperfect judgments
    • Yilmaz, E., Aslam, J.A.: Estimating average precision with incomplete and imperfect judgments. In: ACM CIKM 2006 Proceedings, pp. 102-111 (2006)
    • (2006) ACM CIKM 2006 Proceedings , pp. 102-111
    • Yilmaz, E.1    Aslam, J.A.2
  • 118
    • 1542347826 scopus 로고    scopus 로고
    • Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval
    • Zhai, C., Cohen, W.W., Lafferty, J.: Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval. In: Proceedings of ACM SIGIR 2003, pp. 10-17 (2003)
    • (2003) Proceedings of ACM SIGIR 2003 , pp. 10-17
    • Zhai, C.1    Cohen, W.W.2    Lafferty, J.3
  • 119
    • 76349123386 scopus 로고    scopus 로고
    • Click-based evidence for decaying weight distributions in search effectiveness metrics
    • Zhang, Y., Park, L.A.F., Moffat, A.: Click-based evidence for decaying weight distributions in search effectiveness metrics. Information Retrieval 13(1), 46-69 (2010)
    • (2010) Information Retrieval , vol.13 , Issue.1 , pp. 46-69
    • Zhang, Y.1    Park, L.A.F.2    Moffat, A.3
  • 121
    • 0032272626 scopus 로고    scopus 로고
    • How reliable are the results of large-scale information retrieval experiments?
    • Zobel, J.: How reliable are the results of large-scale information retrieval experiments? In: Proceedings of ACM SIGIR 1998, pp. 307-314 (1998)
    • (1998) Proceedings of ACM SIGIR 1998 , pp. 307-314
    • Zobel, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.