메뉴 건너뛰기




Volumn 13, Issue 5, 2010, Pages 568-600

Entity ranking in Wikipedia: Utilising categories, links and topic difficulty prediction

Author keywords

Entity ranking; INEX; Wikipedia; XML Retrieval

Indexed keywords


EID: 77957147134     PISSN: 13864564     EISSN: 15737659     Source Type: Journal    
DOI: 10.1007/s10791-009-9125-9     Document Type: Article
Times cited : (27)

References (51)
  • 1
    • 84865724706 scopus 로고    scopus 로고
    • Adelberg, B., & Denny, M. (1999). Nodose version 2. 0. In Proceedings of the 1999 ACM SIGMOD international conference on management of data (SIGMOD'99), Philadelphia, Pennsylvania, pp. 559-561.
  • 2
    • 38049125336 scopus 로고    scopus 로고
    • Awang Iskandar, D., Pehcevski, J., Thom, J. A., & Tahaghoghi, S. M. M. (2007). Social media retrieval using image features and structured text. In Comparative evaluation of XML information retrieval systems: Fifth workshop of the INitiative for the evaluation of XML retrieval, INEX 2006, Lecture notes in computer science, Vol. 4518, pp. 358-372.
  • 3
    • 36448932681 scopus 로고    scopus 로고
    • Bast, H., Chitea, A., Suchanek, F., & Weber, I. (2007). ESTER: Efficient search on text, entities, and relations. In Proceedings of the 30th ACM international conference on research and development in information retrieval (SIGIR'07), Amsterdam, The Netherlands, pp. 671-678.
  • 4
    • 77957105687 scopus 로고    scopus 로고
    • Blanchard, E., Harzallah, M., & Henri Briand, P. K. (2005). A typology of ontology-based semantic measures. In Proceedings of the open interop workshop on enterprise modelling and ontologies for interoperability (EMOI-INTEROP'05), Porto, Portugal. http://www. sunsite. informatik. rwth-aachen. de/Publications/CEUR-WS/Vol-160/paper26. pdf.
  • 5
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • Breiman, L. (2001). Random forests. Machine Learning 45(1), 5-32.
    • (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 6
    • 0038589165 scopus 로고    scopus 로고
    • Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the 7th international conference on world wide web, Brisbane, Australia, pp. 107-117.
  • 7
    • 8644241107 scopus 로고    scopus 로고
    • Cai, D., He, X., Wen, J. R., & Ma, W. Y. (2004). Block-level link analysis. In Proceedings of the 27th ACM international conference on research and development in information retrieval (SIGIR'04), Sheffield, UK, pp. 440-447.
  • 8
    • 0038156232 scopus 로고    scopus 로고
    • Callan, J., & Mitamura, T. (2002). Knowledge-based extraction of named entities. In Proceedings of the 11th ACM conference on information and knowledge management (CIKM'02), McLean, Virginia, pp. 532-537.
  • 9
    • 33751375485 scopus 로고    scopus 로고
    • Predicting query difficulty-methods and applications
    • Carmel, D., Yom-Tov, E., & Soboroff, I. (2005). Predicting query difficulty-methods and applications. SIGIR Forum 39(2), 25-28.
    • (2005) SIGIR Forum , vol.39 , Issue.2 , pp. 25-28
    • Carmel, D.1    Yom-Tov, E.2    Soboroff, I.3
  • 10
    • 0036989577 scopus 로고    scopus 로고
    • Cronen-Townsend, S., Zhou, Y., & Croft, W. B. (2002). Predicting query performance. In Proceedings of the 25th ACM SIGIR conference on research and development in information retrieval (SIGIR'02), Tampere, Finland, pp. 299-306.
  • 11
    • 80053379324 scopus 로고    scopus 로고
    • Cucerzan, S. (2007). Large-scale named entity disambiguation based on Wikipedia data. In Proceedings of the 2007 joint conference on EMNLP and CoNLL, Prague, The Czech Republic, pp. 708-716.
  • 12
    • 77957159077 scopus 로고    scopus 로고
    • Cucerzan, S., & Yarowsky, D. (1999). Language independent named entity recognition combining morphological and contextual evidence. In Proceedings of the 1999 joint SIGDAT conference on EMNLP and VLC, Maryland, MD, pp. 90-99.
  • 13
    • 51849085310 scopus 로고    scopus 로고
    • de Vries A. P., Vercoustre A. M., Thom J. A., Craswell N., & Lalmas M. (2008). Overview of the INEX 2007 entity ranking track. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, Vol. 4862, pp. 1-23.
  • 14
    • 70350485025 scopus 로고    scopus 로고
    • Demartini, G., de Vries, A. P., Iofciu, T., & Zhu, J. (2009). Overview of the INEX 2008 entity ranking track. In Advances in focused retrieval: Seventh international workshop of the initiative for the evaluation of XML retrieval, INEX 2008, Lecture notes in computer science, Vol. 5631.
  • 15
    • 34547439207 scopus 로고    scopus 로고
    • The Wikipedia XML corpus
    • Denoyer, L., & Gallinari, P. (2006). The Wikipedia XML corpus. SIGIR Forum 40(1), 64-69.
    • (2006) SIGIR Forum , vol.40 , Issue.1 , pp. 64-69
    • Denoyer, L.1    Gallinari, P.2
  • 16
    • 84871006508 scopus 로고    scopus 로고
    • Ehrig, M., Haase, P., Stojanovic, N., & Hefke, M. (2005). Similarity for ontologies-a comprehensive framework. In Proceedings of the 13th European conference on information systems.
  • 17
    • 85130930851 scopus 로고    scopus 로고
    • Fissaha Adafre, S., de Rijke, M., & Sang, E. T. K. (2007). Entity retrieval. In Proceedings of international conference on recent advances in natural language processing (RANLP-2007), September 27-29, Borovets, Bulgaria.
  • 18
    • 77957163875 scopus 로고    scopus 로고
    • Grivolla, J., Jourlin, P., & de Mori, R. (2005). Automatic classification of queries by expected retrieval performance. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil.
  • 19
    • 33845445794 scopus 로고    scopus 로고
    • Hassell, J., Aleman-Meza, B., & Arpinar, I. B. (2006). Ontology-driven automatic entity disambiguation in unstructured text. In Proceedings of the 5th international semantic web conference (ISWC), Athens, GA, Lecture notes in computer science, Vol. 4273, pp. 44-57.
  • 20
    • 33747187264 scopus 로고    scopus 로고
    • Query performance prediction
    • He, B., & Ounis, I. (2006). Query performance prediction. Information Systems 31(7), 585-594.
    • (2006) Information Systems , vol.31 , Issue.7 , pp. 585-594
    • He, B.1    Ounis, I.2
  • 21
    • 33751382309 scopus 로고    scopus 로고
    • Hu, G., Liu, J., Li, H., Cao, Y., Nie, J. Y., & Gao, J. (2006). A supervised learning approach to entity search. In Proceedings of the Asia information retrieval symposium (AIRS 2006). Lecture notes in computer science, Vol. 4182, pp. 54-66.
  • 22
    • 77957147219 scopus 로고    scopus 로고
    • Kamps, J., & Larsen, B. (2006). Understanding differences between search requests in XML element retrieval. In Proceedings of the SIGIR 2006 workshop on XML element retrieval methodology, Seattle, Washington, pp. 13-19.
  • 23
    • 77957145729 scopus 로고    scopus 로고
    • Kaptein, R., & Kamps, J. (2009). Finding entities or information using annotations. In ECIR workshop on information retrieval over social networks, pp. 71-78.
  • 24
    • 80053362929 scopus 로고    scopus 로고
    • Kazama, J., & Torisawa, K. (2007). Exploiting Wikipedia as external knowledge for named entity recognition. In Proceedings of the 2007 joint conference on EMNLP and CoNLL, Prague, The Czech Republic, pp. 698-707.
  • 25
    • 4243148480 scopus 로고    scopus 로고
    • Authoritative sources in hyperlinked environment
    • Kleinberg, J. M. (1999). Authoritative sources in hyperlinked environment. Journal of the ACM 46(5), 604-632.
    • (1999) Journal of the ACM , vol.46 , Issue.5 , pp. 604-632
    • Kleinberg, J.M.1
  • 26
    • 0034172374 scopus 로고    scopus 로고
    • Wrapper induction: Efficiency and expressiveness
    • Kushmerick, N. (2000). Wrapper induction: Efficiency and expressiveness. Artificial Intelligence 118(1-2), 15-68.
    • (2000) Artificial Intelligence , vol.118 , Issue.1-2 , pp. 15-68
    • Kushmerick, N.1
  • 27
    • 77957154751 scopus 로고    scopus 로고
    • Kwok, K. (2005). An attempt to identify weakest and strongest queries. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil.
  • 30
    • 77957134892 scopus 로고    scopus 로고
    • Loper, E., & Bird, S. (2002). NLTK: The natural language toolkit. In Proceedings of the ACL-02 workshop on effective tools and methodologies for teaching natural language processing and computational linguistics, Philadelphia, Pennsylvania, pp. 63-70.
  • 31
    • 41849135039 scopus 로고    scopus 로고
    • Mizzaro, S. (2008). The good, the bad, the difficult, and the easy: Something wrong with information retrieval evaluation? In Proceedings of the 30th European conference on information retrieval (ECIR'08), Lecture Notes in Computer Science, Vol. 4956, pp. 642-646.
  • 32
    • 36448959373 scopus 로고    scopus 로고
    • Mizzaro, S., & Robertson, S. (2007). HITS hits TREC: Exploring IR evaluation results with network analysis. In Proceedings of the 30th ACM SIGIR conference on research and development in information retrieval (SIGIR'07), Amsterdam, The Netherlands, pp. 479-486.
  • 33
    • 77957140731 scopus 로고    scopus 로고
    • Mothe, J., & Tanguy, L. (2005). Linguistic features to predict query difficulty. In Proceedings of the SIGIR workshop on predicting query difficulty, Salvador, Brazil.
  • 34
    • 33750405448 scopus 로고    scopus 로고
    • Nie, L., Davison, B. D., & Qi, X. (2006). Topical link analysis for web search. In Proceedings of the 29th ACM international conference on research and development in information retrieval (SIGIR'06), Seattle, Washington, pp. 91-98.
  • 35
    • 17444380836 scopus 로고    scopus 로고
    • Hybrid XML retrieval: Combining information retrieval and a native XML database
    • Pehcevski, J., Thom, J. A., & Vercoustre, A. M. (2005). Hybrid XML retrieval: Combining information retrieval and a native XML database. Information Retrieval 8(4), 571-600.
    • (2005) Information Retrieval , vol.8 , Issue.4 , pp. 571-600
    • Pehcevski, J.1    Thom, J.A.2    Vercoustre, A.M.3
  • 36
    • 41849145332 scopus 로고    scopus 로고
    • Pehcevski, J., Vercoustre, A. M., & Thom, J. A. (2008). Exploiting locality of Wikipedia links in entity ranking. In Proceedings of the 30th European conference on information retrieval (ECIR'08), Lecture notes in computer science, Vol. 4956, pp. 258-269.
  • 37
    • 77957108994 scopus 로고    scopus 로고
    • Quinlan, J. R. (1993). C4. 5: Programs for machine learning. Morgan Kaufmann Publishers, Inc.
  • 38
    • 77957114016 scopus 로고    scopus 로고
    • Sahuguet, A., & Azavant, F. (1999). Building light-weight wrappers for legacy web data-sources using W4F. In Proceedings of 25th international conference on very large data bases (VLDB'99), Edinburgh, Scotland, UK, pp. 738-741.
  • 39
    • 84873532856 scopus 로고    scopus 로고
    • Soboroff, I., de Vries, A. P., & Craswell, N. (2006). Overview of the TREC 2006 Enterprise track. In Proceedings of the fifteenth text retrieval conference (TREC 2006), pp. 32-51.
  • 40
    • 84876713855 scopus 로고    scopus 로고
    • Thom, J. A., Pehcevski, J., & Vercoustre, A. M. (2007). Use of Wikipedia categories in entity ranking. In Proceedings of 12th Australasian document computing symposium (ADCS'07), Melbourne, Australia, pp. 56-63.
  • 41
    • 51849114763 scopus 로고    scopus 로고
    • Tsikrika, T., Serdyukov, P., Rode, H., Westerveld, T., Aly, R., Hiemstra, D., et al. (2008). Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, Vol. 4862, pp. 306-320.
  • 42
    • 77957135666 scopus 로고    scopus 로고
    • Vercoustre, A. M., & Paradis, F. (1997). A descriptive language for information object reuse through virtual documents. In 4th International conference on object-oriented information systems (OOIS'97), Brisbane, Australia, pp. 299-311.
  • 43
    • 51849093123 scopus 로고    scopus 로고
    • Vercoustre, A. M., Pehcevski, J., & Thom, J. A. (2008a). Using Wikipedia categories and links in entity ranking. In Focused access to XML documents: Sixth international workshop of the initiative for the evaluation of XML retrieval, INEX 2007, Lecture notes in computer science, vol. 4862, pp. 321-335.
  • 44
    • 51849092997 scopus 로고    scopus 로고
    • Vercoustre, A. M., Thom, J. A., & Pehcevski, J. (2008b). Entity ranking in Wikipedia. In Proceedings of the 23rd ACM symposium on applied computing, Fortaleza, Ceará, Brazil, pp. 1101-1106.
  • 45
    • 70350498878 scopus 로고    scopus 로고
    • Vercoustre, A. M., Pehcevski, J., & Naumovski, V. (2009). Topic difficulty prediction in entity ranking. In Advances in focused retrieval: Seventh international workshop of the initiative for the evaluation of XML retrieval, INEX 2008, Lecture notes in computer science, Vol. 5631.
  • 46
    • 77957138213 scopus 로고    scopus 로고
    • Voorhees, E. M. (2004). The TREC robust retrieval track. In Proceedings of the thirteenth text retrieval conference (TREC 2004).
  • 47
    • 57349160444 scopus 로고    scopus 로고
    • Webber, W., Moffat, A., & Zobel, J. (2008). Score standardization for inter-collection comparison of retrieval systems. In Proceedings of the 31st ACM SIGIR conference on research and development in information retrieval (SIGIR'08), Singapore, pp. 51-58.
  • 48
    • 77957167086 scopus 로고    scopus 로고
    • Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques, second edition. Morgan Kaufmann Publishers, Inc.
  • 49
    • 77957116533 scopus 로고    scopus 로고
    • Yom-Tov, E., Fine, S., Carmel, D., Darlow, A., & Amitay, E. (2004). Juru at TREC 2004: Experiments with prediction of query difficulty. In Proceedings of the thirteenth text retrieval conference (TREC 2004).
  • 50
    • 58049111201 scopus 로고    scopus 로고
    • Yu, J., Thom, J. A., & Tam, A. (2007). Ontology evaluation using Wikipedia categories for browsing. In Proceedings of the 16th ACM conference on information and knowledge management (CIKM'07), Lisboa, Portugal, pp. 223-232.
  • 51
    • 36448977901 scopus 로고    scopus 로고
    • Zhou, Y., & Croft, W. B. (2007). Query performance prediction in web search environments. In Proceedings of the 30th ACM SIGIR conference on research and development in information retrieval (SIGIR'07), Amsterdam, The Netherlands, pp. 543-550.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.