메뉴 건너뛰기




Volumn 38, Issue 2, 2008, Pages 189-225

A personalized search engine based on Web-snippet hierarchical clustering

Author keywords

Personalized Web ranking; Search engines; Web snippet clustering

Indexed keywords

CLUSTERING ALGORITHMS; DATA MINING; INFORMATION RETRIEVAL; WEB SERVICES;

EID: 39449131909     PISSN: 00380644     EISSN: 1097024X     Source Type: Journal    
DOI: 10.1002/spe.829     Document Type: Article
Times cited : (60)

References (80)
  • 1
    • 15844394231 scopus 로고    scopus 로고
    • What's new on the Web? The evolution of the Web from a search engine perspective
    • New York, ACM Press: New York
    • Ntoulas A, Cho J, Olston C. What's new on the Web? The evolution of the Web from a search engine perspective. Proceedings of the 13th International World Wide Web Conference, New York, 2004. ACM Press: New York, 2004; 1-12.
    • (2004) Proceedings of the 13th International World Wide Web Conference , pp. 1-12
    • Ntoulas, A.1    Cho, J.2    Olston, C.3
  • 3
    • 33748988604 scopus 로고    scopus 로고
    • The popularity and importance of search engines
    • Technical Report, The Pew Internet & American Life Project
    • Fallows D, Rainie L, Mudd G. The popularity and importance of search engines. Technical Report, The Pew Internet & American Life Project, 2004.
    • (2004)
    • Fallows, D.1    Rainie, L.2    Mudd, G.3
  • 4
    • 0142030258 scopus 로고    scopus 로고
    • A taxonomy of Web search
    • Broder AZ. A taxonomy of Web search. SIGIR Forum 2002; 36(2):3-10.
    • (2002) SIGIR Forum , vol.36 , Issue.2 , pp. 3-10
    • Broder, A.Z.1
  • 5
    • 39449120856 scopus 로고    scopus 로고
    • Searchenginewatch. http://searchenginewatch.com/showPage.html?page= 2156241 [22 April 2007].
    • Searchenginewatch. http://searchenginewatch.com/showPage.html?page= 2156241 [22 April 2007].
  • 6
    • 39449127770 scopus 로고    scopus 로고
    • 22 April 2007
    • Searchenginelowdown. http://www.searchenginelowdown.com/2004/10/Web-20- exclusive-demonstration-of.html [22 April 2007].
    • Searchenginelowdown
  • 7
    • 39449107548 scopus 로고    scopus 로고
    • 22 April 2007
    • Microsoft clustering. http://www.betanews.com/article/Microsoft- Tests_Search_Clustering/1106319504 [22 April 2007].
    • Microsoft clustering
  • 8
    • 13244295841 scopus 로고    scopus 로고
    • Seeking better Web searches
    • February
    • Mostafa J. Seeking better Web searches. Scientific American, February 2005.
    • (2005) Scientific American
    • Mostafa, J.1
  • 11
    • 0004140078 scopus 로고    scopus 로고
    • An analysis of recent work on clustering algorithms
    • Technical Report 01-03-02, Department of Computer Science and Engineering, University of Washington
    • Fasulo D. An analysis of recent work on clustering algorithms. Technical Report 01-03-02, Department of Computer Science and Engineering, University of Washington, 1999.
    • (1999)
    • Fasulo, D.1
  • 16
    • 0242701229 scopus 로고    scopus 로고
    • Ephemeral document clustering for Web applications
    • Technical Report RJ 10186, IBM Research
    • Maarek YS, Fagin R, Ben-Shaul IZ, Pelleg D. Ephemeral document clustering for Web applications. Technical Report RJ 10186, IBM Research, 2000.
    • (2000)
    • Maarek, Y.S.1    Fagin, R.2    Ben-Shaul, I.Z.3    Pelleg, D.4
  • 18
    • 39449087408 scopus 로고    scopus 로고
    • 22 April 2007
    • Google Labs, http://labs.google.com/personalized/ [22 April 2007].
    • Google Labs
  • 19
    • 39449113709 scopus 로고    scopus 로고
    • My Yahoo! http://mysearch.yahoo.com/ [22 April 2007].
    • My Yahoo! http://mysearch.yahoo.com/ [22 April 2007].
  • 20
    • 39449130873 scopus 로고    scopus 로고
    • 22 April 2007
    • My Jeeves, http://myjeeves.ask.com/ [22 April 2007].
    • My Jeeves
  • 21
    • 39449118981 scopus 로고    scopus 로고
    • 22 April 2007
    • Eurekster. http://www.eurekster.com/ [22 April 2007].
    • Eurekster
  • 24
    • 39449114606 scopus 로고    scopus 로고
    • 22 April 2007
    • SnakeT's queries dataset. http://roquefort.di.unipi.it/ ~gulliflistAllowed/testSnakeT/ [22 April 2007].
    • SnakeT's queries dataset
  • 25
    • 39449088280 scopus 로고    scopus 로고
    • Better search results than Google? Next-generation sites help narrow internet searches
    • January, 22 April
    • CNN.com. Better search results than Google? Next-generation sites help narrow internet searches. Associated Press, January 2004. http://edition.cnn. com/2004/TECH/internet/01/05/seeing.searchl.ap/index.html [22 April 2007].
    • (2004) Associated Press
  • 27
    • 39449098215 scopus 로고    scopus 로고
    • 22 April 2007
    • Time Warner press release. http://media.timewarner.com/media/newmedia/ cb_press_view.cfm?release_num=55254348 [22 April 2007].
    • Time Warner press release
  • 28
    • 39449087977 scopus 로고    scopus 로고
    • Searchenginewatch. http://searchenginewatch.com/showPage.html?page= 2226841 [22 April 2007].
    • Searchenginewatch. http://searchenginewatch.com/showPage.html?page= 2226841 [22 April 2007].
  • 30
    • 39449135687 scopus 로고    scopus 로고
    • Jiang Z, Joshi A, Krishnapuram R, Yi L. Managing business with Electronic Commerce 02. Retriever: Improving Web Search Engine Results Using Clustering, ch. 4, Gangopadhyay A (ed.). Idea Group Publishing: Hersley, PA, 2002.
    • Jiang Z, Joshi A, Krishnapuram R, Yi L. Managing business with Electronic Commerce 02. Retriever: Improving Web Search Engine Results Using Clustering, ch. 4, Gangopadhyay A (ed.). Idea Group Publishing: Hersley, PA, 2002.
  • 31
    • 84949743789 scopus 로고    scopus 로고
    • On combining link and contents information for Web page clustering
    • Aix en Provence, France, Springer: Berlin
    • Wang Y, Kitsuregawa M. On combining link and contents information for Web page clustering. Proceedings of Database and Expert Systems Applications, Aix en Provence, France, 2002. Springer: Berlin, 2002; 902-913.
    • (2002) Proceedings of Database and Expert Systems Applications , pp. 902-913
    • Wang, Y.1    Kitsuregawa, M.2
  • 32
    • 0033294891 scopus 로고    scopus 로고
    • Grouper: A dynamic clustering interface to Web search results
    • Toronto, Canada, ACM Press: New York
    • Zamir O, Etzioni O. Grouper: A dynamic clustering interface to Web search results. Proceedings of the 8th International World Wide Web Conference, Toronto, Canada, 1999. ACM Press: New York, 1999; 1361-1374.
    • (1999) Proceedings of the 8th International World Wide Web Conference , pp. 1361-1374
    • Zamir, O.1    Etzioni, O.2
  • 33
    • 20844451461 scopus 로고    scopus 로고
    • Conceptual clustering using lingo algorithm: Evaluation on open directory project data
    • Zakopane, Poland, Springer: Berlin
    • Osinski S, Weiss D. Conceptual clustering using lingo algorithm: Evaluation on open directory project data. Proceedings of New Trends Intelligent Information Processing and Web Mining, Zakopane, Poland, 2004. Springer: Berlin, 2004; 369-377.
    • (2004) Proceedings of New Trends Intelligent Information Processing and Web Mining , pp. 369-377
    • Osinski, S.1    Weiss, D.2
  • 34
    • 8644273327 scopus 로고    scopus 로고
    • Learning to cluster Web search results
    • Sheffield, U.K, ACM Press: New York
    • Zeng H, He Q, Chen Z, Ma W. Learning to cluster Web search results. Proceedings of SIGIR04, Sheffield, U.K., 2004. ACM Press: New York, 2004; 210-217.
    • (2004) Proceedings of SIGIR04 , pp. 210-217
    • Zeng, H.1    He, Q.2    Chen, Z.3    Ma, W.4
  • 35
    • 35048892005 scopus 로고    scopus 로고
    • Web search results clustering in Polish: Experimental evaluation of carrot
    • Zakopane, Poland, Springer: Berlin
    • Weiss D, Stefanowski J. Web search results clustering in Polish: Experimental evaluation of carrot. Proceedings of Intelligent Information Processing and Web Mining, Zakopane, Poland, 2003. Springer: Berlin, 2003; 209-218.
    • (2003) Proceedings of Intelligent Information Processing and Web Mining , pp. 209-218
    • Weiss, D.1    Stefanowski, J.2
  • 36
    • 26944452609 scopus 로고    scopus 로고
    • Large hierarchical document clustering using frequent itemsets
    • San Francisco, CA, SIAM: Philadelphia, PA
    • Fung B, Wang K, Ester M. Large hierarchical document clustering using frequent itemsets. Proceedings of SIAM International Conference on Data Mining, San Francisco, CA, 2003. SIAM: Philadelphia, PA, 2003; 209-218.
    • (2003) Proceedings of SIAM International Conference on Data Mining , pp. 209-218
    • Fung, B.1    Wang, K.2    Ester, M.3
  • 40
  • 41
    • 24144447265 scopus 로고    scopus 로고
    • Di Giacomo E, Didimo W, Grilli L, Liotta G. A topology-driven approach to the design of Web meta-search clustering engines. Proceedings of the 31st Annual Conference on Current Trends in Theory and Practice of Informatics, Liptovský Ján, Slovakia, 2005. Springer: Berlin, 2005; 106-116.
    • Di Giacomo E, Didimo W, Grilli L, Liotta G. A topology-driven approach to the design of Web meta-search clustering engines. Proceedings of the 31st Annual Conference on Current Trends in Theory and Practice of Informatics, Liptovský Ján, Slovakia, 2005. Springer: Berlin, 2005; 106-116.
  • 42
    • 1542287431 scopus 로고    scopus 로고
    • Generating hierachical summaries for Web searches
    • Toronto, Canada, ACM Press: New York
    • Lawrie DJ, Croft WB. Generating hierachical summaries for Web searches. Proceedings of SIGIR03, Toronto, Canada, 2003. ACM Press: New York, 2003; 457-458.
    • (2003) Proceedings of SIGIR03 , pp. 457-458
    • Lawrie, D.J.1    Croft, W.B.2
  • 47
    • 48149096972 scopus 로고    scopus 로고
    • 22 April 2007
    • Google search history, http://www.google.com/searchhistory/ [22 April 2007].
    • Google search history
  • 48
    • 39449087123 scopus 로고    scopus 로고
    • 22 April 2007
    • A9 search engine, http://www.a9.com/ [22 April 2007].
    • A9 search engine
  • 49
    • 4544296848 scopus 로고    scopus 로고
    • The perfect search engine is not enough: A study of orienteering behavior in directed search
    • Vienna, Austria, Springer: Berlin
    • Teevan J, Alvarado C, Ackerman MS, Karger DR. The perfect search engine is not enough: A study of orienteering behavior in directed search. Proceedings of Computer-Human Interaction Conference, Vienna, Austria, 2004. Springer: Berlin, 2004; 415-422.
    • (2004) Proceedings of Computer-Human Interaction Conference , pp. 415-422
    • Teevan, J.1    Alvarado, C.2    Ackerman, M.S.3    Karger, D.R.4
  • 51
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual Web search engine
    • Brisbane, Australia, ACM Press: New York
    • Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine. Proceedings of the 7th International World Wide Web Conference, Brisbane, Australia, 1998. ACM Press: New York, 1998; 107-117.
    • (1998) Proceedings of the 7th International World Wide Web Conference , pp. 107-117
    • Brin, S.1    Page, L.2
  • 56
    • 34250689178 scopus 로고    scopus 로고
    • Automatic identification of user interest for personalized search
    • Edinburgh, Scotland, U.K, ACM Press: New York
    • Qiu F, Cho J. Automatic identification of user interest for personalized search. Proceedings of the 15th International World Wide Web Conference, Edinburgh, Scotland, U.K., 2006. ACM Press: New York, 2006.
    • (2006) Proceedings of the 15th International World Wide Web Conference
    • Qiu, F.1    Cho, J.2
  • 57
    • 1542317651 scopus 로고    scopus 로고
    • Exploiting query history for document ranking in interactive information retrieval
    • Toronto, Canada, ACM Press: New York
    • Shen X, Zhai CX. Exploiting query history for document ranking in interactive information retrieval. Proceedings of the 26th Annual International ACM SIGIR (Poster), Toronto, Canada, 2003. ACM Press: New York, 2003; 377-378.
    • (2003) Proceedings of the 26th Annual International ACM SIGIR (Poster) , pp. 377-378
    • Shen, X.1    Zhai, C.X.2
  • 58
    • 84941155576 scopus 로고
    • Information filtering based on user behavior analysis and best match text retrieval
    • Dublin, Ireland, ACM Press: New York
    • Morita M, Shinoda Y. Information filtering based on user behavior analysis and best match text retrieval. Proceedings of the 17th Annual International SIGIR, Dublin, Ireland, 1994. ACM Press: New York, 1994; 272-281.
    • (1994) Proceedings of the 17th Annual International SIGIR , pp. 272-281
    • Morita, M.1    Shinoda, Y.2
  • 60
    • 33746034080 scopus 로고    scopus 로고
    • Communities, collaboration and cooperation in personalized Web search
    • Edinburgh, Scotland, IJCAI Secretary Treasurer: Rochester Hills, MI
    • Freyne J, Smyth B. Communities, collaboration and cooperation in personalized Web search. Proceedings of the 3rd Workshop on Intelligent Techniques for Web Personalization, Edinburgh, Scotland, 2005. IJCAI Secretary Treasurer: Rochester Hills, MI, 2005.
    • (2005) Proceedings of the 3rd Workshop on Intelligent Techniques for Web Personalization
    • Freyne, J.1    Smyth, B.2
  • 61
    • 0033699978 scopus 로고    scopus 로고
    • Searchpad: Explicit capture of search context to support Web search
    • Amsterdam, The Netherlands, ACM Press: New York
    • Bharat K. Searchpad: Explicit capture of search context to support Web search. Proceedings of the 10th International World Wide Web Conference, Amsterdam, The Netherlands, 2000. ACM Press: New York, 2000; 493-501.
    • (2000) Proceedings of the 10th International World Wide Web Conference , pp. 493-501
    • Bharat, K.1
  • 63
    • 1542299386 scopus 로고    scopus 로고
    • Comparing clusterings
    • Technical Report 418, University of Washington
    • Meila M. Comparing clusterings. Technical Report 418, University of Washington, 2002.
    • (2002)
    • Meila, M.1
  • 65
    • 39449138530 scopus 로고    scopus 로고
    • 22 April 2007
    • Nutch. http://www.nutch.org/ [22 April 2007].
    • Nutch
  • 69
    • 39449135993 scopus 로고    scopus 로고
    • DMOZ, 22 April 2007
    • DMOZ. http://www.dmoz.com/ [22 April 2007].
  • 71
    • 77953048404 scopus 로고    scopus 로고
    • A personalized search engine based on Web snippet hierarchical clustering
    • Chiba, Japan, ACM Press: New York
    • Ferragina P, Gulli A. A personalized search engine based on Web snippet hierarchical clustering. Proceedings of the 14th International World Wide Web Conference, Chiba, Japan, 2005. ACM Press: New York, 2005; 801-810.
    • (2005) Proceedings of the 14th International World Wide Web Conference , pp. 801-810
    • Ferragina, P.1    Gulli, A.2
  • 72
    • 39449084483 scopus 로고    scopus 로고
    • 22 April 2007
    • RankComparison Engine, http://rankcomparison.di.unipi.it/ [22 April 2007].
    • RankComparison Engine
  • 73
    • 39449102515 scopus 로고    scopus 로고
    • 22 April 2007
    • Snowball stemmer. http://snowball.tartarus.org/ [22 April 2007],
    • Snowball stemmer
  • 74
    • 39449101924 scopus 로고    scopus 로고
    • CPAN, 22 April 2007
    • CPAN. http://www.cpan.org/ [22 April 2007].
  • 75
    • 39449136894 scopus 로고    scopus 로고
    • 22 April 2007
    • Vivisimo. http://vivisimo.com/docs/personalization.pdf [22 April 2007].
    • Vivisimo
  • 77
    • 39449134064 scopus 로고    scopus 로고
    • 22 April 2007
    • SnakeT's evaluation results, http://roquefort.di.unipi.it/~gulli/ listAllowed/testing/ [22 April 2007].
    • SnakeT's evaluation results
  • 78
    • 0032256758 scopus 로고    scopus 로고
    • Authoritative sources in a hyper-linked environment
    • San Francisco, CA, SIAM: Philadelphia, PA
    • Kleinberg J. Authoritative sources in a hyper-linked environment. Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms, San Francisco, CA, 1998. SIAM: Philadelphia, PA, 1998; 668-677.
    • (1998) Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms , pp. 668-677
    • Kleinberg, J.1
  • 79
    • 20444387298 scopus 로고    scopus 로고
    • A technique for measuring the relative size and overlap of public Web search engines
    • Brisbane, Australia, ACM Press: New York
    • Bharat K, Broder AZ. A technique for measuring the relative size and overlap of public Web search engines. Proceedings of the 7th International World Wide Web Conference, Brisbane, Australia, 1998. ACM Press: New York, 1998; 379-388.
    • (1998) Proceedings of the 7th International World Wide Web Conference , pp. 379-388
    • Bharat, K.1    Broder, A.Z.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.