메뉴 건너뛰기




Volumn 31, Issue 11, 1999, Pages 1623-1640

Focused crawling: A new approach to topic-specific Web resource discovery

Author keywords

[No Author keywords available]

Indexed keywords

DATA REDUCTION; DATA STRUCTURES; HYPERTEXT SYSTEMS; SEARCH ENGINES;

EID: 0033294474     PISSN: 13891286     EISSN: None     Source Type: Journal    
DOI: 10.1016/S1389-1286(99)00052-3     Document Type: Article
Times cited : (783)

References (31)
  • 1
    • 17144416189 scopus 로고    scopus 로고
    • Learning probabilistic user profiles: applications to finding interesting web sites, notifying users of relevant changes to web pages, and locating grant opportunities
    • online at
    • M. Ackerman, D. Billsus, S. Gaffney, S. Hettich, G. Khoo, D. Kim, R. Klefstad, C. Lowe, A. Ludeman, J. Muramatsu, K. Omori, M. Pazzani, D. Semler, B. Starr and P. Yap, Learning probabilistic user profiles: applications to finding interesting web sites, notifying users of relevant changes to web pages, and locating grant opportunities, AI Magazine 18(2): 47-56, 1997, online at http://www.ics.uci.edu/~pazzani/Publications/AI-MAG.pdf.
    • (1997) AI Magazine , vol.18 , Issue.2 , pp. 47-56
    • Ackerman, M.1    Billsus, D.2    Gaffney, S.3    Hettich, S.4    Khoo, G.5    Kim, D.6    Klefstad, R.7    Lowe, C.8    Ludeman, A.9    Muramatsu, J.10    Omori, K.11    Pazzani, M.12    Semler, D.13    Starr, B.14    Yap, P.15
  • 2
    • 0342938822 scopus 로고    scopus 로고
    • A technique for measuring the relative size and overlap of public web search engines
    • online at also see an update at http://www.research.digital.com/SRC/whatsnew/sem.html
    • K. Bharat and A. Broder, A technique for measuring the relative size and overlap of public web search engines, in: Proc. of the 7th World-Wide Web Conference (WWW7), 1998, online at http://www7.scu.edu.au/programme/fullpapers/1937/com1937.htm; also see an update at http://www.research.digital.com/SRC/whatsnew/sem.html.
    • (1998) In: Proc. of the 7th World-Wide Web Conference (WWW7)
    • Bharat, K.1    Broder, A.2
  • 3
    • 0032283569 scopus 로고    scopus 로고
    • Improved algorithms for topic distillation in a hyperlinked environment
    • ACM, online at
    • K. Bharat and M. Henzinger, Improved algorithms for topic distillation in a hyperlinked environment, in: SIGIR Conference on Research and Development in Information Retrieval, vol. 21. ACM, 1998, online at ftp://ftp.digital.com/pub/DEC/SRC/publications/monika/sigir98.pdf.
    • (1998) In: SIGIR Conference on Research and Development in Information Retrieval , vol.21
    • Bharat, K.1    Henzinger, M.2
  • 4
    • 0012349158 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual web search engine
    • online at
    • S. Brin and L. Page, The anatomy of a large-scale hypertextual web search engine, in: Proc. of the 7th World-Wide Web WWW Conference, 1998, online at http://google.stanford.edu/~backrub/google.html.
    • (1998) In: Proc. of the 7th World-Wide Web WWW Conference
    • Brin, S.1    Page, L.2
  • 5
    • 0000776545 scopus 로고    scopus 로고
    • Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies
    • S. Chakrabarti, B. Dom, R. Agrawal and P. Raghavan, Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies, VLDB Journal 7(3): 163-178, 1998.
    • (1998) VLDB Journal , vol.7 , Issue.3 , pp. 163-178
    • Chakrabarti, S.1    Dom, B.2    Agrawal, R.3    Raghavan, P.4
  • 6
    • 0002044911 scopus 로고    scopus 로고
    • Automatic resource compilation by analyzing hyperlink structure and associated text
    • online at and at http://www.almaden.ibm.com/cs/people/pragh/www98/438.html
    • S. Chakrabarti, B. Dom, D. Gibson, J. Kleinberg, P. Raghavan and S. Rajagopalan, Automatic resource compilation by analyzing hyperlink structure and associated text, in: Proc. of the 7th World-Wide Web Conference (WWW7), 1998, online at http://www7.scu.edu.au/programme/fullpapers/1898/com1898.html and at http://www.almaden.ibm.com/cs/people/pragh/www98/438.html.
    • (1998) In: Proc. of the 7th World-Wide Web Conference (WWW7)
    • Chakrabarti, S.1    Dom, B.2    Gibson, D.3    Kleinberg, J.4    Raghavan, P.5    Rajagopalan, S.6
  • 7
    • 0032090684 scopus 로고    scopus 로고
    • Enhanced hypertext categorization using hyperlinks
    • online at
    • S. Chakrabarti, B. Dom and P. Indyk, Enhanced hypertext categorization using hyperlinks, in: SIGMOD. ACM, 1998, online at http://www.cs.berkeley.edu/~soumen/sigmod98.ps.
    • (1998) In: SIGMOD. ACM
    • Chakrabarti, S.1    Dom, B.2    Indyk, P.3
  • 12
    • 0023809273 scopus 로고    scopus 로고
    • Efficient crawling through URL ordering
    • Brisbane, Australia, Apr. online at
    • J. Cho, H. Garcia-Molina and L. Page, Efficient crawling through URL ordering, in: 7th World Wide Web Conference, Brisbane, Australia, Apr. 1998, online at http://www7.scu.edu.au/programme/fullpapers/1919/com1919.htm.
    • (1998) In: 7th World Wide Web Conference
    • Cho, J.1    Garcia-Molina, H.2    Page, L.3
  • 14
    • 0346826859 scopus 로고
    • Information retrieval in the world-wide web: making client-based searching feasible
    • Geneva, Switzerland
    • P. DeBra and R. Post, Information retrieval in the world-wide web: making client-based searching feasible, in: Proc. of the 1st International World Wide Web Conference, Geneva, Switzerland, 1994.
    • (1994) In: Proc. of the 1st International World Wide Web Conference
    • Debra, P.1    Post, R.2
  • 15
    • 84884804279 scopus 로고    scopus 로고
    • Moving up the information food chain: deploying softbots on the world wide web
    • O. Etzioni, Moving up the information food chain: deploying softbots on the world wide web, in: Proc. of AAAI-96, 1996.
    • (1996) In: Proc. of AAAI-96
    • Etzioni, O.1
  • 16
    • 85024269793 scopus 로고    scopus 로고
    • Small portals prove that size matters
    • December online at and http://www.cs.berkeley.edu/~soumen/focus/DanGillmor19981206.htm.
    • D. Gillmor, Small portals prove that size matters, Tech column in San Jose Mercury News, December 1998, online at http://www.sjmercury.com/columnists/gillmor/docs/dg120698.htm and http://www.cs.berkeley.edu/~soumen/focus/DanGillmor19981206.htm.
    • (1998) Tech Column in San Jose Mercury News
    • Gillmor, D.1
  • 17
    • 0342707412 scopus 로고    scopus 로고
    • The Shark Search algorithm - An application: Tailored web site mappping
    • Brisbane, Australia, april, online at:
    • M. Hersovici, M. Jacovi, Y.S. Maarek, D. Pelleg, M. Shtalheim, S. Ur, The Shark Search algorithm - An application: Tailored web site mappping, 7th World-Wide Web Conference, 1998, Brisbane, Australia, april, online at: http:www.7scu.edu.au/programme/fullpapers/1849/com1849.htm.
    • (1998) 7th World-Wide Web Conference
    • Hersovici, M.1    Jacovi, M.2    Maarek, Y.S.3    Pelleg, D.4    Shtalheim, M.5    Ur, S.6
  • 18
    • 0000169986 scopus 로고    scopus 로고
    • WebWatcher: A tour guide for the web
    • August online at
    • T. Joachims, D. Freitag and T. Mitchell, WebWatcher: a tour guide for the web, in: IJCAI, August 1997, online at http://www.cs.cmu.edu/~webwatcher/ijcai97.ps.
    • (1997) In: IJCAI
    • Joachims, T.1    Freitag, D.2    Mitchell, T.3
  • 19
    • 0004347260 scopus 로고    scopus 로고
    • Scientific American, March and http://www.alexa.com/~brewster/essays/sciam_article.html
    • B. Kahle, Preserving the Internet, Scientific American, March 1997, online at http://www.sciam.com/0397issue/0397kahle.html and http://www.alexa.com/~brewster/essays/sciam_article.html.
    • (1997) Preserving the Internet
    • Kahle, B.1
  • 20
    • 0002827622 scopus 로고
    • A new status index derived from sociometric analysis
    • March
    • L. Katz, A new status index derived from sociometric analysis, Psychometrika 18(1): 39-43, March 1953.
    • (1953) Psychometrika , vol.18 , Issue.1 , pp. 39-43
    • Katz, L.1
  • 21
    • 0032256758 scopus 로고    scopus 로고
    • Authoritative sources in a hyperlinked environment
    • also appears as IBM Research Report RJ 10076(91892) and online at
    • J. Kleinberg, Authoritative sources in a hyperlinked environment, in: Proc. ACM-SIAM Symposium on Discrete Algorithms, 1998, also appears as IBM Research Report RJ 10076(91892) and online at http://www.cs.cornell.edu/home/kleinber/auth.ps.
    • (1998) In: Proc. ACM-SIAM Symposium on Discrete Algorithms
    • Kleinberg, J.1
  • 22
    • 0032478628 scopus 로고    scopus 로고
    • Searching the world wide web
    • April
    • S. Lawrence and C.L. Giles, Searching the world wide web, Science 280: 98-100, April 1998.
    • (1998) Science , vol.280 , pp. 98-100
    • Lawrence, S.1    Giles, C.L.2
  • 25
    • 84928445598 scopus 로고
    • Techniques for disaggregating centrality scores in social networks
    • in: N.B. Tuma (Ed.), Jossey-Bass, San Francisco
    • M.S. Mizruchi, P. Mariolis, M. Schwartz and B. Mintz, Techniques for disaggregating centrality scores in social networks, in: N.B. Tuma (Ed.), Sociological Methodology, pp. 26-48, Jossey-Bass, San Francisco, 1986.
    • (1986) Sociological Methodology , pp. 26-48
    • Mizruchi, M.S.1    Mariolis, P.2    Schwartz, M.3    Mintz, B.4
  • 26
    • 0003309997 scopus 로고    scopus 로고
    • Text classification from labeled and unlabeled documents using EM
    • online at
    • K. Nigam, A. McCallum, S. Thrun and T. Mitchell, Text classification from labeled and unlabeled documents using EM, Machine Learning, 1999, online at http://www.cs.cmu.edu/~knigam/papers/emcat-mlj99.ps.gz.
    • (1999) Machine Learning
    • Nigam, K.1    McCallum, A.2    Thrun, S.3    Mitchell, T.4
  • 29
    • 0003457176 scopus 로고    scopus 로고
    • Analysis of a very large AltaVista query log
    • COMPAQ System Research Center, October online at
    • C. Silverstein, M. Henzinger, H. Marais and M. Moricz, Analysis of a very large AltaVista query log, Technical Report 1998-014, COMPAQ System Research Center, October 1998, online at http://gatekeeper.dec.com/pub/DEC/SRC/technical-notes/abstracts/src-tn-1998-014. html.
    • (1998) Technical Report 1998-014
    • Silverstein, C.1    Henzinger, M.2    Marais, H.3    Moricz, M.4
  • 30
    • 0031619255 scopus 로고    scopus 로고
    • Finding and visualizing inter-site clan graphs
    • Los Angeles, CA, April ACM SIGCHI, online at and http://www.acm.org/pubs/articles/proceedings/chi/274644/p448-terveen/p4 48-terveen.pdf
    • L. Terveen and W. Hill, Finding and visualizing inter-site clan graphs, in: Computer Human Interaction (CHI), pp. 448-455, Los Angeles, CA, April 1998, ACM SIGCHI, online at http://www.research.att.com/~terveen/chi98.htm and http://www.acm.org/pubs/articles/proceedings/chi/274644/p448-terveen/p4 48-terveen.pdf.
    • (1998) In: Computer Human Interaction (CHI) , pp. 448-455
    • Terveen, L.1    Hill, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.