메뉴 건너뛰기




Volumn , Issue , 2002, Pages 148-159

Accelerated focused crawling through online relevance feedback

Author keywords

Document object model; Focused crawling; Reinforcement learning

Indexed keywords

AUTOMATIC PROGRAMS; DOCUMENT OBJECT MODEL; FALSE POSITIVE; FOCUSED CRAWLER; FOCUSED CRAWLING; HUMAN BEHAVIORS; INFORMATION NEED; OPEN DIRECTORY; RECTANGULAR REGIONS; RELEVANCE FEEDBACK; RESOURCE DISCOVERY; TREE STRUCTURES;

EID: 77953064623     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/511446.511466     Document Type: Conference Paper
Times cited : (180)

References (34)
  • 1
    • 84874371227 scopus 로고    scopus 로고
    • Intelligent crawling on the World Wide Web with arbitrary predicates
    • ACM. Online at
    • C. C. Aggarwal, F. Al-Garawi, and P. S. Yu. Intelligent crawling on the World Wide Web with arbitrary predicates. In WWW2001, Hong Kong, May 2001. ACM. Online at http: //www10.org/cdrom/papers/110/.
    • WWW2001, Hong Kong, May 2001
    • Aggarwal, C.C.1    Al-Garawi, F.2    Yu, P.S.3
  • 2
    • 0028461417 scopus 로고
    • Automated learning of decision rules for text categorization
    • IBM Research Report RC18879
    • C. Apte, F. Damerau, and S. M. Weiss. Automated learning of decision rules for text categorization. ACM Transactions on Information Systems, 1994. IBM Research Report RC18879.
    • (1994) ACM Transactions on Information Systems
    • Apte, C.1    Damerau, F.2    Weiss, S.M.3
  • 3
    • 0031620208 scopus 로고    scopus 로고
    • Combining labeled and unlabeled data with co-training
    • A. Blum and T. M. Mitchell. Combining labeled and unlabeled data with co-training. In Computational Learning Theory, pages 92-100, 1998.
    • (1998) Computational Learning Theory , pp. 92-100
    • Blum, A.1    Mitchell, T.M.2
  • 4
    • 84944134642 scopus 로고    scopus 로고
    • Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction
    • Online at
    • S. Chakrabarti. Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction. In WWW10, Hong Kong, May 2001. Online at http://www10.org/cdrom/papers/489.
    • WWW10, Hong Kong, May 2001
    • Chakrabarti, S.1
  • 5
    • 0000776545 scopus 로고    scopus 로고
    • Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies
    • Aug. Online at
    • S. Chakrabarti, B. Dom, R. Agrawal, and P. Raghavan. Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies. VLDB Journal, Aug. 1998. Online at http: //www.cs.berkeley.edu/~soumen/VLDB54-3.PDF.
    • (1998) VLDB Journal
    • Chakrabarti, S.1    Dom, B.2    Agrawal, R.3    Raghavan, P.4
  • 7
    • 0032090684 scopus 로고    scopus 로고
    • Enhanced hypertext categorization using hyperlinks
    • ACM, Online at
    • S. Chakrabarti, B. Dom, and P. Indyk. Enhanced hypertext categorization using hyperlinks. In SIGMOD Conference. ACM, 1998. Online at http://www.cs.berkeley.edu/~soumen/sigmod98.ps.
    • (1998) SIGMOD Conference
    • Chakrabarti, S.1    Dom, B.2    Indyk, P.3
  • 9
    • 0033294474 scopus 로고    scopus 로고
    • Focused crawling: A new approach to topic-specific web resource discovery
    • S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: a new approach to topic-specific web resource discovery. Computer Networks, 31:1623-1640, 1999.
    • (1999) Computer Networks , vol.31 , pp. 1623-1640
    • Chakrabarti, S.1    Van Den Berg, M.2    Dom, B.3
  • 14
    • 77953049489 scopus 로고    scopus 로고
    • Searching for arbitrary information in the WWW: The fish search for Mosaic
    • Online at
    • P. M. E. De Bra and R. D. J. Post. Searching for arbitrary information in the WWW: The fish search for Mosaic. In Second World Wide Web Conference '94: Mosaic and the Web, Chicago, Oct. 1994. Online at http://archive.ncsa.uiuc.edu/ SDG/IT94/Proceedings/Searching/debra/article.html and http: //citeseer.nj.nec. com/172936.html.
    • Second World Wide Web Conference '94: Mosaic and the Web, Chicago, Oct. 1994
    • De Bra, P.M.E.1    Post, R.D.J.2
  • 16
    • 34248847962 scopus 로고
    • A method for disambiguating word senses in a large corpus
    • W. A. Gale, K. W. Church, and D. Yarowsky. A method for disambiguating word senses in a large corpus. Computer and the Humanities, 26:415-439, 1993.
    • (1993) Computer and the Humanities , vol.26 , pp. 415-439
    • Gale, W.A.1    Church, K.W.2    Yarowsky, D.3
  • 17
    • 33745765905 scopus 로고    scopus 로고
    • The shark-search algorithm|an application: Tailored Web site mapping
    • Online at
    • M. Hersovici, M. Jacovi, Y. S. Maarek, D. Pelleg, M. Shtalhaim, and S. Ur. The shark-search algorithm|an application: Tailored Web site mapping. In WWW7, 1998. Online at http://www7.scu.edu.au/programme/fullpapers/1849/com1849. htm.
    • (1998) WWW7
    • Hersovici, M.1    Jacovi, M.2    Maarek, Y.S.3    Pelleg, D.4    Shtalhaim, M.5    Ur, S.6
  • 18
    • 0000169986 scopus 로고    scopus 로고
    • WebWatcher: A tour guide for the web
    • Aug. Online at
    • T. Joachims, D. Freitag, and T. Mitchell. WebWatcher: A tour guide for the web. In IJCAI, Aug. 1997. Online at http://www.cs.cmu.edu/~webwatcher/ ijcai97.ps.
    • (1997) IJCAI
    • Joachims, T.1    Freitag, D.2    Mitchell, T.3
  • 20
    • 0001783522 scopus 로고    scopus 로고
    • Exploring the Web with reconnaissance agents
    • Aug
    • H. Leiberman, C. Fry, and L. Weitzman. Exploring the Web with reconnaissance agents. CACM, 44(8):69-75, Aug. 2001. http://www.acm.org/cacm.
    • (2001) CACM , vol.44 , Issue.8 , pp. 69-75
    • Leiberman, H.1    Fry, C.2    Weitzman, L.3
  • 22
    • 0001673996 scopus 로고    scopus 로고
    • A comparison of event models for naive Bayes text classification
    • AAAI Press, Online at
    • A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In AAAI/ICML-98 Workshop on Learning for Text Categorization, pages 41-48. AAAI Press, 1998. Online at http://www.cs.cmu.edu/ ~knigam/.
    • (1998) AAAI/ICML-98 Workshop on Learning for Text Categorization , pp. 41-48
    • McCallum, A.1    Nigam, K.2
  • 23
    • 0001673996 scopus 로고    scopus 로고
    • A comparison of event models for naive Bayes text classification
    • AAAI Press, Also technical report WS-98-05, CMU; online at
    • A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In AAAI/ICML-98 Workshop on Learning for Text Categorization, pages 41-48. AAAI Press, 1998. Also technical report WS-98-05, CMU; online at http: //www.cs.cmu.edu/~knigam/papers/multinomial-aaaiws98.pdf.
    • (1998) AAAI/ICML-98 Workshop on Learning for Text Categorization , pp. 41-48
    • McCallum, A.1    Nigam, K.2
  • 24
    • 1142294893 scopus 로고    scopus 로고
    • Links tell us about lexical and semantic Web content
    • Aug. Online at
    • F. Menczer. Links tell us about lexical and semantic Web content. Technical Report Computer Science Abstract CS.IR/0108004, arXiv.org, Aug. 2001. Online at http://arxiv.org/abs/cs.IR/0108004.
    • (2001) Technical Report Computer Science Abstract CS.IR/0108004
    • Menczer, F.1
  • 25
    • 0033904351 scopus 로고    scopus 로고
    • Adaptive retrieval agents: Internalizing local context and scaling up to the Web
    • Longer version available as Technical Report CS98-579, University of California, San Diego
    • F. Menczer and R. K. Belew. Adaptive retrieval agents: Internalizing local context and scaling up to the Web. Machine Learning, 39(2/3):203-242, 2000. Longer version available as Technical Report CS98-579, http://dollar.biz.uiowa.edu/~fil/Papers/MLJ.ps, University of California, San Diego.
    • (2000) Machine Learning , vol.39 , Issue.2-3 , pp. 203-242
    • Menczer, F.1    Belew, R.K.2
  • 28
    • 77953070455 scopus 로고    scopus 로고
    • Mining theWeb
    • Sept. Invited talk
    • T. Mitchell. Mining theWeb. In SIGIR 2001, Sept. 2001. Invited talk.
    • (2001) SIGIR 2001
    • Mitchell, T.1
  • 29
    • 0033707265 scopus 로고    scopus 로고
    • WTMS: A system for collecting and analyzing topic-specific Web information
    • Online at
    • S. Mukherjea. WTMS: a system for collecting and analyzing topic-specific Web information. WWW9/Computer Networks, 33(1-6):457-471, 2000. Online at http://www9.org/w9cdrom/293/293.html.
    • (2000) WWW9/Computer Networks , vol.33 , Issue.1-6 , pp. 457-471
    • Mukherjea, S.1
  • 30
    • 0034497785 scopus 로고    scopus 로고
    • Stochastic models for the Web graph
    • IEEE, nov Online at
    • S. RaviKumar, P. Raghavan, S. Rajagopalan, D. Sivakumar, A. Tomkins, and E. Upfal. Stochastic models for the Web graph. In FOCS, volume 41, pages 57-65. IEEE, nov 2000. Online at http://www.cs.brown.edu/people/eli/papers/focs00.ps.
    • (2000) FOCS , vol.41 , pp. 57-65
    • RaviKumar, S.1    Raghavan, P.2    Rajagopalan, S.3    Sivakumar, D.4    Tomkins, A.5    Upfal, E.6
  • 31
    • 0000133751 scopus 로고    scopus 로고
    • Using reinforcement learning to spider the web efficiently
    • Online at
    • J. Rennie and A. McCallum. Using reinforcement learning to spider the web efficiently. In ICML, 1999. Online at http:// www.cs.cmu.edu/~mccallum/papers/ rlspider-icml99s.ps.gz.
    • (1999) ICML
    • Rennie, J.1    McCallum, A.2
  • 33
    • 31344431872 scopus 로고    scopus 로고
    • Focused crawling using TFIDF centroid
    • (CS610) class project, Apr. Details available from manyam@cs.utexas.edu
    • M. Subramanyam, G. V. R. Phanindra, M. Tiwari, and M. Jain. Focused crawling using TFIDF centroid. Hypertext Retrieval and Mining (CS610) class project, Apr. 2001. Details available from manyam@cs.utexas.edu.
    • (2001) Hypertext Retrieval and Mining
    • Subramanyam, M.1    Phanindra, G.V.R.2    Tiwari, M.3    Jain, M.4
  • 34
    • 84948955018 scopus 로고    scopus 로고
    • Regression by classification
    • D. Borgesand C. Kaestner, editors, Brasilian AI Symposium, Curitiba, Brazil, Springer-Verlag. Online at
    • L. Torgo and J. Gama. Regression by classification. In D. Borgesand C. Kaestner, editors, Brasilian AI Symposium, volume 1159 of Lecture Notes in Artificial Intelligence, Curitiba, Brazil, 1996. Springer-Verlag. Online at http://www.ncc.up.pt/~ltorgo/Papers/list-pub.html.
    • (1996) Lecture Notes in Artificial Intelligence , vol.1159
    • Torgo, L.1    Gama, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.