메뉴 건너뛰기




Volumn , Issue , 2008, Pages 447-456

iRobot: An intelligent crawler for web forums

Author keywords

Forum crawler; Sitemap construction; Traversal path selection

Indexed keywords

DATA MINING; DECISION SUPPORT SYSTEMS; INFORMATION MANAGEMENT; INTERNET; KNOWLEDGE BASED SYSTEMS; SEARCH ENGINES;

EID: 57349112859     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1367497.1367558     Document Type: Conference Paper
Times cited : (76)

References (28)
  • 1
    • 57349114760 scopus 로고    scopus 로고
    • Internet Forum Software
    • Internet Forum Software. http://en.wikipedia.org/wiki/category:internet- forum-software.
  • 6
    • 35348921241 scopus 로고    scopus 로고
    • Do not crawl in the DUST: Different URLs with similar text
    • Banff, Alberta, Canada, May
    • th WWW, pages 111-120, Banff, Alberta, Canada, May 2007.
    • (2007) th WWW , pp. 111-120
    • Bar-Yossef, Z.1    Keidar, I.2    Schonfeld, U.3
  • 8
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual Web search engine
    • S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks, 30(1-7):107-117, 1998.
    • (1998) Computer Networks , vol.30 , Issue.1-7 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 11
    • 0033294474 scopus 로고    scopus 로고
    • Focused crawling: A new approach to topic-specific Web resource discovery
    • S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: a new approach to topic-specific Web resource discovery. Computer Networks, 31(11-16):1623-1640, 1999.
    • (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1623-1640
    • Chakrabarti, S.1    van den Berg, M.2    Dom, B.3
  • 17
    • 33750296887 scopus 로고    scopus 로고
    • Finding near-duplicate Web pages: A large-scale evaluation of algorithms
    • Seattle, Washington, USA, Aug
    • th SIGIR, pages 284-291, Seattle, Washington, USA, Aug. 2006.
    • (2006) th SIGIR , pp. 284-291
    • Henzinger, M.1
  • 18
    • 1842832183 scopus 로고    scopus 로고
    • Automatic generation of agents for collecting hidden Web pages for data extraction
    • May
    • J. P. Lage, A. S. da Silva, P. B. Golgher, and A. H. F. Laender. Automatic generation of agents for collecting hidden Web pages for data extraction. Data & Knowledge Engineering, 49(2):177-196, May 2004.
    • (2004) Data & Knowledge Engineering , vol.49 , Issue.2 , pp. 177-196
    • Lage, J.P.1    da Silva, A.S.2    Golgher, P.B.3    Laender, A.H.F.4
  • 19
    • 35348911985 scopus 로고    scopus 로고
    • Detecting near-duplicates for Web crawling
    • Banff, Alberta, Canada, May
    • th WWW, pages 141-150, Banff, Alberta, Canada, May 2007.
    • (2007) th WWW , pp. 141-150
    • Manku, G.S.1    Jain, A.2    Sarma, A.D.3
  • 20
    • 33745753308 scopus 로고    scopus 로고
    • User-centric Web crawling
    • Chiba, Japan, May
    • th WWW, pages 401-411, Chiba, Japan, May 2005.
    • (2005) th WWW , pp. 401-411
    • Pandey, S.1    Olston, C.2
  • 24
    • 26444532019 scopus 로고    scopus 로고
    • Learning important models for Web page blocks based on layout and content analysis
    • Dec
    • R. Song, H. Liu, J.-R. Wen, W.-Y. Ma. Learning important models for Web page blocks based on layout and content analysis. ACMSIGKDD Explorations Newsletter, 6(2): 14-23, Dec. 2004.
    • (2004) ACMSIGKDD Explorations Newsletter , vol.6 , Issue.2 , pp. 14-23
    • Song, R.1    Liu, H.2    Wen, J.-R.3    Ma, W.-Y.4
  • 25
    • 33750333928 scopus 로고    scopus 로고
    • th SIGIR, pages 292-299, Seattle, USA, Aug. 2006.
    • th SIGIR, pages 292-299, Seattle, USA, Aug. 2006.
  • 26
    • 33750797710 scopus 로고    scopus 로고
    • Structured data extraction from the Web based on partial tree alignment
    • Dec
    • Y. Zhai and B. Liu. Structured data extraction from the Web based on partial tree alignment. IEEE Trans. Knowl. Data E ng., 18(12): 1614-1628, Dec. 2006.
    • (2006) IEEE Trans. Knowl. Data E ng , vol.18 , Issue.12 , pp. 1614-1628
    • Zhai, Y.1    Liu, B.2
  • 27
    • 35348926088 scopus 로고    scopus 로고
    • Expertise networks in online communities: Structure and algorithms
    • Banff, Canada, May
    • th WWW, pages 221-230, Banff, Canada, May 2007.
    • (2007) th WWW , pp. 221-230
    • Zhang, J.1    Ackerman, M.S.2    Adamic, L.3
  • 28
    • 36849062139 scopus 로고    scopus 로고
    • Joint optimization of wrapper generation and template detection
    • San Jose, CA, USA, Aug
    • th KDD, pages 894-902, San Jose, CA, USA, Aug. 2007.
    • (2007) th KDD , pp. 894-902
    • Zheng, S.1    Song, R.2    Wen, J.-R.3    Wu, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.