메뉴 건너뛰기




Volumn 59, Issue 2, 2006, Pages 270-291

Using HMM to learn user browsing patterns for focused Web crawling

Author keywords

Focused crawling; Hidden Markov models; Pattern learning; Relevance modelling; User modelling; Web Graph; Web searching; World Wide Web

Indexed keywords

COMPUTER AIDED SOFTWARE ENGINEERING; DATA ACQUISITION; MATHEMATICAL MODELS; ONLINE SYSTEMS; PORTALS; WEB BROWSERS;

EID: 33748132951     PISSN: 0169023X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.datak.2006.01.012     Document Type: Article
Times cited : (50)

References (29)
  • 1
    • 33748157553 scopus 로고    scopus 로고
    • P. Lyman, H. Varian, J. Dunn, A. Strygin, K. Swearingen, How much information? 2003. Available from: . Link checked on March 10, 2006.
  • 2
    • 33748204717 scopus 로고    scopus 로고
    • R. Zakon, Hobbes' internet timeline v7.0. Available from: . Link checked on March 10, 2006.
  • 3
    • 33748182037 scopus 로고    scopus 로고
    • Search engine sizes. Available from: . Link checked on March 10, 2006.
  • 4
    • 33748172967 scopus 로고    scopus 로고
    • Site position and coverage. Available from: . Link checked on March 10, 2006.
  • 5
    • 33748142139 scopus 로고    scopus 로고
    • S. Chakrabarti, M. van den Berg, B. Dom, Focused crawling: a new approach to topic-specific Web resource discovery, in: Proceedings of the 8th International WWW Conference, Toronto, Canada, 1999.
  • 6
    • 33748189874 scopus 로고    scopus 로고
    • D. Bergmark, C. Lagoze, A. Sbityakov, Focused crawls, tunneling, and digital libraries, in: Proceedings of the 6th European Conference on Digital Libraries, Rome, Italy, 2002.
  • 7
    • 33748162744 scopus 로고    scopus 로고
    • P.D. Bra, R. Post, Information retrieval in the World Wide Web: making client-base searching feasible, in: Proceedings of the 1st International WWW Conference, Geneva, Switzerland, 1994.
  • 8
    • 33748138286 scopus 로고    scopus 로고
    • M. Hersovici, M. Jacovi, Y. Maarek, D. Pelleg, M. Shtalhaim, S. Ur, The Shark-search algorithm-an application: tailored Web site mapping, in: Proceedings of the 7th International WWW Conference, Brisbane, Australia, 1998.
  • 9
    • 84874371227 scopus 로고    scopus 로고
    • C. Aggarwal, F. Al-Garawi, P. Yu, Intelligent crawling on the World Wide Web with arbitrary predicates, in: Proceedings of the 10th International WWW Conference, Hong Kong, 2001.
  • 10
    • 33748174705 scopus 로고    scopus 로고
    • S. Chakrabarti, K. Punera, M. Subramanyam, Accelerated focused crawling through online relevance feedback, in: Proceedings of the 11th International WWW Conference, Hawaii, USA, 1999.
  • 11
    • 33748130953 scopus 로고    scopus 로고
    • J. Cho, H. Garcia-Molina, L. Page, Efficient crawling through URL ordering, in: Proceedings of the 7th World Wide Web Conference, Brisbane, Australia, 1998.
  • 12
    • 33748206438 scopus 로고    scopus 로고
    • K. Stamatakis, V. Karkaletsis, G. Paliouras, J. Horlock, et al., Domain-specific Web site identification: the CROSSMARC focused Web crawler, in: Proceedings of the 2nd International Workshop on Web Document Analysis (WDA2003), Edinburgh, UK, 2003.
  • 13
    • 33748178502 scopus 로고    scopus 로고
    • J. Rennie, A. McCallum, Using reinforcement learning to spider the Web efficiently, in: Proceedings of the 16th International Conference on Machine Learning (ICML-99), Bled, Slovenia, 1999.
  • 14
    • 1942484949 scopus 로고    scopus 로고
    • J. Johnson, K. Tsioutsiouliklis, C.L. Giles, Evolving strategies for focused Web crawling, in: Proceedings of the 20th International Conference on Machine Learning (ICML-2003), Washington, DC, USA, 2003.
  • 15
    • 70350672544 scopus 로고    scopus 로고
    • M. Diligenti, F. Coetzee, S. Lawrence, C. Giles, M. Gori, Focused crawling using context graphs, in: Proceedings of the 26th International Conference on Very Large Databases (VLDB 2000), Cairo, Egypt, 2000.
  • 16
    • 0034794539 scopus 로고    scopus 로고
    • F. Menczer, G. Pant, P. Srinivasan, M. Ruiz, Evaluating topic-driven Web crawlers, in: Proceedings of the 24th Annual International ACM/SIGIR Conference, New Orleans, USA, 2001.
  • 17
    • 9744257884 scopus 로고    scopus 로고
    • Topical Web crawlers: evaluating adaptive algorithms
    • Menczer F., Pant G., and Srinivasan P. Topical Web crawlers: evaluating adaptive algorithms. ACM TOIT 4 4 (2004) 378-419
    • (2004) ACM TOIT , vol.4 , Issue.4 , pp. 378-419
    • Menczer, F.1    Pant, G.2    Srinivasan, P.3
  • 18
    • 17444365825 scopus 로고    scopus 로고
    • A general evaluation framework for topical crawlers
    • Srinivasan P., Menczer F., and Pant G. A general evaluation framework for topical crawlers. Information Retrieval 8 3 (2005) 417-447
    • (2005) Information Retrieval , vol.8 , Issue.3 , pp. 417-447
    • Srinivasan, P.1    Menczer, F.2    Pant, G.3
  • 19
    • 4944227235 scopus 로고    scopus 로고
    • G. Pant, K. Tsioutsiouliklis, J. Johnson, C. Giles, Panorama: extending digital libraries with topical crawlers, in: Proceedings of ACM/IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, Arizona, June 2004, pp. 142-150.
  • 20
    • 33748136124 scopus 로고    scopus 로고
    • Algorithmic Solutions. Available from: . Link checked on March 10, 2006.
  • 21
    • 33748147026 scopus 로고    scopus 로고
    • M.W. Berry, LSI: Latent Semantic Indexing Web Site. Available from: . Link checked on March 10, 2006.
  • 24
    • 33748154973 scopus 로고    scopus 로고
    • M.W. Berry et al., SVDPACKC: Version 1.0 User's Guide, Technical Report CS-93-194, University of Tennessee, Knoxville, TN, October 1993.
  • 26
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov model and selected applications in speech recognition
    • Rabiner L.R. A tutorial on hidden Markov model and selected applications in speech recognition. Proceedings of the IEEE 77 2 (1989) 257-285
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-285
    • Rabiner, L.R.1
  • 27
    • 33748199239 scopus 로고    scopus 로고
    • Google. Available from: . Link checked on March 10, 2006.
  • 29
    • 1542287488 scopus 로고    scopus 로고
    • D. Pinto, A. McCallum, X. Wei, W.B. Croft, Table extraction using conditional random fields, in: Proceedings of the 26th Annual International ACM SIGIR Conference, Toronto, Canada, 2003.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.