메뉴 건너뛰기




Volumn , Issue , 2010, Pages 381-390

Learning URL patterns for webpage de-duplication

Author keywords

Decision trees; Generalization; MapReduce; Page importance; Search engines; Site specific delimiters; Webpage de duplication

Indexed keywords

BUILDING BLOCKES; DELIMITERS; MACHINE LEARNING TECHNIQUES; MINE RULES; RULE EXTRACTION; SET OF RULES; SITE-SPECIFIC; TRANSFORMATION RULES; WEB SEARCHES; WEB-PAGE;

EID: 77950949494     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1718487.1718535     Document Type: Conference Paper
Times cited : (41)

References (19)
  • 1
    • 77950927610 scopus 로고    scopus 로고
    • Hadoop: Open source implementation of mapreduce. http://lucene.apache. org/hadoop/.
  • 19
    • 33744584654 scopus 로고
    • Induction of decision trees
    • March
    • J. R. Quinlan. Induction of decision trees. Mach. Learn., 1(1):81-106, March 1986.
    • (1986) Mach. Learn. , vol.1 , Issue.1 , pp. 81-106
    • Quinlan, J.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.