|
Volumn , Issue , 2008, Pages 3489-3493
|
A lightweight and efficient tool for cleaning web pages
|
Author keywords
[No Author keywords available]
|
Indexed keywords
OPEN SOURCE SOFTWARE;
OPEN SYSTEMS;
SOFTWARE ENGINEERING;
MINOR LOSS;
N-GRAM LANGUAGE MODELS;
PLAIN TEXT;
STATE OF THE ART;
TRAINING DATA;
WEB CORPORA;
WEBSITES;
|
EID: 84869471345
PISSN: None
EISSN: None
Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper |
Times cited : (25)
|
References (5)
|