|
Volumn 38, Issue 4, 2002, Pages 509-527
|
Integrated multi-strategic Web document pre-processing for sentence and word boundary detection
|
Author keywords
Sentence boundary disambiguation; Spacing word correction; Text normalization; Word boundary disambiguation
|
Indexed keywords
DECISION THEORY;
HEURISTIC METHODS;
HTML;
HYPERTEXT SYSTEMS;
LEARNING SYSTEMS;
STATISTICAL METHODS;
INDUCTIVE LEARNING;
TEXT NORMALIZATION;
WORLD WIDE WEB;
|
EID: 0036643016
PISSN: 03064573
EISSN: None
Source Type: Journal
DOI: 10.1016/S0306-4573(01)00044-9 Document Type: Article |
Times cited : (4)
|
References (14)
|