|
Volumn 6273 LNCS, Issue , 2010, Pages 413-416
|
SciPlore xtract: Extracting titles from scientific PDF documents by analyzing style information (Font Size)
|
Author keywords
document analysis; header extraction; style information; title extraction
|
Indexed keywords
ACADEMIC SEARCH ENGINES;
CITESEER;
CONDITIONAL RANDOM FIELD;
DOCUMENT ANALYSIS;
FONT SIZE;
HEADER EXTRACTION;
MACHINE LEARNING ALGORITHMS;
PDF DOCUMENT;
SIMPLE RULES;
STYLE INFORMATION;
TITLE EXTRACTION;
DIGITAL LIBRARIES;
GEARS;
INFORMATION RETRIEVAL;
PROBABILITY DISTRIBUTIONS;
SEARCH ENGINES;
SUPPORT VECTOR MACHINES;
LEARNING ALGORITHMS;
|
EID: 78049405786
PISSN: 03029743
EISSN: 16113349
Source Type: Book Series
DOI: 10.1007/978-3-642-15464-5_45 Document Type: Conference Paper |
Times cited : (19)
|
References (3)
|