|
Volumn 1, Issue 3, 2002, Pages 269-278
|
A Language and Character Set Determination Method Based on N-gram Statistics
|
Author keywords
Algorithms; character set; corpus based analysis; Languages; local language site; N gram; natural languages; Text categorization; Unicode
|
Indexed keywords
|
EID: 80155181779
PISSN: 15300226
EISSN: 15583430
Source Type: Journal
DOI: 10.1145/772755.772759 Document Type: Article |
Times cited : (33)
|
References (6)
|