SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

ACM Transactions on Asian Language Information Processing

Volumn 1, Issue 3, 2002, Pages 269-278

A Language and Character Set Determination Method Based on N-gram Statistics

(4) Suzuki, Izumi a Mikami, Yoshiki a Ohsato, Ario a Chubachi, Yoshihide b

a NAGAOKA UNIVERSITY OF TECHNOLOGY (Japan)

b Numeric Co Ltd (Japan)

Author keywords

Algorithms; character set; corpus based analysis; Languages; local language site; N gram; natural languages; Text categorization; Unicode

Indexed keywords

EID: 80155181779 PISSN: 15300226 EISSN: 15583430 Source Type: Journal
DOI: 10.1145/772755.772759 Document Type: Article

Times cited : (33)

References (6)

1
- 0002636321
- N-gram based text categorization
- (Las Vegas, NV,
- Cavnar, W. and Trenkle, J. 1994. N-gram based text categorization. In Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval (Las Vegas, NV, 1994), 161-175
- (1994) Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval , pp. 161-175
- Cavnar, W.¹ Trenkle, J.²

2
- 0033894701
- Text categorization using compression models
- (Snowbird, UT, March 2000). IEEE Computer Society
- Frank, E., Chui, C. and Witten, I. H. 2000. Text categorization using compression models. In Proceedings of the IEEE Data Compression Conference (Snowbird, UT, March 2000). IEEE Computer Society, 276-288
- (2000) Proceedings of the IEEE Data Compression Conference , pp. 276-288
- Frank, E.¹ Chui, C.² Witten, I.H.³

3
- 0003612818
- Foundations of Statistical Natural Language Processing
- The MIT Press
- Manning, C. D. and Schulze, H. 1999. Foundations of Statistical Natural Language Processing. The MIT Press
- (1999)
- Manning, C.D.¹ Schulze, H.²

4
- 85025379084
- Natural language determination using correlation between common words
- U.S. Patent No. 6,023,670
- Martino, M. J. et al. 2000. Natural language determination using correlation between common words. U.S. Patent No. 6,023,670
- Martino, M.J.¹

5
- 84874005368
- Natural language determination using partial words
- U.S. Patent No. 6
- Martino, M. J. et al. 2001. Natural language determination using partial words. U.S. Patent No. 6, 216, 102
- Martino, M.J.¹

6
- 85025377870
- A language and character set recognition method based on the immune system
- (Korea, Aug. 2001)
- Suzuki, I., Ohsato, A., and Mikami, Y. 2001. A language and character set recognition method based on the immune system. In Proceedings of the 2nd International Symposium on Advanced Intelligent Systems (Korea, Aug. 2001), 292-296
- (2001) Proceedings of the 2nd International Symposium on Advanced Intelligent Systems , pp. 292-296
- Suzuki, I.¹ Ohsato, A.² Mikami, Y.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.