-
1
-
-
0032677683
-
An efficient, probabilistically sound algorithm for segmentation and word discovery
-
February
-
M. R. Brent. An efficient, probabilistically sound algorithm for segmentation and word discovery. Machine Learning, 34:71-105, February 1999.
-
(1999)
Machine Learning
, vol.34
, pp. 71-105
-
-
Brent, M.R.1
-
2
-
-
85119974890
-
Character language models for Chinese word segmentation and named entity recognition
-
Sydney, Australia, July Association for Computational Linguistics
-
B. Carpenter. Character language models for Chinese word segmentation and named entity recognition. In Proceedings of the Fifth SIGH AN Workshop on Chinese Language Processing, pages 169-172. Sydney, Australia, July 2006. Association for Computational Linguistics.
-
(2006)
Proceedings of the Fifth SIGH an Workshop on Chinese Language Processing
, pp. 169-172
-
-
Carpenter, B.1
-
3
-
-
0030717280
-
PAT-tree-based keyword extraction for Chinese information retrieval
-
L.-F. Chien. PAT-trec-based keyword extraction for Chinese information retrieval. In Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 50-58, Philadelphia, 1997. (Pubitemid 127720304)
-
(1997)
SIGIR Forum (ACM Special Interest Group on Information Retrieval)
, vol.31
, Issue.1 SPEC. ISS.
, pp. 50-58
-
-
Chien, L.-F.1
-
5
-
-
84958571083
-
Discovering Chinese words from unsegmented text
-
Berkeley, CA, USA, August 15-19 ACM
-
X. Ge, W. Pratt, and P. Smyth. Discovering Chinese words from unsegmented text. In SIGIR '99: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 271-272, Berkeley, CA, USA, August 15-19, 1999. ACM.
-
(1999)
SIGIR '99: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 271-272
-
-
Ge, X.1
Pratt, W.2
Smyth, P.3
-
6
-
-
78650869754
-
Contextual dependencies in unsupervised word segmentation
-
Sidney, Australia
-
S. Goldwater, T. L. Griffiths, and M. Johnson. Contextual dependencies in unsupervised word segmentation. In COLING-ACL 2006, pages 673-670, Sidney, Australia, 2006.
-
(2006)
COLING-ACL 2006
, pp. 673-670
-
-
Goldwater, S.1
Griffiths, T.L.2
Johnson, M.3
-
9
-
-
85090807987
-
Unsupervised learning of word boundary with description length gain
-
M. Osborne and E. T. K. Sang, editors Bergen, Norway
-
C. Kit and Y. Wilks. Unsupervised learning of word boundary with description length gain. In M. Osborne and E. T. K. Sang, editors, CoNLL-99, pages 1-6, Bergen, Norway, 1999.
-
(1999)
CoNLL-99
, pp. 1-6
-
-
Kit, C.1
Wilks, Y.2
-
10
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
San Francisco, CA, USA Morgan Kaufmann Publishers Inc
-
J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In IGML'Ol: Proceedings of the 18th International Conference on Machine Learning, pages 282-289, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc.
-
(2001)
IGML'Ol: Proceedings of the 18th International Conference on Machine Learning
, pp. 282-289
-
-
Lafferty, J.D.1
McCallum, A.2
Pereira, F.C.N.3
-
11
-
-
85119995698
-
The third international Chinese language processing bakeoff: Word segmentation and named entity recognition
-
Sidney, Australia
-
G.-A. Levow. The third international Chinese language processing bakeoff: Word segmentation and named entity recognition. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 108-117, Sidney, Australia, 2006.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing
, pp. 108-117
-
-
Levow, G.-A.1
-
12
-
-
85097824915
-
A maximum entropy approach to Chinese word segmentation
-
Jeju Island, Korea
-
J. K. Low, H. T. Ng, and W. Guo. A maximum entropy approach to Chinese word segmentation. In Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, pages 161-164, Jeju Island, Korea, 2005.
-
(2005)
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing
, pp. 161-164
-
-
Low, J.K.1
Ng, H.T.2
Guo, W.3
-
13
-
-
85116342676
-
Chinese segmentation and new word detection using conditional random fields
-
Geneva, Switzerland
-
F. Peng, F. Feng, and A. McCallum. Chinese segmentation and new word detection using conditional random fields. In COLING 2004, pages 562-568, Geneva, Switzerland, 2004.
-
(2004)
COLING 2004
, pp. 562-568
-
-
Peng, F.1
Feng, F.2
McCallum, A.3
-
14
-
-
84958533967
-
Self-Supervised Chinese Word Segmentation
-
Advances in Intelligent Data Analysis
-
F. Peng and D. Schuurmans. Self-supervised Chinese word segmentation. In The Forth International Symposium on Intelligent Data Analysis, pages 238-247, Lisbon, Portugal, September 2001. (Pubitemid 33348503)
-
(2001)
Lecture Notes in Computer Science
, Issue.2189
, pp. 238-247
-
-
Peng, F.1
Schuurmans, D.2
-
15
-
-
0018015137
-
Modelling by shortest data description
-
J. Rissanen. Modelling by shortest data description. Automatica, 14:465-471, 1978.
-
(1978)
Automatica
, vol.14
, pp. 465-471
-
-
Rissanen, J.1
-
17
-
-
33745440902
-
A systematic cross-comparison of sequence classifiers
-
Bethesda, Maryland
-
B. Rosenfeld, R. Feldman, and M. Fresko. A systematic cross-comparison of sequence classifiers. In Proceedings of the Sixth SIAM International Conference on Data Mining (SDM06), pages 563-567, Bethesda, Maryland, 2006.
-
(2006)
Proceedings of the Sixth SIAM International Conference on Data Mining (SDM06)
, pp. 563-567
-
-
Rosenfeld, B.1
Feldman, R.2
Fresko, M.3
-
18
-
-
84856043672
-
A mathematical theory of communication
-
623-656 July, October
-
C. E. Shannon. A mathematical theory of communication. The Bell System Technical Journal, 27:379-423, 623-656, July, October 1948.
-
(1948)
The Bell System Technical Journal
, vol.27
, pp. 379-423
-
-
Shannon, C.E.1
-
21
-
-
84872841506
-
Chinese word segmentation without using lexicon and hand-crafted training data
-
Montreal, Quebec, Canada
-
M. Sun, D. Shen, and B. K. Tsou. Chinese word segmentation without using lexicon and hand-crafted training data. In COLING-ACL'98, volume 2, pages 1265-1271, Montreal, Quebec, Canada, 1998.
-
(1998)
COLING-ACL'98
, vol.2
, pp. 1265-1271
-
-
Sun, M.1
Shen, D.2
Tsou, B.K.3
-
22
-
-
85120006548
-
On closed task of Chinese word segmentation: An improved CRF model coupled with character clustering and automatically generated template matching
-
Sidney, Australia
-
R. T.-H. Tsai, H.-C. Hung, C.-L. Sung, H.-J. Dai, and W.-L. Hsu. On closed task of Chinese word segmentation: An improved CRF model coupled with character clustering and automatically generated template matching. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 108-117, Sidney, Australia, 2006.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing
, pp. 108-117
-
-
Tsai, R.T.-H.1
Hung, H.-C.2
Sung, C.-L.3
Dai, H.-J.4
Hsu, W.-L.5
-
23
-
-
85093043295
-
A conditional random field word seg-menter for SIGHAN bakeoff 2005
-
Jeju Island, Korea
-
H. Tseng, P. Chang, G. Andrew, D. Jurafsky, and C. Manning. A conditional random field word seg-menter for SIGHAN bakeoff 2005. In Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, pages 168-171, Jeju Island, Korea, 2005.
-
(2005)
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing
, pp. 168-171
-
-
Tseng, H.1
Chang, P.2
Andrew, G.3
Jurafsky, D.4
Manning, C.5
-
24
-
-
85119979405
-
Chinese word segmentation with maximum entropy and n-gram language model
-
Sidney, Australia
-
X. Wang, X. Lin, D. Yu, H. Tian, and X. Wu. Chinese word segmentation with maximum entropy and n-gram language model. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 138-141, Sidney, Australia, 2006.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing
, pp. 138-141
-
-
Wang, X.1
Lin, X.2
Yu, D.3
Tian, H.4
Wu, X.5
-
26
-
-
0038632285
-
Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus
-
M. Yamamoto and K. W. Church. Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus. Computational Linguistics, 27(1):1-30, 2001.
-
(2001)
Computational Linguistics
, vol.27
, Issue.1
, pp. 1-30
-
-
Yamamoto, M.1
Church, K.W.2
-
27
-
-
2142726570
-
Extraction of Chinese compound words - An experimental study on a very large corpus
-
Hong Kong, China
-
J. Zhang, J. Gao, and M. Zhou. Extraction of Chinese compound words - an experimental study on a very large corpus. In Proceedings of the Second Chinese Language Processing Workshop, pages 132-139, Hong Kong, China, 2000.
-
(2000)
Proceedings of the Second Chinese Language Processing Workshop
, pp. 132-139
-
-
Zhang, J.1
Gao, J.2
Zhou, M.3
-
28
-
-
77953769061
-
Chinese word segmentation and named entity recognition based on a context-dependent mutual information independence model
-
Sidney, Australia
-
M. Zhang, G.-D. Zhou, L.-P. Yang, and D.-H. Ji. Chinese word segmentation and named entity recognition based on a context-dependent mutual information independence model. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 154-157, Sidney, Australia, 2006.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing
, pp. 154-157
-
-
Zhang, M.1
Zhou, G.-D.2
Yang, L.-P.3
Ji, D.-H.4
-
29
-
-
85097831731
-
An improved Chinese word segmentation system with conditional random field
-
Sidney, Australia
-
H. Zhao, C.-N. Huang, and M. Li. An improved Chinese word segmentation system with conditional random field. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 162-165, Sidney, Australia, 2006.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing
, pp. 162-165
-
-
Zhao, H.1
Huang, C.-N.2
Li, M.3
-
30
-
-
38049001432
-
Effective tag set selection in Chinese word segmentation via conditional random field modeling
-
Wuhan, China
-
H. Zhao, C.-N. Huang, M. Li, and B.-L. Lu. Effective tag set selection in Chinese word segmentation via conditional random field modeling. In Proceedings of PACLIC-20, pages 87-94, Wuhan, China, 2006.
-
(2006)
Proceedings of PACLIC-20
, pp. 87-94
-
-
Zhao, H.1
Huang, C.-N.2
Li, M.3
Lu, B.-L.4
-
31
-
-
77953784160
-
Designing special post-processing rules for SVM-based Chinese word segmentation
-
Sidney, Australia
-
M.-H. Zhu, Y.-L. Wang, Z.-X. Wang, H.-Z. Wang, and J.-B. Zhu. Designing special post-processing rules for SVM-based Chinese word segmentation. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 217-220, Sidney, Australia, 2006.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing
, pp. 217-220
-
-
Zhu, M.-H.1
Wang, Y.-L.2
Wang, Z.-X.3
Wang, H.-Z.4
Zhu, J.-B.5
|