-
1
-
-
26444473118
-
Mostly-unsupervised statistical segmentation of Japanese: Applications to Kanji
-
Seattle, Washington
-
R.K. Ando, L. Lee, Mostly-unsupervised statistical segmentation of Japanese: Applications to Kanji, in: Proceedings of the First Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL 2000), Seattle, Washington, 2000, pp. 241-248.
-
(2000)
Proceedings of the First Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL 2000)
, pp. 241-248
-
-
Ando, R.K.1
Lee, L.2
-
6
-
-
2142787936
-
Accessor variety criteria for Chinese word extraction
-
H. Feng, K. Chen, X. Deng, and W. Zheng Accessor variety criteria for Chinese word extraction Computational Linguistics 30 1 2004 75 93
-
(2004)
Computational Linguistics
, vol.30
, Issue.1
, pp. 75-93
-
-
Feng, H.1
Chen, K.2
Deng, X.3
Zheng, W.4
-
7
-
-
26444614686
-
Unsupervised segmentation of Chinese corpus using accessor variety
-
Natural Language Processing - IJCNLP 2004
-
H. Feng, K. Chen, C. Kit, and X. Deng Unsupervised segmentation of Chinese corpus using accessor variety K.-Y. Su, J. Tsujii, J.H. Lee, O.Y. Kwong, Natural Language Processing - IJCNLP 2004 LNAI vol. 3248 2005 Springer 694 703
-
(2005)
LNAI
, vol.3248
, pp. 694-703
-
-
Feng, H.1
Chen, K.2
Kit, C.3
Deng, X.4
-
8
-
-
39149127887
-
Unsupervised Chinese word segmentation and unknown word identification
-
Closing the Millennium, Beijing, China
-
G.-H. Fu, X.-L. Wang, Unsupervised Chinese word segmentation and unknown word identification, in: The Fifth Natural Language Processing Pacific Rim Symposium 1999 (NLPRS'99), Closing the Millennium, Beijing, China, 1999, pp. 32-37.
-
(1999)
The Fifth Natural Language Processing Pacific Rim Symposium 1999 (NLPRS'99)
, pp. 32-37
-
-
Fu, G.-H.1
Wang, X.-L.2
-
9
-
-
39749156980
-
Chinese word segmentation as morpheme-based lexical chunking
-
G.-H. Fu, C. Kit, and J.J. Webster Chinese word segmentation as morpheme-based lexical chunking Information Sciences 178 9 2008 2282 2296
-
(2008)
Information Sciences
, vol.178
, Issue.9
, pp. 2282-2296
-
-
Fu, G.-H.1
Kit, C.2
Webster, J.J.3
-
13
-
-
0001074490
-
From phoneme to morpheme
-
Z.S. Harris From phoneme to morpheme Language 31 2 1955 90 222
-
(1955)
Language
, vol.31
, Issue.2
, pp. 90-222
-
-
Harris, Z.S.1
-
17
-
-
85119983675
-
Maximum entropy word segmentation of Chinese text
-
Sydney, Australia
-
A.J. Jacobs, Y.W. Wong, Maximum entropy word segmentation of Chinese text, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 108-117.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 108-117
-
-
Jacobs, A.J.1
Wong, Y.W.2
-
18
-
-
84860537772
-
Semi-supervised conditional random fields for improved sequence segmentation and labeling
-
Sydney, Australia
-
F. Jiao, S. Wang, C.-H. Lee, R. Greiner, D. Schuurmans, Semi-supervised conditional random fields for improved sequence segmentation and labeling, in: COLING/ACL-2006, Sydney, Australia, 2006, pp. 209-216.
-
(2006)
COLING/ACL-2006
, pp. 209-216
-
-
Jiao, F.1
Wang, S.2
Lee, C.-H.3
Greiner, R.4
Schuurmans, D.5
-
19
-
-
85092217204
-
Unsupervised segmentation of Chinese text by use of branching entropy
-
Sidney, Australia
-
Z. Jin, K. Tanaka-Ishii, Unsupervised segmentation of Chinese text by use of branching entropy, in: COLING/ACL 2006, Sidney, Australia, 2006, pp. 428-435.
-
(2006)
COLING/ACL 2006
, pp. 428-435
-
-
Jin, Z.1
Tanaka-Ishii, K.2
-
21
-
-
85090807987
-
Unsupervised learning of word boundary with description length gain
-
Osborne, M., Sang, E.T.K. (Eds.) Bergen, Norway
-
C. Kit, Y. Wilks, Unsupervised learning of word boundary with description length gain, in: Osborne, M., Sang, E.T.K. (Eds.), Computational Natural Language Learning (CoNLL-99), Bergen, Norway, 1999, pp. 1-6.
-
(1999)
Computational Natural Language Learning (CoNLL-99)
, pp. 1-6
-
-
Kit, C.1
Wilks, Y.2
-
22
-
-
77958111978
-
Improving Chinese word segmentation with description length gain
-
Las Vegas, Nevada, USA
-
C. Kit, H. Zhao, Improving Chinese word segmentation with description length gain, in: The 2007 International Conference on Artificial Intelligence (ICAI-2007), Las Vegas, Nevada, USA, 2007, pp. 846-851.
-
(2007)
The 2007 International Conference on Artificial Intelligence (ICAI-2007)
, pp. 846-851
-
-
Kit, C.1
Zhao, H.2
-
24
-
-
85119995698
-
The third international Chinese language processing bakeoff: Word segmentation and named entity recognition
-
Sydney, Australia
-
G.-A. Levow, The third international Chinese language processing bakeoff: Word segmentation and named entity recognition, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 108-117.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 108-117
-
-
Levow, G.-A.1
-
25
-
-
77958098383
-
France Telecom R& D Beijing word segmenter for SIGHAN bakeoff 2006
-
Sydney, Australia
-
W. Liu, H. Li, Y. Dong, N. He, H. Luo, H. Wang, France Telecom R& D Beijing word segmenter for SIGHAN bakeoff 2006, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 108-117.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 108-117
-
-
Liu, W.1
Li, H.2
Dong, Y.3
He, N.4
Luo, H.5
Wang, H.6
-
26
-
-
85097824915
-
A maximum entropy approach to Chinese word segmentation
-
Jeju Island, Korea
-
J.K. Low, H.T. Ng, W. Guo, A maximum entropy approach to Chinese word segmentation, in: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Jeju Island, Korea, 2005, pp. 161-164.
-
(2005)
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 161-164
-
-
Low, J.K.1
Ng, H.T.2
Guo, W.3
-
27
-
-
26444582893
-
Statistical substring reduction in linear time
-
Natural Language Processing - IJCNLP 2004
-
X. Lü, L. Zhang, and J. Hu Statistical substring reduction in linear time K.-Y. Su, J. Tsujii, J.H. Lee, O.Y. Kwong, Natural Language Processing - IJCNLP 2004 LNAI vol. 3248 2005 Springer 320 327
-
(2005)
LNAI
, vol.3248
, pp. 320-327
-
-
Lü, X.1
Zhang, L.2
Hu, J.3
-
30
-
-
0345191309
-
Tokenisation and sentence segmentation
-
D.D. Palmer Tokenisation and sentence segmentation R. Dale, H. Moisl, H. Somers, Handbook of Natural Language Processing 2000 Marcel Dekker New York 11 36
-
(2000)
Handbook of Natural Language Processing
, pp. 11-36
-
-
Palmer, D.D.1
-
31
-
-
85116342676
-
Chinese segmentation and new word detection using conditional random fields
-
Geneva, Switzerland
-
F. Peng, F. Feng, A. McCallum, Chinese segmentation and new word detection using conditional random fields, in: Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland, pp. 562-568.
-
Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004)
, pp. 562-568
-
-
Peng, F.1
Feng, F.2
McCallum, A.3
-
32
-
-
0036993134
-
Using self-supervised word segmentation in Chinese information retrieval
-
Tampere, Finland
-
F. Peng, X. Huang, D. Schuurmans, N. Cercone, S. Robertson, Using self-supervised word segmentation in Chinese information retrieval, in: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'01), Tampere, Finland, 2001, pp. 349-350.
-
(2001)
Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'01)
, pp. 349-350
-
-
Peng, F.1
Huang, X.2
Schuurmans, D.3
Cercone, N.4
Robertson, S.5
-
33
-
-
84958533967
-
Self-supervised Chinese word segmentation
-
Lisbon, Portugal
-
F. Peng, D. Schuurmans, Self-supervised Chinese word segmentation, in: The Fourth International Symposium on Intelligent Data Analysis (IDA-2001), Lisbon, Portugal, 2001, pp. 238-247.
-
(2001)
The Fourth International Symposium on Intelligent Data Analysis (IDA-2001)
, pp. 238-247
-
-
Peng, F.1
Schuurmans, D.2
-
34
-
-
0040261510
-
USeg: A retargetable word segmentation procedure for information retrieval
-
Technical Report TR96-2, University of Massachusetts, Amherst, MA
-
J.M. Ponte, W.B. Croft, USeg: A retargetable word segmentation procedure for information retrieval, Presented at the Symposium on Document Analysis and Information Retrieval'96 (SDAIR),Technical Report TR96-2, University of Massachusetts, Amherst, MA, 1996.
-
(1996)
Symposium on Document Analysis and Information Retrieval'96 (SDAIR)
-
-
Ponte, J.M.1
Croft, W.B.2
-
35
-
-
38049043319
-
A systematic cross-comparison of sequence classifiers
-
Bethesda, Maryland
-
B. Rosenfeld, R. Feldman, M. Fresko, A systematic cross-comparison of sequence classifiers, in: SDM 2006, Bethesda, Maryland, pp. 563-567.
-
SDM 2006
, pp. 563-567
-
-
Rosenfeld, B.1
Feldman, R.2
Fresko, M.3
-
36
-
-
84856043672
-
A mathematical theory of communication
-
C.E. Shannon A mathematical theory of communication The Bell System Technical Journal 27 1948 379 423 623-656
-
(1948)
The Bell System Technical Journal
, vol.27
, pp. 379-423
-
-
Shannon, C.E.1
-
37
-
-
6344253989
-
The first international Chinese word segmentation bakeoff
-
Sapporo, Japan
-
R. Sproat, T. Emerson, The first international Chinese word segmentation bakeoff, in: The Second SIGHAN Workshop on Chinese Language Processing (SIGHAN-2), Sapporo, Japan, 2003, pp. 133-143.
-
(2003)
The Second SIGHAN Workshop on Chinese Language Processing (SIGHAN-2)
, pp. 133-143
-
-
Sproat, R.1
Emerson, T.2
-
39
-
-
84872841506
-
Chinese word segmentation without using lexicon and hand-crafted training data
-
Montreal, Quebec, Canada
-
M. Sun, D. Shen, B.K. Tsou, Chinese word segmentation without using lexicon and hand-crafted training data, in: COLING-ACL'98, vol. 2, Montreal, Quebec, Canada, 1998, pp. 1265-1271.
-
(1998)
COLING-ACL'98
, vol.2
, pp. 1265-1271
-
-
Sun, M.1
Shen, D.2
Tsou, B.K.3
-
40
-
-
3142751938
-
Chinese word segmentation without using dictionary based on unsupervised learning strategy
-
M. Sun, M. Xiao, and B.K. Tsou Chinese word segmentation without using dictionary based on unsupervised learning strategy Chinese Journal of Computers 27 6 2004 736 742
-
(2004)
Chinese Journal of Computers
, vol.27
, Issue.6
, pp. 736-742
-
-
Sun, M.1
Xiao, M.2
Tsou, B.K.3
-
41
-
-
85071661854
-
Semi-supervised structured output learning based on a hybrid generative and discriminative approach
-
Prague, Czech
-
J. Suzuki, A. Fujino, H. Isozaki, Semi-supervised structured output learning based on a hybrid generative and discriminative approach, in: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL'07), Prague, Czech, 2007, pp. 791-800.
-
(2007)
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL'07)
, pp. 791-800
-
-
Suzuki, J.1
Fujino, A.2
Isozaki, H.3
-
42
-
-
0001277731
-
A compression-based algorithm for Chinese word segmentation
-
W.J. Teahan, Y. Wen, R. McNab, and I.H. Witten A compression-based algorithm for Chinese word segmentation Computational Linguistics 26 3 2000 375 393
-
(2000)
Computational Linguistics
, vol.26
, Issue.3
, pp. 375-393
-
-
Teahan, W.J.1
Wen, Y.2
McNab, R.3
Witten, I.H.4
-
43
-
-
85120006548
-
On closed task of Chinese word segmentation: An improved CRF model coupled with character clustering and automatically generated template matching
-
Sydney, Australia
-
R.T.-H. Tsai, H.-C. Hung, C.-L. Sung, H.-J.Dai, W.-L. Hsu, On closed task of Chinese word segmentation: An improved CRF model coupled with character clustering and automatically generated template matching, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 108-117.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 108-117
-
-
Tsai, R.T.-H.1
Hung, H.-C.2
Sung, C.-L.3
Dai, H.-J.4
Hsu, W.-L.5
-
45
-
-
85119979405
-
Chinese word segmentation with maximum entropy and N-gram language model
-
Sydney, Australia
-
X. Wang, X. Lin, D. Yu, H. Tian, X. Wu, Chinese word segmentation with maximum entropy and N-gram language model, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 138-141.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 138-141
-
-
Wang, X.1
Lin, X.2
Yu, D.3
Tian, H.4
Wu, X.5
-
46
-
-
70350717618
-
The character-based CRF segmenter of MSRA & NEU for the 4th Bakeoff
-
Hyderabad, India
-
Z. Wang, C. Huang, J. Zhu, The character-based CRF segmenter of MSRA & NEU for the 4th Bakeoff, in: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing (SIGHAN-6), Hyderabad, India, 2008, pp.98-101.
-
(2008)
Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing (SIGHAN-6)
, pp. 98-101
-
-
Wang, Z.1
Huang, C.2
Zhu, J.3
-
47
-
-
0347109463
-
Tokenization as the initial phase in nlp
-
Nantes, France
-
J.J. Webster, C. Kit, Tokenization as the initial phase in nlp, in: Proceedings of the 14th International Conference on Computational Linguistics (COLING-92), vol. IV, Nantes, France, 1992, pp. 1106-1110.
-
(1992)
Proceedings of the 14th International Conference on Computational Linguistics (COLING-92)
, vol.4
, pp. 1106-1110
-
-
Webster, J.J.1
Kit, C.2
-
48
-
-
55549127511
-
Minimum tag error for discriminative training of conditional random fields
-
Y. Xiong, J. Zhu, H. Huang, and H. Xu Minimum tag error for discriminative training of conditional random fields Information Sciences 179 1-2 2009 169 179
-
(2009)
Information Sciences
, vol.179
, Issue.12
, pp. 169-179
-
-
Xiong, Y.1
Zhu, J.2
Huang, H.3
Xu, H.4
-
50
-
-
2142726570
-
Extraction of Chinese compound words - An experimental study on a very large corpus
-
Hong Kong, China
-
J. Zhang, J. Gao, M. Zhou, Extraction of Chinese compound words - An experimental study on a very large corpus, in: Proceedings of the Second Chinese Language Processing Workshop, Hong Kong, China, 2000, pp. 132-139.
-
(2000)
Proceedings of the Second Chinese Language Processing Workshop
, pp. 132-139
-
-
Zhang, J.1
Gao, J.2
Zhou, M.3
-
51
-
-
77953769061
-
Chinese word segmentation and named entity recognition based on a context-dependent mutual information independence model
-
Sydney, Australia
-
M., Zhang, G.-D. Zhou, L.-P. Yang, D.-H. Ji, Chinese word segmentation and named entity recognition based on a context-dependent mutual information independence model, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 154-157.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 154-157
-
-
Zhang, M.1
Zhou, G.-D.2
Yang, L.-P.3
Ji, D.-H.4
-
52
-
-
85097831731
-
An improved Chinese word segmentation system with conditional random field
-
Sydney, Australia
-
H. Zhao, Huang, C.-N., M. Li, An improved Chinese word segmentation system with conditional random field, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 162-165.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 162-165
-
-
Zhao, H.1
Huang, C.-N.2
Li, M.3
-
53
-
-
38049001432
-
Effective tag set selection in Chinese word segmentation via conditional random field modeling
-
Wuhan, China
-
H. Zhao, C.-N. Huang, Li, M., Lu, B.-L., Effective tag set selection in Chinese word segmentation via conditional random field modeling, in: Proceedings of the 20th Pacific Asian Conference on Language, Information and Computation (PACLIC 20), Wuhan, China, 2006, pp. 87-94.
-
(2006)
Proceedings of the 20th Pacific Asian Conference on Language, Information and Computation (PACLIC 20)
, pp. 87-94
-
-
Zhao, H.1
Huang, C.-N.2
Li, M.3
Lu, B.-L.4
-
54
-
-
57649198363
-
Incorporating global information into supervised learning for Chinese word segmentation
-
Melbourne, Australia
-
H. Zhao, C. Kit, Incorporating global information into supervised learning for Chinese word segmentation, in: Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING 2007), Melbourne, Australia, 2007, pp. 66-74.
-
(2007)
Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING 2007)
, pp. 66-74
-
-
Zhao, H.1
Kit, C.2
-
55
-
-
84871054273
-
Unsupervised segmentation helps supervised learning of character tagging for word segmentation and named entity recognition
-
Hyderabad, India
-
H. Zhao, C. Kit, Unsupervised segmentation helps supervised learning of character tagging for word segmentation and named entity recognition, in: The Sixth SIGHAN Workshop on Chinese Language Processing (SIGHAN-6), Hyderabad, India, 2008, pp. 106-111.
-
(2008)
The Sixth SIGHAN Workshop on Chinese Language Processing (SIGHAN-6)
, pp. 106-111
-
-
Zhao, H.1
Kit, C.2
-
56
-
-
70450183849
-
An empirical comparison of goodness measures for unsupervised Chinese word segmentation with a unified framework
-
Hyderabad, India
-
H. Zhao, C. Kit, An empirical comparison of goodness measures for unsupervised Chinese word segmentation with a unified framework, in: The Third International Joint Conference on Natural Language Processing (IJCNLP-2008), vol. 1, Hyderabad, India, 2008, pp. 9-16.
-
(2008)
The Third International Joint Conference on Natural Language Processing (IJCNLP-2008)
, vol.1
, pp. 9-16
-
-
Zhao, H.1
Kit, C.2
-
57
-
-
80052200139
-
Exploiting unlabeled text with different unsupervised segmentation criteria for Chinese word segmentation
-
Research in Computing Science
-
H. Zhao, and C. Kit Exploiting unlabeled text with different unsupervised segmentation criteria for Chinese word segmentation The 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2008), Haifa, Israel, 2008 Research in Computing Science 33 2008 93 104
-
(2008)
The 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2008), Haifa, Israel, 2008
, vol.33
, pp. 93-104
-
-
Zhao, H.1
Kit, C.2
-
58
-
-
49349140728
-
Scaling conditional random fields by one-against-the-other decomposition
-
H. Zhao, and C. Kit Scaling conditional random fields by one-against-the-other decomposition Journal of Computer Science and Technology 23 4 2008 612 619
-
(2008)
Journal of Computer Science and Technology
, vol.23
, Issue.4
, pp. 612-619
-
-
Zhao, H.1
Kit, C.2
-
59
-
-
77957833002
-
A simple and efficient model pruning method for conditional random fields
-
Hong Kong, China
-
H. Zhao, C. Kit, A simple and efficient model pruning method for conditional random fields, in: Proceedings of the 22nd International Conference on the Computer Processing of Oriental Languages (ICCPOL-2009), Hong Kong, China, 2009, pp. 149-159.
-
(2009)
Proceedings of the 22nd International Conference on the Computer Processing of Oriental Languages (ICCPOL-2009)
, pp. 149-159
-
-
Zhao, H.1
Kit, C.2
-
60
-
-
77953784160
-
Designing special post-processing rules for SVM-based Chinese word segmentation
-
Sydney, Australia
-
M.-H. Zhu, Y.-L. Wang, Z.-X. Wang, H.-Z. Wang, J.-B. Zhu, Designing special post-processing rules for SVM-based Chinese word segmentation, in: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5), Sydney, Australia, 2006, pp. 217-220.
-
(2006)
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (SIGHAN-5)
, pp. 217-220
-
-
Zhu, M.-H.1
Wang, Y.-L.2
Wang, Z.-X.3
Wang, H.-Z.4
Zhu, J.-B.5
|