-
1
-
-
0028461417
-
Automated learning of decision rules for text categorization
-
C. Apté, F. Damerau, and S. Weiss. Automated learning of decision rules for text categorization. ACM Transactions on Information Systems, 12(3):233-251, 1994.
-
(1994)
ACM Transactions on Information Systems
, vol.12
, Issue.3
, pp. 233-251
-
-
Apté, C.1
Damerau, F.2
Weiss, S.3
-
2
-
-
12244295756
-
An objective evaluation criterion for clustering
-
ACM Press, New York, NY
-
A. Banerjee and J. Langford. An objective evaluation criterion for clustering. In Proceedings of KDD-2004, ACM Press, New York, NY, 2004.
-
(2004)
Proceedings of KDD-2004
-
-
Banerjee, A.1
Langford, J.2
-
3
-
-
77951430107
-
Distributional word clusters vs. Words for text categorization
-
R. Bekkerman, R. El-Yaniv, N. Tishby, and Y. Winter. Distributional word clusters vs. words for text categorization. Journal of Machine Learning Research, 3:1183-1208, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1183-1208
-
-
Bekkerman, R.1
El-Yaniv, R.2
Tishby, N.3
Winter, Y.4
-
4
-
-
84938213121
-
Nymble: A high-performance learning name finder
-
ACM Press, New York, NY
-
D. Bikel, S. Miller, R. Schwartz, and R. Weischedel. Nymble: A high-performance learning name finder. In The Fifth Conference on Applied Natural Language Processing, pages 194-201, ACM Press, New York, NY, 1997.
-
(1997)
The Fifth Conference on Applied Natural Language Processing
, pp. 194-201
-
-
Bikel, D.1
Miller, S.2
Schwartz, R.3
Weischedel, R.4
-
5
-
-
0032632354
-
An algorithm that learns what's in a name
-
D. Bikel, R. Schwartz, and R. Weischedel. An algorithm that learns what's in a name. Machine Learning, 34(1-3):211-231, 1999.
-
(1999)
Machine Learning
, vol.34
, Issue.1-3
, pp. 211-231
-
-
Bikel, D.1
Schwartz, R.2
Weischedel, R.3
-
6
-
-
0031620208
-
Combining labeled and unlabeled data with co-training
-
ACM Press, New York, NY
-
A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In Proceedings of the Eleventh Annual Conference on Computational Learning Theory, pages 92-100, ACM Press, New York, NY, 1998.
-
(1998)
Proceedings of the Eleventh Annual Conference on Computational Learning Theory
, pp. 92-100
-
-
Blum, A.1
Mitchell, T.2
-
8
-
-
0000275022
-
Prediction games and arcing algorithms
-
L. Breiman. Prediction games and arcing algorithms. Neural Computation, 11:1493-1517, 1999.
-
(1999)
Neural Computation
, vol.11
, pp. 1493-1517
-
-
Breiman, L.1
-
9
-
-
84867919822
-
Transformation-based error-driven learning and natural language processing: A case study in part-ofspeech tagging
-
E. Brill. Transformation-based error-driven learning and natural language processing: A case study in part-ofspeech tagging. Computational Linguistics, 21(4):543-565, 1995. http://www.cis.upenn. edu/~adwait/penntools.html.
-
(1995)
Computational Linguistics
, vol.21
, Issue.4
, pp. 543-565
-
-
Brill, E.1
-
10
-
-
3042540888
-
Relational learning of pattern-match rules for information extraction
-
AAAI Press, Menlo Park, CA
-
M. Califf and R. Mooney. Relational learning of pattern-match rules for information extraction. In Working Notes of AAAI Spring Symposium on Applying Machine Learning to Discourse Processing, pages 6-11, AAAI Press, Menlo Park, CA, 1998.
-
(1998)
Working Notes of AAAI Spring Symposium on Applying Machine Learning to Discourse Processing
, pp. 6-11
-
-
Califf, M.1
Mooney, R.2
-
11
-
-
85094927271
-
Noun phrase coreference as clustering
-
ACL, East Stroudsburg, PA
-
C. Cardie and K. Wagstaff. Noun phrase coreference as clustering. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in NLP and Very Large Corpora, pages 82-89, ACL, East Stroudsburg, PA, 1999.
-
(1999)
Proceedings of the Joint SIGDAT Conference on Empirical Methods in NLP and Very Large Corpora
, pp. 82-89
-
-
Cardie, C.1
Wagstaff, K.2
-
12
-
-
0031343557
-
Statistical techniques for natural language parsing
-
E. Charniak. Statistical techniques for natural language parsing. AI Magazine, 18(4):33-43, 1997.
-
(1997)
AI Magazine
, vol.18
, Issue.4
, pp. 33-43
-
-
Charniak, E.1
-
13
-
-
84907334477
-
Statistical parsing with an automaticallyextracted tree adjoining grammar
-
ACL, East Stroudsburg, PA
-
D. Chiang. Statistical parsing with an automaticallyextracted tree adjoining grammar. In Proceedings of the ACL 2000, pages 456-463, ACL, East Stroudsburg, PA, 2000.
-
(2000)
Proceedings of the ACL 2000
, pp. 456-463
-
-
Chiang, D.1
-
15
-
-
85127836544
-
Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms
-
ACL, East Stroudsburg, PA
-
M. Collins. Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms. In Proceedings of EMNLP'02, ACL, East Stroudsburg, PA, 2002.
-
(2002)
Proceedings of EMNLP'02
-
-
Collins, M.1
-
16
-
-
0035312953
-
Relational learning with statistical predicate invention: Better models for hypertext
-
M. Craven and S. Slattery. Relational learning with statistical predicate invention: Better models for hypertext. Machine Learning, 43:97-119, 2001.
-
(2001)
Machine Learning
, vol.43
, pp. 97-119
-
-
Craven, M.1
Slattery, S.2
-
17
-
-
0027029929
-
Scatter/Gather: A cluster-based approach to browsing large document collections
-
ACM Press, New York, NY
-
D. Cutting, D. Karger, J. Pedersen, and J. Tukey. Scatter/Gather: A cluster-based approach to browsing large document collections. In Proceedings of SIGIR-92, pages 1-12, ACM Press, New York, NY, 1992.
-
(1992)
Proceedings of SIGIR-92
, pp. 1-12
-
-
Cutting, D.1
Karger, D.2
Pedersen, J.3
Tukey, J.4
-
18
-
-
0347527558
-
Text categorization for a comprehensive time-dependent benchmark
-
F. Damerau, T. Zhang, S. Weiss, and N. Indurkhya. Text categorization for a comprehensive time-dependent benchmark. Information Processing and Management, 40(2):209-221, 2004.
-
(2004)
Information Processing and Management
, vol.40
, Issue.2
, pp. 209-221
-
-
Damerau, F.1
Zhang, T.2
Weiss, S.3
Indurkhya, N.4
-
19
-
-
84976699870
-
Problems and some solutions in customization of natural language database front ends
-
F. Damerau. Problems and some solutions in customization of natural language database front ends. ACM Transactions on Information Systems, 3(2):165-184, 1985.
-
(1985)
ACM Transactions on Information Systems
, vol.3
, Issue.2
, pp. 165-184
-
-
Damerau, F.1
-
21
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
With discussion
-
A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B, 39(1):1-38, 1977. With discussion.
-
(1977)
Journal of the Royal Statistical Society Series B
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
22
-
-
0034824884
-
Concept decompositions for large sparse text data using clustering
-
I. Dhillon and D. Modha. Concept decompositions for large sparse text data using clustering. Machine Learning, 42(1):143-175, 2001.
-
(2001)
Machine Learning
, vol.42
, Issue.1
, pp. 143-175
-
-
Dhillon, I.1
Modha, D.2
-
23
-
-
0033656184
-
Hierarchical classification of web content
-
ACM Press, New York, NY
-
S. Dumais and H. Chen. Hierarchical classification of web content. In Proceedings of the 23rd ACM International Conference on Research and Development in Information Retrieval, pages 256-263, ACM Press, New York, NY, 2000.
-
(2000)
Proceedings of the 23rd ACM International Conference on Research and Development in Information Retrieval
, pp. 256-263
-
-
Dumais, S.1
Chen, H.2
-
24
-
-
0014732304
-
An efficient context-free parsing algorithm
-
J. Earley. An efficient context-free parsing algorithm. Communications of the ACM, 13(2):94-102, 1970.
-
(1970)
Communications of the ACM
, vol.13
, Issue.2
, pp. 94-102
-
-
Earley, J.1
-
25
-
-
84976733117
-
A tree algorithm for nearest neighbor searching in document retrieval systems
-
ACM Press, New York, NY
-
C. Eastman and S. Weiss. A tree algorithm for nearest neighbor searching in document retrieval systems. In Proceedings of the ACM-SIGIR International Conference on Information Storage and Retrieval, pages 131-149, ACM Press, New York, NY, 1978.
-
(1978)
Proceedings of the ACM-SIGIR International Conference on Information Storage and Retrieval
, pp. 131-149
-
-
Eastman, C.1
Weiss, S.2
-
28
-
-
85122279702
-
Named entity recognition through classifier combination
-
ACL, East Stroudsburg, PA
-
R. Florian, A. Ittycheriah, H. Jing, and T. Zhang. Named entity recognition through classifier combination. In Proceedings of CoNLL-2003, pages 168-171, ACL, East Stroudsburg, PA, 2003.
-
(2003)
Proceedings of CoNLL-2003
, pp. 168-171
-
-
Florian, R.1
Ittycheriah, A.2
Jing, H.3
Zhang, T.4
-
29
-
-
2942731012
-
An extensive empirical study of feature selection metrics for text classification
-
G. Forman. An extensive empirical study of feature selection metrics for text classification. Journal of Machine Learning Research, 3:1289-1305, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1289-1305
-
-
Forman, G.1
-
30
-
-
0031643563
-
Information extraction from HTML: Application of a general machine learning approach
-
AAAI Press, Menlo Park, CA
-
D. Freitag. Information extraction from HTML: Application of a general machine learning approach. In Proceedings of the 15th National Conference on Artificial Intelligence, pages 517-523, AAAI Press, Menlo Park, CA, 1998.
-
(1998)
Proceedings of the 15th National Conference on Artificial Intelligence
, pp. 517-523
-
-
Freitag, D.1
-
31
-
-
0031211090
-
A decisiontheoretic generalization of on-line learning and an application to boosting
-
Y. Freund and R. Schapire. A decisiontheoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119-139, 1997.
-
(1997)
Journal of Computer and System Sciences
, vol.55
, Issue.1
, pp. 119-139
-
-
Freund, Y.1
Schapire, R.2
-
32
-
-
0034164230
-
Additive logistic regression: A statistical view of boosting
-
With discussion
-
J. Friedman, T. Hastie, and R. Tibshirani. Additive logistic regression: A statistical view of boosting. The Annals of Statistics, 28(2):337-407, 2000. With discussion.
-
(2000)
The Annals of Statistics
, vol.28
, Issue.2
, pp. 337-407
-
-
Friedman, J.1
Hastie, T.2
Tibshirani, R.3
-
33
-
-
0015493305
-
Citation analysis as a tool in journal evaluation
-
E. Garfield. Citation analysis as a tool in journal evaluation. Science, 178:471-479, 1972.
-
(1972)
Science
, vol.178
, pp. 471-479
-
-
Garfield, E.1
-
35
-
-
0003684449
-
-
Springer Series in Statistics, Springer-Verlag, New York, NY
-
T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics, Springer-Verlag, New York, NY, 2001.
-
(2001)
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
-
-
Hastie, T.1
Tibshirani, R.2
Friedman, J.3
-
36
-
-
84976669602
-
Construe/tis: A system for content-based indexing of a database of news stories
-
AAAI Press, Menlo Park, CA
-
P. Hayes and S. Weinstein. Construe/tis: A system for content-based indexing of a database of news stories. In Proceedings of the 2nd Conference on Innovative Applications of Artificial Intelligence, pages 49-66, AAAI Press, Menlo Park, CA, 1990.
-
(1990)
Proceedings of the 2nd Conference on Innovative Applications of Artificial Intelligence
, pp. 49-66
-
-
Hayes, P.1
Weinstein, S.2
-
38
-
-
0002487107
-
Word sense disambiguation: The state of the art
-
N. Ide and J. Véronis. Word sense disambiguation: The state of the art. Computational Linguistics, 24(1):1-40, 1998.
-
(1998)
Computational Linguistics
, vol.24
, Issue.1
, pp. 1-40
-
-
Ide, N.1
Véronis, J.2
-
39
-
-
0034592915
-
Active learning using adaptive resampling
-
ACM Press, New York, NY
-
V. Iyengar, C. Apté, and T. Zhang. Active learning using adaptive resampling. In The Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 91-98, ACM Press, New York, NY, 2000.
-
(2000)
The Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 91-98
-
-
Iyengar, V.1
Apté, C.2
Zhang, T.3
-
41
-
-
11944266539
-
Information theory and statistical mechanics
-
E. Jaynes. Information theory and statistical mechanics. Physical Review, 106:620-630, 1957.
-
(1957)
Physical Review
, vol.106
, pp. 620-630
-
-
Jaynes, E.1
-
42
-
-
0000636553
-
Text categorization with support vector machines: Learning with many relevant features
-
Springer-Verlag, New York, NY
-
T. Joachims. Text categorization with support vector machines: Learning with many relevant features. In Proceedings of the 10th European Conference on Machine Learning, Springer-Verlag, New York, NY, 1998.
-
(1998)
Proceedings of the 10th European Conference on Machine Learning
-
-
Joachims, T.1
-
43
-
-
0012183808
-
-
Prentice-Hall, Englewood Cliffs, NJ
-
D. Jurafsky and J. Martin. An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Prentice-Hall, Englewood Cliffs, NJ, 2000.
-
(2000)
An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
-
-
Jurafsky, D.1
Martin, J.2
-
44
-
-
0002499892
-
An information-thoretic analysis of hard and soft assignment methods for clustering
-
Morgan Kaufmann, San Francisco, CA
-
M. Kearns, Y. Mansour, and A-Y. Ng. An information-thoretic analysis of hard and soft assignment methods for clustering. In Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, pages 282-293, Morgan Kaufmann, San Francisco, CA, 1997.
-
(1997)
Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence
, pp. 282-293
-
-
Kearns, M.1
Mansour, Y.2
Ng, A.-Y.3
-
45
-
-
0029389728
-
Acquisition of linguistic patterns for knowledge-based information extraction
-
J. Kim and D. Moldovan. Acquisition of linguistic patterns for knowledge-based information extraction. IEEE Transactions on Knowledge and Data Engineering, 7(5):713-724, 1995.
-
(1995)
IEEE Transactions on Knowledge and Data Engineering
, vol.7
, Issue.5
, pp. 713-724
-
-
Kim, J.1
Moldovan, D.2
-
46
-
-
4243148480
-
Authoritative sources in a hyperlinked environment
-
J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999.
-
(1999)
Journal of the ACM
, vol.46
, Issue.5
, pp. 604-632
-
-
Kleinberg, J.1
-
47
-
-
28344432282
-
SVM-based filtering of e-mail spam with content-specific misclassification costs
-
IEEE Press, Piscataway, NJ
-
A. Kolcz and J. Alspector. SVM-based filtering of e-mail spam with content-specific misclassification costs. In Proceedings of Workshop on Text Mining, IEEE ICDM-2001, IEEE Press, Piscataway, NJ, 2001.
-
(2001)
Proceedings of Workshop on Text Mining, IEEE ICDM-2001
-
-
Kolcz, A.1
Alspector, J.2
-
49
-
-
84945520073
-
Use of support vector learning for chunk identification
-
ACL, East Stroudsburg, PA
-
T. Kudoh and Y. Matsumoto. Use of support vector learning for chunk identification. In Proceedings of CoNLL-2000 and LLL-2000, pages 142-144, ACL, East Stroudsburg, PA, 2000.
-
(2000)
Proceedings of CoNLL-2000 and LLL-2000
, pp. 142-144
-
-
Kudoh, T.1
Matsumoto, Y.2
-
50
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
Morgan Kaufmann, San Francisco, CA
-
J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of ICML-01, pages 282-289, Morgan Kaufmann, San Francisco, CA, 2001.
-
(2001)
Proceedings of ICML-01
, pp. 282-289
-
-
Lafferty, J.1
McCallum, A.2
Pereira, F.3
-
51
-
-
30844441070
-
An algorithm for pronominal anaphora resolution
-
S. Lappin and H. Leass. An algorithm for pronominal anaphora resolution. Computational Linguistics, 20(4):535-561, 1994.
-
(1994)
Computational Linguistics
, vol.20
, Issue.4
, pp. 535-561
-
-
Lappin, S.1
Leass, H.2
-
52
-
-
85124125604
-
Heterogeneous uncertainty sampling for supervised learning
-
Morgan Kaufmann, San Francisco, CA
-
D. Lewis and J. Catlett. Heterogeneous uncertainty sampling for supervised learning. In Proceedings of the Eleventh International Conference on Machine Learning, pages 148-156, Morgan Kaufmann, San Francisco, CA, 1994.
-
(1994)
Proceedings of the Eleventh International Conference on Machine Learning
, pp. 148-156
-
-
Lewis, D.1
Catlett, J.2
-
53
-
-
84876811202
-
RCV1: A new benchmark collection for text categorization research
-
D. Lewis, Y. Yang, T. Rose, and F. Li. RCV1: A new benchmark collection for text categorization research. Journal of Machine Learning Research, 5:361-397, 2004.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 361-397
-
-
Lewis, D.1
Yang, Y.2
Rose, T.3
Li, F.4
-
54
-
-
0002312061
-
Feature selection and feature extraction for text categorization
-
Morgan Kaufmann, San Francisco, CA
-
D. Lewis. Feature selection and feature extraction for text categorization. In Proceedings of the Speech and Natural Language Workshop, pages 212-217, Morgan Kaufmann, San Francisco, CA, 1992.
-
(1992)
Proceedings of the Speech and Natural Language Workshop
, pp. 212-217
-
-
Lewis, D.1
-
55
-
-
1942516915
-
A loss function analysis for classification methods in text categorization
-
AAAI Press, Menlo Park, CA
-
F. Li and Y. Yang. A loss function analysis for classification methods in text categorization. In Proceedings of the Twentieth International Conference on Machine Learning, pages 472-479, AAAI Press, Menlo Park, CA, 2003.
-
(2003)
Proceedings of the Twentieth International Conference on Machine Learning
, pp. 472-479
-
-
Li, F.1
Yang, Y.2
-
56
-
-
0031369631
-
Active learning with committees for text categorization
-
AAAI Press, Menlo Park, CA
-
R. Liere and P. Tadepalli. Active learning with committees for text categorization. In Proceedings of the 14th National Conference on Artificial Intelligence, pages 591-596, AAAI Press, Menlo Park, CA, 1997.
-
(1997)
Proceedings of the 14th National Conference on Artificial Intelligence
, pp. 591-596
-
-
Liere, R.1
Tadepalli, P.2
-
57
-
-
34250091945
-
Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
-
N. Littlestone. Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2:285-318, 1988.
-
(1988)
Machine Learning
, vol.2
, pp. 285-318
-
-
Littlestone, N.1
-
58
-
-
0000880768
-
The automatic creation of literature abstracts
-
H. Luhn. The automatic creation of literature abstracts. IBM Journal of Research and Development, 2(2):159-165, 1958.
-
(1958)
IBM Journal of Research and Development
, vol.2
, Issue.2
, pp. 159-165
-
-
Luhn, H.1
-
59
-
-
33244462107
-
Auto-encoding of documents for information retrieval systems
-
M. Boaz, editor, Pergamon Press, London
-
H. Luhn. Auto-encoding of documents for information retrieval systems. In M. Boaz, editor, Modern Trends in Documentation, pages 45-58, Pergamon Press, London, 1959.
-
(1959)
Modern Trends in Documentation
, pp. 45-58
-
-
Luhn, H.1
-
60
-
-
0001457509
-
Some methods for classification and analysis of multivariate observations
-
University of California Press, Berkeley, CA
-
J. MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pages 281-297, University of California Press, Berkeley, CA, 1967.
-
(1967)
Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability
, pp. 281-297
-
-
MacQueen, J.1
-
61
-
-
0141592441
-
Incremental rule learning with partial instance memory for changing concepts
-
IEEE Press, Piscataway, NJ
-
M. Maloof. Incremental rule learning with partial instance memory for changing concepts. In Proceedings of the International Joint Conference on Neural Networks (IJCNN '03), pages 2764-2769, IEEE Press, Piscataway, NJ, 2003.
-
(2003)
Proceedings of the International Joint Conference on Neural Networks (IJCNN '03)
, pp. 2764-2769
-
-
Maloof, M.1
-
63
-
-
84945708697
-
On relevance, probabilistic indexing and information retrieval
-
M. Maron and J. Kuhns. On relevance, probabilistic indexing and information retrieval. Journal of the ACM, 7:216-244, 1960.
-
(1960)
Journal of the ACM
, vol.7
, pp. 216-244
-
-
Maron, M.1
Kuhns, J.2
-
64
-
-
0026986166
-
Classifying news stories using memory based reasoning
-
ACM Press, New York, NY
-
B. Masand, G. Linoff, and D. Waltz. Classifying news stories using memory based reasoning. In Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 59-65, ACM Press, New York, NY, 1992.
-
(1992)
Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 59-65
-
-
Masand, B.1
Linoff, G.2
Waltz, D.3
-
65
-
-
0001673996
-
A comparison of event models for naive Bayes text classification
-
AAAI Press, Menlo Park, CA
-
A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In AAAI/ICML-98 Workshop on Learning for Text Categorization, pages 41-48, AAAI Press, Menlo Park, CA, 1998.
-
(1998)
AAAI/ICML-98 Workshop on Learning For Text Categorization
, pp. 41-48
-
-
McCallum, A.1
Nigam, K.2
-
66
-
-
85018097470
-
Using decision trees for coreference resolution
-
Morgan Kaufmann, San Francisco, CA
-
J. McCarthy and W. Lehnert. Using decision trees for coreference resolution. In Proceedings of the 14th International Joint Conference on Artificial Intelligence, pages 1050-1055, Morgan Kaufmann, San Francisco, CA, 1995.
-
(1995)
Proceedings of the 14th International Joint Conference on Artificial Intelligence
, pp. 1050-1055
-
-
McCarthy, J.1
Lehnert, W.2
-
67
-
-
0002967043
-
Slot grammar: A system for simple construction of practical natural language grammars
-
Springer-Verlag, New York, NY
-
M. McCord. Slot grammar: A system for simple construction of practical natural language grammars. In Proceedings of the International Symposium on Natural Language and Logic, pages 118-145, Springer-Verlag, New York, NY, 1989.
-
(1989)
Proceedings of the International Symposium on Natural Language and Logic
, pp. 118-145
-
-
McCord, M.1
-
68
-
-
2942661100
-
Tracking and summarizing news on a daily basis with Columbia's newsblaster
-
ACL, East Stroudsburg, PA
-
K. McKeown, R. Barzilay, D. Evans, V. Hatzivassiloglou, J. Klavans, A. Nenkova, C. Sable, B. Schiffman, and S. Sigelman. Tracking and summarizing news on a daily basis with Columbia's newsblaster. In Proceedings of the Human Languages Technology Conference, ACL, East Stroudsburg, PA, 2002.
-
(2002)
Proceedings of the Human Languages Technology Conference
-
-
McKeown, K.1
Barzilay, R.2
Evans, D.3
Hatzivassiloglou, V.4
Klavans, J.5
Nenkova, A.6
Sable, C.7
Schiffman, B.8
Sigelman, S.9
-
69
-
-
79952573047
-
Description of the LTG system used for MUC-7
-
NIST, Washington, DC
-
A. Mikheev, C. Grover, and M. Moens. Description of the LTG system used for MUC-7. In Proceedings of the Seventh Message Understanding Conference (MUC-7), NIST, Washington, DC, 1998.
-
(1998)
Proceedings of the Seventh Message Understanding Conference (MUC-7)
-
-
Mikheev, A.1
Grover, C.2
Moens, M.3
-
70
-
-
0345120050
-
BBN: Description of the SIFT system as used for MUC-7
-
NIST, Washington, DC
-
S. Miller, M. Crystal, H. Fox, L. Ramshaw, R. Schwartz, R. Stone, and R. Weischedel. BBN: Description of the SIFT system as used for MUC-7. In Proceedings of the Seventh Message Understanding Conference (MUC-7), NIST, Washington, DC, 1998.
-
(1998)
Proceedings of the Seventh Message Understanding Conference (MUC-7)
-
-
Miller, S.1
Crystal, M.2
Fox, H.3
Ramshaw, L.4
Schwartz, R.5
Stone, R.6
Weischedel, R.7
-
71
-
-
0033886806
-
Text classification from labeled and unlabeled documents using EM
-
K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2/3):1-32, 2000.
-
(2000)
Machine Learning
, vol.39
, Issue.2-3
, pp. 1-32
-
-
Nigam, K.1
McCallum, A.2
Thrun, S.3
Mitchell, T.4
-
75
-
-
0031120321
-
Inducing features of random fields
-
S. Pietra, V. Pietra, and J. Lafferty. Inducing features of random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(4):380-393, 1997.
-
(1997)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.19
, Issue.4
, pp. 380-393
-
-
Pietra, S.1
Pietra, V.2
Lafferty, J.3
-
76
-
-
84948481845
-
An algorithm for suffix stripping
-
M. Porter. An algorithm for suffix stripping. Program, 14(3):130-137, 1980.
-
(1980)
Program
, vol.14
, Issue.3
, pp. 130-137
-
-
Porter, M.1
-
78
-
-
80053431468
-
Multidocument centroid-based text summarization
-
ACL, East Stroudsburg, PA
-
D. Radev, M. Topper, and A. Winkel. Multidocument centroid-based text summarization. In Proceedings of ACL-02 Demo Session, pages 112-113, ACL, East Stroudsburg, PA, 2002.
-
(2002)
Proceedings of ACL-02 Demo Session
, pp. 112-113
-
-
Radev, D.1
Topper, M.2
Winkel, A.3
-
79
-
-
0002711083
-
Text chunking using transformation-based learning
-
ACL, East Stroudsburg, PA
-
L. Ramshaw and M. Marcus. Text chunking using transformation-based learning. In Proceedings of the Third Workshop on Very Large Corpora, pages 82-94, ACL, East Stroudsburg, PA, 1995.
-
(1995)
Proceedings of the Third Workshop on Very Large Corpora
, pp. 82-94
-
-
Ramshaw, L.1
Marcus, M.2
-
80
-
-
84891448883
-
A maximum entropy part-ofspeech tagger
-
A. Ratnaparkhi. A maximum entropy part-ofspeech tagger. Computational Linguistics, 21(4):543-565, 1995. http://www.cis.upenn. edu/~adwait/penntools. html.
-
(1995)
Computational Linguistics
, vol.21
, Issue.4
, pp. 543-565
-
-
Ratnaparkhi, A.1
-
81
-
-
0032660854
-
Learning to parse natural language with maximum entropy models
-
A. Ratnaparkhi. Learning to parse natural language with maximum entropy models. Machine Learning, 34:151-178, 1999.
-
(1999)
Machine Learning
, vol.34
, pp. 151-178
-
-
Ratnaparkhi, A.1
-
82
-
-
0142154760
-
-
O'Reilly & Associates, Sebastopol, CA
-
E. Ray. Learning XML, O'Reilly & Associates, Sebastopol, CA, 2001.
-
(2001)
Learning XML
-
-
Ray, E.1
-
83
-
-
0027709268
-
Automatically constructing a dictionary for information extraction tasks
-
AAAI Press, Menlo Park, CA
-
E. Riloff. Automatically constructing a dictionary for information extraction tasks. In Proceedings of the 11th National Conference on Artificial Intelligence, pages 811-816, AAAI Press, Menlo Park, CA, 1993.
-
(1993)
Proceedings of the 11th National Conference on Artificial Intelligence
, pp. 811-816
-
-
Riloff, E.1
-
84
-
-
0001319911
-
Okapi at TREC-3
-
NIST, Washington, DC
-
S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3. In Proceedings of the Third Text Retrieval Conference, pages 109-126, NIST, Washington, DC, 1994. http://trec.nist.gov/pubs/trec3/papers/ city.ps.gz.
-
(1994)
Proceedings of the Third Text Retrieval Conference
, pp. 109-126
-
-
Robertson, S.1
Walker, S.2
Jones, S.3
Hancock-Beaulieu, M.4
Gatford, M.5
-
86
-
-
84880912564
-
Relational learning via propositional algorithms: An information extraction case study
-
Morgan Kaufmann, San Francisco, CA
-
D. Roth and W. Yih. Relational learning via propositional algorithms: An information extraction case study. In Proceedings of the 17th International Joint Conference on Artificial Intelligence, pages 1257-1263, Morgan Kaufmann, San Francisco, CA, 2001.
-
(2001)
Proceedings of the 17th International Joint Conference on Artificial Intelligence
, pp. 1257-1263
-
-
Roth, D.1
Yih, W.2
-
87
-
-
0010046240
-
The SMART automatic document retrieval system - An illustration
-
G. Salton and M. Lesk. The SMART automatic document retrieval system - An illustration. Communications of the ACM, 8(6):391-398, 1965.
-
(1965)
Communications of the ACM
, vol.8
, Issue.6
, pp. 391-398
-
-
Salton, G.1
Lesk, M.2
-
89
-
-
0019650203
-
A term weighting model based on utility theory
-
ACM Press, New York, NY
-
G. Salton and H. Wu. A term weighting model based on utility theory. In Proceedings of SIGIR, pages 9-22, ACM Press, New York, NY, 1980.
-
(1980)
Proceedings of SIGIR
, pp. 9-22
-
-
Salton, G.1
Wu, H.2
-
93
-
-
85109864082
-
Introduction to the CoNLL-2000 shared task: Chunking
-
ACL, East Stroudsburg, PA
-
E. Sang and S. Buchholz. Introduction to the CoNLL-2000 shared task: Chunking. In Proceedings of the CoNLL-2000 and LLL-2000, pages 127-132, ACL, East Stroudsburg, PA, 2000.
-
(2000)
Proceedings of the CoNLL-2000 and LLL-2000
, pp. 127-132
-
-
Sang, E.1
Buchholz, S.2
-
94
-
-
85099019865
-
Introduction to the CoNLL-2003 shared task: Language independent named entity recognition
-
W. Daelemans and M. Osborne, editors, ACL, East Stroudsburg, PA
-
E. Sang and F. De Meulder. Introduction to the CoNLL-2003 shared task: Language independent named entity recognition. In W. Daelemans and M. Osborne, editors, Proceedings of CoNLL-2003, pages 142-147, ACL, East Stroudsburg, PA, 2003.
-
(2003)
Proceedings of CoNLL-2003
, pp. 142-147
-
-
Sang, E.1
De Meulder, F.2
-
95
-
-
0033281701
-
Improved boosting algorithms using confidence-rated predictions
-
R. Schapire and Y. Singer. Improved boosting algorithms using confidence-rated predictions. Machine Learning, 37:297-336, 1999.
-
(1999)
Machine Learning
, vol.37
, pp. 297-336
-
-
Schapire, R.1
Singer, Y.2
-
96
-
-
0033905095
-
BoosTexter: A boosting-based system for text categorization
-
R. Schapire and Y. Singer. BoosTexter: A boosting-based system for text categorization. Machine Learning, 39(2/3):135-168, 2000.
-
(2000)
Machine Learning
, vol.39
, Issue.2-3
, pp. 135-168
-
-
Schapire, R.1
Singer, Y.2
-
97
-
-
84891451127
-
Winston, Katz sue Ask Jeeves: AI lab researchers attempt to enforce natural language patent
-
S. Seshasai. Winston, Katz sue Ask Jeeves: AI lab researchers attempt to enforce natural language patent. The Tech (MIT), 2000. http://www-tech.mit.edu/ V119/N66/.
-
(2000)
The Tech (MIT)
-
-
Seshasai, S.1
-
98
-
-
85168119478
-
CRYSTAL: Inducing a conceptual dictionary
-
Morgan Kaufmann, San Francisco, CA
-
S. Soderland, D. Fisher, J. Aseltine, and W. Lehnert. CRYSTAL: Inducing a conceptual dictionary. In Proceedings of the 14th International Joint Conference on Artificial Intelligence, pages 1314-1319, Morgan Kaufmann, San Francisco, CA, 1995.
-
(1995)
Proceedings of the 14th International Joint Conference on Artificial Intelligence
, pp. 1314-1319
-
-
Soderland, S.1
Fisher, D.2
Aseltine, J.3
Lehnert, W.4
-
99
-
-
0032624184
-
Learning information extraction rules for semi-structured and free text
-
S. Soderland. Learning information extraction rules for semi-structured and free text. Machine Learning, 34(1-3):233-272, 1999.
-
(1999)
Machine Learning
, vol.34
, Issue.1-3
, pp. 233-272
-
-
Soderland, S.1
-
100
-
-
0039891959
-
A machine learning approach to coreference resolution of noun phrases
-
W-M. Soon, H-T. Ng, and C-Y. Lim. A machine learning approach to coreference resolution of noun phrases. Computational Linguistics, 27(4):521-544, 2001.
-
(2001)
Computational Linguistics
, vol.27
, Issue.4
, pp. 521-544
-
-
Soon, W.-M.1
Ng, H.-T.2
Lim, C.-Y.3
-
101
-
-
0003192559
-
Multi-document summarization: Methodologies and evaluations
-
ATALA Press, Paris, France
-
G. Stein, A. Bagga, and G. Wise. Multi-document summarization: Methodologies and evaluations. In Proceedings of the 7th Conference on Automatic Natural Language Processing (TALN'00), pages 337-346, ATALA Press, Paris, France, 2000.
-
(2000)
Proceedings of the 7th Conference on Automatic Natural Language Processing (TALN'00)
, pp. 337-346
-
-
Stein, G.1
Bagga, A.2
Wise, G.3
-
102
-
-
0034592743
-
Textual data mining of service center call records
-
ACM Press, New York, NY
-
P. Tan, H. Blau, S. Harp, and R. Goldman. Textual data mining of service center call records. In Proceedings of KDD-2000, pages 417-423, ACM Press, New York, NY, 2000.
-
(2000)
Proceedings of KDD-2000
, pp. 417-423
-
-
Tan, P.1
Blau, H.2
Harp, S.3
Goldman, R.4
-
103
-
-
0036643010
-
The use of bigrams to enhance text categorization
-
C-M. Tan, Y-F. Wang, and C-D. Lee. The use of bigrams to enhance text categorization. Information Processing and Management, 38(4):529-546, 2002.
-
(2002)
Information Processing and Management
, vol.38
, Issue.4
, pp. 529-546
-
-
Tan, C.-M.1
Wang, Y.-F.2
Lee, C.-D.3
-
106
-
-
84891473308
-
-
Gaithersburg, Maryland, November 19-22, 2002, NIST Press, Washington, DC, Co-sponsored by DARPA and ARDA
-
E. Voorhees and L. Buckland, editors. NIST Special Publication 500-251: The Eleventh Text Retrieval Conference (TREC 2002), Gaithersburg, Maryland, November 19-22, 2002, NIST Press, Washington, DC, 2002. Co-sponsored by DARPA and ARDA.
-
(2002)
NIST Special Publication 500-251: The Eleventh Text Retrieval Conference (TREC 2002)
-
-
Voorhees, E.1
Buckland, L.2
-
107
-
-
84989599138
-
The cluster hypothesis revisited
-
ACM Press, New York, NY
-
E. Voorhees. The cluster hypothesis revisited. In Proceedings of SIGIR-85, pages 188-196, ACM Press, New York, NY, 1985.
-
(1985)
Proceedings of SIGIR-85
, pp. 188-196
-
-
Voorhees, E.1
-
108
-
-
33845485865
-
Sentence boundary detection: A comparison of paradigms for improving MT quality
-
ACL, East Stroudsburg, PA
-
D. Walker, D. Clements, M. Darwin, and J. Amtrup. Sentence boundary detection: A comparison of paradigms for improving MT quality. In Proceedings of the Eighth Machine Translation Summit, ACL, East Stroudsburg, PA, 2001.
-
(2001)
Proceedings of the Eighth Machine Translation Summit
-
-
Walker, D.1
Clements, D.2
Darwin, M.3
Amtrup, J.4
-
109
-
-
0242540450
-
A system for real-time competitive market intelligence
-
ACM Press, New York, NY
-
S. Weiss and N. Verma. A system for real-time competitive market intelligence. In Proceedings of SIGKDD-2002, ACM Press, New York, NY, 2002.
-
(2002)
Proceedings of SIGKDD-2002
-
-
Weiss, S.1
Verma, N.2
-
110
-
-
0033366097
-
Maximizing text-mining performance
-
S. Weiss, C. Apté, F. Damerau, et al. Maximizing text-mining performance. IEEE Intelligent Systems, 14(4):63-69, 1999.
-
(1999)
IEEE Intelligent Systems
, vol.14
, Issue.4
, pp. 63-69
-
-
Weiss, S.1
Apté, C.2
Damerau, F.3
-
111
-
-
84974659886
-
Lightweight document clustering
-
Springer-Verlag, New York, NY
-
S. Weiss, B. White, and C. Apté. Lightweight document clustering. In Proceedings of PKDD-2000, pages 665-672, Springer-Verlag, New York, NY, 2000.
-
(2000)
Proceedings of PKDD-2000
, pp. 665-672
-
-
Weiss, S.1
White, B.2
Apté, C.3
-
112
-
-
0033890498
-
Lightweight document matching for help-desk applications
-
S. Weiss, B. White, C. Apté, and F. Damerau. Lightweight document matching for help-desk applications. IEEE Intelligent Systems, 15(2):57-61, 2000.
-
(2000)
IEEE Intelligent Systems
, vol.15
, Issue.2
, pp. 57-61
-
-
Weiss, S.1
White, B.2
Apté, C.3
Damerau, F.4
-
113
-
-
3543147086
-
Recent trends in hierarchic document clustering
-
P. Willett. Recent trends in hierarchic document clustering. Information Processing and Management, 24:577-597, 1988.
-
(1988)
Information Processing and Management
, vol.24
, pp. 577-597
-
-
Willett, P.1
-
114
-
-
0031599183
-
Corpus-based stemming using cooccurrence of word variants
-
J. Xu and B. Croft. Corpus-based stemming using cooccurrence of word variants. ACM Topics on Information Systems, 16(1):61-81, 1998.
-
(1998)
ACM Topics on Information Systems
, vol.16
, Issue.1
, pp. 61-81
-
-
Xu, J.1
Croft, B.2
-
115
-
-
0003141935
-
A comparative study of feature selection in text categorization
-
Morgan Kaufmann, San Francisco, CA
-
Y. Yang and J. Pedersen. A comparative study of feature selection in text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, pages 412-420, Morgan Kaufmann, San Francisco, CA, 1997.
-
(1997)
Proceedings of the Fourteenth International Conference on Machine Learning
, pp. 412-420
-
-
Yang, Y.1
Pedersen, J.2
-
117
-
-
84919495233
-
A robust risk minimization based named entity recognition system
-
ACL, East Stroudsburg, PA, 2003
-
T. Zhang and D. Johnson. A robust risk minimization based named entity recognition system. In Proceedings of CoNLL-2003, pages 204-207, ACL, East Stroudsburg, PA, 2003.
-
Proceedings of CoNLL-2003
, pp. 204-207
-
-
Zhang, T.1
Johnson, D.2
-
118
-
-
0005004572
-
A probability analysis on the value of unlabeled data for classification problems
-
Morgan Kaufmann, San Francisco, CA
-
T. Zhang and F. Oles. A probability analysis on the value of unlabeled data for classification problems. In Proceedings of ICML-00, pages 1191-1198, Morgan Kaufmann, San Francisco, CA, 2000.
-
(2000)
Proceedings of ICML-00
, pp. 1191-1198
-
-
Zhang, T.1
Oles, F.2
-
119
-
-
0001868572
-
Text categorization based on regularized linear classification methods
-
T. Zhang and F. Oles. Text categorization based on regularized linear classification methods. Information Retrieval, 4:5-31, 2001.
-
(2001)
Information Retrieval
, vol.4
, pp. 5-31
-
-
Zhang, T.1
Oles, F.2
-
120
-
-
0347617360
-
Text chunking based on a generalization of Winnow
-
T. Zhang, F. Damerau, and D. Johnson. Text chunking based on a generalization of Winnow. Journal of Machine Learning Research, 2(5):615-637, 2002.
-
(2002)
Journal of Machine Learning Research
, vol.2
, Issue.5
, pp. 615-637
-
-
Zhang, T.1
Damerau, F.2
Johnson, D.3
-
121
-
-
84891407142
-
Updating an NLP system to fit new domains: An empirical study on the sentence segmentation problem
-
ACL, East Stroudsburg, PA
-
T. Zhang, F. Damerau, and D. Johnson. Updating an NLP system to fit new domains: An empirical study on the sentence segmentation problem. In Proceedings of the Seventh Conference on Natural Language Learning, CoNLL-2003, pages 56-62, ACL, East Stroudsburg, PA, 2003.
-
(2003)
Proceedings of the Seventh Conference on Natural Language Learning, CoNLL-2003
, pp. 56-62
-
-
Zhang, T.1
Damerau, F.2
Johnson, D.3
-
122
-
-
0036158505
-
On the dual formulation of regularized linear systems
-
T. Zhang. On the dual formulation of regularized linear systems. Machine Learning, 46:91-129, 2002.
-
(2002)
Machine Learning
, vol.46
, pp. 91-129
-
-
Zhang, T.1
-
123
-
-
4644257995
-
Statistical behavior and consistency of classification methods based on convex risk minimization
-
With discussion
-
T. Zhang. Statistical behavior and consistency of classification methods based on convex risk minimization. The Annals of Statistics, 32(1):56-134, 2004. With discussion.
-
(2004)
The Annals of Statistics
, vol.32
, Issue.1
, pp. 56-134
-
-
Zhang, T.1
|