-
2
-
-
0040748315
-
Automatic text segmentation for extracting structured records
-
V.R Borkar, K. Deshmukh, and S. Sarawagi. Automatic text segmentation for extracting structured records. In Proc. ACM SIGMOD International Conf. on Management of Data, Santa Barbara, USA, 2001.
-
Proc. ACM SIGMOD International Conf. on Management of Data, Santa Barbara, USA, 2001
-
-
Borkar, V.R.1
Deshmukh, K.2
Sarawagi, S.3
-
3
-
-
84944949595
-
The effect of adding relevance information in a relevance feedback environment
-
C. Buckley, G. Salton, and J. Allan. The effect of adding relevance information in a relevance feedback environment. In Proc. of SIGIR, 292-300, 1994.
-
(1994)
Proc. of SIGIR
, pp. 292-300
-
-
Buckley, C.1
Salton, G.2
Allan, J.3
-
4
-
-
27144489164
-
A tutorial on support vector machines for pattern recognition
-
C.J.C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2):121-167, 1998.
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, Issue.2
, pp. 121-167
-
-
Burges, C.J.C.1
-
5
-
-
0036204730
-
Efficient evaluation of queries with mining predicates
-
S. Chaudhuri, V. Narasayya, and S. Sarawagi. Efficient evaluation of queries with mining predicates. In Proc. of the 18th Int'l Conference on Data Engineering (ICDE), San Jose, USA, April 2002.
-
Proc. of the 18th Int'l Conference on Data Engineering (ICDE), San Jose, USA, April 2002
-
-
Chaudhuri, S.1
Narasayya, V.2
Sarawagi, S.3
-
6
-
-
0028424239
-
Improving generalization with active learning
-
D. Cohn, L. Atlas, and R. Ladner. Improving generalization with active learning. Machine Learning, 15(2):201-221, 1994.
-
(1994)
Machine Learning
, vol.15
, Issue.2
, pp. 201-221
-
-
Cohn, D.1
Atlas, L.2
Ladner, R.3
-
7
-
-
0000913324
-
Svmtorch: Support vector machines for large-scale regression problems
-
R. Collobert and S. Bengio. Svmtorch: Support vector machines for large-scale regression problems. Journal of Machine Learning Research, 1:143-160, 2001. Software available from "http://www.idiap.ch/learning/SVMTorch.html".
-
(2001)
Journal of Machine Learning Research
, vol.1
, pp. 143-160
-
-
Collobert, R.1
Bengio, S.2
-
8
-
-
0031209604
-
Selective sampling using the query by committee algorithm
-
Y. Freund, H.S. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Machine Learning, 28(2-3):133-168, 1997.
-
(1997)
Machine Learning
, vol.28
, Issue.2-3
, pp. 133-168
-
-
Freund, Y.1
Seung, H.S.2
Shamir, E.3
Tishby, N.4
-
9
-
-
0344756845
-
Declarative data cleaning: Language, model and algorithms
-
Rome,Italy
-
H. Galhardas, D. Florescu, D. Shasha, E. Simon, and C. Saita. Declarative data cleaning: Language, model and algorithms. In Proc. of the 27th Int'l Conference on Very Large Databases (VLDB), 307-316, Rome,Italy, 2001
-
(2001)
Proc. of the 27th Int'l Conference on Very Large Databases (VLDB)
, pp. 307-316
-
-
Galhardas, H.1
Florescu, D.2
Shasha, D.3
Simon, E.4
Saita, C.5
-
10
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
L. Gravano, Panagiotis, and H.V. Jagadish, Approximate string joins in a database (almost) for free. In Proc. of the 27th Int'l Conference on Very Large Databases (VLDB), Rome, Italy, 2001.
-
Proc. of the 27th Int'l Conference on Very Large Databases (VLDB), Rome, Italy, 2001
-
-
Gravano, L.1
Panagiotis2
Jagadish, H.V.3
-
11
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
M.A. Hernandez and S.J. Stolfo. Real-world data is dirty: Data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery, 2(1):9-37, 1998.
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, Issue.1
, pp. 9-37
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
12
-
-
0003897956
-
Identifying and merging related bibliographic records
-
Master's thesis, MIT
-
J. Hylton. Identifying and merging related bibliographic records. Master's thesis, MIT, 1996.
-
(1996)
-
-
Hylton, J.1
-
13
-
-
0034592915
-
Active learning using adaptive resampling
-
In R. Ramakrishnan, S. Stolfo, R. Bayardo, and I. Parsa, editors; N. Y., Aug. 20-23; ACM Press
-
V.S. Iyengar, C. Apte, and T. Zhang. In R. Ramakrishnan, S. Stolfo, R. Bayardo, and I. Parsa, editors, Active learning using adaptive resampling Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-00), 91-98, N. Y., Aug. 20-23 2000. ACM Press.
-
(2000)
Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-00)
, pp. 91-98
-
-
Iyengar, V.S.1
Apte, C.2
Zhang, T.3
-
15
-
-
0030422272
-
Data mining using MLC++: A machine learning library in C++
-
IEEE Computer Society Press
-
R. Kohavi, D. Sommerfield, and J. Dougherty. Data mining using MLC++: A machine learning library in C++. In Tools with Artificial Intelligence, 234-245. IEEE Computer Society Press, available from http://www.sgi.com/tech/mlc/, 1996.
-
(1996)
Tools with Artificial Intelligence
, pp. 234-245
-
-
Kohavi, R.1
Sommerfield, D.2
Dougherty, J.3
-
16
-
-
0032640910
-
Digital libraries and autonomous citation indexing
-
S. Lawrence, C.L. Giles, and K. Bollacker. Digital libraries and autonomous citation indexing. IEEE Computer, 32(6):67-71, 1999.
-
(1999)
IEEE Computer
, vol.32
, Issue.6
, pp. 67-71
-
-
Lawrence, S.1
Giles, C.L.2
Bollacker, K.3
-
17
-
-
0031369631
-
Active learning with committees for text categorization
-
Providence, US; AAAI Press, Menlo Park, US
-
R. Liere and P. Tadepalli. Active learning with committees for text categorization. In Proceedings of AAAI-97, 14th Conference of the American Association for Artificial Intelligence, 591-596, Providence, US, 1997. AAAI Press, Menlo Park, US.
-
(1997)
Proceedings of AAAI-97, 14th Conference of the American Association for Artificial Intelligence
, pp. 591-596
-
-
Liere, R.1
Tadepalli, P.2
-
18
-
-
0242529371
-
Cora: Computer science research paper search engine
-
A. McCallum, K. Nigam, J. Reed, J. Rennie, and K. Seymour. Cora: Computer science research paper search engine. http://cora.whizbang.com/, 2000.
-
(2000)
-
-
McCallum, A.1
Nigam, K.2
Reed, J.3
Rennie, J.4
Seymour, K.5
-
19
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
A. McCallum, K. Nigam, and L.H. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Knowledge Discovery and Data Mining, 169-178, 2000.
-
(2000)
Knowledge Discovery and Data Mining
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
-
20
-
-
0000314722
-
Employing EM in pool-based active learning for text classification
-
In J. W. Shavlik, editor; Madison, US; Morgan Kaufmann Publishers, San Francisco, US
-
A.K. McCallum and K. Nigam. Employing EM in pool-based active learning for text classification. Proceedings of ICML-98, 15th International Conference on Machine Learning, In J. W. Shavlik, editor, 350-358, Madison, US, 1998. Morgan Kaufmann Publishers, San Francisco, US.
-
(1998)
Proceedings of ICML-98, 15th International Conference on Machine Learning
, pp. 350-358
-
-
McCallum, A.K.1
Nigam, K.2
-
23
-
-
0345566149
-
A guided tour to approximate string matching
-
G. Navarro. A guided tour to approximate string matching. ACM Computing Surveys, 33(1):31-88, 2001.
-
(2001)
ACM Computing Surveys
, vol.33
, Issue.1
, pp. 31-88
-
-
Navarro, G.1
-
26
-
-
0242614048
-
-
S. Sarawagi, editor; December
-
S. Sarawagi, editor. IEEE Data Engineering special issue on Data Cleaning. http://www.research.microsoft.com/research/db/debull/AOOdec/issue.htm December 2000.
-
(2000)
IEEE Data Engineering Special Issue on Data Cleaning
-
-
-
27
-
-
0007696417
-
Less is more: Active learning with support vector machines
-
Morgan Kaufmann, San Francisco, CA
-
G. Schohn and D. Cohn. Less is more: Active learning with support vector machines. In Proc. 17th International Conf. on Machine Learning, 839-846. Morgan Kaufmann, San Francisco, CA, 2000.
-
(2000)
Proc. 17th International Conf. on Machine Learning
, pp. 839-846
-
-
Schohn, G.1
Cohn, D.2
-
29
-
-
0242445793
-
Cleanup and deduplication of an international deduplication function
-
S. Toney. Cleanup and deduplication of an international deduplication function. Information Technology and libraries, 11(1):19 - 28, 1992.
-
(1992)
Information Technology and Libraries
, vol.1
, Issue.1
, pp. 19-28
-
-
Toney, S.1
-
30
-
-
0042868698
-
Support vector machine active learning with applications to text classification
-
Nov.
-
S. Tong and D. Koller. Support vector machine active learning with applications to text classification. Journal of Machine Learning Research, 2:45-66, Nov. 2001.
-
(2001)
Journal of Machine Learning Research
, vol.2
, pp. 45-66
-
-
Tong, S.1
Koller, D.2
-
31
-
-
0003363140
-
Matching and record linkage
-
In B. G. C. et al, editor; New York: J. Wiley
-
W.E. Winkler. Matching and record linkage. In B. G. C. et al, editor, Business Survey Methods, pages 355-384. New York: J. Wiley, 1995. available from http:/www.census.gov/
-
(1995)
Business Survey Methods
, pp. 355-384
-
-
Winkler, W.E.1
-
32
-
-
0012866045
-
The state of record linkage and current research problems
-
RR99/04
-
W.E. Winkler. The state of record linkage and current research problems. RR99/04, http://www.census.gov/srd/papers/pdf/rr99-04.pdf 1999.
-
(1999)
-
-
Winkler, W.E.1
-
34
-
-
0005004572
-
A probability analysis on the value of unlabeled data for classification problems
-
Morgan Kaufmann, San Francisco, CA
-
T. Zhang and F.J. Oles. A probability analysis on the value of unlabeled data for classification problems. In Proc. 17th International Conf. on Machine Learning, pages 1191-1198. Morgan Kaufmann, San Francisco, CA, 2000.
-
(2000)
Proc. 17th International Conf. on Machine Learning
, pp. 1191-1198
-
-
Zhang, T.1
Oles, F.J.2
|