메뉴 건너뛰기




Volumn , Issue , 2002, Pages 269-278

Interactive deduplication using active learning

Author keywords

[No Author keywords available]

Indexed keywords

CODING ERRORS; DATA RECORDING; DATABASE SYSTEMS; INTERACTIVE COMPUTER SYSTEMS; WEBSITES;

EID: 0242456811     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/775085.775087     Document Type: Conference Paper
Times cited : (539)

References (34)
  • 3
    • 84944949595 scopus 로고
    • The effect of adding relevance information in a relevance feedback environment
    • C. Buckley, G. Salton, and J. Allan. The effect of adding relevance information in a relevance feedback environment. In Proc. of SIGIR, 292-300, 1994.
    • (1994) Proc. of SIGIR , pp. 292-300
    • Buckley, C.1    Salton, G.2    Allan, J.3
  • 4
    • 27144489164 scopus 로고    scopus 로고
    • A tutorial on support vector machines for pattern recognition
    • C.J.C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2):121-167, 1998.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-167
    • Burges, C.J.C.1
  • 6
    • 0028424239 scopus 로고
    • Improving generalization with active learning
    • D. Cohn, L. Atlas, and R. Ladner. Improving generalization with active learning. Machine Learning, 15(2):201-221, 1994.
    • (1994) Machine Learning , vol.15 , Issue.2 , pp. 201-221
    • Cohn, D.1    Atlas, L.2    Ladner, R.3
  • 7
    • 0000913324 scopus 로고    scopus 로고
    • Svmtorch: Support vector machines for large-scale regression problems
    • R. Collobert and S. Bengio. Svmtorch: Support vector machines for large-scale regression problems. Journal of Machine Learning Research, 1:143-160, 2001. Software available from "http://www.idiap.ch/learning/SVMTorch.html".
    • (2001) Journal of Machine Learning Research , vol.1 , pp. 143-160
    • Collobert, R.1    Bengio, S.2
  • 8
    • 0031209604 scopus 로고    scopus 로고
    • Selective sampling using the query by committee algorithm
    • Y. Freund, H.S. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Machine Learning, 28(2-3):133-168, 1997.
    • (1997) Machine Learning , vol.28 , Issue.2-3 , pp. 133-168
    • Freund, Y.1    Seung, H.S.2    Shamir, E.3    Tishby, N.4
  • 11
    • 0013331361 scopus 로고    scopus 로고
    • Real-world data is dirty: Data cleansing and the merge/purge problem
    • M.A. Hernandez and S.J. Stolfo. Real-world data is dirty: Data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery, 2(1):9-37, 1998.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.1 , pp. 9-37
    • Hernandez, M.A.1    Stolfo, S.J.2
  • 12
    • 0003897956 scopus 로고    scopus 로고
    • Identifying and merging related bibliographic records
    • Master's thesis, MIT
    • J. Hylton. Identifying and merging related bibliographic records. Master's thesis, MIT, 1996.
    • (1996)
    • Hylton, J.1
  • 15
    • 0030422272 scopus 로고    scopus 로고
    • Data mining using MLC++: A machine learning library in C++
    • IEEE Computer Society Press
    • R. Kohavi, D. Sommerfield, and J. Dougherty. Data mining using MLC++: A machine learning library in C++. In Tools with Artificial Intelligence, 234-245. IEEE Computer Society Press, available from http://www.sgi.com/tech/mlc/, 1996.
    • (1996) Tools with Artificial Intelligence , pp. 234-245
    • Kohavi, R.1    Sommerfield, D.2    Dougherty, J.3
  • 16
    • 0032640910 scopus 로고    scopus 로고
    • Digital libraries and autonomous citation indexing
    • S. Lawrence, C.L. Giles, and K. Bollacker. Digital libraries and autonomous citation indexing. IEEE Computer, 32(6):67-71, 1999.
    • (1999) IEEE Computer , vol.32 , Issue.6 , pp. 67-71
    • Lawrence, S.1    Giles, C.L.2    Bollacker, K.3
  • 18
    • 0242529371 scopus 로고    scopus 로고
    • Cora: Computer science research paper search engine
    • A. McCallum, K. Nigam, J. Reed, J. Rennie, and K. Seymour. Cora: Computer science research paper search engine. http://cora.whizbang.com/, 2000.
    • (2000)
    • McCallum, A.1    Nigam, K.2    Reed, J.3    Rennie, J.4    Seymour, K.5
  • 19
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • A. McCallum, K. Nigam, and L.H. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Knowledge Discovery and Data Mining, 169-178, 2000.
    • (2000) Knowledge Discovery and Data Mining , pp. 169-178
    • McCallum, A.1    Nigam, K.2    Ungar, L.H.3
  • 20
    • 0000314722 scopus 로고    scopus 로고
    • Employing EM in pool-based active learning for text classification
    • In J. W. Shavlik, editor; Madison, US; Morgan Kaufmann Publishers, San Francisco, US
    • A.K. McCallum and K. Nigam. Employing EM in pool-based active learning for text classification. Proceedings of ICML-98, 15th International Conference on Machine Learning, In J. W. Shavlik, editor, 350-358, Madison, US, 1998. Morgan Kaufmann Publishers, San Francisco, US.
    • (1998) Proceedings of ICML-98, 15th International Conference on Machine Learning , pp. 350-358
    • McCallum, A.K.1    Nigam, K.2
  • 23
    • 0345566149 scopus 로고    scopus 로고
    • A guided tour to approximate string matching
    • G. Navarro. A guided tour to approximate string matching. ACM Computing Surveys, 33(1):31-88, 2001.
    • (2001) ACM Computing Surveys , vol.33 , Issue.1 , pp. 31-88
    • Navarro, G.1
  • 26
  • 27
    • 0007696417 scopus 로고    scopus 로고
    • Less is more: Active learning with support vector machines
    • Morgan Kaufmann, San Francisco, CA
    • G. Schohn and D. Cohn. Less is more: Active learning with support vector machines. In Proc. 17th International Conf. on Machine Learning, 839-846. Morgan Kaufmann, San Francisco, CA, 2000.
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 839-846
    • Schohn, G.1    Cohn, D.2
  • 29
    • 0242445793 scopus 로고
    • Cleanup and deduplication of an international deduplication function
    • S. Toney. Cleanup and deduplication of an international deduplication function. Information Technology and libraries, 11(1):19 - 28, 1992.
    • (1992) Information Technology and Libraries , vol.1 , Issue.1 , pp. 19-28
    • Toney, S.1
  • 30
    • 0042868698 scopus 로고    scopus 로고
    • Support vector machine active learning with applications to text classification
    • Nov.
    • S. Tong and D. Koller. Support vector machine active learning with applications to text classification. Journal of Machine Learning Research, 2:45-66, Nov. 2001.
    • (2001) Journal of Machine Learning Research , vol.2 , pp. 45-66
    • Tong, S.1    Koller, D.2
  • 31
    • 0003363140 scopus 로고
    • Matching and record linkage
    • In B. G. C. et al, editor; New York: J. Wiley
    • W.E. Winkler. Matching and record linkage. In B. G. C. et al, editor, Business Survey Methods, pages 355-384. New York: J. Wiley, 1995. available from http:/www.census.gov/
    • (1995) Business Survey Methods , pp. 355-384
    • Winkler, W.E.1
  • 32
    • 0012866045 scopus 로고    scopus 로고
    • The state of record linkage and current research problems
    • RR99/04
    • W.E. Winkler. The state of record linkage and current research problems. RR99/04, http://www.census.gov/srd/papers/pdf/rr99-04.pdf 1999.
    • (1999)
    • Winkler, W.E.1
  • 34
    • 0005004572 scopus 로고    scopus 로고
    • A probability analysis on the value of unlabeled data for classification problems
    • Morgan Kaufmann, San Francisco, CA
    • T. Zhang and F.J. Oles. A probability analysis on the value of unlabeled data for classification problems. In Proc. 17th International Conf. on Machine Learning, pages 1191-1198. Morgan Kaufmann, San Francisco, CA, 2000.
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 1191-1198
    • Zhang, T.1    Oles, F.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.