-
1
-
-
0023041177
-
A bit-string longestcommon-subsequence algorithm
-
DOI: 10.1016/0020-0190(86)90091-8
-
Allison, L. and T.I. Dix, 1986. A bit-string longestcommon-subsequence algorithm. Inform. Process. Lett., 23: 305-310. DOI: 10.1016/0020-0190(86)90091-8
-
(1986)
Inform. Process. Lett.
, vol.23
, pp. 305-310
-
-
Allison, L.1
Dix, T.I.2
-
2
-
-
79251506653
-
Statistical bayesian learning for automatic arabic text categorization
-
DOI: 10.3844/jcssp.2011.39.45
-
Al-Salemi, B. and M.J.A. Aziz, 2011. Statistical bayesian learning for automatic arabic text categorization. J. Comput. Sci., 7: 39-45. DOI: 10.3844/jcssp.2011.39.45
-
(2011)
J. Comput. Sci.
, vol.7
, pp. 39-45
-
-
Al-Salemi, B.1
Aziz, M.J.A.2
-
3
-
-
44649135932
-
A two-step classification approach to unsupervised record linkage
-
Australian Computer Society, Inc. Darlinghurst, Australia
-
Christen, P., 2007. A two-step classification approach to unsupervised record linkage. Proceedings of the 6th Australasian Conference on Data Mining and Analytics (AusDM'07), Australian Computer Society, Inc. Darlinghurst, Australia, pp: 111-119.
-
(2007)
Proceedings of the 6th Australasian Conference on Data Mining and Analytics (AusDM'07)
, pp. 111-119
-
-
Christen, P.1
-
4
-
-
65449139594
-
Automatic record linkage using seeded nearest neighbour and support vector machine classification
-
(KDD'08), ACM New York, NY, USA.,DOI: 10.1145/1401890.1401913
-
Christen, P., 2008. Automatic record linkage using seeded nearest neighbour and support vector machine classification. Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (KDD'08), ACM New York, NY, USA., pp: 151-159. DOI: 10.1145/1401890.1401913
-
(2008)
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 151-159
-
-
Christen, P.1
-
5
-
-
84884417241
-
Preparation of name and address data for record linkage using hidden Markov models
-
DOI: 10.1186/1472-6947-2-9
-
Churches, T., P. Christen, K. Lim and J.X. Zhu, 2002. Preparation of name and address data for record linkage using hidden Markov models. BMC Med. Inform. Decision Mak., 2: 9-9. DOI: 10.1186/1472-6947-2-9
-
(2002)
BMC Med. Inform. Decision Mak.
, vol.2
, pp. 9-19
-
-
Churches, T.1
Christen, P.2
Lim, K.3
Zhu, J.X.4
-
6
-
-
0036203458
-
Tailor: A record linkage toolbox
-
Feb. 26-1 Mar., San Jose, CA, USA., DOI: 10.1109/ICDE.2002.994694
-
Elfeky, M.G., A.K. Elmagarmid and V.S. Verykios, 2002. Tailor: A record linkage toolbox. Proceedings of the 18th International Conference on Data Engineering, Feb. 26-1 Mar., San Jose, CA, USA., pp: 17-28. DOI: 10.1109/ICDE.2002.994694
-
(2002)
Proceedings of the 18th International Conference on Data Engineering
, pp. 17-28
-
-
Elfeky, M.G.1
Elmagarmid, A.K.2
Verykios, V.S.3
-
7
-
-
84947399464
-
A theory for record linkage
-
DOI: 10.2307/2286061
-
Fellegi, I.P. and A.B. Sunter, 1969. A theory for record linkage. J. Am. Stat. Assoc., 64: 1183-1210. DOI: 10.2307/2286061
-
(1969)
J. Am. Stat. Assoc.
, vol.64
, pp. 1183-1210
-
-
Fellegi, I.P.1
Sunter, A.B.2
-
8
-
-
37149056535
-
Decision Models for Record Linkage
-
DOI: 10.1007/11677437_12
-
Gu, L. and R. Baxter, 2006. Decision Models for Record Linkage. Data Min., 3755: 146-160. DOI: 10.1007/11677437_12
-
(2006)
Data Min
, vol.3755
, pp. 146-160
-
-
Gu, L.1
Baxter, R.2
-
9
-
-
0003585297
-
-
2nd Edn., Morgan Kaufmann, USA., ISBN-10: 1558609016
-
Han, J. and M. Kamber, 2006. Data mining: concepts and techniques. 2nd Edn., Morgan Kaufmann, USA., ISBN-10: 1558609016, pp: 800.
-
(2006)
Data mining: concepts and techniques
, pp. 800
-
-
Han, J.1
Kamber, M.2
-
10
-
-
70349826301
-
Creating probabilistic databases from duplicated data
-
DOI: 10.1007/s00778-009-0161-2
-
Hassanzadeh, O. and R.J. Miller, 2009. Creating probabilistic databases from duplicated data. VLDB J., 18: 1141-1166. DOI: 10.1007/s00778-009-0161-2
-
(2009)
VLDB J
, vol.18
, pp. 1141-1166
-
-
Hassanzadeh, O.1
Miller, R.J.2
-
11
-
-
84976856849
-
The merge/purge problem for large databases
-
(ICMD'95), ACM New York, NY, USA., DOI: 10.1145/223784.223807
-
Hernández, M.A. and S. Stolfo, 1995. The merge/purge problem for large databases. Proceedings of the International Conference on Management of Data, (ICMD'95), ACM New York, NY, USA., pp: 127-138. DOI: 10.1145/223784.223807
-
(1995)
Proceedings of the International Conference on Management of Data
, pp. 127-138
-
-
Hernández, M.A.1
Stolfo, S.2
-
12
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
DOI: 10.1023/A:1009761603038
-
Hernández, M.A. and S.J. Stolfo, 1998. Real-world data is dirty: Data cleansing and the merge/purge problem. Data Min. Know. Discovery, 2: 9-37. DOI: 10.1023/A:1009761603038
-
(1998)
Data Min. Know. Discovery
, vol.2
, pp. 9-37
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
13
-
-
49149122316
-
Semantic text similarity using corpus-based word similarity and string similarity
-
DOI: 10.1145/1376815.1376819
-
Islam, A. and D. Inkpen, 2008. Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data, 2: 1-25. DOI: 10.1145/1376815.1376819
-
(2008)
ACM Trans. Knowl. Discov. Data
, vol.2
, pp. 1-25
-
-
Islam, A.1
Inkpen, D.2
-
14
-
-
84950419860
-
Advances in record-linkage methodology as applied to matching the 1985 census of Tampa
-
DOI: 10.2307/2289924
-
Jaro, M.A., 1989. Advances in record-linkage methodology as applied to matching the 1985 census of Tampa. Florida. J. Am. Stat. Assoc., 84: 414-420. DOI: 10.2307/2289924
-
(1989)
Florida. J. Am. Stat. Assoc.
, vol.84
, pp. 414-420
-
-
Jaro, M.A.1
-
15
-
-
72649095071
-
Frameworks for entity matching: A comparison
-
DOI: 10.1016/j.datak.2009.10.003
-
Kopcke, H. and E. Rahm, 2010. Frameworks for entity matching: A comparison. Data Knowl. Eng., 69: 197-210. DOI: 10.1016/j.datak.2009.10.003
-
(2010)
Data Knowl. Eng.
, vol.69
, pp. 197-210
-
-
Kopcke, H.1
Rahm, E.2
-
16
-
-
77952496732
-
Differential diagnosis knowledge building by using CUC-C4.5 framework
-
DOI: 10.3844/jcssp.2010.180.185
-
Kusrini, S. Hartati, R. Wardoyo and A. Harjoko, 2010. Differential diagnosis knowledge building by using CUC-C4.5 framework. J. Comput. Sci., 6: 180-185. DOI: 10.3844/jcssp.2010.180.185
-
(2010)
J. Comput. Sci.
, vol.6
, pp. 180-185
-
-
Kusrini, S.H.1
Wardoyo, R.2
Harjoko, A.3
-
17
-
-
77954832863
-
A review of nearest neighbor-support vector machines hybrid classification models
-
DOI: 10.3923/jas.2010.1841.1858
-
Lee, L.H., C.H. Wan, T.F. Yong and H.M. Kok, 2011. A review of nearest neighbor-support vector machines hybrid classification models. J. Applied Sci., 10: 1841-1858. DOI: 10.3923/jas.2010.1841.1858
-
(2011)
J. Applied Sci.
, vol.10
, pp. 1841-1858
-
-
Lee, L.H.1
Wan, C.H.2
Yong, T.F.3
Kok, H.M.4
-
18
-
-
77953790411
-
Tournament structure ranking techniques for Bayesian text classification with highly similar categories
-
Lee, L.H., D. Isa, W.O. Choo and W.Y. Chue, 2010. Tournament structure ranking techniques for Bayesian text classification with highly similar categories. J. Applied Sci., 10: 1243-1254
-
(2010)
J. Applied Sci.
, vol.10
, pp. 1243-1254
-
-
Lee, L.H.1
Isa, D.2
Choo, W.O.3
Chue, W.Y.4
-
19
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
(KDD'00), ACM New York, NY, USA., DOI: 10.1145/347090.347123
-
McCallum, A., K. Nigam and L.H. Ungar, 2000. Efficient clustering of high-dimensional data sets with application to reference matching. Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (KDD'00), ACM New York, NY, USA., pp: 169-178. DOI: 10.1145/347090.347123
-
(2000)
Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
-
20
-
-
44649150596
-
A bayesian networks in intrusion detection systems
-
DOI: 10.3844/jcssp.2007.259.265
-
Mehdi, M., S. Zair, A. Anou and M. Bensebti, 2007. A bayesian networks in intrusion detection systems. J. Comput. Sci., 3: 259-265. DOI: 10.3844/jcssp.2007.259.265
-
(2007)
J. Comput. Sci.
, vol.3
, pp. 259-265
-
-
Mehdi, M.1
Zair, S.2
Anou, A.3
Bensebti, M.4
-
21
-
-
79251504274
-
Dynamic bayesian networks in classification-and-ranking architecture of response generation
-
DOI: 10.3844/jcssp.2011.59.64
-
Mustapha, A., M.N. Sulaiman, R. Mahmod and M.H. Selamat, 2011. Dynamic bayesian networks in classification-and-ranking architecture of response generation. J. Comput. Sci., 7: 59-64. DOI: 10.3844/jcssp.2011.59.64
-
(2011)
J. Comput. Sci.
, vol.7
, pp. 59-64
-
-
Mustapha, A.1
Sulaiman, M.N.2
Mahmod, R.3
Selamat, M.H.4
-
22
-
-
0037867900
-
Two approaches to handling noisy variation in text mining
-
(ICML-2002), Sydney, Australia
-
Nahm, U.Y., M. Bilenko and R.J. Mooney, 2002. Two approaches to handling noisy variation in text mining. Proceedings of the Papers from the 19th International Conference on Machine Learning Workshop on Text Learning, (ICML-2002), Sydney, Australia, pp: 18-27.
-
(2002)
Proceedings of the Papers from the 19th International Conference on Machine Learning Workshop on Text Learning
, pp. 18-27
-
-
Nahm, U.Y.1
Bilenko, M.2
Mooney, R.J.3
-
23
-
-
0001592068
-
Automatic linkage of vital records
-
DOI: 10.1126/science.130.3381.954
-
Newcombe, H.B., J.M. Kennedy, S.J. Axford and A.P. James, 1959. Automatic linkage of vital records. Science, 130: 954-959. DOI: 10.1126/science.130.3381.954
-
(1959)
Science
, vol.130
, pp. 954-959
-
-
Newcombe, H.B.1
Kennedy, J.M.2
Axford, S.J.3
James, A.P.4
-
24
-
-
79952315725
-
A trigram hidden Markov model for metadata extraction from heterogeneous references
-
DOI: 10.1016/j.ins.2011.01.014
-
Ojokoh, B., M. Zhang, J. Tang, 2011. A trigram hidden Markov model for metadata extraction from heterogeneous references. Inform. Sci., 181: 1538-1551. DOI: 10.1016/j.ins.2011.01.014
-
(2011)
Inform. Sci.
, vol.181
, pp. 1538-1551
-
-
Ojokoh, B.1
Zhang, M.2
Tang, J.3
-
25
-
-
80052861770
-
Unbalance quantitative structure activity relationship problem reduction in drug design
-
DOI: 10.3844/jcssp.2009.764.772
-
Pugazhenthi, D. and S.P. Rajagopalan, 2009. Unbalance quantitative structure activity relationship problem reduction in drug design. J. Comput. Sci., 5: 764-772. DOI: 10.3844/jcssp.2009.764.772
-
(2009)
J. Comput. Sci.
, vol.5
, pp. 764-772
-
-
Pugazhenthi, D.1
Rajagopalan, S.P.2
-
26
-
-
0002442571
-
Discovering Rules by Induction from Large Collections of Examples
-
Age, D. Michie, (Ed.). Edinburgh: Edinburgh University Press
-
Quinlan, J.R., 1979. Discovering Rules by Induction from Large Collections of Examples. In: Expert Systems in the Micro-Electronic Age, D. Michie, (Ed.). Edinburgh: Edinburgh University Press, pp: 168-201.
-
(1979)
Expert Systems in the Micro-Electronic
, pp. 168-201
-
-
Quinlan, J.R.1
-
27
-
-
0003500248
-
-
1st Edn., Morgan Kaufmann, USA., ISBN-10: 1558602380
-
Quinlan, J.R., 1992. C4.5: programs for machine learning. 1st Edn., Morgan Kaufmann, USA., ISBN-10: 1558602380, pp: 302.
-
(1992)
C4.5:programs for machine learning
, pp. 302
-
-
Quinlan, J.R.1
-
28
-
-
0242456811
-
Interactive deduplication using active learning
-
(KDD'02), ACM New York, NY, USA., DOI: 10.1145/775047.775087
-
Sarawagi, S. and A. Bhamidipaty, 2002. Interactive deduplication using active learning. Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (KDD'02), ACM New York, NY, USA., pp: 269-278. DOI: 10.1145/775047.775087
-
(2002)
Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 269-278
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
29
-
-
79956298114
-
Feature-based entity matching: The FBEM model, implementation, evaluation
-
DOI: 10.1007/978-3-642-13094-6_15
-
Stoermer, H., N. Rassadko and N. Vaidya, 2010. Feature-based entity matching: The FBEM model, implementation, evaluation. Advanced Inform. Syst. Eng., 6051: 180-193. DOI: 10.1007/978-3-642-13094-6_15
-
(2010)
Advanced Inform. Syst. Eng.
, vol.6051
, pp. 180-193
-
-
Stoermer, H.1
Rassadko, N.2
Vaidya, N.3
-
30
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
(SIGMOD'03 ACM New York, NY, USA., DOI: 10.1145/872757.872796
-
Surajit, C., K. Ganjam, V. Ganti and R. Motwani, 2003. Robust and efficient fuzzy match for online data cleaning. Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, (SIGMOD'03 ACM New York, NY, USA., pp: 313-324. DOI: 10.1145/872757.872796
-
(2003)
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data
, pp. 313-324
-
-
Surajit, C.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
31
-
-
77956263873
-
Log data approach to acquisition of optimal Bayesian learner model
-
DOI: 10.3844/ajassp.2009.913.921
-
Ting, C.Y. and S. Phon-Amnuaisuk, 2009. Log data approach to acquisition of optimal Bayesian learner model. Am. J. Applied Sci., 6: 913-921. DOI: 10.3844/ajassp.2009.913.921
-
(2009)
Am. J. Applied Sci.
, vol.6
, pp. 913-921
-
-
Ting, C.Y.1
Phon-Amnuaisuk, S.2
-
32
-
-
80052862250
-
Forecasting daily demand in cash supply chains
-
DOI: 10.3844/ajebasp.2010.377.383
-
Wagner, M., 2010. Forecasting daily demand in cash supply chains. Am. J. Econ. Bus. Admin., 2: 377-383. DOI: 10.3844/ajebasp.2010.377.383
-
(2010)
Am. J. Econ. Bus. Admin.
, vol.2
, pp. 377-383
-
-
Wagner, M.1
-
33
-
-
0035786729
-
Automated name authority control
-
(JCDL'01), ACM New York, NY, USA., DOI: 10.1145/379437.379441
-
Warnner, J.W. and E.W. Brown, 2001. Automated name authority control. Proceedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries, (JCDL'01), ACM New York, NY, USA., pp: 21-22. DOI: 10.1145/379437.379441
-
(2001)
Proceedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries
, pp. 21-22
-
-
Warnner, J.W.1
Brown, E.W.2
|