-
2
-
-
84976810280
-
Copy Detection Mechanisms for Digital Documents
-
San Jose, California, USA, May 22-25
-
S. Brin, J. Davis and H. Garcia-Molina, "Copy Detection Mechanisms for Digital Documents", in the ACM SIGMOD International Conference on Management of Data (San Jose, California, USA, May 22-25 1995), 1995, pp. 398-409.
-
(1995)
ACM SIGMOD International Conference on Management of Data
, pp. 398-409
-
-
Brin, S.1
Davis, J.2
Garcia-Molina, H.3
-
3
-
-
77749264172
-
Combined Syntactical Structures and Sequence Alignment Approach to Document Similarity Calculation for Copy Detection
-
M.Sc. thesis, Department of Computer Science, Collage of Science, Sultan Qaboos University, Muscat, Oman
-
A. Al-Tobi, "Combined Syntactical Structures and Sequence Alignment Approach to Document Similarity Calculation for Copy Detection", M.Sc. thesis, Department of Computer Science, Collage of Science, Sultan Qaboos University, Muscat, Oman, 2008.
-
(2008)
-
-
Al-Tobi, A.1
-
4
-
-
84931831899
-
Using copy-detection and text comparison algorithms for cross-referencing multiple editions of literary works
-
Darmstadt, Germany, September 4-9
-
A. Zaslavsky, A. Bia, and K. Monostori, "Using copy-detection and text comparison algorithms for cross-referencing multiple editions of literary works", in Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries (Darmstadt, Germany, September 4-9 2001), 2001, pp 103-114,.
-
(2001)
Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries
, pp. 103-114
-
-
Zaslavsky, A.1
Bia, A.2
Monostori, K.3
-
5
-
-
84892758238
-
CHECK: A document plagiarism detection system
-
San Jose, California, USA, February 28, March 1
-
A. Si, H.V. Leong and R.W.H. Lau, "CHECK: A document plagiarism detection system", in Proceedings of ACM Symposium for Applied Computing, ACM (San Jose, California, USA, February 28 - March 1 1997), 1997, pp. 70-77.
-
(1997)
Proceedings of ACM Symposium for Applied Computing, ACM
, pp. 70-77
-
-
Si, A.1
Leong, H.V.2
Lau, R.W.H.3
-
6
-
-
0013273370
-
SCAM: A Copy Detection Mechanism for Digital Documents
-
Austin, Texas, USA, June 11-13
-
N. Shivakumar and H. Garcia-Molina, "SCAM: A Copy Detection Mechanism for Digital Documents", in Proceedings of 2nd International Conference in Theory and Practice of Digital Libraries (Austin, Texas, USA, June 11-13 1995), 1995.
-
(1995)
Proceedings of 2nd International Conference in Theory and Practice of Digital Libraries
-
-
Shivakumar, N.1
Garcia-Molina, H.2
-
7
-
-
77749237100
-
-
REUTERS, Reuters Corpus (1: English Language, 1996-08-20 to 1997-08-19), Released date: November 2000, NIST, 2000.
-
REUTERS, Reuters Corpus (Volume 1: English Language, 1996-08-20 to 1997-08-19), Released date: November 2000, NIST, 2000.
-
-
-
-
8
-
-
34250698372
-
A Dual-method Model for Copy Detection
-
IEEE Hong Kong Convention and Exhibition Centre, Hong Kong, December 18-22
-
Y. Liu and L. Liang, "A Dual-method Model for Copy Detection", in Proceedings of the IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology, IEEE (Hong Kong Convention and Exhibition Centre, Hong Kong, December 18-22 2006), 2006, pp. 634-637.
-
(2006)
Proceedings of the IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
, pp. 634-637
-
-
Liu, Y.1
Liang, L.2
-
9
-
-
32344441912
-
Finding Similar Files in Large Document Repositories
-
Chicago, Illinois, USA, August
-
G. Forman, K. Eshghi and S. Chiocchetti, "Finding Similar Files in Large Document Repositories", in Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, August 2005, pp 394-400.
-
(2005)
Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 394-400
-
-
Forman, G.1
Eshghi, K.2
Chiocchetti, S.3
-
10
-
-
62949125921
-
Use of Text Syntactical Structures in Detection of Document Duplicates
-
University of East London, London. UK, November 13-16
-
M. Elhadi and A. Al-Tobi, "Use of Text Syntactical Structures in Detection of Document Duplicates", in Third IEEE International Conference on Digital Information Management (University of East London, London. UK, November 13-16 2008), 2008.
-
(2008)
Third IEEE International Conference on Digital Information Management
-
-
Elhadi, M.1
Al-Tobi, A.2
-
11
-
-
0344756842
-
Modern Information Retrieval: A Brief Overview
-
March
-
A. Singhal, "Modern Information Retrieval: A Brief Overview", IEEE Data Engin. Bulletin, Vol. 24, No. 4, pp. 35-43, March 2001.
-
(2001)
IEEE Data Engin. Bulletin
, vol.24
, Issue.4
, pp. 35-43
-
-
Singhal, A.1
-
12
-
-
0025183708
-
Basic Local Alignment Search Tool
-
October
-
S.F. Altschul, W. Gish, W. Miller, E.W. Myers and D.J. Lipman, "Basic Local Alignment Search Tool", Journal of Molecular Biology, Vol. 215, No. 3, pp. 403-410, October 1990.
-
(1990)
Journal of Molecular Biology
, vol.215
, Issue.3
, pp. 403-410
-
-
Altschul, S.F.1
Gish, W.2
Miller, W.3
Myers, E.W.4
Lipman, D.J.5
-
13
-
-
77749273600
-
-
class notes for CSE 591: Computational Molecular Biology, Depar. of Computer Science & Engineering, Arizona State University, Spring
-
"Local Alignment: Smith-Waterman algorithm", class notes for CSE 591: Computational Molecular Biology, Depar. of Computer Science & Engineering, Arizona State University, Spring 2003.
-
(2003)
Local Alignment: Smith-Waterman algorithm
-
-
-
14
-
-
0010649742
-
-
Second Edition, Oxford University Press Inc, New York, USA
-
A. M. Lesk, Introduction to Bioinformatics, Second Edition, Oxford University Press Inc., New York, USA, 2005.
-
(2005)
Introduction to Bioinformatics
-
-
Lesk, A.M.1
-
15
-
-
1142267351
-
Winnowing: Local Algorithms for Document Fingerprinting
-
San Diego,California, USA, June
-
S. Schleimer, D. S. Wilkerson and A. Aiken, "Winnowing: Local Algorithms for Document Fingerprinting", in Proceedings of the 2003 ACM SIGMOD Inter. Conf. on Management of Data, San Diego,California, USA, June 2003, pp. 76-85.
-
(2003)
Proceedings of the 2003 ACM SIGMOD Inter. Conf. on Management of Data
, pp. 76-85
-
-
Schleimer, S.1
Wilkerson, D.S.2
Aiken, A.3
-
16
-
-
77749273607
-
Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, Germany
-
Lexicon and Textcorpora Group
-
Lexicon and Textcorpora Group, Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, Germany, "TreeTagger - a language independent part-of-speech tagger", 2003, http://www.ims.uni- stuttgart.de/projekte/corplex/TreeTagger.
-
(2003)
TreeTagger - a language independent part-of-speech tagger
-
-
-
17
-
-
76649139963
-
A survey of machine learning approaches to analysis of large corpora
-
SProLaC, Lancaster University, UK, March 28, 31
-
X.R. Hu and E. Atwell, "A survey of machine learning approaches to analysis of large corpora", in Proceedings of the Workshop on Shallow Processing of Large Corpora (SProLaC), (Lancaster University, UK, March 28 - 31 2003), 2003, pp. 45-52.
-
(2003)
Proceedings of the Workshop on Shallow Processing of Large Corpora
, pp. 45-52
-
-
Hu, X.R.1
Atwell, E.2
-
18
-
-
0000329809
-
General Methods of Sequence Comparison
-
M.S. Waterman, "General Methods of Sequence Comparison", Bulletin of Math. Biology, Vol. 46, No. 4, pp. 473-500, 1984.
-
(1984)
Bulletin of Math. Biology
, vol.46
, Issue.4
, pp. 473-500
-
-
Waterman, M.S.1
-
19
-
-
77749237095
-
Parts of Speech
-
ELC Courses
-
ELC Courses, English Language Centre, University of Victoria, "Parts of Speech", 1997, http://web2.uvcs.uvic.ca/elc/StudyZone/330/grammar/ parts.htm.
-
(1997)
-
-
-
20
-
-
0002363874
-
Probabilistic Part-of-Speech Tagging Using Decision Trees
-
Manchester, UK, September
-
H. Schmid, "Probabilistic Part-of-Speech Tagging Using Decision Trees", in Proceedings of International Conference on New Methods in Language Processing, (Manchester, UK, September 1994), 1994, pp. 44-49.
-
(1994)
Proceedings of International Conference on New Methods in Language Processing
, pp. 44-49
-
-
Schmid, H.1
-
21
-
-
0042096783
-
First refinments in Part-of-Speech Tagging With an Application To German
-
Kyoto, Japan, March
-
H. Schmid, "First refinments in Part-of-Speech Tagging With an Application To German", in Proceedings of the 14th International Conference on Computational Linguistics, (Kyoto, Japan, March 1995), 1995, pp. 172-176.
-
(1995)
Proceedings of the 14th International Conference on Computational Linguistics
, pp. 172-176
-
-
Schmid, H.1
-
22
-
-
33646683083
-
Algorithmic Detection of Semantic Similarity
-
Chiba, Japan, May 10-14
-
A.G. Maguitman, F. Menczer, H. Roinestad and A. Vespignani, "Algorithmic Detection of Semantic Similarity", in Proceedings of the 14th international conference on World Wide Web, (Chiba, Japan, May 10-14 2005), 2005, pp. 107-116.
-
(2005)
Proceedings of the 14th international conference on World Wide Web
, pp. 107-116
-
-
Maguitman, A.G.1
Menczer, F.2
Roinestad, H.3
Vespignani, A.4
-
23
-
-
33750693384
-
Corpus-based and Knowledge-based Measures of Text Semantic Similarity
-
Boston, Massachusetts, USA, July 16-20
-
R. Mihalcea, C. Corley and C. Strapparava, "Corpus-based and Knowledge-based Measures of Text Semantic Similarity", in Proceedings of The Twenty-First National Conference on Artificial Intelligence and the Eighteenth Innovative Applications of Artificial Intelligence Conference, (Boston, Massachusetts, USA, July 16-20, 2006), 2006.
-
(2006)
Proceedings of The Twenty-First National Conference on Artificial Intelligence and the Eighteenth Innovative Applications of Artificial Intelligence Conference
-
-
Mihalcea, R.1
Corley, C.2
Strapparava, C.3
-
24
-
-
33748374589
-
Comparison of Overlap Detection Techniques
-
Amsterdam, The Netherlands, April 21-24
-
K. Monostori, R. Finkel, A. Zaslavsky, G. Hodasz and M. Pataki, "Comparison of Overlap Detection Techniques", International Conference on Computational Science, (Amsterdam, The Netherlands, April 21-24 2002), 2002, pp 51-60.
-
(2002)
International Conference on Computational Science
, pp. 51-60
-
-
Monostori, K.1
Finkel, R.2
Zaslavsky, A.3
Hodasz, G.4
Pataki, M.5
-
25
-
-
33750236086
-
PPChecker: Plagiarism Pattern Checker in Document Copy Detection
-
Brno, Czech Republic, September 11-15
-
N. Kang, A. Gelbukh and S. Han, "PPChecker: Plagiarism Pattern Checker in Document Copy Detection", in Proceedings of Text, Speech and Dialogue 9th International Conference, (Brno, Czech Republic, September 11-15 2006), 2006, pp 661-667.
-
(2006)
Proceedings of Text, Speech and Dialogue 9th International Conference
, pp. 661-667
-
-
Kang, N.1
Gelbukh, A.2
Han, S.3
-
27
-
-
84957878343
-
Part-of-Speech Tagging Using Progol
-
Prague, Czech Republic, September 17-20
-
J. Cussens, "Part-of-Speech Tagging Using Progol", in Proceedings of the 7th International Workshop (Inductive Logic Programming), (Prague, Czech Republic, September 17-20 1997), 1997, pp. 93-108.
-
(1997)
Proceedings of the 7th International Workshop (Inductive Logic Programming)
, pp. 93-108
-
-
Cussens, J.1
-
28
-
-
76649099372
-
-
University of Ottawa, Accessed: 25th Sep 2008
-
H. MacFadyen, University of Ottawa, "The Parts of Speech", 2007, http://www.arts.uottawa.ca/writcent/hypergrammar/partsp.html, [Accessed: 25th Sep 2008].
-
(2007)
The Parts of Speech
-
-
MacFadyen, H.1
-
29
-
-
77749264173
-
Viterbi algorithm
-
Wikipedia®, Wikimedia Foundation, Inc
-
Wikipedia® , Wikimedia Foundation, Inc., "Viterbi algorithm", 8th Sept 2008, http://en.wikipedia.org/wiki/Viterbi-algorithm..
-
(2008)
8th Sept
-
-
-
30
-
-
0040639071
-
MBT: A Memory-Based Part of Speech Tagger Generator
-
University of Copenhagen, Copenhagen, Denmark, August 5-9
-
W. Daelemans, J. Zavrel, P. Berck and S. Gillis, "MBT: A Memory-Based Part of Speech Tagger Generator", in Proceedings of Fourth Workshop on Very Large Corpora (WVLC), (University of Copenhagen, Copenhagen, Denmark, August 5-9 1996), 1996, pp. 14-27.
-
(1996)
Proceedings of Fourth Workshop on Very Large Corpora (WVLC)
, pp. 14-27
-
-
Daelemans, W.1
Zavrel, J.2
Berck, P.3
Gillis, S.4
-
31
-
-
77749237087
-
-
C. J. van RIJSBERGEN, INFORMATION RETRIEVAL, Department of Computing Science, University of Glasgow, London: Butterworths, 1979.
-
C. J. van RIJSBERGEN, INFORMATION RETRIEVAL, Department of Computing Science, University of Glasgow, London: Butterworths, 1979.
-
-
-
-
33
-
-
26844519620
-
-
Department of Computing Science, Cornell University, Ithaca, N.Y
-
G. Salton, "Automatic Content Analysis in Information Retrieval", Department of Computing Science, Cornell University, Ithaca, N.Y., 1968.
-
(1968)
Automatic Content Analysis in Information Retrieval
-
-
Salton, G.1
-
34
-
-
77749273545
-
-
Cognitive Science in Context Laboratory, Cornell University, New York, U.S
-
Yonghong Mao, Natural Language Processing Module (Part of Speech Tagging and Sentence Parsing), Cognitive Science in Context Laboratory, Cornell University, New York, U.S., 1997.
-
(1997)
Natural Language Processing Module (Part of Speech Tagging and Sentence Parsing)
-
-
Mao, Y.1
-
35
-
-
77749237092
-
diff
-
Wikipedia®, Wikimedia Foundation, Inc
-
Wikipedia® , Wikimedia Foundation, Inc., "diff", 25th Sep 2007, http://en.wikipedia.org/wiki/Diff.
-
(2007)
25th Sep
-
-
-
36
-
-
77749237084
-
-
Algorithmist, GNU Free Documentation License, Longest Common Subsequence, 23rd Oct 2006, http://www.algorithmist.com/index.php/Longest- Common-Subsequence.
-
Algorithmist, GNU Free Documentation License, "Longest Common Subsequence", 23rd Oct 2006, http://www.algorithmist.com/index.php/Longest- Common-Subsequence.
-
-
-
-
38
-
-
77749237088
-
Part-of-speech tagging
-
Wikipedia®, Wikimedia Foundation, Inc
-
Wikipedia® , Wikimedia Foundation, Inc., "Part-of-speech tagging", 8th Sept 2008, http://en.wikipedia.org/wiki/Part-ofspeech- tagging.
-
(2008)
8th Sept
-
-
|