-
1
-
-
84857600027
-
Towards linguistically searchable text
-
Bilbao, Basque Country
-
Alcázar, Asier. 2006. Towards Linguistically Searchable Text. In Proceedings of the BIDE 2005, Bilbao, Basque Country.
-
(2006)
Proceedings of the BIDE 2005
-
-
Alcázar, A.1
-
2
-
-
80053399153
-
-
editors. University of the Basque Country
-
Alegria, Iñaki, Mikel L. Forcada, and Kepa Sarasola, editors. 2009. Proceedings of the SEPLN 2009 Workshop on Information Retrieval and Information Extraction for Less Resourced Languages, Donostia, Basque Country. University of the Basque Country.
-
(2009)
Proceedings of the SEPLN 2009 Workshop on Information Retrieval and Information Extraction for Less Resourced Languages, Donostia, Basque Country
-
-
Alegria, I.1
Forcada, M.L.2
Sarasola, K.3
-
3
-
-
84885224337
-
On cross-lingual plagiarism analysis using a statistical model
-
Stein, Stamatatos, and Koppel, editors, Patras, Greece. CEUR-WS.org
-
Barrón-Cedeño, Alberto, Paolo Rosso, David Pinto, and Alfons Juan. 2008. On Cross-lingual Plagiarism Analysis Using a Statistical Model. In Stein, Stamatatos, and Koppel, editors, ECAI 2008 Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 2008), pages 9-13, Patras, Greece. CEUR-WS.org.
-
(2008)
ECAI 2008 Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 2008)
, pp. 9-13
-
-
Barrón-Cedeño, A.1
Rosso, P.2
Pinto, D.3
Juan, A.4
-
4
-
-
78650486144
-
Monolingual text similarity measures: A comparison of models over wikipedia articles revisions
-
Sharma, Verma, and Sangal, editors, Hyderabad, India. Macmillan Publishers
-
Barrón-Cedeño, Alberto, Andreas Eiselt, and Paolo Rosso. 2009. Monolingual Text Similarity Measures: A Comparison of Models over Wikipedia Articles Revisions. In Sharma, Verma, and Sangal, editors, ICON 2009, pages 29-38, Hyderabad, India. Macmillan Publishers.
-
(2009)
ICON 2009
, pp. 29-38
-
-
Barrón-Cedeño, A.1
Eiselt, A.2
Rosso, P.3
-
5
-
-
0031346696
-
On the resemblance and containment of documents
-
IEEE Computer Society
-
Broder, Andrei Z. 1997. On the Resemblance and Containment of Documents. In Compression and Complexity of Sequences (SEQUENCES'97), pages 21-29. IEEE Computer Society.
-
(1997)
Compression and Complexity of Sequences (SEQUENCES'97)
, pp. 21-29
-
-
Broder, A.Z.1
-
6
-
-
85044611587
-
The mathematics of statisticalmachine translation: Parameter estimation
-
Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra, and Robert L. Mercer. 1993. The Mathematics of StatisticalMachine Translation: Parameter Estimation. Computational Linguistics, 19(2):263-311.
-
(1993)
Computational Linguistics
, vol.19
, Issue.2
, pp. 263-311
-
-
Brown, P.F.1
Della Pietra, S.A.2
Della Pietra, V.J.3
Mercer, R.L.4
-
9
-
-
84977531644
-
Building and annotating a corpus for the study of journalistic text reuse
-
Las Palmas, Spain
-
Clough, Paul, Robert Gaizauskas, and Scott Piao. 2002. Building and Annotating a Corpus for the Study of Journalistic Text Reuse. In Proceedings of the 3rd International Conference on Language Resources and Evaluation (LREC 2002), volume V, pages 1678-1691, Las Palmas, Spain.
-
(2002)
Proceedings of the 3rd International Conference on Language Resources and Evaluation (LREC 2002)
, vol.V
, pp. 1678-1691
-
-
Clough, P.1
Gaizauskas, R.2
Piao, S.3
-
10
-
-
20444470607
-
Automatic cross-language retrieval using latent semantic indexing
-
Stanford University
-
Dumais, Susan T., Todd A. Letsche, Michael L. Littman, and Thomas K. Landauer. 1997. Automatic Cross-Language Retrieval Using Latent Semantic Indexing. In AAAI-97 Spring Symposium Series: Cross-Language Text and Speech Retrieval, pages 24-26. Stanford University.
-
(1997)
AAAI-97 Spring Symposium Series: Cross-Language Text and Speech Retrieval
, pp. 24-26
-
-
Dumais, S.T.1
Letsche, T.A.2
Littman, M.L.3
Landauer, T.K.4
-
12
-
-
33745868242
-
N-gram-based author profiles for authorship attribution
-
Halifax, Canada
-
Keselj, Vlado, Fuchun Peng, Nick Cercone, and Calvin Thomas. 2003. N-gram-based Author Profiles for Authorship Attribution. In Proceedings of the Conference Pacific Association for Computational Linguistics, PACLING'03, pages 255-264, Halifax, Canada.
-
(2003)
Proceedings of the Conference Pacific Association for Computational Linguistics, PACLING'03
, pp. 255-264
-
-
Keselj, V.1
Peng, F.2
Cercone, N.3
Thomas, C.4
-
13
-
-
85110867932
-
Moses: Open source toolkit for statistical machine translation
-
demonstration session, Prague, Czech Republic
-
Koehn, Philipp, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open Source Toolkit for Statistical Machine Translation. In Annual Meeting of the Association for Computational Linguistics (ACL), demonstration session, Prague, Czech Republic.
-
(2007)
Annual Meeting of the Association for Computational Linguistics (ACL)
-
-
Koehn, P.1
Hoang, H.2
Birch, A.3
Callison-Burch, C.4
Federico, M.5
Bertoldi, N.6
Cowan, B.7
Shen, W.8
Moran, C.9
Zens, R.10
Dyer, C.11
Bojar, O.12
Constantin, A.13
Herbst, E.14
-
14
-
-
33748759334
-
Plagiarism - A survey
-
Maurer, Hermann, Frank Kappe, and Bilal Zaka. 2006. Plagiarism - A Survey. Journal of Universal Computer Science, 12(8):1050-1084. (Pubitemid 44413385)
-
(2006)
Journal of Universal Computer Science
, vol.12
, Issue.8
, pp. 1050-1084
-
-
Maurer, H.1
Kappe, F.2
Zaka, B.3
-
15
-
-
3843127500
-
Character n-gram tokenization for European language text retrieval
-
Mcnamee, Paul and James Mayfield. 2004. Character N-Gram Tokenization for European Language Text Retrieval. Information Retrieval, 7(1-2):73-97. (Pubitemid 39046509)
-
(2004)
Information Retrieval
, vol.7
, Issue.1-2
, pp. 73-97
-
-
McNamee, P.1
Mayfield, J.2
-
17
-
-
0042879653
-
A systematic comparison of various statistical alignment models
-
DOI 10.1162/089120103321337421
-
Och, Frank Josef and Hermann Ney. 2003. A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics, 29(1):19-51. See also http://www.fjoch.com/GIZA++.html. (Pubitemid 37049767)
-
(2003)
Computational Linguistics
, vol.29
, Issue.1
, pp. 19-51
-
-
Och, F.J.1
Ney, H.2
-
18
-
-
84940363657
-
A statistical approach to crosslingual natural language tasks
-
Pinto, David, Jorge Civera, Alberto Barrón-Cedeño, Alfons Juan, and Paolo Rosso. 2009. A Statistical Approach to Crosslingual Natural Language Tasks. Journal of Algorithms, 64(1):51-60.
-
(2009)
Journal of Algorithms
, vol.64
, Issue.1
, pp. 51-60
-
-
Pinto, D.1
Civera, J.2
Barrón-Cedeño, A.3
Juan, A.4
Rosso, P.5
-
19
-
-
41849092199
-
A Wikipedia-based multilingual retrieval model
-
DOI 10.1007/978-3-540-78646-7-51, Advances in Information Retrieval - 30th European Conference on IR Research, ECIR 2008, Proceedings
-
Potthast, Martin, Benno Stein, and Maik Anderka. 2008. A Wikipedia-Based Multilingual Retrieval Model. In Macdonald, Ounis, Plachouras, Ruthven, and White, editors, 30th European Conference on IR Research, ECIR 2008, Glasgow, volume 4956 LNCS of Lecture Notes in Computer Science, pages 522-530, Berlin Heidelberg NewYork. Springer. (Pubitemid 351499100)
-
(2008)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.LNCS4956
, pp. 522-530
-
-
Potthast, M.1
Stein, B.2
Anderka, M.3
-
20
-
-
84881310726
-
Overview of the 1st international competition on plagiarism detection
-
Stein, Rosso, Stamatatos, Koppel, and Agirre, editors, San Sebastian, Spain. CEUS-WS.org
-
Potthast, Martin, Benno Stein, Andreas Eiselt, Alberto Barrón-Cedeño, and Paolo Rosso. 2009. Overview of the 1st International Competition on Plagiarism Detection. In Stein, Rosso, Stamatatos, Koppel, and Agirre, editors, SEPLN 2009 Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 09), pages 1-9, San Sebastian, Spain. CEUS-WS.org.
-
(2009)
SEPLN 2009 Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (PAN 09)
, pp. 1-9
-
-
Potthast, M.1
Stein, B.2
Eiselt, A.3
Barrón-Cedeño, A.4
Rosso, P.5
-
21
-
-
79952246356
-
Cross-language plagiarism detection
-
Potthast, Martin, Alberto Barrón-Cedeño, Benno Stein, and Paolo Rosso. 2010. Cross-Language Plagiarism Detection. Language Resources and Evaluation, Special Issue on Plagiarism and Authorship Analysis.
-
(2010)
Language Resources and Evaluation, Special Issue on Plagiarism and Authorship Analysis
-
-
Potthast, M.1
Barrón-Cedeño, A.2
Stein, B.3
Rosso, P.4
-
22
-
-
78049371317
-
Automatic identification of document translations in large multilingual document collections
-
Borovets, Bulgaria
-
Pouliquen, Bruno, Ralf Steinberger, and Camelia Ignat. 2003. Automatic Identification of Document Translations in Large Multilingual Document Collections. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-2003), pages 401-408, Borovets, Bulgaria.
-
(2003)
Proceedings of the International Conference on Recent Advances In Natural Language Processing (RANLP-2003)
, pp. 401-408
-
-
Pouliquen, B.1
Steinberger, R.2
Ignat, C.3
-
25
-
-
36448995739
-
Strategies for retrieving plagiarized documents
-
DOI 10.1145/1277741.1277928, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
-
Stein, Benno, Sven Meyer zu Eissen, and Martin Potthast. 2007. Strategies for Retrieving Plagiarized Documents. In Clarke, Fuhr, Kando, Kraaij, and de Vries, editors, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 825-826, Amsterdam, The Netherlands. ACM. (Pubitemid 350165089)
-
(2007)
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
, pp. 825-826
-
-
Stein, B.1
Zu Eissen, S.M.2
Potthast, M.3
-
27
-
-
85037539156
-
The JRC-acquis: A multilingual aligned parallel corpus with 20+ languages
-
Genoa, Italy
-
Steinberger, Ralf, Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaz Erjavec, Dan Tufis, and Dániel Varga. 2006. The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006), volume 9, Genoa, Italy.
-
(2006)
Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006)
, vol.9
-
-
Steinberger, R.1
Pouliquen, B.2
Widiger, A.3
Ignat, C.4
Erjavec, T.5
Tufis, D.6
Varga, D.7
-
28
-
-
84891308106
-
SRILM - An extensible language modeling toolkit
-
Denver, Colorado. Wikipedia. 2010a. Basque language. [Online; accessed 5-February-2010]
-
Stolcke, Andreas. 2002. SRILM - An Extensible LanguageModeling toolkit. In Intl. Conference on Spoken Language Processing, Denver, Colorado. Wikipedia. 2010a. Basque language. [Online; accessed 5-February-2010].
-
(2002)
Intl. Conference on Spoken Language Processing
-
-
Stolcke, A.1
-
29
-
-
80053395051
-
-
Wikipedia.| Partido Socialista Europeo | Europako Alderdi Sozialista. [Online; accessed 10-February-2010]
-
Wikipedia. 2010b. Party of European Socialists | Partido Socialista Europeo | Europako Alderdi Sozialista. [Online; accessed 10-February-2010].
-
(2010)
Party of European Socialists
-
-
|