-
1
-
-
84876788506
-
Developing a large semantically annotated corpus
-
Istanbul, Turkey
-
Valerio Basile, Johan Bos, Kilian Evang, and Noortje Venhuizen. 2012. Developing a large semantically annotated corpus. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012), pages 3196-3200, Istanbul, Turkey.
-
(2012)
Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012)
, pp. 3196-3200
-
-
Basile, V.1
Bos, J.2
Evang, K.3
Venhuizen, N.4
-
2
-
-
84926297353
-
I testi del web: Una proposta di classificazione sulla base del corpus PAISÀ
-
M. Cerruti, E. Corino, and C. Onesti, editors Carocci, Roma
-
Claudia Borghetti, Sara Castagnoli, and Marco Brunello. 2011. I testi del web: una proposta di classificazione sulla base del corpus PAISÀ. In M. Cerruti, E. Corino, and C. Onesti, editors, Formale e informale. La variazione di registro nella comunicazione elettronica, pages 147-170. Carocci, Roma.
-
(2011)
Formale e Informale. la Variazione di Registro nella Comunicazione Elettronica
, pp. 147-170
-
-
Borghetti, C.1
Castagnoli, S.2
Brunello, M.3
-
3
-
-
84906924739
-
Text segmentation with character-level text embeddings
-
Speech and Language Processing, Atlanta, USA
-
Grzegorz Chrupała. 2013. Text segmentation with character-level text embeddings. In ICML Workshop on Deep Learning for Audio, Speech and Language Processing, Atlanta, USA.
-
(2013)
ICML Workshop on Deep Learning for Audio
-
-
Chrupała, G.1
-
4
-
-
84875542948
-
Tokenization: Returning to a long solved problem - A survey, contrastive experiment, recommendations, and toolkit
-
Jeju Island, Korea. Association for Computational Linguistics
-
Rebecca Dridan and Stephan Oepen. 2012. Tokenization: Returning to a long solved problem - a survey, contrastive experiment, recommendations, and toolkit. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 378-382, Jeju Island, Korea. Association for Computational Linguistics.
-
(2012)
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
, pp. 378-382
-
-
Dridan, R.1
Oepen, S.2
-
5
-
-
26444565569
-
Finding structure in time
-
Jeffrey L. Elman. 1990. Finding structure in time. Cognitive science, 14(2):179-211.
-
(1990)
Cognitive Science
, vol.14
, Issue.2
, pp. 179-211
-
-
Elman, J.L.1
-
6
-
-
0001419757
-
Distributed representations, simple recurrent networks, and grammatical structure
-
Jeffrey L. Elman. 1991. Distributed representations, simple recurrent networks, and grammatical structure. Machine learning, 7(2):195-225.
-
(1991)
Machine Learning
, vol.7
, Issue.2
, pp. 195-225
-
-
Elman, J.L.1
-
7
-
-
84875512745
-
Machine learning for high-quality tokenization - Replicating variable tokenization schemes
-
A. Gelbukh, editor Berlin Heidelberg. Springer-Verlag
-
Murhaf Fares, Stephan Oepen, and Zhang Yi. 2013. Machine learning for high-quality tokenization - replicating variable tokenization schemes. In A. Gelbukh, editor, CICLING 2013, Volume 7816 of Lecture Notes in Computer Science, pages 231-244, Berlin Heidelberg. Springer-Verlag.
-
(2013)
CICLING 2013, Volume 7816 of Lecture Notes in Computer Science
, pp. 231-244
-
-
Fares, M.1
Oepen, S.2
Yi, Z.3
-
8
-
-
9944228797
-
Tokenization
-
Hans van Halteren, editor Kluwer Academic Publishers, Dordrecht
-
Gregory Grefenstette. 1999. Tokenization. In Hans van Halteren, editor, Syntactic Wordclass Tagging, pages 117-133. Kluwer Academic Publishers, Dordrecht.
-
(1999)
Syntactic Wordclass Tagging
, pp. 117-133
-
-
Grefenstette, G.1
-
10
-
-
33845487544
-
Unsupervised multilingual sentence boundary detection
-
Tibor Kiss and Jan Strunk. 2006. Unsupervised multilingual sentence boundary detection. Computational Linguistics, 32(4):485-525.
-
(2006)
Computational Linguistics
, vol.32
, Issue.4
, pp. 485-525
-
-
Kiss, T.1
Strunk, J.2
-
11
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
John Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of ICML-01, pages 282-289.
-
(2001)
Proceedings of ICML-01
, pp. 282-289
-
-
Lafferty, J.1
McCallum, A.2
Pereira, F.3
-
12
-
-
84859972823
-
Practical very large scale CRFs
-
Uppsala, Sweden, July. Association for Computational Linguistics
-
Thomas Lavergne, Olivier Cappé, and François Yvon. 2010. Practical very large scale CRFs. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 504-513, Uppsala, Sweden, July. Association for Computational Linguistics.
-
(2010)
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
, pp. 504-513
-
-
Lavergne, T.1
Cappé, O.2
Yvon, F.3
-
13
-
-
0039484386
-
Periods, capitalized words, etc
-
Andrei Mikheev. 2002. Periods, capitalized words, etc. Computational Linguistics, 28(3):289-318.
-
(2002)
Computational Linguistics
, vol.28
, Issue.3
, pp. 289-318
-
-
Mikheev, A.1
-
15
-
-
77956331067
-
TwNC: A multifaceted dutch news corpus
-
Roeland Ordelman, Franciska de Jong, Arjan van Hessen, and Hendri Hondorp. 2007. TwNC: a multifaceted Dutch news corpus. ELRA Newsleter, 12(3/4):4-7.
-
(2007)
ELRA Newsleter
, vol.12
, Issue.3-4
, pp. 4-7
-
-
Ordelman, R.1
De Jong, F.2
Van Hessen, A.3
Hondorp, H.4
-
16
-
-
0347138625
-
Adaptive multilingual sentence boundary disambiguation
-
David D. Palmer and Marti A. Hearst. 1997. Adaptive multilingual sentence boundary disambiguation. Computational Linguistics, 23(2):241-267.
-
(1997)
Computational Linguistics
, vol.23
, Issue.2
, pp. 241-267
-
-
Palmer, D.D.1
Hearst, M.A.2
-
17
-
-
84881219500
-
A maximum entropy approach to identifying sentence boundaries
-
Washington, DC, USA. Association for Computational Linguistics
-
Jeffrey C. Reynar and Adwait Ratnaparkhi. 1997. A maximum entropy approach to identifying sentence boundaries. In Proceedings of the Fifth Conference on Applied Natural Language Processing, pages 16-19, Washington, DC, USA. Association for Computational Linguistics.
-
(1997)
Proceedings of the Fifth Conference on Applied Natural Language Processing
, pp. 16-19
-
-
Reynar, J.C.1
Ratnaparkhi, A.2
-
18
-
-
85118952085
-
Some applications of tree-based modelling to speech and language
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
Michael D. Riley. 1989. Some applications of tree-based modelling to speech and language. In Proceedings of the workshop on Speech and Natural Language, HLT '89, pages 339-352, Stroudsburg, PA, USA. Association for Computational Linguistics.
-
(1989)
Proceedings of the Workshop on Speech and Natural Language, HLT '89
, pp. 339-352
-
-
Riley, M.D.1
-
20
-
-
77954190037
-
Sentence and token splitting based on conditional random fields
-
Melbourne, Australia
-
Katrin Tomanek, Joachim Wermter, and Udo Hahn. 2007. Sentence and token splitting based on conditional random fields. In Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics, pages 49-57, Melbourne, Australia.
-
(2007)
Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics
, pp. 49-57
-
-
Tomanek, K.1
Wermter, J.2
Hahn, U.3
|