-
1
-
-
0032677683
-
An efficient, probabilistically sound algorithm for segmentation and word discovery
-
Brent, Michael R. 1999. An efficient, probabilistically sound algorithm for segmentation and word discovery. Machine Learning, 34(1-3):71-105.
-
(1999)
Machine Learning
, vol.34
, Issue.1-3
, pp. 71-105
-
-
Brent, M.R.1
-
4
-
-
84870546001
-
Improving successor variety for morphological segmentation
-
Çöltekin, Çăgrı. 2010. Improving successor variety for morphological segmentation. LOT Occasional Series, 16:13-28.
-
(2010)
LOT Occasional Series
, vol.16
, pp. 13-28
-
-
Çöltekin, Ç.1
-
5
-
-
34547975610
-
Risks of semi-supervised learning: How unlabelled data can degrade performance of generative classifiers
-
Olivier Chapelle, Bernhard Schölkopf, Alexander Zien, editors, MIT press
-
Cozman, Fabio and Ira Cohen. 2006. Risks of semi-supervised learning: How unlabelled data can degrade performance of generative classifiers. In Olivier Chapelle, Bernhard Schölkopf, Alexander Zien, editors, Semi-supervised learning, pages 57-72. MIT press.
-
(2006)
Semi-supervised learning
, pp. 57-72
-
-
Cozman, F.1
Cohen, I.2
-
6
-
-
1942452771
-
Semi-supervised learning of mixture models
-
Washington, DC
-
Cozman, Fabio Gagliardi, Ira Cohen, and Marcelo Cesar Cirelo. 2003. Semi-supervised learning of mixture models. In Proceedings of the 20th International Conference on Machine Learning (ICML-2003), pages 99-106, Washington, DC.
-
(2003)
Proceedings of the 20th International Conference on Machine Learning (ICML-2003)
, pp. 99-106
-
-
Cozman, F.G.1
Cohen, I.2
Cirelo, M.C.3
-
7
-
-
37849048345
-
Morph-based speech recognition and modeling of out-of-vocabulary words across languages
-
Creutz, Mathias, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arisoy, Murat Saraçlar, and Andreas Stolcke. 2007. Morph-based speech recognition and modeling of out-of-vocabulary words across languages. ACM Transactions on Speech and Language Processing, 5(1):3:1-3:29.
-
(2007)
ACM Transactions on Speech and Language Processing
, vol.5
, Issue.1
, pp. 3:1-3:29
-
-
Creutz, M.1
Hirsimäki, T.2
Kurimo, M.3
Puurula, A.4
Pylkkönen, J.5
Siivola, V.6
Varjokallio, M.7
Arisoy, E.8
Saraçlar, M.9
Stolcke, A.10
-
10
-
-
33846987588
-
Unsupervised models for morpheme segmentation and morphology learning
-
Creutz, Mathias and Krista Lagus. 2007. Unsupervised models for morpheme segmentation and morphology learning. ACM Transactions on Speech and Language Processing, 4(1):1-27.
-
(2007)
ACM Transactions on Speech and Language Processing
, vol.4
, Issue.1
, pp. 1-27
-
-
Creutz, M.1
Lagus, K.2
-
11
-
-
77958047314
-
Minimum Bayes risk combination of translation hypotheses from alternative morphological decompositions
-
Boulder, CO
-
de Gispert, Adrià, Sami Virpioja, Mikko Kurimo, and William Byrne. 2009. Minimum Bayes risk combination of translation hypotheses from alternative morphological decompositions. In Proceedings of Human Language Technologies: The 2009. Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2009), pages 73-76, Boulder, CO.
-
(2009)
Proceedings of Human Language Technologies: The 2009. Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2009)
, pp. 73-76
-
-
de Gispert, A.1
Virpioja, S.2
Kurimo, M.3
Byrne, W.4
-
12
-
-
85015565735
-
Sequence segmentation by enumeration: An exploration
-
Eger, Steffen. 2013. Sequence segmentation by enumeration: An exploration. The Prague Bulletin of Mathematical Linguistics, 100:113-131.
-
(2013)
The Prague Bulletin of Mathematical Linguistics
, vol.100
, pp. 113-131
-
-
Eger, S.1
-
16
-
-
84959912577
-
Morfessor FlatCat: An HMM-based method for unsupervised and semi-supervised learning of morphology
-
Dublin
-
Grönroos, Stig-Arne, Sami Virpioja, Peter Smit, and Mikko Kurimo. 2014. Morfessor FlatCat: An HMM-based method for unsupervised and semi-supervised learning of morphology. In Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), pages 1177-1185, Dublin.
-
(2014)
Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014)
, pp. 1177-1185
-
-
Grönroos, S.-A.1
Virpioja, S.2
Smit, P.3
Kurimo, M.4
-
17
-
-
79958736799
-
Unsupervised learning of morphology
-
Hammarström, Harald and Lars Borin. 2011. Unsupervised learning of morphology. Computational Linguistics, 37(2):309-350.
-
(2011)
Computational Linguistics
, vol.37
, Issue.2
, pp. 309-350
-
-
Hammarström, H.1
Borin, L.2
-
18
-
-
0001074490
-
From phoneme to morpheme
-
Harris, Zellig S. 1955. From phoneme to morpheme. Language, 31(2):190-222.
-
(1955)
Language
, vol.31
, Issue.2
, pp. 190-222
-
-
Harris, Z.S.1
-
19
-
-
33746524944
-
Unlimited vocabulary speech recognition with morph language models applied to Finnish
-
Hirsimäki, Teemu, Mathias Creutz, Vesa Siivola, Mikko Kurimo, Sami Virpioja, and Janne Pylkkönen. 2006. Unlimited vocabulary speech recognition with morph language models applied to Finnish. Computer Speech and Language, 20(4):515-541.
-
(2006)
Computer Speech and Language
, vol.20
, Issue.4
, pp. 515-541
-
-
Hirsimäki, T.1
Creutz, M.2
Siivola, V.3
Kurimo, M.4
Virpioja, S.5
Pylkkönen, J.6
-
20
-
-
84860537772
-
Semi-supervised conditional random fields for improved sequence segmentation and labeling
-
Sidney
-
Jiao, Feng, Shaojun Wang, Chi-Hoon Lee, Russell Greiner, and Dale Schuurmans. 2006. Semi-supervised conditional random fields for improved sequence segmentation and labeling. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL 2006), pages 209-216, Sidney.
-
(2006)
Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL 2006)
, pp. 209-216
-
-
Jiao, F.1
Wang, S.2
Lee, C.-H.3
Greiner, R.4
Schuurmans, D.5
-
25
-
-
67349214856
-
Adaptor grammars: A framework for specifying compositional nonparametric Bayesian models
-
Vancouver
-
Johnson, Mark, Thomas L. Griffiths, and Sharon Goldwater. 2006. Adaptor grammars: A framework for specifying compositional nonparametric Bayesian models. In Advances in Neural Information Processing Systems, pages 641-648, Vancouver.
-
(2006)
Advances in Neural Information Processing Systems
, pp. 641-648
-
-
Johnson, M.1
Griffiths, T.L.2
Goldwater, S.3
-
26
-
-
84863393622
-
Bayesian inference for PCFGs via Markov chain Monte Carlo
-
Rochester, NY
-
Johnson, Mark, Thomas L. Griffiths, and Sharon Goldwater. 2007. Bayesian inference for PCFGs via Markov chain Monte Carlo. In Proceedings of Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2007), pages 139-146, Rochester, NY.
-
(2007)
Proceedings of Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2007)
, pp. 139-146
-
-
Johnson, M.1
Griffiths, T.L.2
Goldwater, S.3
-
27
-
-
78049265455
-
The Estonian reference corpus: Its composition and morphology-aware user interface
-
Riga
-
Kaalep, Heiki-Jaan, Kadri Muischnek, Kristel Uiboaed, and Kaarel Veskis. 2010. The Estonian reference corpus: Its composition and morphology-aware user interface. In Proceedings of the 2010. Conference on Human Language Technologies-The Baltic Perspective: Proceedings of the Fourth International Conference Baltic (HLT 2010), pages 143-146, Riga.
-
(2010)
Proceedings of the 2010. Conference on Human Language Technologies-The Baltic Perspective: Proceedings of the Fourth International Conference Baltic (HLT 2010)
, pp. 143-146
-
-
Kaalep, H.-J.1
Muischnek, K.2
Uiboaed, K.3
Veskis, K.4
-
31
-
-
84922022679
-
Overview and results of Morpho Challenge 2009
-
Corfu
-
Kurimo, Mikko, Sami Virpioja, Ville Turunen, Graeme W. Blackwood, and William Byrne. 2009. Overview and results of Morpho Challenge 2009. In Working Notes for the CLEF 2009. Workshop, pages 578-597, Corfu.
-
(2009)
Working Notes for the CLEF 2009. Workshop
, pp. 578-597
-
-
Kurimo, M.1
Virpioja, S.2
Turunen, V.3
Blackwood, G.W.4
Byrne, W.5
-
32
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
Williamstown, MA
-
Lafferty, John, Andrew McCallum, and Fernando C. N. Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning (ICML 2001), pages 282-289, Williamstown, MA.
-
(2001)
Proceedings of the 18th International Conference on Machine Learning (ICML 2001)
, pp. 282-289
-
-
Lafferty, J.1
McCallum, A.2
Fernando Pereira, C.N.3
-
33
-
-
84862301175
-
Modeling syntactic context improves morphological segmentation
-
Portland, OR
-
Lee, Yoong Keok, Aria Haghighi, and Regina Barzilay. 2011. Modeling syntactic context improves morphological segmentation. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning (CoNLL 2011), pages 1-9, Portland, OR.
-
(2011)
Proceedings of the Fifteenth Conference on Computational Natural Language Learning (CoNLL 2011)
, pp. 1-9
-
-
Lee, Y.K.1
Haghighi, A.2
Barzilay, R.3
-
38
-
-
84926052215
-
Morphological segmentation for keyword spotting
-
Doha
-
Narasimhan, Karthik, Damianos Karakos, Richard Schwartz, Stavros Tsakalidis, and Regina Barzilay. 2014. Morphological segmentation for keyword spotting. In Proceedings of the 2014. Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), pages 880-885, Doha.
-
(2014)
Proceedings of the 2014. Conference on Empirical Methods in Natural Language Processing (EMNLP 2014)
, pp. 880-885
-
-
Narasimhan, K.1
Karakos, D.2
Schwartz, R.3
Tsakalidis, S.4
Barzilay, R.5
-
40
-
-
0033886806
-
Text classification from labeled and unlabeled documents using EM
-
Nigam, Kamal, Andrew Kachites McCallum, Sebastian Thrun, and Tom Mitchell. 2000. Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2-3):103-134.
-
(2000)
Machine Learning
, vol.39
, Issue.2-3
, pp. 103-134
-
-
Nigam, K.1
Andrew Kachites McCallum, S.T.2
Mitchell, T.3
-
42
-
-
84858427034
-
Unsupervised morphological segmentation with log-linear models
-
Boulder, CO
-
Poon, Hoifung, Colin Cherry, and Kristina Toutanova. 2009. Unsupervised morphological segmentation with log-linear models. In Proceedings of Human Language Technologies: The 2009. Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2009), pages 209-217, Boulder, CO.
-
(2009)
Proceedings of Human Language Technologies: The 2009. Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2009)
, pp. 209-217
-
-
Poon, H.1
Cherry, C.2
Toutanova, K.3
-
43
-
-
84945270809
-
Co-learning of word representations and morpheme representations
-
Dublin
-
Qiu, Siyu, Qing Cui, Jiang Bian, Bin Gao, and Tie-Yan Liu. 2014. Co-learning of word representations and morpheme representations. In Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), pages 141-150, Dublin.
-
(2014)
Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014)
, pp. 141-150
-
-
Qiu, S.1
Cui, Q.2
Bian, J.3
Gao, B.4
Liu, T.-Y.5
-
45
-
-
85072755546
-
Supervised morphological segmentation in a low-resource learning setting using conditional random fields
-
Sofia
-
Ruokolainen, Teemu, Oskar Kohonen, Sami Virpioja, and Mikko Kurimo. 2013. Supervised morphological segmentation in a low-resource learning setting using conditional random fields. In Proceedings of the 17th Conference on Computational Natural Language Learning (CoNLL 2013), pages 29-37, Sofia.
-
(2013)
Proceedings of the 17th Conference on Computational Natural Language Learning (CoNLL 2013)
, pp. 29-37
-
-
Ruokolainen, T.1
Kohonen, O.2
Virpioja, S.3
Kurimo, M.4
-
46
-
-
85122039144
-
Painless semi-supervised morphological segmentation using conditional random fields
-
Gothenburg
-
Ruokolainen, Teemu, Oskar Kohonen, Sami Virpioja, and Mikko Kurimo. 2014. Painless semi-supervised morphological segmentation using conditional random fields. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), pages 84-89. Gothenburg.
-
(2014)
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014)
, pp. 84-89
-
-
Ruokolainen, T.1
Kohonen, O.2
Virpioja, S.3
Kurimo, M.4
-
49
-
-
84937825475
-
Morfessor 2.0: Toolkit for statistical morphological segmentation
-
Gothenburg
-
Smit, Peter, Sami Virpioja, Stig-Arne Grönroos, and Mikko Kurimo. 2014. Morfessor 2.0: Toolkit for statistical morphological segmentation. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), pages 21-24, Gothenburg.
-
(2014)
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014)
, pp. 21-24
-
-
Smit, P.1
Virpioja, S.2
Grönroos, S.-A.3
Kurimo, M.4
-
51
-
-
84878178394
-
Unsupervised morphology rivals supervised morphology for Arabic MT
-
Jeju Island
-
Stallard, David, Jacob Devlin, Michael Kayser, Yoong Keok Lee, and Regina Barzilay. 2012. Unsupervised morphology rivals supervised morphology for Arabic MT. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), pages 322-327, Jeju Island.
-
(2012)
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012)
, pp. 322-327
-
-
Stallard, D.1
Devlin, J.2
Kayser, M.3
Lee, Y.K.4
Barzilay, R.5
-
54
-
-
80455130027
-
Speech retrieval from unsegmented Finnish audio using statistical morphemelike units for segmentation, recognition, and retrieval
-
Turunen, Ville and Mikko Kurimo. 2011. Speech retrieval from unsegmented Finnish audio using statistical morphemelike units for segmentation, recognition, and retrieval. ACM Transactions on Speech and Language Processing, 8(1):1:1-1:25.
-
(2011)
ACM Transactions on Speech and Language Processing
, vol.8
, Issue.1
, pp. 1:1-1:25
-
-
Turunen, V.1
Kurimo, M.2
-
57
-
-
84907015223
-
-
Report 25/2013 in Aalto University publication series SCIENCE + TECHNOLOGY, Department of Signal Processing and Acoustics, Aalto University, Helsinki, Finland
-
Virpioja, Sami, Peter Smit, Stig-Arne Grönroos, and Mikko Kurimo. 2013. Morfessor 2.0: Python implementation and extensions for Morfessor Baseline. Report 25/2013 in Aalto University publication series SCIENCE + TECHNOLOGY, Department of Signal Processing and Acoustics, Aalto University, Helsinki, Finland.
-
(2013)
Morfessor 2.0: Python implementation and extensions for Morfessor Baseline
-
-
Virpioja, S.1
Smit, P.2
Grönroos, S.-A.3
Kurimo, M.4
-
58
-
-
84861519211
-
Empirical comparison of evaluation methods for unsupervised learning of morphology
-
Virpioja, Sami, Ville Turunen, Sebastian Spiegler, Oskar Kohonen, and Mikko Kurimo. 2011. Empirical comparison of evaluation methods for unsupervised learning of morphology. Traitement Automatique des Langues, 52(2):45-90.
-
(2011)
Traitement Automatique des Langues
, vol.52
, Issue.2
, pp. 45-90
-
-
Virpioja, S.1
Turunen, V.2
Spiegler, S.3
Kohonen, O.4
Kurimo, M.5
-
59
-
-
84863351869
-
A rate distortion approach for semi-supervised conditional random fields
-
Vancouver
-
Wang, Yang, Gholamreza Haffari, Shaojun Wang, and Greg Mori. 2009. A rate distortion approach for semi-supervised conditional random fields. In Advances in Neural Information Processing Systems (NIPS), pages 2008-2016, Vancouver.
-
(2009)
Advances in Neural Information Processing Systems (NIPS)
, pp. 2008-2016
-
-
Wang, Y.1
Haffari, G.2
Wang, S.3
Mori, G.4
-
60
-
-
85123581800
-
Improving Chinese word segmentation and POS tagging with semi-supervised methods using large auto-analyzed data
-
Chiang Mai
-
Wang, Yiou, Yoshimasa Tsuruoka Jun'ichi Kazama, Yoshimasa Tsuruoka, Wenliang Chen, Yujie Zhang, and Kentaro Torisawa. 2011. Improving Chinese word segmentation and POS tagging with semi-supervised methods using large auto-analyzed data. In Proceedings of the 5th International Joint Conference on Natural Language Processing, pages 309-317, Chiang Mai.
-
(2011)
Proceedings of the 5th International Joint Conference on Natural Language Processing
, pp. 309-317
-
-
Wang, Y.1
Yoshimasa Tsuruoka Jun'ichi Kazama, Y.T.2
Chen, W.3
Zhang, Y.4
Torisawa, K.5
|