SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Meeting of the Association for Computational Linguistics

Volumn 2010-July, Issue , 2010, Pages 384-394

Word representations: A simple and general method for semi-supervised learning

(3) Turian, Joseph a Ratinov, Lev b Bengio, Yoshua a

a UNIVERSITÉ DE MONTRÉAL (Canada)

b University of Illinois at Urbana Champaign ^* (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; EMBEDDINGS; NATURAL LANGUAGE PROCESSING SYSTEMS;

EMBEDDINGS; GENERAL METHOD; NLP SYSTEMS; SEMI-SUPERVISED LEARNING; SIMPLE METHOD; SIMPLE++; STATE OF THE ART; WORD REPRESENTATIONS;

SUPERVISED LEARNING;

EID: 85118455913 PISSN: 0736587X EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (279)

References (52)

1
- 84859921107
- A high-performance semi-supervised learning method for text chunking
- Ando, R., & Zhang, T. (2005). A high-performance semi-supervised learning method for text chunking. ACL.
- (2005) ACL
- Ando, R.¹ Zhang, T.²

2
- 79959407847
- Neural net language models
- Bengio, Y. (2008). Neural net language models. Scholarpedia, 3, 3881.
- (2008) Scholarpedia , vol.3 , pp. 3881
- Bengio, Y.¹

3
- 84899005563
- A neural probabilistic language model
- Bengio, Y., Ducharme, R., & Vincent, P. (2001). A neural probabilistic language model. NIPS.
- (2001) NIPS
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³

4
- 0142166851
- A neural probabilistic language model
- Bengio, Y., Ducharme, R., Vincent, P., & Jauvin, C. (2003). A neural probabilistic language model. Journal of Machine Learning Research, 3, 1137-1155.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 1137-1155
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³ Jauvin, C.⁴

5
- 71149116544
- Curriculum learning
- Bengio, Y., Louradour, J., Collobert, R., & Weston, J. (2009). Curriculum learning. ICML.
- (2009) ICML
- Bengio, Y.¹ Louradour, J.² Collobert, R.³ Weston, J.⁴

6
- 10944221006
- Quick training of probabilistic neural nets by importance sampling
- Bengio, Y., & Sénécal, J.-S. (2003). Quick training of probabilistic neural nets by importance sampling. AISTATS.
- (2003) AISTATS
- Bengio, Y.¹ Sénécal, J.-S.²

7
- 0141607824
- Latent dirichlet allocation
- Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993-1022.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 993-1022
- Blei, D. M.¹ Ng, A. Y.² Jordan, M. I.³

8
- 85022919385
- Class-based n-gram models of natural language
- Brown, P. F., deSouza, P. V., Mercer, R. L., Pietra, V. J. D., & Lai, J. C. (1992). Class-based n-gram models of natural language. Computational Linguistics, 18, 467-479.
- (1992) Computational Linguistics , vol.18 , pp. 467-479
- Brown, P. F.¹ deSouza, P. V.² Mercer, R. L.³ Pietra, V. J. D.⁴ Lai, J. C.⁵

9
- 84866875199
- Improving generative statistical parsing with semi-supervised word clustering
- Candito, M., & Crabbé, B. (2009). Improving generative statistical parsing with semi-supervised word clustering. IWPT (pp. 138-141).
- (2009) IWPT , pp. 138-141
- Candito, M.¹ Crabbé, B.²

10
- 56449095373
- A unified architecture for natural language processing: Deep neural networks with multitask learning
- Collobert, R., & Weston, J. (2008). A unified architecture for natural language processing: Deep neural networks with multitask learning. ICML.
- (2008) ICML
- Collobert, R.¹ Weston, J.²

11
- 80053402648
- Semi-supervised semantic role labeling using the Latent Words Language Model
- Deschacht, K., & Moens, M.-F. (2009). Semi-supervised semantic role labeling using the Latent Words Language Model. EMNLP (pp. 21-29).
- (2009) EMNLP , pp. 21-29
- Deschacht, K.¹ Moens, M.-F.²

12
- 85031726785
- Using latent semantic analysis to improve access to textual information
- ACM
- Dumais, S. T., Furnas, G. W., Landauer, T. K., Deerwester, S., & Harshman, R. (1988). Using latent semantic analysis to improve access to textual information. SIGCHI Conference on Human Factors in Computing Systems (pp. 281-285). ACM.
- (1988) SIGCHI Conference on Human Factors in Computing Systems , pp. 281-285
- Dumais, S. T.¹ Furnas, G. W.² Landauer, T. K.³ Deerwester, S.⁴ Harshman, R.⁵

13
- 0027636611
- Learning and development in neural networks: The importance of starting small
- Elman, J. L. (1993). Learning and development in neural networks: The importance of starting small. Cognition, 48, 781-799.
- (1993) Cognition , vol.48 , pp. 781-799
- Elman, J. L.¹

14
- 84874625722
- Enhancing unlexicalized parsing performance using a wide coverage lexicon, fuzzy tag-set mapping, and EM-HMM-based lexical probabilities
- Goldberg, Y., Tsarfaty, R., Adler, M., & Elhadad, M. (2009). Enhancing unlexicalized parsing performance using a wide coverage lexicon, fuzzy tag-set mapping, and EM-HMM-based lexical probabilities. EACL.
- (2009) EACL
- Goldberg, Y.¹ Tsarfaty, R.² Adler, M.³ Elhadad, M.⁴

15
- 0345331815
- Self-organizing maps of words for natural language processing applications
- Honkela, T. (1997). Self-organizing maps of words for natural language processing applications. Proceedings of the International ICSC Symposium on Soft Computing.
- (1997) Proceedings of the International ICSC Symposium on Soft Computing
- Honkela, T.¹

16
- 0002646713
- Contextual relations of words in grimm tales, analyzed by self-organizing map
- ICANN
- Honkela, T., Pulkki, V., & Kohonen, T. (1995). Contextual relations of words in grimm tales, analyzed by self-organizing map. ICANN.
- (1995)
- Honkela, T.¹ Pulkki, V.² Kohonen, T.³

17
- 80053028502
- Distributional representations for handling sparsity in supervised sequence labeling
- Huang, F., & Yates, A. (2009). Distributional representations for handling sparsity in supervised sequence labeling. ACL.
- (2009) ACL
- Huang, F.¹ Yates, A.²

18
- 0031625017
- Dimensionality reduction by random mapping: Fast similarity computation for clustering
- Kaski, S. (1998). Dimensionality reduction by random mapping: Fast similarity computation for clustering. IJCNN (pp. 413-418).
- (1998) IJCNN , pp. 413-418
- Kaski, S.¹

19
- 80053551637
- Simple semi-supervised dependency parsing
- Koo, T., Carreras, X., & Collins, M. (2008). Simple semi-supervised dependency parsing. ACL (pp. 595-603).
- (2008) ACL , pp. 595-603
- Koo, T.¹ Carreras, X.² Collins, M.³

20
- 80053344402
- An effective two-stage model for exploiting non-local dependencies in named entity recognition
- Krishnan, V., & Manning, C. D. (2006). An effective two-stage model for exploiting non-local dependencies in named entity recognition. COLING-ACL.
- (2006) COLING-ACL
- Krishnan, V.¹ Manning, C. D.²

21
- 80053431219
- An introduction to latent semantic analysis
- Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 259-284.
- (1998) Discourse Processes , pp. 259-284
- Landauer, T. K.¹ Foltz, P. W.² Laham, D.³

22
- 34547991287
- Semi-supervised sequence modeling with syntactic topic models
- Li, W., & McCallum, A. (2005). Semi-supervised sequence modeling with syntactic topic models. AAAI.
- (2005) AAAI
- Li, W.¹ McCallum, A.²

23
- 69849092237
- Master's thesis, Massachusetts Institute of Technology
- Liang, P. (2005). Semi-supervised learning for natural language. Master's thesis, Massachusetts Institute of Technology.
- (2005) Semi-supervised learning for natural language
- Liang, P.¹

24
- 85185398851
- Phrase clustering for discriminative learning
- Lin, D., & Wu, X. (2009). Phrase clustering for discriminative learning. ACL-IJCNLP (pp. 1030-1038).
- (2009) ACL-IJCNLP , pp. 1030-1038
- Lin, D.¹ Wu, X.²

25
- 0000458784
- Producing highdimensional semantic spaces from lexical co-occurrence
- Lund, K., & Burgess, C. (1996). Producing highdimensional semantic spaces from lexical co-occurrence. Behavior Research Methods, Instrumentation, and Computers, 28, 203-208.
- (1996) Behavior Research Methods, Instrumentation, and Computers , vol.28 , pp. 203-208
- Lund, K.¹ Burgess, C.²

26
- 0001860276
- Semantic and associative priming in highdimensional semantic space
- Lund, K., Burgess, C., & Atchley, R. A. (1995). Semantic and associative priming in highdimensional semantic space. Cognitive Science Proceedings, LEA (pp. 660-665).
- (1995) Cognitive Science Proceedings, LEA , pp. 660-665
- Lund, K.¹ Burgess, C.² Atchley, R. A.³

27
- 0032049073
- Algorithms for bigram and trigram word clustering
- Martin, S., Liermann, J., & Ney, H. (1998). Algorithms for bigram and trigram word clustering. Speech Communication, 24, 19-37.
- (1998) Speech Communication , vol.24 , pp. 19-37
- Martin, S.¹ Liermann, J.² Ney, H.³

28
- 85117730830
- Name tagging with word clusters and discriminative training
- Miller, S., Guinness, J., & Zamanian, A. (2004). Name tagging with word clusters and discriminative training. HLT-NAACL (pp. 337-342).
- (2004) HLT-NAACL , pp. 337-342
- Miller, S.¹ Guinness, J.² Zamanian, A.³

29
- 67650453038
- Three new graphical models for statistical language modelling
- Mnih, A., & Hinton, G. E. (2007). Three new graphical models for statistical language modelling. ICML.
- (2007) ICML
- Mnih, A.¹ Hinton, G. E.²

30
- 84858779990
- A scalable hierarchical distributed language model
- Mnih, A., & Hinton, G. E. (2009). A scalable hierarchical distributed language model. NIPS (pp. 1081-1088).
- (2009) NIPS , pp. 1081-1088
- Mnih, A.¹ Hinton, G. E.²

31
- 34547997987
- Hierarchical probabilistic neural network language model
- Morin, F., & Bengio, Y. (2005). Hierarchical probabilistic neural network language model. AISTATS.
- (2005) AISTATS
- Morin, F.¹ Bengio, Y.²

32
- 85123966307
- Distributional clustering of english words
- Pereira, F., Tishby, N., & Lee, L. (1993). Distributional clustering of english words. ACL (pp. 183-190).
- (1993) ACL , pp. 183-190
- Pereira, F.¹ Tishby, N.² Lee, L.³

33
- 84862300668
- Design challenges and misconceptions in named entity recognition
- Ratinov, L., & Roth, D. (2009). Design challenges and misconceptions in named entity recognition. CoNLL.
- (2009) CoNLL
- Ratinov, L.¹ Roth, D.²

34
- 34249971816
- Self-organizing semantic maps
- Ritter, H., & Kohonen, T. (1989). Self-organizing semantic maps. Biological Cybernetics, 241-254.
- (1989) Biological Cybernetics , pp. 241-254
- Ritter, H.¹ Kohonen, T.²

35
- 74549202026
- Vector-based semantic analysis: Representing word meanings based on random labels
- Sahlgren, M. (2001). Vector-based semantic analysis: Representing word meanings based on random labels. Proceedings of the Semantic Knowledge Acquisition and Categorisation Workshop, ESSLLI.
- (2001) Proceedings of the Semantic Knowledge Acquisition and Categorisation Workshop, ESSLLI
- Sahlgren, M.¹

36
- 76649115181
- An introduction to random indexing
- Sahlgren, M. (2005). An introduction to random indexing. Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering (TKE).
- (2005) Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering (TKE)
- Sahlgren, M.¹

37
- 34347351294
- Doctoral dissertation, Stockholm University
- Sahlgren, M. (2006). The word-space model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces. Doctoral dissertation, Stockholm University.
- (2006) The word-space model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces
- Sahlgren, M.¹

38
- 0346346018
- Introduction to the CoNLL-2000 shared task: Chunking
- CoNLL
- Sang, E. T., & Buchholz, S. (2000). Introduction to the CoNLL-2000 shared task: Chunking. CoNLL.
- (2000)
- Sang, E. T.¹ Buchholz, S.²

39
- 0036293862
- Connectionist language modeling for large vocabulary continuous speech recognition
- Orlando, Florida
- Schwenk, H., & Gauvain, J.-L. (2002). Connectionist language modeling for large vocabulary continuous speech recognition. International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 765-768). Orlando, Florida.
- (2002) International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pp. 765-768
- Schwenk, H.¹ Gauvain, J.-L.²

40
- 85043116988
- Shallow parsing with conditional random fields
- Sha, F., & Pereira, F. C. N. (2003). Shallow parsing with conditional random fields. HLT-NAACL.
- (2003) HLT-NAACL
- Sha, F.¹ Pereira, F. C. N.²

41
- 84858423134
- From baby steps to leapfrog: How “less is more” in unsupervised dependency parsing
- Spitkovsky, V., Alshawi, H., & Jurafsky, D. (2010). From baby steps to leapfrog: How “less is more” in unsupervised dependency parsing. NAACL-HLT.
- (2010) NAACL-HLT
- Spitkovsky, V.¹ Alshawi, H.² Jurafsky, D.³

42
- 84859884966
- Semi-supervised sequential labeling and segmentation using giga-word scale unlabeled data
- Suzuki, J., & Isozaki, H. (2008). Semi-supervised sequential labeling and segmentation using giga-word scale unlabeled data. ACL-08: HLT (pp. 665-673).
- (2008) ACL-08: HLT , pp. 665-673
- Suzuki, J.¹ Isozaki, H.²

43
- 80053399428
- An empirical study of semi-supervised structured conditional models for dependency parsing
- Suzuki, J., Isozaki, H., Carreras, X., & Collins, M. (2009). An empirical study of semi-supervised structured conditional models for dependency parsing. EMNLP.
- (2009) EMNLP
- Suzuki, J.¹ Isozaki, H.² Carreras, X.³ Collins, M.⁴

44
- 84859979112
- A preliminary evaluation of word representations for named-entity recognition
- Turian, J., Ratinov, L., Bengio, Y., & Roth, D. (2009). A preliminary evaluation of word representations for named-entity recognition. NIPS Workshop on Grammar Induction, Representation of Language and Language Learning.
- (2009) NIPS Workshop on Grammar Induction, Representation of Language and Language Learning
- Turian, J.¹ Ratinov, L.² Bengio, Y.³ Roth, D.⁴

45
- 77952700189
- From frequency to meaning: Vector space models of semantics
- Turney, P. D., & Pantel, P. (2010). From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research.
- (2010) Journal of Artificial Intelligence Research
- Turney, P. D.¹ Pantel, P.²

46
- 2242436639
- Hierarchical clustering of words
- Ushioda, A. (1996). Hierarchical clustering of words. COLING (pp. 1159-1162).
- (1996) COLING , pp. 1159-1162
- Ushioda, A.¹

47
- 44249114311
- Comparison of independent component analysis and singular value decomposition in word context analysis
- Väyrynen, J., & Honkela, T. (2005). Comparison of independent component analysis and singular value decomposition in word context analysis. AKRR'05, International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning.
- (2005) AKRR'05, International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning
- Väyrynen, J.¹ Honkela, T.²

48
- 84859958341
- Word category maps based on emergent features created by ICA
- Finnish Artificial Intelligence Society
- Väyrynen, J. J., & Honkela, T. (2004). Word category maps based on emergent features created by ICA. Proceedings of the STeP'2004 Cognition + Cybernetics Symposium (pp. 173-185). Finnish Artificial Intelligence Society.
- (2004) Proceedings of the STeP'2004 Cognition + Cybernetics Symposium , pp. 173-185
- Väyrynen, J. J.¹ Honkela, T.²

49
- 84859968879
- Towards explicit semantic features using independent component analysis
- Stockholm, Sweden: Swedish Institute of Computer Science
- Väyrynen, J. J., Honkela, T., & Lindqvist, L. (2007). Towards explicit semantic features using independent component analysis. Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR). Stockholm, Sweden: Swedish Institute of Computer Science.
- (2007) Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR)
- Väyrynen, J. J.¹ Honkela, T.² Lindqvist, L.³

50
- 78649933846
- Software framework for topic modelling with large corpora
- Řehůřek, R., & Sojka, P. (2010). Software framework for topic modelling with large corpora. LREC.
- (2010) LREC
- Řehůřek, R.¹ Sojka, P.²

51
- 84919495233
- A robust risk minimization based named entity recognition system
- Zhang, T., & Johnson, D. (2003). A robust risk minimization based named entity recognition system. CoNLL.
- (2003) CoNLL
- Zhang, T.¹ Johnson, D.²

52
- 84859984064
- Multilingual dependency learning: a huge feature engineering method to semantic dependency parsing
- Zhao, H., Chen, W., Kit, C., & Zhou, G. (2009). Multilingual dependency learning: a huge feature engineering method to semantic dependency parsing. CoNLL (pp. 55-60).
- (2009) CoNLL , pp. 55-60
- Zhao, H.¹ Chen, W.² Kit, C.³ Zhou, G.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.