SCOPUS 정보 검색 플랫폼

ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.

Volumn , Issue , 2009, Pages 477-485

Stochastic gradient descent training for L1-regularized log-linear models with cumulative penalty

(3) Tsuruoka, Yoshimasa a Tsujii, Jun'ichi a,b Ananiadou, Sophia a

a UNIVERSITY OF MANCHESTER (United Kingdom)

b UNIVERSITY OF TOKYO (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

CHARACTER RECOGNITION; COMPUTATIONAL LINGUISTICS; GRADIENT METHODS; NEWTON-RAPHSON METHOD; REGRESSION ANALYSIS; SPEECH RECOGNITION; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

APPROXIMATE GRADIENT; BATCH TRAINING ALGORITHMS; LEARNING FRAMEWORKS; NAMED ENTITY RECOGNITION; NATURAL LANGUAGE PROCESSING; PART OF SPEECH TAGGING; QUASI-NEWTON METHODS; STOCHASTIC GRADIENT DESCENT;

NATURAL LANGUAGE PROCESSING SYSTEMS;

EID: 80052416457 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.3115/1687878.1687946 Document Type: Conference Paper

Times cited : (207)

References (28)

1
- 34547984768
- Scalable training of L1-regularized log-linear models
- Galen Andrew and Jianfeng Gao. 2007. Scalable training of L1-regularized log-linear models. In Proceedings of ICML, pages 33-40.
- (2007) Proceedings of ICML , pp. 33-40
- Andrew, G.¹ Gao, J.²

2
- 64149111796
- Technical report, Alias-I
- Bob Carpenter. 2008. Lazy sparse stochastic gradient descent for regularized multinomial logistic regression. Technical report, Alias-i.
- (2008) Lazy Sparse Stochastic Gradient Descent for Regularized Multinomial Logistic Regression
- Carpenter, B.¹

3
- 85149103496
- Parsing the WSJ using CCG and log-linear models
- Stephen Clark and James R. Curran. 2004. Parsing the WSJ using CCG and log-linear models. In Proceedings of COLING 2004, pages 103-110.
- (2004) Proceedings of COLING 2004 , pp. 103-110
- Clark, S.¹ Curran, J.R.²

4
- 80055033732
- Semantic role labeling with tree conditional random fields
- Trevor Cohn and Philip Blunsom. 2005. Semantic role labeling with tree conditional random fields. In Proceedings of CoNLL, pages 169-172.
- (2005) Proceedings of CoNLL , pp. 169-172
- Cohn, T.¹ Blunsom, P.²

5
- 50949133940
- Exponentiated gradient algorithms for conditional random fields and max-margin Markov networks
- Michael Collins, Amir Globerson, Terry Koo, Xavier Carreras, and Peter L. Bartlett. 2008. Exponentiated gradient algorithms for conditional random fields and max-margin markov networks. The Journal of Machine Learning Research (JMLR), 9:1775-1822.
- (2008) The Journal of Machine Learning Research (JMLR) , vol.9 , pp. 1775-1822
- Collins, M.¹ Globerson, A.² Koo, T.³ Carreras, X.⁴ Bartlett, P.L.⁵

6
- 85127836544
- Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms
- Michael Collins. 2002. Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In Proceedings of EMNLP, pages 1-8.
- (2002) Proceedings of EMNLP , pp. 1-8
- Collins, M.¹

7
- 0000595242
- Note on learning rate schedules for stochastic optimization
- Christian Darken and John Moody. 1990. Note on learning rate schedules for stochastic optimization. In Proceedings of NIPS, pages 832-838.
- (1990) Proceedings of NIPS , pp. 832-838
- Darken, C.¹ Moody, J.²

8
- 84859883820
- Online and batch learning using forward-looking subgradients
- Juhn Duchi and Yoram Singer. 2008. Online and batch learning using forward-looking subgradients. In NIPS Workshop: OPT 2008 Optimization for Machine Learning.
- (2008) NIPS Workshop: OPT 2008 Optimization for Machine Learning
- Duchi, J.¹ Singer, Y.²

9
- 56449092085
- Efficient projections onto the l1-ball for learning in high dimensions
- Juhn Duchi, Shai Shalev-Shwartz, Yoram Singer, and Tushar Chandra. 2008. Efficient projections onto the l1-ball for learning in high dimensions. In Proceedings of ICML, pages 272-279.
- (2008) Proceedings of ICML , pp. 272-279
- Duchi, J.¹ Shalev-Shwartz, S.² Singer, Y.³ Chandra, T.⁴

10
- 72449211489
- Efficient, feature-based, conditional random field parsing
- Jenny Rose Finkel, Alex Kleeman, and Christopher D. Manning. 2008. Efficient, feature-based, conditional random field parsing. In Proceedings of ACL-08:HLT, pages 959-967.
- (2008) Proceedings of ACL-08:HLT , pp. 959-967
- Finkel, J.R.¹ Kleeman, A.² Manning, C.D.³

11
- 84860542469
- A comparative study of parameter estimation methods for statistical natural language processing
- Jianfeng Gao, Galen Andrew, Mark Johnson, and Kristina Toutanova. 2007. A comparative study of parameter estimation methods for statistical natural language processing. In Proceedings of ACL, pages 824-831.
- (2007) Proceedings of ACL , pp. 824-831
- Gao, J.¹ Andrew, G.² Johnson, M.³ Toutanova, K.⁴

12
- 49749132313
- Training conditional random fields by periodic step size adaptation for large-scale textmining
- Han-Shen Huang, Yu-Ming Chang, and Chun-Nan Hsu. 2007. Training conditional random fields by periodic step size adaptation for large-scale textmining. In Proceedings of ICDM, pages 511-516.
- (2007) Proceedings of ICDM , pp. 511-516
- Huang, H.-S.¹ Chang, Y.-M.² Hsu, C.-N.³

13
- 9444232930
- Evaluation and extension of maximum entropy models with inequality constraints
- Jun'ichi Kazama and Jun'ichi Tsujii. 2003. Evaluation and extension of maximum entropy models with inequality constraints. In Proceedings of EMNLP 2003.
- (2003) Proceedings of EMNLP 2003
- Kazama, J.¹ Tsujii, J.²

14
- 16244414494
- Introduction to the bio-entity recognition task at JNLPBA
- J.-D. Kim, T. Ohta, Y. Tsuruoka, Y. Tateisi, and N. Collier. 2004. Introduction to the bio-entity recognition task at JNLPBA. In Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA), pages 70-75.
- (2004) Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications (JNLPBA) , pp. 70-75
- Kim, J.-D.¹ Ohta, T.² Tsuruoka, Y.³ Tateisi, Y.⁴ Collier, N.⁵

15
- 0142192295
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- John Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of ICML, pages 282-289.
- (2001) Proceedings of ICML , pp. 282-289
- Lafferty, J.¹ McCallum, A.² Pereira, F.³

16
- 64149115569
- Sparse online learning via truncated gradient
- John Langford, Lihong Li, and Tong Zhang. 2009. Sparse online learning via truncated gradient. The Journal of Machine Learning Research (JMLR), 10:777-801.
- (2009) The Journal of Machine Learning Research (JMLR) , vol.10 , pp. 777-801
- Langford, J.¹ Li, L.² Zhang, T.³

17
- 33750695296
- Efficient l1 regularized logistic regression
- Su-In Lee, Honglak Lee, Pieter Abbeel, and Andrew Y. Ng. 2006. Efficient l1 regularized logistic regression. In Proceedings of AAAI-06, pages 401-408.
- (2006) Proceedings of AAAI-06 , pp. 401-408
- Lee, S.-I.¹ Lee, H.² Abbeel, P.³ Ng, A.Y.⁴

18
- 34249852033
- Building a large annotated corpus of English: The Penn Treebank
- Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1994. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313-330.
- (1994) Computational Linguistics , vol.19 , Issue.2 , pp. 313-330
- Marcus, M.P.¹ Santorini, B.² Marcinkiewicz, M.A.³

19
- 84966262179
- Updating quasi-Newton matrices with limited storage
- Jorge Nocedal. 1980. Updating quasi-newton matrices with limited storage. Mathematics of Computation, 35(151):773-782.
- (1980) Mathematics of Computation , vol.35 , Issue.151 , pp. 773-782
- Nocedal, J.¹

20
- 84860514602
- Improving the scalability of semi-Markov conditional random fields for named entity recognition
- Daisuke Okanohara, Yusuke Miyao, Yoshimasa Tsuruoka, and Jun'ichi Tsujii. 2006. Improving the scalability of semi-markov conditional random fields for named entity recognition. In Proceedings of COLING/ACL, pages 465-472.
- (2006) Proceedings of COLING/ACL , pp. 465-472
- Okanohara, D.¹ Miyao, Y.² Tsuruoka, Y.³ Tsujii, J.⁴

21
- 85124016637
- A maximum entropy model for part-of-speech tagging
- Adwait Ratnaparkhi. 1996. A maximum entropy model for part-of-speech tagging. In Proceedings of EMNLP 1996, pages 133-142.
- (1996) Proceedings of EMNLP 1996 , pp. 133-142
- Ratnaparkhi, A.¹

22
- 84860520429
- Guided learning for bidirectional sequence classification
- Libin Shen, Giorgio Satta, and Aravind Joshi. 2007. Guided learning for bidirectional sequence classification. In Proceedings of ACL, pages 760-767.
- (2007) Proceedings of ACL , pp. 760-767
- Shen, L.¹ Satta, G.² Joshi, A.³

23
- 80053375671
- Dependency parsing by belief propagation
- David Smith and Jason Eisner. 2008. Dependency parsing by belief propagation. In Proceedings of EMNLP, pages 145-156.
- (2008) Proceedings of EMNLP , pp. 145-156
- Smith, D.¹ Eisner, J.²

24
- 0013025914
- Wiley-IEEE
- James C. Spall. 2005. Introduction to Stochastic Search and Optimization. Wiley-IEEE.
- (2005) Introduction to Stochastic Search and Optimization
- Spall, J.C.¹

25
- 84860527068
- A discriminative global training algorithm for statistical MT
- Christoph Tillmann and Tong Zhang. 2006. A discriminative global training algorithm for statistical MT. In Proceedings of COLING/ACL, pages 721-728.
- (2006) Proceedings of COLING/ACL , pp. 721-728
- Tillmann, C.¹ Zhang, T.²

26
- 47749146046
- Joint learning improves semantic role labeling
- Kristina Toutanova, Aria Haghighi, and Christopher Manning. 2005. Joint learning improves semantic role labeling. In Proceedings of ACL, pages 589-596.
- (2005) Proceedings of ACL , pp. 589-596
- Toutanova, K.¹ Haghighi, A.² Manning, C.³

27
- 33749243756
- Accelerated training of conditional random fields with stochastic gradient methods
- S. V. N. Vishwanathan, Nicol N. Schraudolph,MarkW. Schmidt, and Kevin P. Murphy. 2006. Accelerated training of conditional random fields with stochastic gradient methods. In Proceedings of ICML, pages 969-976.
- (2006) Proceedings of ICML , pp. 969-976
- Vishwanathan, S.V.N.¹ Schraudolph, N.N.² Schmidt, M.W.³ Murphy, K.P.⁴

28
- 84862290737
- Leveraging machine readable dictionaries in discriminative sequence models
- Ben Wellner and Marc Vilain. 2006. Leveraging machine readable dictionaries in discriminative sequence models. In Proceedings of LREC 2006.
- (2006) Proceedings of LREC 2006
- Wellner, B.¹ Vilain, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.