메뉴 건너뛰기




Volumn , Issue , 2009, Pages 477-485

Stochastic gradient descent training for L1-regularized log-linear models with cumulative penalty

Author keywords

[No Author keywords available]

Indexed keywords

CHARACTER RECOGNITION; COMPUTATIONAL LINGUISTICS; GRADIENT METHODS; NEWTON-RAPHSON METHOD; REGRESSION ANALYSIS; SPEECH RECOGNITION; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

EID: 80052416457     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.3115/1687878.1687946     Document Type: Conference Paper
Times cited : (207)

References (28)
  • 1
    • 34547984768 scopus 로고    scopus 로고
    • Scalable training of L1-regularized log-linear models
    • Galen Andrew and Jianfeng Gao. 2007. Scalable training of L1-regularized log-linear models. In Proceedings of ICML, pages 33-40.
    • (2007) Proceedings of ICML , pp. 33-40
    • Andrew, G.1    Gao, J.2
  • 3
    • 85149103496 scopus 로고    scopus 로고
    • Parsing the WSJ using CCG and log-linear models
    • Stephen Clark and James R. Curran. 2004. Parsing the WSJ using CCG and log-linear models. In Proceedings of COLING 2004, pages 103-110.
    • (2004) Proceedings of COLING 2004 , pp. 103-110
    • Clark, S.1    Curran, J.R.2
  • 4
    • 80055033732 scopus 로고    scopus 로고
    • Semantic role labeling with tree conditional random fields
    • Trevor Cohn and Philip Blunsom. 2005. Semantic role labeling with tree conditional random fields. In Proceedings of CoNLL, pages 169-172.
    • (2005) Proceedings of CoNLL , pp. 169-172
    • Cohn, T.1    Blunsom, P.2
  • 6
    • 85127836544 scopus 로고    scopus 로고
    • Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms
    • Michael Collins. 2002. Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In Proceedings of EMNLP, pages 1-8.
    • (2002) Proceedings of EMNLP , pp. 1-8
    • Collins, M.1
  • 7
    • 0000595242 scopus 로고
    • Note on learning rate schedules for stochastic optimization
    • Christian Darken and John Moody. 1990. Note on learning rate schedules for stochastic optimization. In Proceedings of NIPS, pages 832-838.
    • (1990) Proceedings of NIPS , pp. 832-838
    • Darken, C.1    Moody, J.2
  • 9
    • 56449092085 scopus 로고    scopus 로고
    • Efficient projections onto the l1-ball for learning in high dimensions
    • Juhn Duchi, Shai Shalev-Shwartz, Yoram Singer, and Tushar Chandra. 2008. Efficient projections onto the l1-ball for learning in high dimensions. In Proceedings of ICML, pages 272-279.
    • (2008) Proceedings of ICML , pp. 272-279
    • Duchi, J.1    Shalev-Shwartz, S.2    Singer, Y.3    Chandra, T.4
  • 10
    • 72449211489 scopus 로고    scopus 로고
    • Efficient, feature-based, conditional random field parsing
    • Jenny Rose Finkel, Alex Kleeman, and Christopher D. Manning. 2008. Efficient, feature-based, conditional random field parsing. In Proceedings of ACL-08:HLT, pages 959-967.
    • (2008) Proceedings of ACL-08:HLT , pp. 959-967
    • Finkel, J.R.1    Kleeman, A.2    Manning, C.D.3
  • 11
    • 84860542469 scopus 로고    scopus 로고
    • A comparative study of parameter estimation methods for statistical natural language processing
    • Jianfeng Gao, Galen Andrew, Mark Johnson, and Kristina Toutanova. 2007. A comparative study of parameter estimation methods for statistical natural language processing. In Proceedings of ACL, pages 824-831.
    • (2007) Proceedings of ACL , pp. 824-831
    • Gao, J.1    Andrew, G.2    Johnson, M.3    Toutanova, K.4
  • 12
    • 49749132313 scopus 로고    scopus 로고
    • Training conditional random fields by periodic step size adaptation for large-scale textmining
    • Han-Shen Huang, Yu-Ming Chang, and Chun-Nan Hsu. 2007. Training conditional random fields by periodic step size adaptation for large-scale textmining. In Proceedings of ICDM, pages 511-516.
    • (2007) Proceedings of ICDM , pp. 511-516
    • Huang, H.-S.1    Chang, Y.-M.2    Hsu, C.-N.3
  • 13
    • 9444232930 scopus 로고    scopus 로고
    • Evaluation and extension of maximum entropy models with inequality constraints
    • Jun'ichi Kazama and Jun'ichi Tsujii. 2003. Evaluation and extension of maximum entropy models with inequality constraints. In Proceedings of EMNLP 2003.
    • (2003) Proceedings of EMNLP 2003
    • Kazama, J.1    Tsujii, J.2
  • 15
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • John Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of ICML, pages 282-289.
    • (2001) Proceedings of ICML , pp. 282-289
    • Lafferty, J.1    McCallum, A.2    Pereira, F.3
  • 17
    • 33750695296 scopus 로고    scopus 로고
    • Efficient l1 regularized logistic regression
    • Su-In Lee, Honglak Lee, Pieter Abbeel, and Andrew Y. Ng. 2006. Efficient l1 regularized logistic regression. In Proceedings of AAAI-06, pages 401-408.
    • (2006) Proceedings of AAAI-06 , pp. 401-408
    • Lee, S.-I.1    Lee, H.2    Abbeel, P.3    Ng, A.Y.4
  • 18
    • 34249852033 scopus 로고
    • Building a large annotated corpus of English: The Penn Treebank
    • Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1994. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313-330.
    • (1994) Computational Linguistics , vol.19 , Issue.2 , pp. 313-330
    • Marcus, M.P.1    Santorini, B.2    Marcinkiewicz, M.A.3
  • 19
    • 84966262179 scopus 로고
    • Updating quasi-Newton matrices with limited storage
    • Jorge Nocedal. 1980. Updating quasi-newton matrices with limited storage. Mathematics of Computation, 35(151):773-782.
    • (1980) Mathematics of Computation , vol.35 , Issue.151 , pp. 773-782
    • Nocedal, J.1
  • 20
    • 84860514602 scopus 로고    scopus 로고
    • Improving the scalability of semi-Markov conditional random fields for named entity recognition
    • Daisuke Okanohara, Yusuke Miyao, Yoshimasa Tsuruoka, and Jun'ichi Tsujii. 2006. Improving the scalability of semi-markov conditional random fields for named entity recognition. In Proceedings of COLING/ACL, pages 465-472.
    • (2006) Proceedings of COLING/ACL , pp. 465-472
    • Okanohara, D.1    Miyao, Y.2    Tsuruoka, Y.3    Tsujii, J.4
  • 21
    • 85124016637 scopus 로고    scopus 로고
    • A maximum entropy model for part-of-speech tagging
    • Adwait Ratnaparkhi. 1996. A maximum entropy model for part-of-speech tagging. In Proceedings of EMNLP 1996, pages 133-142.
    • (1996) Proceedings of EMNLP 1996 , pp. 133-142
    • Ratnaparkhi, A.1
  • 22
    • 84860520429 scopus 로고    scopus 로고
    • Guided learning for bidirectional sequence classification
    • Libin Shen, Giorgio Satta, and Aravind Joshi. 2007. Guided learning for bidirectional sequence classification. In Proceedings of ACL, pages 760-767.
    • (2007) Proceedings of ACL , pp. 760-767
    • Shen, L.1    Satta, G.2    Joshi, A.3
  • 23
    • 80053375671 scopus 로고    scopus 로고
    • Dependency parsing by belief propagation
    • David Smith and Jason Eisner. 2008. Dependency parsing by belief propagation. In Proceedings of EMNLP, pages 145-156.
    • (2008) Proceedings of EMNLP , pp. 145-156
    • Smith, D.1    Eisner, J.2
  • 25
    • 84860527068 scopus 로고    scopus 로고
    • A discriminative global training algorithm for statistical MT
    • Christoph Tillmann and Tong Zhang. 2006. A discriminative global training algorithm for statistical MT. In Proceedings of COLING/ACL, pages 721-728.
    • (2006) Proceedings of COLING/ACL , pp. 721-728
    • Tillmann, C.1    Zhang, T.2
  • 26
    • 47749146046 scopus 로고    scopus 로고
    • Joint learning improves semantic role labeling
    • Kristina Toutanova, Aria Haghighi, and Christopher Manning. 2005. Joint learning improves semantic role labeling. In Proceedings of ACL, pages 589-596.
    • (2005) Proceedings of ACL , pp. 589-596
    • Toutanova, K.1    Haghighi, A.2    Manning, C.3
  • 27
    • 33749243756 scopus 로고    scopus 로고
    • Accelerated training of conditional random fields with stochastic gradient methods
    • S. V. N. Vishwanathan, Nicol N. Schraudolph,MarkW. Schmidt, and Kevin P. Murphy. 2006. Accelerated training of conditional random fields with stochastic gradient methods. In Proceedings of ICML, pages 969-976.
    • (2006) Proceedings of ICML , pp. 969-976
    • Vishwanathan, S.V.N.1    Schraudolph, N.N.2    Schmidt, M.W.3    Murphy, K.P.4
  • 28
    • 84862290737 scopus 로고    scopus 로고
    • Leveraging machine readable dictionaries in discriminative sequence models
    • Ben Wellner and Marc Vilain. 2006. Leveraging machine readable dictionaries in discriminative sequence models. In Proceedings of LREC 2006.
    • (2006) Proceedings of LREC 2006
    • Wellner, B.1    Vilain, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.