SCOPUS 정보 검색 플랫폼

Volumn 11, Issue , 2010, Pages 815-848

Iterative scaling and coordinate descent methods for maximum entropy models

(4) Huang, Fang Lan a Hsieh, Cho Jui a Chang, Kai Wei a Lin, Chih Jen a

Author keywords

Coordinate descent; Iterative scaling; Maximum entropy; Natural language processing; Optimization

Indexed keywords

CONVERGENCE RESULTS; COORDINATE DESCENT; LINEAR SVM; MAXIMUM ENTROPY; MAXIMUM ENTROPY MODELS; NATURAL LANGUAGE PROCESSING; SCALING METHOD; UNIFIED FRAMEWORK;

COMPUTATIONAL COMPLEXITY; COMPUTATIONAL LINGUISTICS; ECHO SUPPRESSION; ENTROPY; NATURAL LANGUAGE PROCESSING SYSTEMS;

ITERATIVE METHODS;

EID: 77949528697 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (31)

References (35)

1
- 73549088698
- Scalable training of L1-regularized log-linear models
- Galen Andrew and Jianfeng Gao. Scalable training of L1-regularized log-linear models. In Proceedings of the Twenty Fourth International Conference on Machine Learning (ICML), 2007.
- (2007) Proceedings of the Twenty Fourth International Conference on Machine Learning (ICML)
- Andrew, G.¹ Gao, J.²

2
- 0004147916
- Addison-Wesley, second edition
- Tom M. Apostol. Mathematical Analysis. Addison-Wesley, second edition, 1974.
- (1974) Mathematical Analysis
- Apostol, T.M.¹

3
- 77949533802
- URL
- Jason Baldridge, Tom Morton, and Gann Bierner. OpenNLP package, 2001. URL http://opennlp.sourceforge.net/.
- (2001)
- Baldridge, J.¹ Morton, T.² Bierner, G.³

4
- 0002652285
- A maximum entropy approach to natural language processing
- Adam L. Berger, Vincent J. Della Pietra, and Stephen A. Della Pietra. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39-71, 1996.
- (1996) Computational Linguistics , vol.22 , Issue.1 , pp. 39-71
- Berger, A.L.¹ Pietra Della, V.J.² Pietra Della, S.A.³

5
- 0003713964
- Athena Scientific, Belmont, MA 02178-9998, second edition
- Dimitri P. Bertsekas. Nonlinear Programming. Athena Scientific, Belmont, MA 02178-9998, second edition, 1999.
- (1999) Nonlinear Programming
- Bertsekas, D.P.¹

6
- 33947180792
- Stochastic learning
- Olivier Bousquet and Ulrike von Luxburg, editors, Lecture Notes in Artificial Intelligence, LNAI 3176. Springer Verlag
- Léon Bottou. Stochastic learning. In Olivier Bousquet and Ulrike von Luxburg, editors, Advanced Lectures on Machine Learning, Lecture Notes in Artificial Intelligence, LNAI 3176, pages 146-168. Springer Verlag, 2004.
- (2004) Advanced Lectures on Machine Learning , pp. 146-168
- Bottou, L.¹

7
- 48849104146
- Coordinate descent method for large-scale L2-loss linear SVM
- URL
- Kai-Wei Chang, Cho-Jui Hsieh, and Chih-Jen Lin. Coordinate descent method for large-scale L2-loss linear SVM. Journal of Machine Learning Research, 9:1369-1398, 2008. URL http://www.csie.ntu.edu.tw/cjlin/papers/cdl2.pdf.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 1369-1398
- Chang, K.-W.¹ Hsieh, C.-J.² Lin, C.-J.³

8
- 0033887568
- A survey of smoothing techniques for ME models
- January
- Stanley F. Chen and Ronald Rosenfeld. A survey of smoothing techniques for ME models. IEEE Transactions on Speech and Audio Processing, 8(1):37-50, January 2000.
- (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.1 , pp. 37-50
- Chen, S.F.¹ Rosenfeld, R.²

9
- 0036643072
- Logistic regression, AdaBoost and Bregman distances
- Michael Collins, Robert E. Schapire, and Yoram Singer. Logistic regression, AdaBoost and Bregman distances. Machine Learning, 48(1-3):253-285, 2002.
- (2002) Machine Learning , vol.48 , Issue.1-3 , pp. 253-285
- Collins, M.¹ Schapire, R.E.² Singer, Y.³

10
- 50949133940
- Exponentiated gradient algorithms for conditional random fields and max-margin Markov networks
- Michael Collins, Amir Globerson, Terry Koo, Xavier Carreras, and Peter Bartlett. Exponentiated gradient algorithms for conditional random fields and max-margin Markov networks. Journal of Machine Learning Research, 9:1775-1822, 2008.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 1775-1822
- Collins, M.¹ Globerson, A.² Koo, T.³ Carreras, X.⁴ Bartlett, P.⁵

11
- 0001573124
- Generalized iterative scaling for log-linear models
- John N. Darroch and Douglas Ratcliff. Generalized iterative scaling for log-linear models. The Annals of Mathematical Statistics, 43(5):1470-1480, 1972.
- (1972) The Annals of Mathematical Statistics , vol.43 , Issue.5 , pp. 1470-1480
- Darroch, J.N.¹ Ratcliff, D.²

12
- 77949412685
- URL
- Hal Daumé, III. Notes on CG and LM-BFGS optimization of logistic regression. 2004. URL http://www.cs.utah.edu/hal/megam/.
- (2004) Notes on CG and LM-BFGS Optimization of Logistic Regression
- Daumé Iii, H.¹

13
- 0031120321
- Inducing features of random fields
- Stephen Della Pietra, Vincent Della Pietra, and John Lafferty. Inducing features of random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(4):380-393, 1997.
- (1997) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.19 , Issue.4 , pp. 380-393
- Della Pietra, S.¹ Della Pietra, V.² Lafferty, J.³

14
- 14344254638
- Performance guarantees for regularized maximum entropy density estimation
- New York. ACM press
- Miroslav Dudík, Steven J. Phillips, and Robert E. Schapire. Performance guarantees for regularized maximum entropy density estimation. In Proceedings of the 17th Annual Conference on Computational Learning Theory, pages 655-662, New York, 2004. ACM press.
- (2004) Proceedings of the 17th Annual Conference on Computational Learning Theory , pp. 655-662
- Dudík, M.¹ Phillips, S.J.² Schapire, R.E.³

15
- 0003768769
- John Wiley and Sons
- Roger Fletcher. Practical Methods of Optimization. John Wiley and Sons, 1987.
- (1987) Practical Methods of Optimization
- Fletcher, R.¹

16
- 62549130467
- Jerome Friedman, Trevor Hastie, and Robert Tibshirani. Regularization paths for generalized linear models via coordinate descent. 2008.
- (2008) Regularization Paths for Generalized Linear Models Via Coordinate Descent
- Friedman, J.¹ Hastie, T.² Tibshirani, R.³

17
- 84860542469
- A comparative study of parameter estimation methods statistical natural language processing
- Jianfeng Gao, Galen Andrew, Mark Johnson, and Kristina Toutanova. A comparative study of parameter estimation methods statistical natural language processing. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (ACL), pages 824-831, 2007.
- (2007) Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (ACL) , pp. 824-831
- Gao, J.¹ Andrew, G.² Johnson, M.³ Toutanova, K.⁴

18
- 34548105186
- Large-scale Bayesian logistic regression for text categorization
- Alexandar Genkin, David D. Lewis, and David Madigan. Large-scale Bayesian logistic regression for text categorization. Technometrics, 49(3):291-304, 2007.
- (2007) Technometrics , vol.49 , Issue.3 , pp. 291-304
- Genkin, A.¹ Lewis, D.D.² Madigan, D.³

19
- 33745800994
- Sequential conditional generalized iterative scaling
- Joshua Goodman. Sequential conditional generalized iterative scaling. In Proceedings of the 40th Annual Meeting of the Association of Computational Linguistics (ACL), pages 9-16, 2002.
- (2002) Proceedings of the 40th Annual Meeting of the Association of Computational Linguistics (ACL) , pp. 9-16
- Goodman, J.¹

20
- 0032665409
- Globally convergent block-coordinate techniques for unconstrained optimization
- Luigi Grippo and Marco Sciandrone. Globally convergent block-coordinate techniques for unconstrained optimization. Optimization Methods and Software, 10:587-637, 1999.
- (1999) Optimization Methods and Software , vol.10 , pp. 587-637
- Grippo, L.¹ Sciandrone, M.²

21
- 84859908589
- Iterative scaling and coordinate descent methods for maximum entropy
- Short paper
- Fang-Lan Huang, Cho-Jui Hsieh, Kai-Wei Chang, and Chih-Jen Lin. Iterative scaling and coordinate descent methods for maximum entropy. In Proceedings of the 47th Annual Meeting of the Association of Computational Linguistics (ACL), 2009. Short paper.
- (2009) Proceedings of the 47th Annual Meeting of the Association of Computational Linguistics (ACL)
- Huang, F.-L.¹ Hsieh, C.-J.² Chang, K.-W.³ Lin, C.-J.⁴

22
- 1942484964
- A faster iterative scaling algorithm for conditional exponential model
- Rong Jin, Rong Yan, Jian Zhang, and Alex G. Hauptmann. A faster iterative scaling algorithm for conditional exponential model. In Proceedings of the Twentieth International Conference on Machine Learning (ICML), 2003.
- (2003) Proceedings of the Twentieth International Conference on Machine Learning (ICML)
- Jin, R.¹ Yan, R.² Zhang, J.³ Hauptmann, A.G.⁴

23
- 30044437592
- A fast dual algorithm for kernel logistic regression
- S. Sathiya Keerthi, Kaibo Duan, Shirish Shevade, and Aun Neow Poo. A fast dual algorithm for kernel logistic regression. Machine Learning, 61:151-165, 2005.
- (2005) Machine Learning , vol.61 , pp. 151-165
- Keerthi, S.S.¹ Duan, K.² Shevade, S.³ Poo, A.N.⁴

24
- 34547688865
- An interior-point method for large-scale l1-regularized logistic regression
- URL
- Kwangmoo Koh, Seung-Jean Kim, and Stephen Boyd. An interior-point method for large-scale l1-regularized logistic regression. Journal of Machine Learning Research, 8:1519-1555, 2007. URL http://www.stanford.edu/boyd/l1-logistic-reg. html.
- (2007) Journal of Machine Learning Research , vol.8 , pp. 1519-1555
- Koh, K.¹ Kim, S.-J.² Boyd, S.³

25
- 77950023906
- Optimization transfer using surrogate objective functions
- March
- Kenneth Lange, David R. Hunter, and Ilsoon Yang. Optimization transfer using surrogate objective functions. Journal of Computational and Graphical Statistics, 9(1):1-20, March 2000.
- (2000) Journal of Computational and Graphical Statistics , vol.9 , Issue.1 , pp. 1-20
- Lange, K.¹ Hunter, D.R.² Yang, I.³

26
- 44649088319
- Trust region Newton method for largescale logistic regression
- URL
- Chih-Jen Lin, Ruby C. Weng, and S. Sathiya Keerthi. Trust region Newton method for largescale logistic regression. Journal of Machine Learning Research, 9:627-650, 2008. URL http://www.csie.ntu.edu.tw/cjlin/papers/logistic.pdf.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 627-650
- Lin, C.-J.¹ Weng, R.C.² Keerthi, S.S.³

27
- 33646887390
- On the limited memory BFGS method for large scale optimization
- Dong C. Liu and Jorge Nocedal. On the limited memory BFGS method for large scale optimization. Mathematical Programming, 45(1):503-528, 1989.
- (1989) Mathematical Programming , vol.45 , Issue.1 , pp. 503-528
- Liu, D.C.¹ Nocedal, J.²

28
- 0026678659
- On the convergence of coordinate descent method for convex differentiable minimization
- Zhi-Quan Luo and Paul Tseng. On the convergence of coordinate descent method for convex differentiable minimization. Journal of Optimization Theory and Applications, 72(1):7-35, 1992.
- (1992) Journal of Optimization Theory and Applications , vol.72 , Issue.1 , pp. 7-35
- Luo, Z.-Q.¹ Tseng, P.²

29
- 1042264823
- A comparison of algorithms for maximum entropy parameter estimation
- Association for Computational Linguistics
- Robert Malouf. A comparison of algorithms for maximum entropy parameter estimation. In Proceedings of the 6th conference on Natural language learning, pages 1-7. Association for Computational Linguistics, 2002.
- (2002) Proceedings of the 6th Conference on Natural Language Learning , pp. 1-7
- Malouf, R.¹

30
- 84876798255
- Online learning of approximate dependency parsing algorithms
- Ryan McDonald and Fernando Pereira. Online learning of approximate dependency parsing algorithms. In Proceedings of 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 81-88, 2006.
- (2006) Proceedings of 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL) , pp. 81-88
- McDonald, R.¹ Pereira, F.²

31
- 21244446518
- URL
- Thomas P. Minka. A comparison of numerical optimizers for logistic regression, 2003. URL http://research.microsoft.com/minka/papers/logreg/.
- (2003) A Comparison of Numerical Optimizers for Logistic Regression
- Minka, T.P.¹

32
- 0009313469
- PhD thesis, University of Pennsylvania
- Adwait Ratnaparkhi. Maximum Entropy Models For Natural Language Ambiguity Resolution. PhD thesis, University of Pennsylvania, 1998.
- (1998) Maximum Entropy Models for Natural Language Ambiguity Resolution
- Ratnaparkhi, A.¹

33
- 72449211086
- A stochastic quasi-Newton method for online convex optimization
- Nicol N. Schraudolph, Jin Yu, and Simon Gunter. A stochastic quasi-Newton method for online convex optimization. In Proceedings of the 11th International Conference Artificial Intelligence and Statistics (AISTATS), pages 433-440, 2007.
- (2007) Proceedings of the 11th International Conference Artificial Intelligence and Statistics (AISTATS) , pp. 433-440
- Schraudolph, N.N.¹ Yu, J.² Gunter, S.³

34
- 33749243756
- Accelerated training of conditional random fields with stochastic gradient methods
- S.V.N. Vishwanathan, Nicol N. Schraudolph, Mark W. Schmidt, and Kevin Murphy. Accelerated training of conditional random fields with stochastic gradient methods. In Proceedings of the 23rd International Conference on Machine Learning (ICML), pages 969-976, 2006.
- (2006) Proceedings of the 23rd International Conference on Machine Learning (ICML) , pp. 969-976
- Vishwanathan, S.V.N.¹ Schraudolph, N.N.² Schmidt, M.W.³ Murphy, K.⁴

35
- 35148838927
- Surrogate maximization/minimization algorithms and extensions
- October
- Zhihua Zhang, James T. Kwok, and Dit-Yan Yeung. Surrogate maximization/minimization algorithms and extensions. Machine Learning, 69(1):1-33, October 2007.
- (2007) Machine Learning , vol.69 , Issue.1 , pp. 1-33
- Zhang, Z.¹ Kwok, J.T.² Yeung, D.-Y.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.