-
4
-
-
0002652285
-
A maximum entropy approach to natural language processing
-
Adam L. Berger, Vincent J. Della Pietra, and Stephen A. Della Pietra. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39-71, 1996.
-
(1996)
Computational Linguistics
, vol.22
, Issue.1
, pp. 39-71
-
-
Berger, A.L.1
Pietra Della, V.J.2
Pietra Della, S.A.3
-
5
-
-
0003713964
-
-
Athena Scientific, Belmont, MA 02178-9998, second edition
-
Dimitri P. Bertsekas. Nonlinear Programming. Athena Scientific, Belmont, MA 02178-9998, second edition, 1999.
-
(1999)
Nonlinear Programming
-
-
Bertsekas, D.P.1
-
6
-
-
33947180792
-
Stochastic learning
-
Olivier Bousquet and Ulrike von Luxburg, editors, Lecture Notes in Artificial Intelligence, LNAI 3176. Springer Verlag
-
Léon Bottou. Stochastic learning. In Olivier Bousquet and Ulrike von Luxburg, editors, Advanced Lectures on Machine Learning, Lecture Notes in Artificial Intelligence, LNAI 3176, pages 146-168. Springer Verlag, 2004.
-
(2004)
Advanced Lectures on Machine Learning
, pp. 146-168
-
-
Bottou, L.1
-
7
-
-
48849104146
-
Coordinate descent method for large-scale L2-loss linear SVM
-
URL
-
Kai-Wei Chang, Cho-Jui Hsieh, and Chih-Jen Lin. Coordinate descent method for large-scale L2-loss linear SVM. Journal of Machine Learning Research, 9:1369-1398, 2008. URL http://www.csie.ntu.edu.tw/cjlin/papers/cdl2.pdf.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 1369-1398
-
-
Chang, K.-W.1
Hsieh, C.-J.2
Lin, C.-J.3
-
9
-
-
0036643072
-
Logistic regression, AdaBoost and Bregman distances
-
Michael Collins, Robert E. Schapire, and Yoram Singer. Logistic regression, AdaBoost and Bregman distances. Machine Learning, 48(1-3):253-285, 2002.
-
(2002)
Machine Learning
, vol.48
, Issue.1-3
, pp. 253-285
-
-
Collins, M.1
Schapire, R.E.2
Singer, Y.3
-
10
-
-
50949133940
-
Exponentiated gradient algorithms for conditional random fields and max-margin Markov networks
-
Michael Collins, Amir Globerson, Terry Koo, Xavier Carreras, and Peter Bartlett. Exponentiated gradient algorithms for conditional random fields and max-margin Markov networks. Journal of Machine Learning Research, 9:1775-1822, 2008.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 1775-1822
-
-
Collins, M.1
Globerson, A.2
Koo, T.3
Carreras, X.4
Bartlett, P.5
-
11
-
-
0001573124
-
Generalized iterative scaling for log-linear models
-
John N. Darroch and Douglas Ratcliff. Generalized iterative scaling for log-linear models. The Annals of Mathematical Statistics, 43(5):1470-1480, 1972.
-
(1972)
The Annals of Mathematical Statistics
, vol.43
, Issue.5
, pp. 1470-1480
-
-
Darroch, J.N.1
Ratcliff, D.2
-
13
-
-
0031120321
-
Inducing features of random fields
-
Stephen Della Pietra, Vincent Della Pietra, and John Lafferty. Inducing features of random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(4):380-393, 1997.
-
(1997)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.19
, Issue.4
, pp. 380-393
-
-
Della Pietra, S.1
Della Pietra, V.2
Lafferty, J.3
-
17
-
-
84860542469
-
A comparative study of parameter estimation methods statistical natural language processing
-
Jianfeng Gao, Galen Andrew, Mark Johnson, and Kristina Toutanova. A comparative study of parameter estimation methods statistical natural language processing. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (ACL), pages 824-831, 2007.
-
(2007)
Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (ACL)
, pp. 824-831
-
-
Gao, J.1
Andrew, G.2
Johnson, M.3
Toutanova, K.4
-
18
-
-
34548105186
-
Large-scale Bayesian logistic regression for text categorization
-
Alexandar Genkin, David D. Lewis, and David Madigan. Large-scale Bayesian logistic regression for text categorization. Technometrics, 49(3):291-304, 2007.
-
(2007)
Technometrics
, vol.49
, Issue.3
, pp. 291-304
-
-
Genkin, A.1
Lewis, D.D.2
Madigan, D.3
-
20
-
-
0032665409
-
Globally convergent block-coordinate techniques for unconstrained optimization
-
Luigi Grippo and Marco Sciandrone. Globally convergent block-coordinate techniques for unconstrained optimization. Optimization Methods and Software, 10:587-637, 1999.
-
(1999)
Optimization Methods and Software
, vol.10
, pp. 587-637
-
-
Grippo, L.1
Sciandrone, M.2
-
23
-
-
30044437592
-
A fast dual algorithm for kernel logistic regression
-
S. Sathiya Keerthi, Kaibo Duan, Shirish Shevade, and Aun Neow Poo. A fast dual algorithm for kernel logistic regression. Machine Learning, 61:151-165, 2005.
-
(2005)
Machine Learning
, vol.61
, pp. 151-165
-
-
Keerthi, S.S.1
Duan, K.2
Shevade, S.3
Poo, A.N.4
-
24
-
-
34547688865
-
An interior-point method for large-scale l1-regularized logistic regression
-
URL
-
Kwangmoo Koh, Seung-Jean Kim, and Stephen Boyd. An interior-point method for large-scale l1-regularized logistic regression. Journal of Machine Learning Research, 8:1519-1555, 2007. URL http://www.stanford.edu/boyd/l1-logistic-reg. html.
-
(2007)
Journal of Machine Learning Research
, vol.8
, pp. 1519-1555
-
-
Koh, K.1
Kim, S.-J.2
Boyd, S.3
-
25
-
-
77950023906
-
Optimization transfer using surrogate objective functions
-
March
-
Kenneth Lange, David R. Hunter, and Ilsoon Yang. Optimization transfer using surrogate objective functions. Journal of Computational and Graphical Statistics, 9(1):1-20, March 2000.
-
(2000)
Journal of Computational and Graphical Statistics
, vol.9
, Issue.1
, pp. 1-20
-
-
Lange, K.1
Hunter, D.R.2
Yang, I.3
-
26
-
-
44649088319
-
Trust region Newton method for largescale logistic regression
-
URL
-
Chih-Jen Lin, Ruby C. Weng, and S. Sathiya Keerthi. Trust region Newton method for largescale logistic regression. Journal of Machine Learning Research, 9:627-650, 2008. URL http://www.csie.ntu.edu.tw/cjlin/papers/logistic.pdf.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 627-650
-
-
Lin, C.-J.1
Weng, R.C.2
Keerthi, S.S.3
-
27
-
-
33646887390
-
On the limited memory BFGS method for large scale optimization
-
Dong C. Liu and Jorge Nocedal. On the limited memory BFGS method for large scale optimization. Mathematical Programming, 45(1):503-528, 1989.
-
(1989)
Mathematical Programming
, vol.45
, Issue.1
, pp. 503-528
-
-
Liu, D.C.1
Nocedal, J.2
-
28
-
-
0026678659
-
On the convergence of coordinate descent method for convex differentiable minimization
-
Zhi-Quan Luo and Paul Tseng. On the convergence of coordinate descent method for convex differentiable minimization. Journal of Optimization Theory and Applications, 72(1):7-35, 1992.
-
(1992)
Journal of Optimization Theory and Applications
, vol.72
, Issue.1
, pp. 7-35
-
-
Luo, Z.-Q.1
Tseng, P.2
-
29
-
-
1042264823
-
A comparison of algorithms for maximum entropy parameter estimation
-
Association for Computational Linguistics
-
Robert Malouf. A comparison of algorithms for maximum entropy parameter estimation. In Proceedings of the 6th conference on Natural language learning, pages 1-7. Association for Computational Linguistics, 2002.
-
(2002)
Proceedings of the 6th Conference on Natural Language Learning
, pp. 1-7
-
-
Malouf, R.1
-
34
-
-
33749243756
-
Accelerated training of conditional random fields with stochastic gradient methods
-
S.V.N. Vishwanathan, Nicol N. Schraudolph, Mark W. Schmidt, and Kevin Murphy. Accelerated training of conditional random fields with stochastic gradient methods. In Proceedings of the 23rd International Conference on Machine Learning (ICML), pages 969-976, 2006.
-
(2006)
Proceedings of the 23rd International Conference on Machine Learning (ICML)
, pp. 969-976
-
-
Vishwanathan, S.V.N.1
Schraudolph, N.N.2
Schmidt, M.W.3
Murphy, K.4
-
35
-
-
35148838927
-
Surrogate maximization/minimization algorithms and extensions
-
October
-
Zhihua Zhang, James T. Kwok, and Dit-Yan Yeung. Surrogate maximization/minimization algorithms and extensions. Machine Learning, 69(1):1-33, October 2007.
-
(2007)
Machine Learning
, vol.69
, Issue.1
, pp. 1-33
-
-
Zhang, Z.1
Kwok, J.T.2
Yeung, D.-Y.3
|