-
1
-
-
0000396062
-
Natural gradient works efficiently in learning
-
10.1162/089976698300017746
-
S.-I. Amari 1998 Natural gradient works efficiently in learning Neural Computation 10 251 276 10.1162/089976698300017746
-
(1998)
Neural Computation
, vol.10
, pp. 251-276
-
-
Amari, S.-I.1
-
2
-
-
0002906163
-
Improving the convergence of back-propagation learning with second-order methods
-
Morgan Kaufman San Mateo
-
Becker, S., & LeCun, Y. (1988). Improving the convergence of back-propagation learning with second-order methods. In Proceedings of the 1988 connectionist models summer school (pp. 29-37). San Mateo: Morgan Kaufman.
-
(1988)
Proceedings of the 1988 Connectionist Models Summer School
, pp. 29-37
-
-
Becker, S.1
Lecun, Y.2
-
4
-
-
34547991424
-
Solving multiclass support vector machines with LaRank
-
DOI 10.1145/1273496.1273508, Proceedings, Twenty-Fourth International Conference on Machine Learning, ICML 2007
-
Bordes, A., Bottou, L., Gallinari, P., & Weston, J. (2007). Solving multiclass support vector machines with LaRank. In Proceedings of the 24th international conference on machine learning (ICML'07) (pp. 89-96). New York: ACM. (Pubitemid 47275053)
-
(2007)
ACM International Conference Proceeding Series
, vol.227
, pp. 89-96
-
-
Bordes, A.1
Bottou, L.2
Gallinari, P.3
Weston, J.4
-
8
-
-
17444425307
-
On-line learning for very large data sets
-
DOI 10.1002/asmb.538
-
L. Bottou Y. LeCun 2005 On-line learning for very large data sets Applied Stochastic Models in Business and Industry 21 2 137 151 1091.68063 10.1002/asmb.538 2137546 (Pubitemid 40541359)
-
(2005)
Applied Stochastic Models in Business and Industry
, vol.21
, Issue.2
, pp. 137-151
-
-
Bottou, L.1
Cun, Y.L.2
-
9
-
-
38849166405
-
-
L. Bottou O. Chapelle D. DeCoste J. Weston (eds). MIT Press Cambridge
-
Bottou, L., Chapelle, O., DeCoste, D., & Weston, J. (Eds.) (2007). Large scale kernel machines. Cambridge: MIT Press.
-
(2007)
Large Scale Kernel Machines
-
-
-
12
-
-
0004014502
-
A Gaussian prior for smoothing maximum entropy models
-
School of Computer Science, Carnegie Mellon University, PA, USA, Pittsburgh
-
Chen, S. F., & Rosenfeld, R. (1999). A Gaussian prior for smoothing maximum entropy models. Technical Report CMU-CS-99-108, School of Computer Science, Carnegie Mellon University, PA, USA, Pittsburgh.
-
(1999)
Technical Report CMU-CS-99-108
-
-
Chen, S.F.1
Rosenfeld, R.2
-
16
-
-
0032751314
-
On computing the largest fraction of missing information for the EM algorithm and the worst linear function for data augmentation
-
DOI 10.1016/S0167-9473(99)00003-1, PII S0167947399000031
-
C. Fraley 1999 On computing the largest fraction of missing information for the EM algorithm and the worst linear function for data augmentation Computational Statistics and Data Analysis 31 1 13 26 1063.62507 10.1016/S0167-9473(99)00003-1 1705719 (Pubitemid 29520103)
-
(1999)
Computational Statistics and Data Analysis
, vol.31
, Issue.1
, pp. 13-26
-
-
Fraley, C.1
-
18
-
-
72449142210
-
Global and componentwise extrapolation for accelerating data mining from large incomplete data sets with the em algorithm
-
Hong Kong, China
-
Hsu, C.-N., Huang, H.-S., & Yang, B.-H. (2006). Global and componentwise extrapolation for accelerating data mining from large incomplete data sets with the EM algorithm. In Proceedings of the sixth IEEE international conference on data mining (ICDM'06) (pp. 265-274), Hong Kong, China.
-
(2006)
Proceedings of the Sixth IEEE International Conference on Data Mining (ICDM'06)
, pp. 265-274
-
-
Hsu, C.-N.1
Huang, H.-S.2
Yang, B.-H.3
-
19
-
-
46249123634
-
Integrating high dimensional bi-directional parsing models for gene mention tagging
-
DOI 10.1093/bioinformatics/btn183
-
C.-N. Hsu Y.-M. Chang C.-J. Kuo Y.-S. Lin H.-S. Huang I.-F. Chuang 2008 Integrating high dimensional bi-directional parsing models for gene mention tagging Bioinformatics 24 13 i286 i294 10.1093/bioinformatics/btn183 Proceedings of ISMB-2008 (Pubitemid 351911684)
-
(2008)
Bioinformatics
, vol.24
, Issue.13
-
-
Hsu, C.-N.1
Chang, Y.-M.2
Kuo, C.-J.3
Lin, Y.-S.4
Huang, H.-S.5
Chung, I.-F.6
-
21
-
-
67649541404
-
Global and componentwise extrapolations for accelerating training of Bayesian networks and conditional random fields
-
10.1007/s10618-009-0128-3
-
H.-S. Huang B.-H. Yang Y.-M. Chang C.-N. Hsu 2009 Global and componentwise extrapolations for accelerating training of Bayesian networks and conditional random fields Data Mining and Knowledge Discovery 19 1 58 91 10.1007/s10618-009-0128-3
-
(2009)
Data Mining and Knowledge Discovery
, vol.19
, Issue.1
, pp. 58-91
-
-
Huang, H.-S.1
Yang, B.-H.2
Chang, Y.-M.3
Hsu, C.-N.4
-
22
-
-
0002714543
-
Making large-scale svm learning practical
-
B. Schölkoph C. J. C. Burges A. J. Smola (eds). MIT Press Cambridge. Chap. 11
-
Joachims, T. (1998). Making large-scale svm learning practical. In B. Schölkoph, C. J. C. Burges, & A. J. Smola (Eds.), Advances in kernel methods: support vector learning (Chap. 11, pp. 169-184). Cambridge: MIT Press.
-
(1998)
Advances in Kernel Methods: Support Vector Learning
, pp. 169-184
-
-
Joachims, T.1
-
24
-
-
46249127243
-
-
Available under LGPL from the following URL
-
Kudo, T. (2006). CRF++: Yet another CRF toolkit. Available under LGPL from the following URL: http://crfpp.sourceforge.net/.
-
(2006)
CRF++: Yet Another CRF Toolkit
-
-
Kudo, T.1
-
27
-
-
0001857994
-
Efficient backprop
-
G. Orr K. Muller (eds). Springer Berlin
-
LeCun, Y., Bottou, L., Orr, G. B., & Muller, K.-R. (1998a). Efficient backprop. In G. Orr & K. Muller (Eds.), Neural networks: tricks of the trade. Berlin: Springer.
-
(1998)
Neural Networks: Tricks of the Trade
-
-
Lecun, Y.1
Bottou, L.2
Orr, G.B.3
Muller, K.-R.4
-
28
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
10.1109/5.726791
-
Y. LeCun L. Bottou Y. Bengio P. Haffner 1998 Gradient-based learning applied to document recognition Proceedings of the IEEE 86 11 2278 2324 10.1109/5.726791
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
29
-
-
84947927646
-
Object recognition with gradient-based learning
-
Springer London. 10.1007/3-540-46805-6-19
-
LeCun, Y., Haffner, P., Bottou, L., & Bengio, Y. (1999). Object recognition with gradient-based learning. In Shape, contour and grouping in computer vision (p. 319). London: Springer.
-
(1999)
Shape, Contour and Grouping in Computer Vision
, pp. 319
-
-
Lecun, Y.1
Haffner, P.2
Bottou, L.3
Bengio, Y.4
-
30
-
-
85162000799
-
Topmoumoute online natural gradient algorithm
-
MIT Press Cambridge
-
LeRoux, N., Manzagol, P.-A., & Bengio, Y. (2008). Topmoumoute online natural gradient algorithm. In Advances in neural information processing systems, 20 (NIPS 2007). Cambridge: MIT Press.
-
(2008)
Advances in Neural Information Processing Systems, 20 (NIPS 2007)
-
-
Leroux, N.1
Manzagol, P.-A.2
Bengio, Y.3
-
33
-
-
38149148314
-
On the global and componentwise rates of convergence of the em algorithm
-
0818.65153 10.1016/0024-3795(94)90363-8 1274429
-
X.-L. Meng D. B. Rubin 1994 On the global and componentwise rates of convergence of the EM algorithm Linear Algebra and Its Applications 199 413 425 0818.65153 10.1016/0024-3795(94)90363-8 1274429
-
(1994)
Linear Algebra and Its Applications
, vol.199
, pp. 413-425
-
-
Meng, X.-L.1
Rubin, D.B.2
-
34
-
-
0001955526
-
A statistical study of on-line learning
-
Cambridge University Press Cambridge
-
Murata, N. (1998). A statistical study of on-line learning. In On-line learning in neural networks (pp. 63-92). Cambridge: Cambridge University Press.
-
(1998)
On-line Learning in Neural Networks
, pp. 63-92
-
-
Murata, N.1
-
35
-
-
0032629928
-
Statistical analysis of learning dynamics
-
0922.68094 10.1016/S0165-1684(98)00206-0
-
N. Murata S.-I. Amari 1999 Statistical analysis of learning dynamics Signal Processing 74 1 3 28 0922.68094 10.1016/S0165-1684(98)00206-0
-
(1999)
Signal Processing
, vol.74
, Issue.1
, pp. 3-28
-
-
Murata, N.1
Amari, S.-I.2
-
44
-
-
34547964973
-
Pegasos: Primal estimated sub-GrAdient sOlver for SVM
-
DOI 10.1145/1273496.1273598, Proceedings, Twenty-Fourth International Conference on Machine Learning, ICML 2007
-
Shalev-Shwartz, S., Singer, Y., & Srebro, N. (2007). Pegasos: Primal Estimated sub-GrAdient SOlver for SVM. In ICML'07: Proceedings of the 24th international conference on machine learning (pp. 807-814). New York: ACM. (Pubitemid 47275141)
-
(2007)
ACM International Conference Proceeding Series
, vol.227
, pp. 807-814
-
-
Shalev-Shwartz, S.1
Singer, Y.2
Srebro, N.3
-
46
-
-
31844438834
-
-
Ph.D. thesis, Stanford University, Stanford, CA, USA. Adviser-Daphne Koller
-
Taskar, B. (2005). Learning structured prediction models: a large margin approach. Ph.D. thesis, Stanford University, Stanford, CA, USA. Adviser-Daphne Koller.
-
(2005)
Learning Structured Prediction Models: A Large Margin Approach
-
-
Taskar, B.1
-
47
-
-
31844442382
-
Learning structured prediction models: A large margin approach
-
ACM New York. 10.1145/1102351.1102464
-
Taskar, B., Chatalbashev, V., Koller, D., & Guestrin, C. (2005). Learning structured prediction models: a large margin approach. In Proceedings of the 22nd international conference on machine learning (ICML'05) (pp. 896-903). New York: ACM.
-
(2005)
Proceedings of the 22nd International Conference on Machine Learning (ICML'05)
, pp. 896-903
-
-
Taskar, B.1
Chatalbashev, V.2
Koller, D.3
Guestrin, C.4
-
50
-
-
34250731290
-
Accelerated training of conditional random fields with stochastic gradient methods
-
Pittsburgh, PA, USA
-
Vishwanathan, S., Schraudolph, N. N., Schmidt, M. W., & Murphy, K. P. (2006a). Accelerated training of conditional random fields with stochastic gradient methods. In Proceedings of the 23rd international conference on machine learning (ICML'06), Pittsburgh, PA, USA.
-
(2006)
Proceedings of the 23rd International Conference on Machine Learning (ICML'06)
-
-
Vishwanathan, S.1
Schraudolph, N.N.2
Schmidt, M.W.3
Murphy, K.P.4
|