-
1
-
-
34547975052
-
Scaling learning algorithms toward AI
-
L. Bottou, O. Chapelle, D. DeCoste, and J. Weston, editors MIT Press
-
Y. Bengio and Y. LeCun. Scaling learning algorithms toward AI. In L. Bottou, O. Chapelle, D. DeCoste, and J. Weston, editors, Large-Scale Kernel Machines, Neural Information Processing Series, pages 321-360. MIT Press, 2007.
-
(2007)
Large-Scale Kernel Machines, Neural Information Processing Series
, pp. 321-360
-
-
Bengio, Y.1
LeCun, Y.2
-
2
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
B. Schölkopf, J. Platt, and T. Hofmann, editors MIT Press, Cambridge, MA
-
Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle. Greedy layer-wise training of deep networks. In B. Schölkopf, J. Platt, and T. Hofmann, editors, Advances in Neural Information Processing Systems (NIPS), Volume 19, pages 153-160. MIT Press, Cambridge, MA, 2007.
-
(2007)
Advances in Neural Information Processing Systems (NIPS)
, vol.19
, pp. 153-160
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
3
-
-
84857855190
-
Random search for hyperparameter optimization
-
J. Bergstra and Y. Bengio. Random search for hyperparameter optimization. J. Machine Learning Research, 13:281-305, 2012.
-
(2012)
J. Machine Learning Research
, vol.13
, pp. 281-305
-
-
Bergstra, J.1
Bengio, Y.2
-
4
-
-
33846516584
-
-
Springer Series in Information Science and Statistics. Springer-Verlag, Berlin
-
C. M. Bishop. Pattern Recognition and Machine Learning. Springer Series in Information Science and Statistics. Springer-Verlag, Berlin, 2006.
-
(2006)
Pattern Recognition and Machine Learning
-
-
Bishop, C.M.1
-
5
-
-
80051762104
-
Distributed optimization and statistical learning via the alternating direction method of multipliers
-
S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning, 3(1):1-122, 2011.
-
(2011)
Foundations and Trends in Machine Learning
, vol.3
, Issue.1
, pp. 1-122
-
-
Boyd, S.1
Parikh, N.2
Chu, E.3
Peleato, B.4
Eckstein, J.5
-
8
-
-
84857158204
-
Manifold learning and missing data recovery through unsupervised regression
-
Vancouver, BC, Dec. 11-14
-
M. Á. Carreira-Perpiñán and Z. Lu. Manifold learning and missing data recovery through unsupervised regression. In Proc. of the 12th IEEE Int. Conf. Data Mining (ICDM 2011), pages 1014-1019, Vancouver, BC, Dec. 11-14 2011.
-
(2011)
Proc. of the 12th IEEE Int. Conf. Data Mining (ICDM 2011)
, pp. 1014-1019
-
-
Carreira-Perpiñán, M.A.1
Lu, Z.2
-
10
-
-
33745751396
-
A very fast learning method for neural networks based on sensitivity analysis
-
July
-
E. Castillo, B. Guijarro-Berdiñas, O. Fontenla-Romero, and A. Alonso-Betanzos. A very fast learning method for neural networks based on sensitivity analysis. J. Machine Learning Research, 7:1159-1182, July 2006.
-
(2006)
J. Machine Learning Research
, vol.7
, pp. 1159-1182
-
-
Castillo, E.1
Guijarro-Berdiñas, B.2
Fontenla-Romero, O.3
Alonso-Betanzos, A.4
-
11
-
-
84867614591
-
Scalable stacking and learning for building deep architectures
-
Kyoto, Japan, Mar. 25-30
-
L. Deng, D. Yu, and J. Platt. Scalable stacking and learning for building deep architectures. In Proc. of the IEEE Int. Conf. Acoustics, Speech and Sig. Proc. (ICASSP'12), pages 2133-2136, Kyoto, Japan, Mar. 25-30 2012.
-
(2012)
Proc. of the IEEE Int. Conf. Acoustics, Speech and Sig. Proc. (ICASSP'12)
, pp. 2133-2136
-
-
Deng, L.1
Yu, D.2
Platt, J.3
-
12
-
-
79961226155
-
The difficulty of training deep architectures and the effect of unsupervised pre-training
-
Clearwater Beach, FL, Mar. 21-24
-
D. Erhan, P. A. Manzagol, Y. Bengio, S. Bengio, and P. Vincent. The difficulty of training deep architectures and the effect of unsupervised pre-training. In Proc. of the 12th Int. Workshop on Artificial Intelligence and Statistics (AISTATS 2009), pages 153-160, Clearwater Beach, FL, Mar. 21-24 2009.
-
(2009)
Proc. of the 12th Int. Workshop on Artificial Intelligence and Statistics (AISTATS 2009)
, pp. 153-160
-
-
Erhan, D.1
Manzagol, P.A.2
Bengio, Y.3
Bengio, S.4
Vincent, P.5
-
14
-
-
0011943051
-
Learning by choice of internal representations
-
T. Grossman, R. Meir, and E. Domany. Learning by choice of internal representations. Complex Systems, 2(5):555-575, 1988.
-
(1988)
Complex Systems
, vol.2
, Issue.5
, pp. 555-575
-
-
Grossman, T.1
Meir, R.2
Domany, E.3
-
15
-
-
0003684449
-
-
Springer Series in Statistics. Springer-Verlag, second edition
-
T. J. Hastie, R. J. Tibshirani, and J. H. Friedman. The Elements of Statistical Learning - Data Mining, Inference and Prediction. Springer Series in Statistics. Springer-Verlag, second edition, 2009.
-
(2009)
The Elements of Statistical Learning - Data Mining, Inference and Prediction
-
-
Hastie, T.J.1
Tibshirani, R.J.2
Friedman, J.H.3
-
16
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
July 28
-
G. E. Hinton and R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313 (5786):504-507, July 28 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.R.2
-
17
-
-
70049083257
-
Fast inference in sparse coding algorithms with applications to object recognition
-
New York University, Dec. 4
-
K. Kavukcuoglu, M. Ranzato, and Y. LeCun. Fast inference in sparse coding algorithms with applications to object recognition. Technical Report CBLL-TR-2008-12-01, Dept. of Computer Science, New York University, Dec. 4 2008.
-
(2008)
Technical Report CBLL-TR-2008-12-01, Dept. of Computer Science
-
-
Kavukcuoglu, K.1
Ranzato, M.2
LeCun, Y.3
-
18
-
-
0000638998
-
A cost function for internal representations
-
D. S. Touretzky, editor Morgan Kaufmann, San Mateo, CA
-
A. Krogh, C. J. Thorbergsson, and J. A. Hertz. A cost function for internal representations. In D. S. Touretzky, editor, Advances in Neural Information Processing Systems (NIPS), Volume 2, pages 733-740. Morgan Kaufmann, San Mateo, CA, 1990.
-
(1990)
Advances in Neural Information Processing Systems (NIPS)
, vol.2
, pp. 733-740
-
-
Krogh, A.1
Thorbergsson, C.J.2
Hertz, J.A.3
-
19
-
-
27844605876
-
Probabilistic non-linear principal component analysis with Gaussian process latent variable models
-
Nov.
-
N. Lawrence. Probabilistic non-linear principal component analysis with Gaussian process latent variable models. J. Machine Learning Research, 6:1783-1816, Nov. 2005.
-
(2005)
J. Machine Learning Research
, vol.6
, pp. 1783-1816
-
-
Lawrence, N.1
-
20
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Nov.
-
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proc. IEEE, 86(11):2278-2324, Nov. 1998.
-
(1998)
Proc. IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
21
-
-
0031105693
-
An efficient EM-based training algorithm for feedforward neural networks
-
Mar.
-
S. Ma, C. Ji, and J. Farmer. An efficient EM-based training algorithm for feedforward neural networks. Neural Networks, 10(2):243-256, Mar. 1997.
-
(1997)
Neural Networks
, vol.10
, Issue.2
, pp. 243-256
-
-
Ma, S.1
Ji, C.2
Farmer, J.3
-
22
-
-
0003840341
-
Columbia object image library (COIL-20)
-
Columbia University, Feb.
-
S. A. Nene, S. K. Nayar, and H. Murase. Columbia object image library (COIL-20). Technical Report CUCS-005-96, Dept. of Computer Science, Columbia University, Feb. 1996.
-
(1996)
Technical Report CUCS-005-96, Dept. of Computer Science
-
-
Nene, S.A.1
Nayar, S.K.2
Murase, H.3
-
23
-
-
85098056778
-
-
Springer Series in Operations Research and Financial Engineering. Springer-Verlag, New York, second edition
-
J. Nocedal and S. J. Wright. Numerical Optimization. Springer Series in Operations Research and Financial Engineering. Springer-Verlag, New York, second edition, 2006.
-
(2006)
Numerical Optimization
-
-
Nocedal, J.1
Wright, S.J.2
-
24
-
-
0029938380
-
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
-
June 13
-
B. A. Olshausen and D. J. Field. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature, 381(6583):607-609, June 13 1996.
-
(1996)
Nature
, vol.381
, Issue.6583
, pp. 607-609
-
-
Olshausen, B.A.1
Field, D.J.2
-
26
-
-
84864069017
-
Efficient learning of sparse representations with an energy-based model
-
B. Schölkopf, J. Platt, and T. Hofmann, editors MIT Press, Cambridge, MA
-
M. Ranzato, C. Poultney, S. Chopra, and Y. LeCun. Efficient learning of sparse representations with an energy-based model. In B. Schölkopf, J. Platt, and T. Hofmann, editors, Advances in Neural Information Processing Systems (NIPS), Volume 19, pages 1137-1144. MIT Press, Cambridge, MA, 2007.
-
(2007)
Advances in Neural Information Processing Systems (NIPS)
, vol.19
, pp. 1137-1144
-
-
Ranzato, M.1
Poultney, C.2
Chopra, S.3
LeCun, Y.4
-
27
-
-
0016985417
-
Monotone operators and the proximal point algorithm
-
R. T. Rockafellar. Monotone operators and the proximal point algorithm. SIAM J. Control and Optim., 14(5): 877-898, 1976.
-
(1976)
SIAM J. Control and Optim.
, vol.14
, Issue.5
, pp. 877-898
-
-
Rockafellar, R.T.1
-
28
-
-
0000201186
-
On langevin updating in multilayer perceptrons
-
Sept.
-
T. Rögnvaldsson. On Langevin updating in multilayer perceptrons. Neural Computation, 6(5):916-926, Sept. 1994.
-
(1994)
Neural Computation
, vol.6
, Issue.5
, pp. 916-926
-
-
Rögnvaldsson, T.1
-
29
-
-
0004663635
-
The 'moving targets' training algorithm
-
D. S. Touretzky, editor Morgan Kaufmann, San Mateo, CA
-
R. Rohwer. The 'moving targets' training algorithm. In D. S. Touretzky, editor, Advances in Neural Information Processing Systems (NIPS), Volume 2, pages 558-565. Morgan Kaufmann, San Mateo, CA, 1990.
-
(1990)
Advances in Neural Information Processing Systems (NIPS)
, vol.2
, pp. 558-565
-
-
Rohwer, R.1
-
30
-
-
0022471098
-
Learning representations by back-propagating errors
-
D. E. Rumelhart, G. E. Hinton, and R. J. Williams. Learning representations by back-propagating errors. Nature, 323:533-536, 1986.
-
(1986)
Nature
, vol.323
, pp. 533-536
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Williams, R.J.3
-
31
-
-
0342427298
-
Learning by choice of internal representations: An energy minimization approach
-
D. Saad and E. Marom. Learning by choice of internal representations: An energy minimization approach. Complex Systems, 4(1):107-118, 1990.
-
(1990)
Complex Systems
, vol.4
, Issue.1
, pp. 107-118
-
-
Saad, D.1
Marom, E.2
-
32
-
-
85032751472
-
Large-vocabulary continuous speech recognition systems: A look at some recent advances
-
Nov.
-
G. Saon and J.-T. Chien. Large-vocabulary continuous speech recognition systems: A look at some recent advances. IEEE Signal Processing Magazine, 29(6):18-33, Nov. 2012.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 18-33
-
-
Saon, G.1
Chien, J.-T.2
-
33
-
-
33847380121
-
Robust object recognition with cortex-like mechanisms
-
Mar.
-
T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, and T. Poggio. Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Analysis and Machine Intelligence, 29(3):411-426, Mar. 2007.
-
(2007)
IEEE Trans. Pattern Analysis and Machine Intelligence
, vol.29
, Issue.3
, pp. 411-426
-
-
Serre, T.1
Wolf, L.2
Bileschi, S.3
Riesenhuber, M.4
Poggio, T.5
-
34
-
-
0029322882
-
Reducing data dimensionality through optimizing neural network inputs
-
June
-
S. Tan and M. L. Mavrovouniotis. Reducing data dimensionality through optimizing neural network inputs. AIChE Journal, 41(6):1471-1479, June 1995.
-
(1995)
AIChE Journal
, vol.41
, Issue.6
, pp. 1471-1479
-
-
Tan, S.1
Mavrovouniotis, M.L.2
-
35
-
-
84954229804
-
Nonlinear low-dimensional regression using auxiliary coordinates
-
N. Lawrence and M. Girolami, editors La Palma, Canary Islands, Spain, Apr. 21-23
-
W. Wang and M. Á. Carreira-Perpiñán. Nonlinear low-dimensional regression using auxiliary coordinates. In N. Lawrence and M. Girolami, editors, Proc. of the 15th Int. Workshop on Artificial Intelligence and Statistics (AISTATS 2012), pages 1295-1304, La Palma, Canary Islands, Spain, Apr. 21-23 2012.
-
(2012)
Proc. of the 15th Int. Workshop on Artificial Intelligence and Statistics (AISTATS 2012)
, pp. 1295-1304
-
-
Wang, W.1
Carreira-Perpiñán, M.A.2
|