-
1
-
-
84922389693
-
-
arxiv:1409.0473, Technical report, arXiv preprint .
-
Bahdanau, D., Cho, K., Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. Technical report, arXiv preprint arxiv:1409.0473.
-
(2014)
Neural machine translation by jointly learning to align and translate
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
3
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H. (2007). Greedy layer-wise training of deep networks. In NIPS.
-
(2007)
NIPS
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
4
-
-
84861776914
-
Multi-column deep neural network for traffic sign classification
-
Ciresan D., Meier U., Masci J., Schmidhuber J. Multi-column deep neural network for traffic sign classification. Neural Networks 2012, 32:333-338.
-
(2012)
Neural Networks
, vol.32
, pp. 333-338
-
-
Ciresan, D.1
Meier, U.2
Masci, J.3
Schmidhuber, J.4
-
5
-
-
85162069624
-
Phone recognition with the mean-covariance restricted Boltzmann machine
-
Dahl, G.E., Ranzato, M., Mohamed, A., Hinton, G.E. (2010). Phone recognition with the mean-covariance restricted Boltzmann machine. In NIPS.
-
(2010)
NIPS
-
-
Dahl, G.E.1
Ranzato, M.2
Mohamed, A.3
Hinton, G.E.4
-
6
-
-
79959842828
-
Binary coding of speech spectrograms using a deep auto-encoder
-
Deng, L., Seltzer, M., Yu, D., Acero, A., Mohamed, A., Hinton, G. (2010). Binary coding of speech spectrograms using a deep auto-encoder. In Interspeech.
-
(2010)
Interspeech
-
-
Deng, L.1
Seltzer, M.2
Yu, D.3
Acero, A.4
Mohamed, A.5
Hinton, G.6
-
7
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
Devlin, J., Zbib, R., Huang, Z., Lamar, T., Schwartz, R., Makhoul, J. (2014). Fast and robust neural network joint models for statistical machine translation. In ACL.
-
(2014)
ACL
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.5
Makhoul, J.6
-
8
-
-
77949522811
-
Why does unsupervised pre-training help deep learning?
-
Erhan D., Bengio Y., Courville A., Manzagol P.-A., Vincent P., Bengio S. Why does unsupervised pre-training help deep learning?. The Journal of Machine Learning Research 2010, 11:625-660.
-
(2010)
The Journal of Machine Learning Research
, vol.11
, pp. 625-660
-
-
Erhan, D.1
Bengio, Y.2
Courville, A.3
Manzagol, P.-A.4
Vincent, P.5
Bengio, S.6
-
9
-
-
84876258641
-
Learning hierarchical features for scene labeling
-
Farabet C., Couprie C., Najman L., LeCun Y. Learning hierarchical features for scene labeling. IEEE Transactions on Pattern Analysis and Machine Intelligence 2013, 35(8):1915-1929.
-
(2013)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.35
, Issue.8
, pp. 1915-1929
-
-
Farabet, C.1
Couprie, C.2
Najman, L.3
LeCun, Y.4
-
11
-
-
80053443013
-
Domain adaptation for large-scale sentiment classification: A deep learning approach
-
Glorot, X., Bordes, A., Bengio, Y. (2011b). Domain adaptation for large-scale sentiment classification: A deep learning approach. In ICML.
-
(2011)
ICML
-
-
Glorot, X.1
Bordes, A.2
Bengio, Y.3
-
13
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
Hinton G., Deng L., Dahl G.E., Mohamed A., Jaitly N., Senior A., Vanhoucke V., Nguyen P., Sainath T., Kingsbury B. Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine 2012, 29(6):82-97.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Dahl, G.E.3
Mohamed, A.4
Jaitly, N.5
Senior, A.6
Vanhoucke, V.7
Nguyen, P.8
Sainath, T.9
Kingsbury, B.10
-
14
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
Hinton G.E., Osindero S., Teh Y. A fast learning algorithm for deep belief nets. Neural Computation 2006, 18:1527-1554.
-
(2006)
Neural Computation
, vol.18
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
16
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
Krizhevsky, A., Sutskever, I., Hinton, G. (2012). ImageNet classification with deep convolutional neural networks. In NIPS.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.3
-
17
-
-
84867135575
-
Building high-level features using large scale unsupervised learning
-
Le, Q., Ranzato, M., Monga, R., Devin, M., Corrado, G., Chen, K., Dean, J., Ng, A. (2012). Building high-level features using large scale unsupervised learning. In ICML.
-
(2012)
ICML
-
-
Le, Q.1
Ranzato, M.2
Monga, R.3
Devin, M.4
Corrado, G.5
Chen, K.6
Dean, J.7
Ng, A.8
-
18
-
-
0000359337
-
Backpropagation applied to handwritten zip code recognition
-
LeCun Y., Boser B., Denker J.S., Henderson D., Howard R.E., Hubbard W., Jackel L.D. Backpropagation applied to handwritten zip code recognition. Neural Computation 1989.
-
(1989)
Neural Computation
-
-
LeCun, Y.1
Boser, B.2
Denker, J.S.3
Henderson, D.4
Howard, R.E.5
Hubbard, W.6
Jackel, L.D.7
-
19
-
-
0032203257
-
Gradient based learning applied to document recognition
-
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P. (1998). Gradient based learning applied to document recognition. In Proc. IEEE.
-
(1998)
Proc IEEE.
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
20
-
-
85161980001
-
Sparse deep belief network model for visual area V2
-
Lee, H., Ekanadham, C., Ng, A.Y. (2008). Sparse deep belief network model for visual area V2. In NIPS.
-
(2008)
NIPS
-
-
Lee, H.1
Ekanadham, C.2
Ng, A.Y.3
-
21
-
-
71149119164
-
Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
-
Lee, H., Grosse, R., Ranganath, R., Ng, A.Y. (2009a). Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In ICML.
-
(2009)
ICML
-
-
Lee, H.1
Grosse, R.2
Ranganath, R.3
Ng, A.Y.4
-
22
-
-
84863380535
-
Unsupervised feature learning for audio classification using convolutional deep belief networks
-
Lee, H., Largman, Y., Pham, P., Ng, A.Y. (2009b). Unsupervised feature learning for audio classification using convolutional deep belief networks. In NIPS.
-
(2009)
NIPS
-
-
Lee, H.1
Largman, Y.2
Pham, P.3
Ng, A.Y.4
-
23
-
-
84872561833
-
Unsupervised and transfer learning challenge: a deep learning approach
-
Mesnil, G., Dauphin, Y., Glorot, X., Rifai, S., Bengio, Y., Goodfellow, I., Lavoie, E., Muller, X., Desjardins, G., Warde-Farley, D., Vincent, P., Courville, A., Bergstra, J. (2011). Unsupervised and transfer learning challenge: a deep learning approach. In JMLR W&CP: Proc. Unsupervised and Transfer Learning, Vol. 7.
-
(2011)
JMLR W&CP: Proc Unsupervised and Transfer Learning
, vol.7
-
-
Mesnil, G.1
Dauphin, Y.2
Glorot, X.3
Rifai, S.4
Bengio, Y.5
Goodfellow, I.6
Lavoie, E.7
Muller, X.8
Desjardins, G.9
Warde-Farley, D.10
Vincent, P.11
Courville, A.12
Bergstra, J.13
-
24
-
-
84898956512
-
Distributed representations of words and phrases and their compositionality
-
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In NIPS.
-
(2013)
NIPS
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Corrado, G.S.4
Dean, J.5
-
25
-
-
77956509090
-
Rectified linear units improve restricted Boltzmann machines
-
Nair, V., Hinton, G. (2010). Rectified linear units improve restricted Boltzmann machines. In ICML.
-
(2010)
ICML
-
-
Nair, V.1
Hinton, G.2
-
26
-
-
51949106645
-
Self-taught learning: Transfer learning from unlabeled data
-
Raina, R., Battle, A., Lee, H., Packer, B., Ng, A.Y. (2007). Self-taught learning: Transfer learning from unlabeled data. In ICML.
-
(2007)
ICML
-
-
Raina, R.1
Battle, A.2
Lee, H.3
Packer, B.4
Ng, A.Y.5
-
27
-
-
84864069017
-
Efficient learning of sparse representations with an energy-based model
-
Ranzato, M., Poultney, C., Chopra, S., LeCun, Y. (2007). Efficient learning of sparse representations with an energy-based model. In NIPS.
-
(2007)
NIPS
-
-
Ranzato, M.1
Poultney, C.2
Chopra, S.3
LeCun, Y.4
-
28
-
-
36849095780
-
Restricted Boltzmann machines for collaborative filtering
-
Salakhutdinov, R., Mnih, A., Hinton, G. (2007). Restricted Boltzmann machines for collaborative filtering. In ICML.
-
(2007)
ICML
-
-
Salakhutdinov, R.1
Mnih, A.2
Hinton, G.3
-
30
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
Seide, F., Li, G., Yu, D. (2011). Conversational speech transcription using context-dependent deep neural networks. In Interspeech (pp. 437-440).
-
(2011)
Interspeech
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
31
-
-
84887328988
-
Pedestrian detection with unsupervised multi-stage feature learning
-
Sermanet, P., Kavukcuoglu, K., Chintala, S., LeCun, Y. (2013). Pedestrian detection with unsupervised multi-stage feature learning. In International Conference on Computer Vision and Pattern Recognition, June.
-
(2013)
International Conference on Computer Vision and Pattern Recognition, June
-
-
Sermanet, P.1
Kavukcuoglu, K.2
Chintala, S.3
LeCun, Y.4
-
32
-
-
84910597353
-
-
Technical report, arXiv preprint .
-
Sutskever, I., Vinyals, O., Le, Q.V. (2014). Sequence to sequence learning with neural networks. Technical report, arXiv preprint arxiv:1409.3215.
-
(2014)
Sequence to sequence learning with neural networks
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
33
-
-
56449119888
-
Deep learning via semi-supervised embedding
-
Weston, J., Ratle, F., Collobert, R. (2008). Deep learning via semi-supervised embedding. In ICML.
-
(2008)
ICML
-
-
Weston, J.1
Ratle, F.2
Collobert, R.3
|