SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2015, Pages 908-916

Auto-sizing neural networks: With applications to n-gram language models

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; MODELING LANGUAGES;

HIDDEN LAYERS; IMPROVE PERFORMANCE; LANGUAGE MODEL; MACHINE TRANSLATIONS; N-GRAM LANGUAGE MODELS; NATURAL LANGUAGES; NEURAL MODELS; OPTIMAL SETTING;

NATURAL LANGUAGE PROCESSING SYSTEMS;

EID: 84959882423 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.18653/v1/d15-1107 Document Type: Conference Paper

Times cited : (24)

References (19)

1
- 84857710417
- Optimization with sparsity-inducing penalties
- Francis Bach, Rodolphe Jenatton, Mien Mairal, and Guillaume Obozinski. 2012. Optimization with sparsity-inducing penalties. Foundations and Trends in Machine Learning, 4(1):1-106.
- (2012) Foundations and Trends in Machine Learning , vol.4 , Issue.1 , pp. 1-106
- Bach, F.¹ Jenatton, R.² Mairal, M.³ Obozinski, G.⁴

2
- 84941201812
- OxLM: A neural language modelling framework for machine translation
- Paul Baltescu, Phil Blunsom, and Hieu Hoang. 2014. OxLM: A neural language modelling framework for machine translation. Prague Bulletin of Mathematical Linguistics, 102(1):81-92.
- (2014) Prague Bulletin of Mathematical Linguistics , vol.102 , Issue.1 , pp. 81-92
- Baltescu, P.¹ Blunsom, P.² Hoang, H.³

3
- 0142166851
- A neural probabilistic language model
- Yoshua Bengio, Rejean Ducharme, Pascal Vincent, and Christian Janvin. 2003. A neural probabilistic language model. J. Machine Learning Research, 3:1137-1155.
- (2003) J. Machine Learning Research , vol.3 , pp. 1137-1155
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³ Janvin, C.⁴

4
- 84906921986
- Fast and robust neural network joint models for statistical machine translation
- Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, and John Makhoul. 2014. Fast and robust neural network joint models for statistical machine translation. In Proc. ACL, pages 1370-1380.
- (2014) Proc. ACL , pp. 1370-1380
- Devlin, J.¹ Zbib, R.² Huang, Z.³ Lamar, T.⁴ Schwartz, R.⁵ Makhoul, J.⁶

5
- 75249102673
- Efficient online and batch learning using forward backward splitting
- John Duchi and Yoram Singer. 2009. Efficient online and batch learning using forward backward splitting. J. Machine Learning Research, 10:2899-2934.
- (2009) J. Machine Learning Research , vol.10 , pp. 2899-2934
- Duchi, J.¹ Singer, Y.²

6
- 56449092085
- 1-ball for learning in high dimensions
- 1-ball for learning in high dimensions. In Proc. ICML, pages 272-279.
- (2008) Proc. ICML , pp. 272-279
- Duchi, J.¹ Shalev-Shwartz, S.² Singer, Y.³ Chandra, T.⁴

7
- 85110867932
- Moses: Open source toolkit for statistical machine translation
- Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondfej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open source toolkit for statistical machine translation. In Proc. ACL, Interactive Poster and Demonstration Sessions, pages 177-180.
- (2007) Proc. ACL, Interactive Poster and Demonstration Sessions , pp. 177-180
- Koehn, P.¹ Hoang, H.² Birch, A.³ Callison-Burch, C.⁴ Federico, M.⁵ Bertoldi, N.⁶ Cowan, B.⁷ Shen, W.⁸ Moran, C.⁹ Zens, R.¹⁰ Dyer, C.¹¹ Bojar, O.¹² Constantin, A.¹³ Herbst, E.¹⁴

8
- 0000494466
- Optimal brain damage
- Yann LeCun, John S. Denker, Sara A. Sofia, Richard E. Howard, and Lawrence D. Jackel. 1989. Optimal brain damage. In Proc. NIPS, volume 2, pages 598-605.
- (1989) Proc. NIPS , vol.2 , pp. 598-605
- Le Cun, Y.¹ Denker, J.S.² Sofia, S.A.³ Howard, R.E.⁴ Jackel, L.D.⁵

9
- 84901784231
- RNNLM-recurrent neural network language modeling toolkit
- Tomas Mikolov, Stefan Kombrink, Anoop Deoras, Lukar Burget, and Jan Cernocky. 2011. RNNLM-recurrent neural network language modeling toolkit. In Proc. ASRU, pages 196-201.
- (2011) Proc. ASRU , pp. 196-201
- Mikolov, T.¹ Kombrink, S.² Deoras, A.³ Burget, L.⁴ Cernocky, J.⁵

10
- 84867118996
- A fast and simple algorithm for training neural probabilistic language models
- Andriy Mnih and Yee Whye Teh. 2012. A fast and simple algorithm for training neural probabilistic language models. In Proc. ICML, pages 1751-1758.
- (2012) Proc. ICML , pp. 1751-1758
- Mnih, A.¹ Teh, Y.W.²

11
- 77956509090
- Rectified linear units improve restricted boltzmann machines
- Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve Restricted Boltzmann Machines. In Proc. ICML, pages 807-814.
- (2010) Proc. ICML , pp. 807-814
- Nair, V.¹ Hinton, G.E.²

12
- 0001765492
- Simplifying neural networks by soft weight-sharing
- Steven J. Nowland and Geoffrey E. Hinton. 1992. Simplifying neural networks by soft weight-sharing. Neural Computation, 4:473-493.
- (1992) Neural Computation , vol.4 , pp. 473-493
- Nowland, S.J.¹ Hinton, G.E.²

13
- 84884129062
- Proximal algorithms
- Neal Parikh and Stephen Boyd. 2014. Proximal algorithms. Foundations and Trends in Optimization, 1(3):127-239.
- (2014) Foundations and Trends in Optimization , vol.1 , Issue.3 , pp. 127-239
- Parikh, N.¹ Boyd, S.²

14
- 71149088514
- 1∞ regularization
- 1∞ regularization. In Proc. ICML, pages 857-864.
- (2009) Proc. ICML , pp. 857-864
- Quattoni, A.¹ Carreras, X.² Collins, M.³ Darrell, T.⁴

15
- 0029732478
- Sequential neural text compression
- Jurgen Schmidhuber and Stefan Heil. 1996. Sequential neural text compression. IEEE Transactions on Neural Networks, 7:142-146.
- (1996) IEEE Transactions on Neural Networks , vol.7 , pp. 142-146
- Schmidhuber, J.¹ Heil, S.²

16
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: A simple way to prevent neural networks from overfitting. J. Machine Learning Research, 15(1):1929-1958.
- (2014) J. Machine Learning Research , vol.15 , Issue.1 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

17
- 84924036578
- From feedforward to recurrent LSTM neural networks for language modeling
- Martin Sundermeyer, Hermann Ney, and Ralf Schliiter. 2015. From feedforward to recurrent LSTM neural networks for language modeling. Trans. Audio, Speech, and Language, 23(3):517-529.
- (2015) Trans. Audio, Speech, and Language , vol.23 , Issue.3 , pp. 517-529
- Sundermeyer, M.¹ Ney, H.² Schliiter, R.³

18
- 84926298172
- Decoding with large-scale neural language models improves translation
- Ashish Vaswani, Yinggong Zhao, Victoria Fossum, and David Chiang. 2013. Decoding with large-scale neural language models improves translation. In Proc. EMNLP, pages 1387-1392.
- (2013) Proc. EMNLP , pp. 1387-1392
- Vaswani, A.¹ Zhao, Y.² Fossum, V.³ Chiang, D.⁴

19
- 84988402904
- Can artificial neural networks learn language models?
- Wei Xu and Alexander I. Rudnicky. 2000. Can artificial neural networks learn language models? In Proc. International Conference on Statistical Language Processing, pages M1-13.
- (2000) Proc. International Conference on Statistical Language Processing , pp. M1-M13
- Xu, W.¹ Rudnicky, A.I.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.