SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

TopicRNN: A recurrent neural network with long-range semantic dependency

(4) Dieng, Adji B a Gao, Jianfeng b Wang, Chong b Paisley, John a

a Columbia University ^* (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; MODELING LANGUAGES; SEMANTIC WEB; SEMANTICS; SENTIMENT ANALYSIS; STATISTICS; SYNTACTICS;

FEATURE EXTRACTOR; LATENT DIRICHLET ALLOCATION; LATENT TOPIC MODEL; LONG-RANGE DEPENDENCIES; RECURRENT NEURAL NETWORK (RNN); SEMANTIC DEPENDENCY; SEMANTIC STRUCTURES; STATE OF THE ART;

RECURRENT NEURAL NETWORKS;

EID: 85088228479 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (111)

References (35)

1
- 0028392483
- Learning long-term dependencies with gradient descent is difficult
- Y. Bengio, P. Simard, and P. Frasconi. Learning long-term dependencies with gradient descent is difficult. IEEE transactions on neural networks, 5(2):157-166, 1994.
- (1994) IEEE Transactions on Neural Networks , vol.5 , Issue.2 , pp. 157-166
- Bengio, Y.¹ Simard, P.² Frasconi, P.³

2
- 0142166851
- A neural probabilistic language model
- Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin. A neural probabilistic language model. journal of machine learning research, 3(Feb):1137-1155, 2003.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 1137-1155
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³ Jauvin, C.⁴

3
- 84864066426
- Correlated topic models
- D. Blei and J. Lafferty. Correlated topic models. Advances in neural information processing systems, 18:147, 2006.
- (2006) Advances in Neural Information Processing Systems , vol.18 , pp. 147
- Blei, D.¹ Lafferty, J.²

4
- 76249118968
- Topic models
- D. M. Blei and J. D. Lafferty. Topic models. Text mining: classification, clustering, and applications, 10(71):34, 2009.
- (2009) Text Mining: Classification, Clustering, and Applications , vol.10 , Issue.71 , pp. 34
- Blei, D.M.¹ Lafferty, J.D.²

5
- 0141607824
- Latent dirichlet allocation
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of machine Learning research, 3(Jan):993-1022, 2003.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 993-1022
- Blei, D.M.¹ Ng, A.Y.² Jordan, M.I.³

6
- 0034295822
- Structured language modeling
- C. Chelba and F. Jelinek. Structured language modeling. Computer Speech & Language, 14(4): 283-332, 2000.
- (2000) Computer Speech & Language , vol.14 , Issue.4 , pp. 283-332
- Chelba, C.¹ Jelinek, F.²

7
- 84943795466
- arXiv preprint
- C. Chelba, T. Mikolov, M. Schuster, Q. Ge, T. Brants, P. Koehn, and T. Robinson. One billion word benchmark for measuring progress in statistical language modeling. arXiv preprint arXiv:1312.3005, 2013.
- (2013) One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling
- Chelba, C.¹ Mikolov, T.² Schuster, M.³ Ge, Q.⁴ Brants, T.⁵ Koehn, P.⁶ Robinson, T.⁷

8
- 84961291190
- arXiv preprint
- K. Cho, B. Van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
- (2014) Learning Phrase Representations Using Rnn Encoder-Decoder for Statistical Machine Translation
- Cho, K.¹ Van Merriënboer, B.² Gulcehre, C.³ Bahdanau, D.⁴ Bougares, F.⁵ Schwenk, H.⁶ Bengio, Y.⁷

9
- 84965138788
- Semi-supervised sequence learning
- A. M. Dai and Q. V. Le. Semi-supervised sequence learning. In Advances in Neural Information Processing Systems, pages 3079-3087, 2015.
- (2015) Advances in Neural Information Processing Systems , pp. 3079-3087
- Dai, A.M.¹ Le, Q.V.²

10
- 8644247561
- Dependence language model for information retrieval
- J. Gao, J.-Y. Nie, G. Wu, and G. Cao. Dependence language model for information retrieval. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pages 170-177. ACM, 2004.
- (2004) Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval , pp. 170-177
- Gao, J.¹ Nie, J.-Y.² Wu, G.³ Cao, G.⁴

11
- 84988877050
- arXiv preprint
- S. Ghosh, O. Vinyals, B. Strope, S. Roy, T. Dean, and L. Heck. Contextual lstm (clstm) models for large scale nlp tasks. arXiv preprint arXiv:1602.06291, 2016.
- (2016) Contextual Lstm (Clstm) Models for Large Scale Nlp Tasks
- Ghosh, S.¹ Vinyals, O.² Strope, B.³ Roy, S.⁴ Dean, T.⁵ Heck, L.⁶

12
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

13
- 84994157341
- arXiv preprint
- Y. Ji, T. Cohn, L. Kong, C. Dyer, and J. Eisenstein. Document context language models. arXiv preprint arXiv:1511.03962, 2015.
- (2015) Document Context Language Models
- Ji, Y.¹ Cohn, T.² Kong, L.³ Dyer, C.⁴ Eisenstein, J.⁵

14
- 85016041195
- arXiv preprint
- Y. Ji, G. Haffari, and J. Eisenstein. A latent variable recurrent neural network for discourse relation language models. arXiv preprint arXiv:1603.01913, 2016.
- (2016) A Latent Variable Recurrent Neural Network for Discourse Relation Language Models
- Ji, Y.¹ Haffari, G.² Eisenstein, J.³

15
- 0033225865
- An introduction to variational methods for graphical models
- M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul. An introduction to variational methods for graphical models. Machine learning, 37(2):183-233, 1999.
- (1999) Machine Learning , vol.37 , Issue.2 , pp. 183-233
- Jordan, M.I.¹ Ghahramani, Z.² Jaakkola, T.S.³ Saul, L.K.⁴

16
- 84978840213
- arXiv preprint
- R. Jozefowicz, O. Vinyals, M. Schuster, N. Shazeer, and Y. Wu. Exploring the limits of language modeling. arXiv preprint arXiv:1602.02410, 2016.
- (2016) Exploring the Limits of Language Modeling
- Jozefowicz, R.¹ Vinyals, O.² Schuster, M.³ Shazeer, N.⁴ Wu, Y.⁵

17
- 84941620184
- arXiv preprint
- D. Kingma and J. Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- (2014) Adam: A Method for Stochastic Optimization
- Kingma, D.¹ Ba, J.²

18
- 84919810317
- arXiv preprint
- D. P. Kingma and M. Welling. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
- (2013) Auto-Encoding Variational Bayes
- Kingma, D.P.¹ Welling, M.²

19
- 84926067654
- Distributed representations of sentences and documents
- Q. V. Le and T. Mikolov. Distributed representations of sentences and documents. In ICML, volume 14, pages 1188-1196, 2014.
- (2014) ICML , vol.14 , pp. 1188-1196
- Le, Q.V.¹ Mikolov, T.²

20
- 84959935599
- Hierarchical recurrent neural network for document modeling
- R. Lin, S. Liu, M. Yang, M. Li, M. Zhou, and S. Li. Hierarchical recurrent neural network for document modeling. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 899-907, 2015.
- (2015) Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing , pp. 899-907
- Lin, R.¹ Liu, S.² Yang, M.³ Li, M.⁴ Zhou, M.⁵ Li, S.⁶

21
- 84859023447
- Learning word vectors for sentiment analysis
- Association for Computational Linguistics
- A. L. Maas, R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng, and C. Potts. Learning word vectors for sentiment analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 142-150. Association for Computational Linguistics, 2011.
- (2011) Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume , vol.1 , pp. 142-150
- Maas, A.L.¹ Daly, R.E.² Pham, P.T.³ Huang, D.⁴ Ng, A.Y.⁵ Potts, C.⁶

22
- 34249852033
- Building a large annotated corpus of english: The penn treebank
- M. P. Marcus, M. A. Marcinkiewicz, and B. Santorini. Building a large annotated corpus of english: The penn treebank. Computational linguistics, 19(2):313-330, 1993.
- (1993) Computational Linguistics , vol.19 , Issue.2 , pp. 313-330
- Marcus, M.P.¹ Marcinkiewicz, M.A.² Santorini, B.³

23
- 84994155407
- arXiv preprint
- Y. Miao, L. Yu, and P. Blunsom. Neural variational inference for text processing. arXiv preprint arXiv:1511.06038, 2015.
- (2015) Neural Variational Inference for Text Processing
- Miao, Y.¹ Yu, L.² Blunsom, P.³

24
- 84874235486
- Context dependent recurrent neural network language model
- T. Mikolov and G. Zweig. Context dependent recurrent neural network language model. In SLT, pages 234-239, 2012.
- (2012) SLT , pp. 234-239
- Mikolov, T.¹ Zweig, G.²

25
- 80051627816
- Recurrent neural network based language model
- ` and
- T. Mikolov, M. Karafiát, L. Burget, J. Cernocky, ` and S. Khudanpur. Recurrent neural network based language model. In Interspeech, volume 2, page 3, 2010.
- (2010) Interspeech , vol.2 , pp. 3
- Mikolov, T.¹ Karafiát, M.² Burget, L.³ Cernocky, J.⁴ Khudanpur, S.⁵

26
- 80051643236
- Extensions of recurrent neural network language model
- ` and IEEE
- T. Mikolov, S. Kombrink, L. Burget, J. Cernocky, ` and S. Khudanpur. Extensions of recurrent neural network language model. In 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 5528-5531. IEEE, 2011.
- (2011) 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pp. 5528-5531
- Mikolov, T.¹ Kombrink, S.² Burget, L.³ Cernocky, J.⁴ Khudanpur, S.⁵

27
- 84939804661
- arXiv preprint
- T. Mikolov, A. Joulin, S. Chopra, M. Mathieu, and M. Ranzato. Learning longer memory in recurrent neural networks. arXiv preprint arXiv:1412.7753, 2014.
- (2014) Learning Longer Memory in Recurrent Neural Networks
- Mikolov, T.¹ Joulin, A.² Chopra, S.³ Mathieu, M.⁴ Ranzato, M.⁵

28
- 80053260943
- Optimizing semantic coherence in topic models
- Association for Computational Linguistics
- D. Mimno, H. M. Wallach, E. Talley, M. Leenders, and A. McCallum. Optimizing semantic coherence in topic models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 262-272. Association for Computational Linguistics, 2011.
- (2011) Proceedings of the Conference on Empirical Methods in Natural Language Processing , pp. 262-272
- Mimno, D.¹ Wallach, H.M.² Talley, E.³ Leenders, M.⁴ McCallum, A.⁵

29
- 85049078406
- Adversarial training methods for semi-supervised text classification
- T. Miyato, A. M. Dai, and I. Goodfellow. Adversarial training methods for semi-supervised text classification. stat, 1050:7, 2016.
- (2016) Stat , vol.1050 , pp. 7
- Miyato, T.¹ Dai, A.M.² Goodfellow, I.³

30
- 84897497795
- On the difficulty of training recurrent neural networks
- R. Pascanu, T. Mikolov, and Y. Bengio. On the difficulty of training recurrent neural networks. ICML (3), 28:1310-1318, 2013.
- (2013) ICML , vol.28 , Issue.3 , pp. 1310-1318
- Pascanu, R.¹ Mikolov, T.² Bengio, Y.³

31
- 84919908080
- arXiv preprint
- D. J. Rezende, S. Mohamed, and D. Wierstra. Stochastic backpropagation and approximate inference in deep generative models. arXiv preprint arXiv:1401.4082, 2014.
- (2014) Stochastic Backpropagation and Approximate Inference in Deep Generative Models
- Rezende, D.J.¹ Mohamed, S.² Wierstra, D.³

32
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15(1): 1929-1958, 2014.
- (2014) Journal of Machine Learning Research , vol.15 , Issue.1 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.E.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

33
- 84884966819
- PhD thesis, University of Toronto
- I. Sutskever. Training recurrent neural networks. PhD thesis, University of Toronto, 2013.
- (2013) Training Recurrent Neural Networks
- Sutskever, I.¹

34
- 34250736841
- Topic modeling: Beyond bag-of-words
- H. M. Wallach. Topic modeling: beyond bag-of-words. In Proceedings of the 23rd international conference on Machine learning, pages 977-984. ACM, 2006.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 977-984
- Wallach, H.M.¹

35
- 79952129745
- Rethinking LDA: Why priors matter
- H. M. Wallach, D. M. Mimno, and A. McCallum. Rethinking lda: Why priors matter. In Advances in neural information processing systems, pages 1973-1981, 2009.
- (2009) Advances in Neural Information Processing Systems , pp. 1973-1981
- Wallach, H.M.¹ Mimno, D.M.² McCallum, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.