SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Structured attention networks

(4) Kim, Yoon a Denton, Carl a Hoang, Luong a Rush, Alexander M a

a HARVARD UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DEEP NEURAL NETWORKS; GRAPHIC METHODS; NATURAL LANGUAGE PROCESSING SYSTEMS; NETWORK LAYERS;

ATTENTION MODEL; CONDITIONAL RANDOM FIELD; DIFFERENT CLASS; EFFECTIVE APPROACHES; GRAPHICAL MODEL; MACHINE TRANSLATIONS; NATURAL LANGUAGES; QUESTION ANSWERING;

MULTILAYER NEURAL NETWORKS;

EID: 85088226886 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (292)

References (57)

1
- 85011954517
- Globally normalized transition-based neural networks
- Daniel Andor, Chris Alberti, David Weiss, Aliaksei Severyn, Alessandro Presta, Kuzman Ganchev, Slav Petrov, and Michael Collins. Globally Normalized Transition-Based Neural Networks. In Proceedings of ACL, 2016.
- (2016) Proceedings of ACL
- Andor, D.¹ Alberti, C.² Weiss, D.³ Severyn, A.⁴ Presta, A.⁵ Ganchev, K.⁶ Petrov, S.⁷ Collins, M.⁸

2
- 85083953689
- Neural machine translation by jointly learning to align and translate
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural Machine Translation by Jointly Learning to Align and Translate. In Proceedings of ICLR, 2015.
- (2015) Proceedings of ICLR
- Bahdanau, D.¹ Cho, K.² Bengio, Y.³

3
- 0002703873
- Trainable grammars for speech recognition
- James K. Baker. Trainable Grammars for Speech Recognition. Speech Communication Papers for the 97th Meeting of the Acoustical Society, 1979.
- (1979) Speech Communication Papers for the 97th Meeting of the Acoustical Society
- Baker, J.K.¹

4
- 84998953764
- Structured prediction energy networks
- David Belanger and Andrew McCallum. Structured Prediction Energy Networks. In Proceedings of ICML, 2016.
- (2016) Proceedings of ICML
- Belanger, D.¹ McCallum, A.²

5
- 84977505088
- Tree-structured composition in neural networks without tree-structured architectures
- Samuel R. Bowman, Christopher D. Manning, and Christopher Potts. Tree-Structured Composition in Neural Networks without Tree-Structured Architectures. In Proceedings of the NIPS workshop on Cognitive Computation: Integrating Neural and Symbolic Approaches, 2015.
- (2015) Proceedings of the NIPS Workshop on Cognitive Computation: Integrating Neural and Symbolic Approaches
- Bowman, S.R.¹ Manning, C.D.² Potts, C.³

6
- 85021645849
- A fast unified model for parsing and sentence understanding
- Samuel R. Bowman, Jon Gauthier, Abhinav Rastogi, Raghav Gupta, Christopher D. Manning, and Christopher Potts. A Fast Unified Model for Parsing and Sentence Understanding. In Proceedings of ACL, 2016.
- (2016) Proceedings of ACL
- Bowman, S.R.¹ Gauthier, J.² Rastogi, A.³ Gupta, R.⁴ Manning, C.D.⁵ Potts, C.⁶

7
- 84994328213
- William Chan, Navdeep Jaitly, Quoc Le, and Oriol Vinyals. Listen, Attend and Spell. arXiv:1508.01211, 2015.
- (2015) Listen, Attend and Spell
- Chan, W.¹ Jaitly, N.² Le, Q.³ Vinyals, O.⁴

8
- 84969930631
- Learning deep structured models
- Liang-Chieh Chen, Alexander G. Schwing, Alan L. Yuille, and Raquel Urtasun. Learning Deep Structured Models. In Proceedings of ICML, 2015.
- (2015) Proceedings of ICML
- Chen, L.-C.¹ Schwing, A.G.² Yuille, A.L.³ Urtasun, R.⁴

9
- 85031930724
- Qian Chen, Xiaodan Zhu, Zhenhua Ling, Si Wei, and Hui Jiang. Enhancing and Combining Sequential and Tree LSTM for Natural Language Inference. arXiv:1609.06038, 2016.
- (2016) Enhancing and Combining Sequential and Tree LSTM for Natural Language Inference
- Chen, Q.¹ Zhu, X.² Ling, Z.³ Wei, S.⁴ Jiang, H.⁵

10
- 84946763507
- Describing Multimedia Content using Attention-based Encoder-Decoder Networks
- Kyunghyun Cho, Aaron Courville, and Yoshua Bengio. Describing Multimedia Content using Attention-based Encoder-Decoder Networks. In IEEE Transactions on Multimedia, 2015.
- (2015) IEEE Transactions on Multimedia
- Cho, K.¹ Courville, A.² Bengio, Y.³

11
- 84965139600
- Attention-based models for speech recognition
- Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, and Yoshua Bengio. Attention-Based Models for Speech Recognition. In Proceedings of NIPS, 2015.
- (2015) Proceedings of NIPS
- Chorowski, J.¹ Bahdanau, D.² Serdyuk, D.³ Cho, K.⁴ Bengio, Y.⁵

12
- 80053558787
- Natural language processing (almost) from scratch
- Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. Natural Language Processing (almost) from Scratch. Journal of Machine Learning Research, 12:2493-2537, 2011.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 2493-2537
- Collobert, R.¹ Weston, J.² Bottou, L.³ Karlen, M.⁴ Kavukcuoglu, K.⁵ Kuksa, P.⁶

13
- 79951759981
- Neural conditional random fields
- Trinh-Minh-Tri Do and Thierry Artiéres. Neural Conditional Random Fields. In Proceedings of AISTATS, 2010.
- (2010) Proceedings of AISTATS
- Do, T.-M.-T.¹ Artiéres, T.²

14
- 80052913933
- Parameter learning with truncated message-passing
- Justin Domke. Parameter Learning with Truncated Message-Passing. In Proceedings of CVPR, 2011.
- (2011) Proceedings of CVPR
- Domke, J.¹

15
- 84869036002
- Generic methods for optimization-based modeling
- Justin Domke. Generic methods for optimization-based modeling. In AISTATS, pp. 318-326, 2012.
- (2012) AISTATS , pp. 318-326
- Domke, J.¹

16
- 80052250414
- Adaptive subgradient methods for online learning and stochastic optimization
- John Duchi, Elad Hazan, and Yoram Singer. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. Journal of Machine Learning Research, 12:2021-2159, 2011.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 2021-2159
- Duchi, J.¹ Hazan, E.² Singer, Y.³

17
- 84943788164
- Neural CRF parsing
- Greg Durrett and Dan Klein. Neural CRF Parsing. In Proceedings of ACL, 2015.
- (2015) Proceedings of ACL
- Durrett, G.¹ Klein, D.²

18
- 4043070633
- Three new probabilistic models for dependency parsing: An exploration
- Jason M. Eisner. Three New Probabilistic Models for Dependency Parsing: An Exploration. In Proceedings of ACL, 1996.
- (1996) Proceedings of ACL
- Eisner, J.M.¹

19
- 85070978095
- Inside-Outside and Forward-Backward Algorithms are just Backprop
- Jason M. Eisner. Inside-Outside and Forward-Backward Algorithms are just Backprop. In Proceedings of Structured Prediction Workshop at EMNLP, 2016.
- (2016) Proceedings of Structured Prediction Workshop at EMNLP
- Eisner, J.M.¹

20
- 85070943017
- Approximation-aware dependency parsing by belief propagation
- Matthew R. Gormley, Mark Dredze, and Jason Eisner. Approximation-Aware Dependency Parsing by Belief Propagation. In Proceedings of TACL, 2015.
- (2015) Proceedings of TACL
- Gormley, M.R.¹ Dredze, M.² Eisner, J.³

21
- 84930616355
- Alex Graves, Greg Wayne, and Ivo Danihelka. Neural Turing Machines. arXiv:1410.5401, 2014.
- (2014) Neural Turing Machines
- Graves, A.¹ Wayne, G.² Danihelka, I.³

22
- 84993949467
- Hybrid computing using a neural network with dynamic external memory
- October
- Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwinska, Sergio Gomez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John Agapiou, Adria Puigdomenech Badia, Karl Moritz Hermann, Yori Zwols, Georg Ostrovski, Adam Cain, Helen King, Christopher Summerfield, Phil Blunsom, Koray Kavukcuoglu, and Demis Hassabis. Hybrid Computing Using a Neural Network with Dynamic External Memory. Nature, October 2016.
- (2016) Nature
- Graves, A.¹ Wayne, G.² Reynolds, M.³ Harley, T.⁴ Danihelka, I.⁵ Grabska-Barwinska, A.⁶ Colmenarejo, S.G.⁷ Grefenstette, E.⁸ Ramalho, T.⁹ Agapiou, J.¹⁰ Badia, A.P.¹¹ Hermann, K.M.¹² Zwols, Y.¹³ Ostrovski, G.¹⁴ Cain, A.¹⁵ King, H.¹⁶ Summerfield, C.¹⁷ Blunsom, P.¹⁸ Kavukcuoglu, K.¹⁹ Hassabis, D.²⁰ more..

23
- 84965153738
- Learning to transduce with unbounded memory
- Edward Grefenstette, Karl Moritz Hermann, Mustafa Suleyman, and Phil Blunsom. Learning to Transduce with Unbounded Memory. In Proceedings of NIPS, 2015.
- (2015) Proceedings of NIPS
- Grefenstette, E.¹ Hermann, K.M.² Suleyman, M.³ Blunsom, P.⁴

24
- 84965139942
- Teaching machines to read and comprehend
- Karl Moritz Hermann, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. Teaching Machines to Read and Comprehend. In Proceedings of NIPS, 2015.
- (2015) Proceedings of NIPS
- Hermann, K.M.¹ Kocisky, T.² Grefenstette, E.³ Espeholt, L.⁴ Kay, W.⁵ Suleyman, M.⁶ Blunsom, P.⁷

25
- 84952628296
- Deep structured output learning for unconstrained text recognition
- Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. Deep Structured Output Learning for Unconstrained Text Recognition. In Proceedings of ICLR, 2014.
- (2014) Proceedings of ICLR
- Jaderberg, M.¹ Simonyan, K.² Vedaldi, A.³ Zisserman, A.⁴

26
- 85083951076
- ADaM: A method for stochastic optimization
- Diederik Kingma and Jimmy Ba. Adam: A Method for Stochastic Optimization. In Proceedings of ICLR, 2015.
- (2015) Proceedings of ICLR
- Kingma, D.¹ Ba, J.²

27
- 85019553882
- Simple and Accurate Dependency Parsing using Bidirectional LSTM Feature Representations
- Eliyahu Kipperwasser and Yoav Goldberg. Simple and Accurate Dependency Parsing using Bidirectional LSTM Feature Representations. In TACL, 2016.
- (2016) TACL
- Kipperwasser, E.¹ Goldberg, Y.²

28
- 85083953994
- Segmental recurrent neural networks
- Lingpeng Kong, Chris Dyer, and Noah A. Smith. Segmental Recurrent Neural Networks. In Proceedings of ICLR, 2016.
- (2016) Proceedings of ICLR
- Kong, L.¹ Dyer, C.² Smith, N.A.³

29
- 0142192295
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- John Lafferty, Andrew McCallum, and Fernando Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proceedings of ICML, 2001.
- (2001) Proceedings of ICML
- Lafferty, J.¹ McCallum, A.² Pereira, F.³

30
- 84994130883
- Neural architectures for named entity recognition
- Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. Neural Architectures for Named Entity Recognition. In Proceedings of NAACL, 2016.
- (2016) Proceedings of NAACL
- Lample, G.¹ Ballesteros, M.² Subramanian, S.³ Kawakami, K.⁴ Dyer, C.⁵

31
- 0032203257
- Gradient-based Learning Applied to Document Recognition
- Yann LeCun, Leon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based Learning Applied to Document Recognition. In Proceedings of IEEE, 1998.
- (1998) Proceedings of IEEE
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

32
- 80053275857
- First- And second-order expectation semirings with applications to minimum-risk training on translation forests
- Zhifei Li and Jason Eisner. First- and Second-Order Expectation Semirings with Applications to Minimum-Risk Training on Translation Forests. In Proceedings of EMNLP 2009, 2009.
- (2009) Proceedings of EMNLP 2009
- Li, Z.¹ Eisner, J.²

33
- 84994242299
- Segmental recurrent neural networks for end-to-end speech recognition
- Liang Lu, Lingpeng Kong, Chris Dyer, Noah A. Smith, and Steve Renals. Segmental Recurrent Neural Networks for End-to-End Speech Recognition. In Proceedings of INTERSPEECH, 2016.
- (2016) Proceedings of INTERSPEECH
- Lu, L.¹ Kong, L.² Dyer, C.³ Smith, N.A.⁴ Renals, S.⁵

34
- 84959874994
- Effective Approaches to Attention-based Neural Machine Translation
- Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. Effective Approaches to Attention-based Neural Machine Translation. In Proceedings of EMNLP, 2015.
- (2015) Proceedings of EMNLP
- Luong, M.-T.¹ Pham, H.² Manning, C.D.³

35
- 84989338543
- Gradient-based Hyperparameter Optimization through Reversible Learning
- Dougal Maclaurin, David Duvenaud, and Ryan P. Adams. Gradient-based Hyperparameter Optimization through Reversible Learning. In Proceedings of ICML, 2015.
- (2015) Proceedings of ICML
- Maclaurin, D.¹ Duvenaud, D.² Adams, R.P.³

36
- 85011819504
- Natural language inference by tree-based convolution and heuristic matching
- Lili Mou, Rui Men, Ge Li, Yan Xu, Lu Zhang, Rui Yan, and Zhi Jin. Natural language inference by tree-based convolution and heuristic matching. In Proceedings of ACL, 2016.
- (2016) Proceedings of ACL
- Mou, L.¹ Men, R.² Li, G.³ Xu, Y.⁴ Zhang, L.⁵ Yan, R.⁶ Jin, Z.⁷

37
- 85023613243
- Tsendsuren Munkhdalai and Hong Yu. Neural Tree Indexers for Text Understanding. arxiv:1607.04492, 2016.
- (2016) Neural Tree Indexers for Text Understanding
- Munkhdalai, T.¹ Yu, H.²

38
- 85031942138
- ASPEC: Asian scientific paper excerpt corpus
- Nicoletta Calzo-lari (Conference Chair), Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis (eds, Portoro, Slovenia, may European Language Resources Association (ELRA
- Toshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi, and Hitoshi Isahara. Aspec: Asian scientific paper excerpt corpus. In Nicoletta Calzo-lari (Conference Chair), Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis (eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2016), pp. 2204-2208, Portoro, Slovenia, may 2016. European Language Resources Association (ELRA). ISBN 978-2-9517408-9-1.
- (2016) Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2016) , pp. 2204-2208
- Nakazawa, T.¹ Yaguchi, M.² Uchimoto, K.³ Utiyama, M.⁴ Sumita, E.⁵ Kurohashi, S.⁶ Isahara, H.⁷

39
- 84859011635
- Pointwise prediction for robust, adaptable Japanese morphological analysis
- Graham Neubig, Yosuke Nakata, and Shinsuke Mori. Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis. In Proceedings of ACL, 2011.
- (2011) Proceedings of ACL
- Neubig, G.¹ Nakata, Y.² Mori, S.³

40
- 85072820995
- A decomposable attention model for natural language inference
- Ankur P. Parikh, Oscar Tackstrom, Dipanjan Das, and Jakob Uszkoreit. A Decomposable Attention Model for Natural Language Inference. In Proceedings of EMNLP, 2016.
- (2016) Proceedings of EMNLP
- Parikh, A.P.¹ Tackstrom, O.² Das, D.³ Uszkoreit, J.⁴

41
- 84863373241
- Conditional neural fields
- Jian Peng, Liefeng Bo, and Jinbo Xu. Conditional Neural Fields. In Proceedings of NIPS, 2009.
- (2009) Proceedings of NIPS
- Peng, J.¹ Bo, L.² Xu, J.³

42
- 84961289992
- Glove: Global vectors for word representation
- Jeffrey Pennington, Richard Socher, and Christopher D. Manning. GloVe: Global Vectors for Word Representation. In Proceedings of EMNLP, 2014.
- (2014) Proceedings of EMNLP
- Pennington, J.¹ Socher, R.² Manning, C.D.³

43
- 85083950860
- Reasoning about entailment with neural attention
- Tim Rocktäschel, Edward Grefenstette, Karl Moritz Hermann, Tomas Kocisky, and Phil Blunsom. Reasoning about Entailment with Neural Attention. In Proceedings of ICLR, 2016.
- (2016) Proceedings of ICLR
- Rocktäschel, T.¹ Grefenstette, E.² Hermann, K.M.³ Kocisky, T.⁴ Blunsom, P.⁵

44
- 84965157716
- Gradient estimation using stochastic computation graphs
- John Schulman, Nicolas Heess, Theophane Weber, and Pieter Abbeel. Gradient estimation using stochastic computation graphs. In Advances in Neural Information Processing Systems, pp. 3528-3536, 2015.
- (2015) Advances in Neural Information Processing Systems , pp. 3528-3536
- Schulman, J.¹ Heess, N.² Weber, T.³ Abbeel, P.⁴

45
- 80053375671
- Dependency parsing as belief propagation
- David A. Smith and Jason Eisner. Dependency Parsing as Belief Propagation. In Proceedings of EMNLP, 2008.
- (2008) Proceedings of EMNLP
- Smith, D.A.¹ Eisner, J.²

46
- 84926143199
- Minimum-Risk Training of Approximate CRF-based NLP Systems
- Veselin Stoyanov and Jason Eisner. Minimum-Risk Training of Approximate CRF-based NLP Systems. In Proceedings of NAACL, 2012.
- (2012) Proceedings of NAACL
- Stoyanov, V.¹ Eisner, J.²

47
- 84883148756
- Empirical risk minimization of graphical model parameters given approximate inference, decoding, and model structure
- Veselin Stoyanov, Alexander Ropson, and Jason Eisner. Empirical Risk Minimization of Graphical Model Parameters Given Approximate Inference, Decoding, and Model Structure. In Proceedings of AISTATS, 2011.
- (2011) Proceedings of AISTATS
- Stoyanov, V.¹ Ropson, A.² Eisner, J.³

48
- 84965143740
- End-to-end memory networks
- Sainbayar Sukhbaatar, Arthur Szlam, Jason Weston, and Rob Fergus. End-To-End Memory Networks. In Proceedings of NIPS, 2015.
- (2015) Proceedings of NIPS
- Sukhbaatar, S.¹ Szlam, A.² Weston, J.³ Fergus, R.⁴

49
- 84965173945
- Pointer networks
- Oriol Vinyals, Meire Fortunato, and Navdeep Jaitly. Pointer Networks. In Proceedings of NIPS, 2015.
- (2015) Proceedings of NIPS
- Vinyals, O.¹ Fortunato, M.² Jaitly, N.³

50
- 85019203971
- Proximal deep structured models
- Shenlong Wang, Sanja Fidler, and Raquel Urtasun. Proximal Deep Structured Models. In Proceedings of NIPS, 2016.
- (2016) Proceedings of NIPS
- Wang, S.¹ Fidler, S.² Urtasun, R.³

51
- 84994156970
- Learning natural language inference with LSTM
- Shuohang Wang and Jing Jiang. Learning Natural Language Inference with LSTM. In Proceedings of NAACL, 2016.
- (2016) Proceedings of NAACL
- Wang, S.¹ Jiang, J.²

52
- 84930635225
- Jason Weston, Sumit Chopra, and Antoine Bordes. Memory Networks. arXiv:1410.3916, 2014.
- (2014) Memory Networks
- Weston, J.¹ Chopra, S.² Bordes, A.³

53
- 84930622674
- arXiv preprint
- Jason Weston, Antoine Bordes, Sumit Chopra, Alexander M Rush, Bart van Merriënboer, Armand Joulin, and Tomas Mikolov. Towards Ai-complete Question Answering: A Set of Prerequisite Toy Tasks. arXiv preprint arXiv:1502.05698, 2015.
- (2015) Towards Ai-Complete Question Answering: A Set of Prerequisite Toy Tasks
- Weston, J.¹ Bordes, A.² Chopra, S.³ Rush, A.M.⁴ Van Merriënboer, B.⁵ Joulin, A.⁶ Mikolov, T.⁷

54
- 84970002232
- Show, attend and tell: Neural image caption generation with visual attention
- Kelvin Xu, Jimma Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard Zemel, and Yoshua Bengio. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. In Proceedings of ICML, 2015.
- (2015) Proceedings of ICML
- Xu, K.¹ Ba, J.² Kiros, R.³ Cho, K.⁴ Courville, A.⁵ Salakhutdinov, R.⁶ Zemel, R.⁷ Bengio, Y.⁸

55
- 85072831387
- Online segment to segment neural transduction
- Lei Yu, Jan Buys, and Phil Blunsom. Online Segment to Segment Neural Transduction. In Proceedings of EMNLP, 2016.
- (2016) Proceedings of EMNLP
- Yu, L.¹ Buys, J.² Blunsom, P.³

56
- 85088231409
- The neural noisy channel
- Lei Yu, Phil Blunsom, Chris Dyer, Edward Grefenstette, and Tomas Kocisky. The Neural Noisy Channel. In Proceedings of ICLR, 2017.
- (2017) Proceedings of ICLR
- Yu, L.¹ Blunsom, P.² Dyer, C.³ Grefenstette, E.⁴ Kocisky, T.⁵

57
- 85050983881
- Textual entailment with structured attentions and composition
- Kai Zhao, Liang Huang, and Minbo Ma. Textual Entailment with Structured Attentions and Composition. In Proceedings of COLING, 2016.
- (2016) Proceedings of COLING
- Zhao, K.¹ Huang, L.² Ma, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.