SCOPUS 정보 검색 플랫폼

EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings

Volumn , Issue , 2016, Pages 1192-1202

Deep reinforcement learning for dialogue generation

(6) Li, Jiwei a Monroe, Will a Ritter, Alan b Galley, Michel c Gao, Jianfeng c Jurafsky, Dan a

a STANFORD UNIVERSITY (United States)

b OHIO STATE UNIVERSITY (United States)

c MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

GRADIENT METHODS; LEARNING ALGORITHMS; MACHINE LEARNING; NATURAL LANGUAGE PROCESSING SYSTEMS; REINFORCEMENT LEARNING;

CHATBOT; CONVERSATIONAL AGENTS; CONVERSATIONAL MODEL; DIALOGUE GENERATIONS; FORWARD LOOKING; NEURAL MODELS; POLICY GRADIENT METHODS; VIRTUAL AGENT;

DEEP LEARNING;

EID: 85029377314 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.18653/v1/d16-1127 Document Type: Conference Paper

Times cited : (968)

References (51)

1
- 0002686204
- Stochastic optimization
- V. M. Aleksandrov, V. I. Sysoyev, and V. V. Shemeneva. 1968. Stochastic optimization. Engineering Cybernetics, 5:11-16.
- (1968) Engineering Cybernetics , vol.5 , pp. 11-16
- Aleksandrov, V.M.¹ Sysoyev, V.I.² Shemeneva, V.V.³

2
- 0002137296
- On the semantics and pragmatics of linguistic feedback
- Jens Allwood, Joakim Nivre, and Elisabeth Ahlsén. 1992. On the semantics and pragmatics of linguistic feedback. Journal of Semantics, 9:1-26.
- (1992) Journal of Semantics , vol.9 , pp. 1-26
- Allwood, J.¹ Nivre, J.² Ahlsén, E.³

3
- 85083953689
- Neural machine translation by jointly learning to align and translate
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In Proc. of ICLR.
- (2015) Proc. Of ICLR.
- Bahdanau, D.¹ Cho, K.² Bengio, Y.³

4
- 85119962881
- Iris: A chat-oriented dialogue system based on the vector space model
- Rafael E Banchs and Haizhou Li. 2012. IRIS: a chat-oriented dialogue system based on the vector space model. In Proceedings of the ACL 2012 System Demonstrations, pages 37-42.
- (2012) Proceedings of the ACL 2012 System Demonstrations , pp. 37-42
- Banchs, R.E.¹ Li, H.²

5
- 70049091039
- Curriculum learning
- Yoshua Bengio, Jérôme Louradour, Ronan Collobert, and Jason Weston. 2009. Curriculum learning. In Proceedings of the 26th annual international conference on machine learning, pages 41-48. ACM.
- (2009) Proceedings of the 26th Annual International Conference on Machine Learning , pp. 41-48
- Bengio, Y.¹ Louradour, J.² Collobert, R.³ Weston, J.⁴

6
- 84859015643
- Learning to win by reading manuals in a monte-carlo framework
- SRK Branavan, David Silver, and Regina Barzilay. 2011. Learning to win by reading manuals in a monte-carlo framework. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 268-277.
- (2011) Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies- , vol.1 , pp. 268-277
- Branavan, S.R.K.¹ Silver, D.² Barzilay, R.³

7
- 84944036795
- Deltableu: A discriminative metric for generation tasks with intrinsically diverse targets
- Beijing, China, July
- Michel Galley, Chris Brockett, Alessandro Sordoni, Yangfeng Ji, Michael Auli, Chris Quirk, Margaret Mitchell, Jianfeng Gao, and Bill Dolan. 2015. deltaBLEU: A discriminative metric for generation tasks with intrinsically diverse targets. In Proc. of ACL-IJCNLP, pages 445-450, Beijing, China, July.
- (2015) Proc. Of ACL-IJCNLP , pp. 445-450
- Galley, M.¹ Brockett, C.² Sordoni, A.³ Ji, Y.⁴ Auli, M.⁵ Quirk, C.⁶ Mitchell, M.⁷ Gao, J.⁸ Dolan, B.⁹

8
- 84987858757
- Pomdp-based dialogue manager adaptation to extended domains
- Milica Gašic, Catherine Breslin, Matthew Henderson, Dongho Kim, Martin Szummer, Blaise Thomson, Pirros Tsiakoulis, and Steve Young. 2013a. Pomdp-based dialogue manager adaptation to extended domains. In Proceedings of SIGDIAL.
- (2013) Proceedings of SIGDIAL
- Gašic, M.¹ Breslin, C.² Henderson, M.³ Kim, D.⁴ Szummer, M.⁵ Thomson, B.⁶ Tsiakoulis, P.⁷ Young, S.⁸

9
- 84890501838
- On-line policy optimisation of Bayesian spoken dialogue systems via human interaction
- Milica Gasic, Catherine Breslin, Mike Henderson, Dongkyu Kim, Martin Szummer, Blaise Thomson, Pirros Tsiakoulis, and Steve Young. 2013b. On-line policy optimisation of bayesian spoken dialogue systems via human interaction. In Proceedings of ICASSP 2013, pages 8367-8371. IEEE.
- (2013) Proceedings of ICASSP 2013 , pp. 8367-8371
- Gasic, M.¹ Breslin, C.² Henderson, M.³ Kim, D.⁴ Szummer, M.⁵ Thomson, B.⁶ Tsiakoulis, P.⁷ Young, S.⁸

10
- 84910048103
- Incremental online adaptation of pomdp-based dialogue managers to extended domains
- Milica Gašic, Dongho Kim, Pirros Tsiakoulis, Catherine Breslin, Matthew Henderson, Martin Szummer, Blaise Thomson, and Steve Young. 2014. Incremental online adaptation of pomdp-based dialogue managers to extended domains. In Proceedings on InterSpeech.
- (2014) Proceedings on InterSpeech
- Gašic, M.¹ Kim, D.² Tsiakoulis, P.³ Breslin, C.⁴ Henderson, M.⁵ Szummer, M.⁶ Thomson, B.⁷ Young, S.⁸

11
- 84976859194
- Likelihood ratio gradient estimation for stochastic systems
- Peter W Glynn. 1990. Likelihood ratio gradient estimation for stochastic systems. Communications of the ACM, 33(10):75-84.
- (1990) Communications of the ACM , vol.33 , Issue.10 , pp. 75-84
- Glynn, P.W.¹

12
- 85011842203
- Deep reinforcement learning with a natural language action space
- Berlin, Germany, August
- Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng, and Mari Ostendorf. 2016. Deep reinforcement learning with a natural language action space. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1621-1630, Berlin, Germany, August.
- (2016) Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pp. 1621-1630
- He, J.¹ Chen, J.² He, X.³ Gao, J.⁴ Li, L.⁵ Deng, L.⁶ Ostendorf, M.⁷

13
- 0030635367
- Learning dialogue strategies within the markov decision process framework
- Esther Levin, Roberto Pieraccini, and Wieland Eckert. 1997. Learning dialogue strategies within the markov decision process framework. In Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on, pages 72-79. IEEE.
- (1997) Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on , pp. 72-79
- Levin, E.¹ Pieraccini, R.² Eckert, W.³

14
- 0033894474
- A stochastic model of human-machine interaction for learning dialog strategies
- Esther Levin, Roberto Pieraccini, and Wieland Eckert. 2000. A stochastic model of human-machine interaction for learning dialog strategies. IEEE Transactions on Speech and Audio Processing, 8(1):11-23.
- (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.1 , pp. 11-23
- Levin, E.¹ Pieraccini, R.² Eckert, W.³

15
- 84994184277
- A diversity-promoting objective function for neural conversation models
- Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016a. A diversity-promoting objective function for neural conversation models. In Proc. of NAACL-HLT.
- (2016) Proc. Of NAACL-HLT.
- Li, J.¹ Galley, M.² Brockett, C.³ Gao, J.⁴ Dolan, B.⁵

16
- 85011829523
- A persona-based neural conversation model
- Berlin, Germany, August
- Jiwei Li, Michel Galley, Chris Brockett, Georgios Spithourakis, Jianfeng Gao, and Bill Dolan. 2016b. A persona-based neural conversation model. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 994-1003, Berlin, Germany, August.
- (2016) Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pp. 994-1003
- Li, J.¹ Galley, M.² Brockett, C.³ Spithourakis, G.⁴ Gao, J.⁵ Dolan, B.⁶

17
- 85072827450
- arXiv preprint
- Chia-Wei Liu, Ryan Lowe, Iulian V Serban, Michael Nose-worthy, Laurent Charlin, and Joelle Pineau. 2016. How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. arXiv preprint arXiv:1603.08023.
- (2016) How Not to Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation
- Liu, C.-W.¹ Lowe, R.² Serban, I.V.³ Nose-Worthy, M.⁴ Charlin, L.⁵ Pineau, J.⁶

18
- 84988871055
- arXiv preprint
- Yi Luan, Yangfeng Ji, and Mari Ostendorf. 2016. LSTM based conversation models. arXiv preprint arXiv:1603.09457.
- (2016) LSTM Based Conversation Models
- Luan, Y.¹ Ji, Y.² Ostendorf, M.³

19
- 84904867557
- Playing Atari with deep reinforcement learning
- Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing Atari with deep reinforcement learning. NIPS Deep Learning Workshop.
- (2013) NIPS Deep Learning Workshop
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Graves, A.⁴ Antonoglou, I.⁵ Wierstra, D.⁶ Riedmiller, M.⁷

20
- 84959861546
- arXiv preprint
- Karthik Narasimhan, Tejas Kulkarni, and Regina Barzilay. 2015. Language understanding for text-based games using deep reinforcement learning. arXiv preprint arXiv:1506.08941.
- (2015) Language Understanding for Text-Based Games Using Deep Reinforcement Learning
- Narasimhan, K.¹ Kulkarni, T.² Barzilay, R.³

21
- 84983186145
- Developing non-goal dialog system based on examples of drama television
- Springer
- Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, and Satoshi Nakamura. 2014. Developing non-goal dialog system based on examples of drama television. In Natural Interaction with Robots, Knowbots and Smartphones, pages 355-361. Springer.
- (2014) Natural Interaction with Robots, Knowbots and Smartphones , pp. 355-361
- Nio, L.¹ Sakti, S.² Neubig, G.³ Toda, T.⁴ Adriani, M.⁵ Nakamura, S.⁶

22
- 0003062804
- Stochastic language generation for spoken dialogue systems
- Alice H Oh and Alexander I Rudnicky. 2000. Stochastic language generation for spoken dialogue systems. In Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems-Volume 3, pages 27-32.
- (2000) Proceedings of the 2000 ANLP/NAACL Workshop on Conversational Systems- , vol.3 , pp. 27-32
- Oh, A.H.¹ Rudnicky, A.I.²

23
- 85133336275
- BLEU: A method for automatic evaluation of machine translation
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pages 311-318.
- (2002) Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , pp. 311-318
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

24
- 70349849676
- Are we there yet? Research in commercial spoken dialog systems
- Springer
- Roberto Pieraccini, David Suendermann, Krishna Dayanidhi, and Jackson Liscombe. 2009. Are we there yet? Research in commercial spoken dialog systems. In Text, Speech and Dialogue, pages 3-13. Springer.
- (2009) Text, Speech and Dialogue , pp. 3-13
- Pieraccini, R.¹ Suendermann, D.² Dayanidhi, K.³ Liscombe, J.⁴

25
- 84994149921
- arXiv preprint
- Marc'Aurelio Ranzato, Sumit Chopra, Michael Auli, and Wojciech Zaremba. 2015. Sequence level training with recurrent neural networks. arXiv preprint arXiv:1511.06732.
- (2015) Sequence Level Training with Recurrent Neural Networks
- Marc'Aurelio, R.¹ Chopra, S.² Auli, M.³ Zaremba, W.⁴

26
- 0036663624
- Trainable approaches to surface natural language generation and their application to conversational dialog systems
- Adwait Ratnaparkhi. 2002. Trainable approaches to surface natural language generation and their application to conversational dialog systems. Computer Speech & Language, 16(3):435-455.
- (2002) Computer Speech & Language , vol.16 , Issue.3 , pp. 435-455
- Ratnaparkhi, A.¹

27
- 80053292690
- Data-driven response generation in social media
- Alan Ritter, Colin Cherry, and William B Dolan. 2011. Data-driven response generation in social media. In Proceedings of EMNLP 2011, pages 583-593.
- (2011) Proceedings of EMNLP 2011 , pp. 583-593
- Ritter, A.¹ Cherry, C.² Dolan, W.B.³

28
- 33747607273
- A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies
- 02
- Jost Schatzmann, Karl Weilhammer, Matt Stuttle, and Steve Young. 2006. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. The knowledge engineering review, 21(02):97-126.
- (2006) The Knowledge Engineering Review , vol.21 , pp. 97-126
- Schatzmann, J.¹ Weilhammer, K.² Stuttle, M.³ Young, S.⁴

29
- 34147141410
- Opening up closings
- Emanuel A. Schegloff and Harvey Sacks. 1973. Opening up closings. Semiotica, 8(4):289-327.
- (1973) Semiotica , vol.8 , Issue.4 , pp. 289-327
- Schegloff, E.A.¹ Sacks, H.²

30
- 84980367197
- Building end-to-end dialogue systems using generative hierarchical neural network models
- February
- Iulian V Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, and Joelle Pineau. 2016. Building end-to-end dialogue systems using generative hierarchical neural network models. In Proceedings of AAAI, February.
- (2016) Proceedings of AAAI
- Serban, I.V.¹ Sordoni, A.² Bengio, Y.³ Courville, A.⁴ Pineau, J.⁵

31
- 84943801401
- Neural responding machine for short-text conversation
- Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural responding machine for short-text conversation. In Proceedings of ACL-IJCNLP, pages 1577-1586.
- (2015) Proceedings of ACL-IJCNLP , pp. 1577-1586
- Shang, L.¹ Lu, Z.² Li, H.³

32
- 84963949906
- Mastering the game of go with deep neural networks and tree search
- David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of go with deep neural networks and tree search. Nature, 529(7587):484-489.
- (2016) Nature , vol.529 , Issue.7587 , pp. 484-489
- Silver, D.¹ Huang, A.² Maddison, C.J.³ Guez, A.⁴ Sifre, L.⁵ Van Den Driessche, G.⁶ Schrittwieser, J.⁷ Antonoglou, I.⁸ Panneershelvam, V.⁹ Lanctot, M.¹⁰

33
- 84898955256
- Reinforcement learning for spoken dialogue systems
- Satinder P Singh, Michael J Kearns, Diane J Litman, and Marilyn A Walker. 1999. Reinforcement learning for spoken dialogue systems. In Nips, pages 956-962.
- (1999) Nips , pp. 956-962
- Singh, S.P.¹ Kearns, M.J.² Litman, D.J.³ Walker, M.A.⁴

34
- 85158142417
- Empirical evaluation of a reinforcement learning spoken dialogue system
- Satinder Singh, Michael Kearns, Diane J Litman, Marilyn A Walker, et al. 2000. Empirical evaluation of a reinforcement learning spoken dialogue system. In AAAI/IAAI, pages 645-651.
- (2000) AAAI/IAAI , pp. 645-651
- Singh, S.¹ Kearns, M.² Litman, D.J.³ Walker, M.A.⁴

35
- 0037841376
- Optimizing dialogue management with reinforcement learning: Experiments with the njfun system
- Satinder Singh, Diane Litman, Michael Kearns, and Marilyn Walker. 2002. Optimizing dialogue management with reinforcement learning: Experiments with the njfun system. Journal of Artificial Intelligence Research, pages 105-133.
- (2002) Journal of Artificial Intelligence Research , pp. 105-133
- Singh, S.¹ Litman, D.² Kearns, M.³ Walker, M.⁴

36
- 84960121226
- A neural network approach to context-sensitive generation of conversational responses
- Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Meg Mitchell, Jian-Yun Nie, Jianfeng Gao, and Bill Dolan. 2015. A neural network approach to context-sensitive generation of conversational responses. In Proceedings of NAACL-HLT.
- (2015) Proceedings of NAACL-HLT
- Sordoni, A.¹ Galley, M.² Auli, M.³ Brockett, C.⁴ Ji, Y.⁵ Mitchell, M.⁶ Nie, J.-Y.⁷ Gao, J.⁸ Dolan, B.⁹

37
- 85021681141
- arxiv
- Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, and Steve Young. 2016. Continuously learning neural dialogue management. arxiv.
- (2016) Continuously Learning Neural Dialogue Management
- Su, P.-H.¹ Gasic, M.² Mrksic, N.³ Rojas-Barahona, L.⁴ Ultes, S.⁵ Vandyke, D.⁶ Wen, T.-H.⁷ Young, S.⁸

38
- 84928547704
- Sequence to sequence learning with neural networks
- Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In Advances in neural information processing systems, pages 3104-3112.
- (2014) Advances in Neural Information Processing Systems , pp. 3104-3112
- Sutskever, I.¹ Vinyals, O.² Le, Q.V.³

39
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Richard S Sutton, David A McAllester, Satinder P Singh, Yishay Mansour, et al. 1999. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pages 1057-1063.
- (1999) NIPS , vol.99 , pp. 1057-1063
- Sutton, R.S.¹ McAllester, D.A.² Singh, S.P.³ Mansour, Y.⁴

40
- 84980377939
- A neural conversational model
- Oriol Vinyals and Quoc Le. 2015. A neural conversational model. In Proceedings of ICML Deep Learning Workshop.
- (2015) Proceedings of ICML Deep Learning Workshop
- Vinyals, O.¹ Le, Q.²

41
- 84859945237
- Learning to follow navigational directions
- Adam Vogel and Dan Jurafsky. 2010. Learning to follow navigational directions. In Proceedings of ACL 2010, pages 806-814.
- (2010) Proceedings of ACL 2010 , pp. 806-814
- Vogel, A.¹ Jurafsky, D.²

42
- 84994101051
- A trainable generator for recommendations in multimodal dialog
- Marilyn A Walker, Rashmi Prasad, and Amanda Stent. 2003. A trainable generator for recommendations in multimodal dialog. In Proceeedings of INTERSPEECH 2003.
- (2003) Proceeedings of INTERSPEECH 2003
- Walker, M.A.¹ Prasad, R.² Stent, A.³

43
- 14344279109
- An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email
- Marilyn A. Walker. 2000. An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email. Journal of Artificial Intelligence Research, pages 387-416.
- (2000) Journal of Artificial Intelligence Research , pp. 387-416
- Walker, M.A.¹

44
- 84959897734
- Semantically conditioned LSTM-based natural language generation for spoken dialogue systems
- Lisbon, Portugal
- Tsung-Hsien Wen, Milica Gasic, Nikola Mrkšic, Pei-Hao Su, David Vandyke, and Steve Young. 2015. Semantically conditioned LSTM-based natural language generation for spoken dialogue systems. In Proceedings of EMNLP, pages 1711-1721, Lisbon, Portugal.
- (2015) Proceedings of EMNLP , pp. 1711-1721
- Wen, T.-H.¹ Gasic, M.² Mrkšic, N.³ Su, P.-H.⁴ Vandyke, D.⁵ Young, S.⁶

45
- 85018716106
- arXiv preprint
- Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, and Steve Young. 2016. A network-based end-to-end trainable task-oriented dialogue system. arXiv preprint arXiv:1604.04562.
- (2016) A Network-Based End-to-End Trainable Task-Oriented Dialogue System
- Wen, T.-H.¹ Gasic, M.² Mrksic, N.³ Rojas-Barahona, L.M.⁴ Su, P.-H.⁵ Ultes, S.⁶ Vandyke, D.⁷ Young, S.⁸

46
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Ronald J Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229-256.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
- Williams, R.J.¹

47
- 85028330084
- arXiv preprint
- Zhen Xu, Bingquan Liu, Baoxun Wang, Chengjie Sun, and Xiaolong Wang. 2016. Incorporating loose-structured knowledge into LSTM with recall gate for conversation modeling. arXiv preprint arXiv:1605.05110.
- (2016) Incorporating Loose-Structured Knowledge into LSTM with Recall Gate for Conversation Modeling
- Xu, Z.¹ Liu, B.² Wang, B.³ Sun, C.⁴ Wang, X.⁵

48
- 84996560804
- Attention with intention for a neural network conversation model
- Kaisheng Yao, Geoffrey Zweig, and Baolin Peng. 2015. Attention with intention for a neural network conversation model. In NIPS workshop on Machine Learning for Spoken Language Understanding and Interaction.
- (2015) NIPS Workshop on Machine Learning for Spoken Language Understanding and Interaction
- Yao, K.¹ Zweig, G.² Peng, B.³

49
- 70349231178
- The hidden information state model: A practical framework for pomdp-based spoken dialogue management
- Steve Young, Milica Gašic, Simon Keizer, François Mairesse, Jost Schatzmann, Blaise Thomson, and Kai Yu. 2010. The hidden information state model: A practical framework for pomdp-based spoken dialogue management. Computer Speech & Language, 24(2):150-174.
- (2010) Computer Speech & Language , vol.24 , Issue.2 , pp. 150-174
- Young, S.¹ Gašic, M.² Keizer, S.³ Mairesse, F.⁴ Schatzmann, J.⁵ Thomson, B.⁶ Yu, K.⁷

50
- 84876682878
- Pomdp-based statistical spoken dialog systems: A review
- Steve Young, Milica Gasic, Blaise Thomson, and Jason D Williams. 2013. Pomdp-based statistical spoken dialog systems: A review. Proceedings of the IEEE, 101(5):1160-1179.
- (2013) Proceedings of the IEEE , vol.101 , Issue.5 , pp. 1160-1179
- Young, S.¹ Gasic, M.² Thomson, B.³ Williams, J.D.⁴

51
- 84943750581
- arXiv preprint
- Wojciech Zaremba and Ilya Sutskever. 2015. Reinforcement learning neural Turing machines. arXiv preprint arXiv:1505.00521.
- (2015) Reinforcement Learning Neural Turing Machines
- Zaremba, W.¹ Sutskever, I.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.