메뉴 건너뛰기




Volumn , Issue , 2016, Pages 1192-1202

Deep reinforcement learning for dialogue generation

Author keywords

[No Author keywords available]

Indexed keywords

GRADIENT METHODS; LEARNING ALGORITHMS; MACHINE LEARNING; NATURAL LANGUAGE PROCESSING SYSTEMS; REINFORCEMENT LEARNING;

EID: 85029377314     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.18653/v1/d16-1127     Document Type: Conference Paper
Times cited : (968)

References (51)
  • 2
    • 0002137296 scopus 로고
    • On the semantics and pragmatics of linguistic feedback
    • Jens Allwood, Joakim Nivre, and Elisabeth Ahlsén. 1992. On the semantics and pragmatics of linguistic feedback. Journal of Semantics, 9:1-26.
    • (1992) Journal of Semantics , vol.9 , pp. 1-26
    • Allwood, J.1    Nivre, J.2    Ahlsén, E.3
  • 3
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translation by jointly learning to align and translate
    • Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In Proc. of ICLR.
    • (2015) Proc. Of ICLR.
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 4
    • 85119962881 scopus 로고    scopus 로고
    • Iris: A chat-oriented dialogue system based on the vector space model
    • Rafael E Banchs and Haizhou Li. 2012. IRIS: a chat-oriented dialogue system based on the vector space model. In Proceedings of the ACL 2012 System Demonstrations, pages 37-42.
    • (2012) Proceedings of the ACL 2012 System Demonstrations , pp. 37-42
    • Banchs, R.E.1    Li, H.2
  • 7
    • 84944036795 scopus 로고    scopus 로고
    • Deltableu: A discriminative metric for generation tasks with intrinsically diverse targets
    • Beijing, China, July
    • Michel Galley, Chris Brockett, Alessandro Sordoni, Yangfeng Ji, Michael Auli, Chris Quirk, Margaret Mitchell, Jianfeng Gao, and Bill Dolan. 2015. deltaBLEU: A discriminative metric for generation tasks with intrinsically diverse targets. In Proc. of ACL-IJCNLP, pages 445-450, Beijing, China, July.
    • (2015) Proc. Of ACL-IJCNLP , pp. 445-450
    • Galley, M.1    Brockett, C.2    Sordoni, A.3    Ji, Y.4    Auli, M.5    Quirk, C.6    Mitchell, M.7    Gao, J.8    Dolan, B.9
  • 11
    • 84976859194 scopus 로고
    • Likelihood ratio gradient estimation for stochastic systems
    • Peter W Glynn. 1990. Likelihood ratio gradient estimation for stochastic systems. Communications of the ACM, 33(10):75-84.
    • (1990) Communications of the ACM , vol.33 , Issue.10 , pp. 75-84
    • Glynn, P.W.1
  • 14
    • 0033894474 scopus 로고    scopus 로고
    • A stochastic model of human-machine interaction for learning dialog strategies
    • Esther Levin, Roberto Pieraccini, and Wieland Eckert. 2000. A stochastic model of human-machine interaction for learning dialog strategies. IEEE Transactions on Speech and Audio Processing, 8(1):11-23.
    • (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.1 , pp. 11-23
    • Levin, E.1    Pieraccini, R.2    Eckert, W.3
  • 15
    • 84994184277 scopus 로고    scopus 로고
    • A diversity-promoting objective function for neural conversation models
    • Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016a. A diversity-promoting objective function for neural conversation models. In Proc. of NAACL-HLT.
    • (2016) Proc. Of NAACL-HLT.
    • Li, J.1    Galley, M.2    Brockett, C.3    Gao, J.4    Dolan, B.5
  • 26
    • 0036663624 scopus 로고    scopus 로고
    • Trainable approaches to surface natural language generation and their application to conversational dialog systems
    • Adwait Ratnaparkhi. 2002. Trainable approaches to surface natural language generation and their application to conversational dialog systems. Computer Speech & Language, 16(3):435-455.
    • (2002) Computer Speech & Language , vol.16 , Issue.3 , pp. 435-455
    • Ratnaparkhi, A.1
  • 27
    • 80053292690 scopus 로고    scopus 로고
    • Data-driven response generation in social media
    • Alan Ritter, Colin Cherry, and William B Dolan. 2011. Data-driven response generation in social media. In Proceedings of EMNLP 2011, pages 583-593.
    • (2011) Proceedings of EMNLP 2011 , pp. 583-593
    • Ritter, A.1    Cherry, C.2    Dolan, W.B.3
  • 28
    • 33747607273 scopus 로고    scopus 로고
    • A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies
    • 02
    • Jost Schatzmann, Karl Weilhammer, Matt Stuttle, and Steve Young. 2006. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. The knowledge engineering review, 21(02):97-126.
    • (2006) The Knowledge Engineering Review , vol.21 , pp. 97-126
    • Schatzmann, J.1    Weilhammer, K.2    Stuttle, M.3    Young, S.4
  • 29
    • 34147141410 scopus 로고
    • Opening up closings
    • Emanuel A. Schegloff and Harvey Sacks. 1973. Opening up closings. Semiotica, 8(4):289-327.
    • (1973) Semiotica , vol.8 , Issue.4 , pp. 289-327
    • Schegloff, E.A.1    Sacks, H.2
  • 30
    • 84980367197 scopus 로고    scopus 로고
    • Building end-to-end dialogue systems using generative hierarchical neural network models
    • February
    • Iulian V Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, and Joelle Pineau. 2016. Building end-to-end dialogue systems using generative hierarchical neural network models. In Proceedings of AAAI, February.
    • (2016) Proceedings of AAAI
    • Serban, I.V.1    Sordoni, A.2    Bengio, Y.3    Courville, A.4    Pineau, J.5
  • 31
    • 84943801401 scopus 로고    scopus 로고
    • Neural responding machine for short-text conversation
    • Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural responding machine for short-text conversation. In Proceedings of ACL-IJCNLP, pages 1577-1586.
    • (2015) Proceedings of ACL-IJCNLP , pp. 1577-1586
    • Shang, L.1    Lu, Z.2    Li, H.3
  • 33
    • 84898955256 scopus 로고    scopus 로고
    • Reinforcement learning for spoken dialogue systems
    • Satinder P Singh, Michael J Kearns, Diane J Litman, and Marilyn A Walker. 1999. Reinforcement learning for spoken dialogue systems. In Nips, pages 956-962.
    • (1999) Nips , pp. 956-962
    • Singh, S.P.1    Kearns, M.J.2    Litman, D.J.3    Walker, M.A.4
  • 34
    • 85158142417 scopus 로고    scopus 로고
    • Empirical evaluation of a reinforcement learning spoken dialogue system
    • Satinder Singh, Michael Kearns, Diane J Litman, Marilyn A Walker, et al. 2000. Empirical evaluation of a reinforcement learning spoken dialogue system. In AAAI/IAAI, pages 645-651.
    • (2000) AAAI/IAAI , pp. 645-651
    • Singh, S.1    Kearns, M.2    Litman, D.J.3    Walker, M.A.4
  • 35
    • 0037841376 scopus 로고    scopus 로고
    • Optimizing dialogue management with reinforcement learning: Experiments with the njfun system
    • Satinder Singh, Diane Litman, Michael Kearns, and Marilyn Walker. 2002. Optimizing dialogue management with reinforcement learning: Experiments with the njfun system. Journal of Artificial Intelligence Research, pages 105-133.
    • (2002) Journal of Artificial Intelligence Research , pp. 105-133
    • Singh, S.1    Litman, D.2    Kearns, M.3    Walker, M.4
  • 39
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • Richard S Sutton, David A McAllester, Satinder P Singh, Yishay Mansour, et al. 1999. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pages 1057-1063.
    • (1999) NIPS , vol.99 , pp. 1057-1063
    • Sutton, R.S.1    McAllester, D.A.2    Singh, S.P.3    Mansour, Y.4
  • 41
    • 84859945237 scopus 로고    scopus 로고
    • Learning to follow navigational directions
    • Adam Vogel and Dan Jurafsky. 2010. Learning to follow navigational directions. In Proceedings of ACL 2010, pages 806-814.
    • (2010) Proceedings of ACL 2010 , pp. 806-814
    • Vogel, A.1    Jurafsky, D.2
  • 43
    • 14344279109 scopus 로고    scopus 로고
    • An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email
    • Marilyn A. Walker. 2000. An application of reinforcement learning to dialogue strategy selection in a spoken dialogue system for email. Journal of Artificial Intelligence Research, pages 387-416.
    • (2000) Journal of Artificial Intelligence Research , pp. 387-416
    • Walker, M.A.1
  • 44
    • 84959897734 scopus 로고    scopus 로고
    • Semantically conditioned LSTM-based natural language generation for spoken dialogue systems
    • Lisbon, Portugal
    • Tsung-Hsien Wen, Milica Gasic, Nikola Mrkšic, Pei-Hao Su, David Vandyke, and Steve Young. 2015. Semantically conditioned LSTM-based natural language generation for spoken dialogue systems. In Proceedings of EMNLP, pages 1711-1721, Lisbon, Portugal.
    • (2015) Proceedings of EMNLP , pp. 1711-1721
    • Wen, T.-H.1    Gasic, M.2    Mrkšic, N.3    Su, P.-H.4    Vandyke, D.5    Young, S.6
  • 46
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Ronald J Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229-256.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
    • Williams, R.J.1
  • 49
    • 70349231178 scopus 로고    scopus 로고
    • The hidden information state model: A practical framework for pomdp-based spoken dialogue management
    • Steve Young, Milica Gašic, Simon Keizer, François Mairesse, Jost Schatzmann, Blaise Thomson, and Kai Yu. 2010. The hidden information state model: A practical framework for pomdp-based spoken dialogue management. Computer Speech & Language, 24(2):150-174.
    • (2010) Computer Speech & Language , vol.24 , Issue.2 , pp. 150-174
    • Young, S.1    Gašic, M.2    Keizer, S.3    Mairesse, F.4    Schatzmann, J.5    Thomson, B.6    Yu, K.7
  • 50
    • 84876682878 scopus 로고    scopus 로고
    • Pomdp-based statistical spoken dialog systems: A review
    • Steve Young, Milica Gasic, Blaise Thomson, and Jason D Williams. 2013. Pomdp-based statistical spoken dialog systems: A review. Proceedings of the IEEE, 101(5):1160-1179.
    • (2013) Proceedings of the IEEE , vol.101 , Issue.5 , pp. 1160-1179
    • Young, S.1    Gasic, M.2    Thomson, B.3    Williams, J.D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.