메뉴 건너뛰기




Volumn 26, Issue 3, 2012, Pages 168-192

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

Author keywords

Dialogue management; POMDP; Reinforcement learning; Spoken dialogue systems

Indexed keywords

DESIGN CONSIDERATIONS; DIALOGUE MANAGEMENT; DIALOGUE MODELS; DIALOGUE SYSTEMS; INFORMATION DOMAINS; MODEL PARAMETERS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; POMDP; REINFORCEMENT ALGORITHMS; REINFORCEMENT TECHNIQUE; REWARD FUNCTION; SPOKEN DIALOGUE SYSTEM;

EID: 84855300452     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2011.09.004     Document Type: Article
Times cited : (48)

References (40)
  • 2
    • 0000396062 scopus 로고    scopus 로고
    • Natural Gradient Works Efficiently in Learning
    • Amari S. Natural gradient works efficiently in learning Neural Computation 10 2 1998 251 276 (Pubitemid 128463152)
    • (1998) Neural Computation , vol.10 , Issue.2 , pp. 251-276
    • Amari, S.-I.1
  • 6
    • 56749163138 scopus 로고    scopus 로고
    • Spoken language interaction with model uncertainty: An adaptive human robot interaction system
    • 10.1080/09540090802413145
    • Doshi F., and Roy N. Spoken language interaction with model uncertainty: an adaptive human robot interaction system Connection Science 20 4 2008 299 318 10.1080/09540090802413145
    • (2008) Connection Science , vol.20 , Issue.4 , pp. 299-318
    • Doshi, F.1    Roy, N.2
  • 7
    • 77958539351 scopus 로고    scopus 로고
    • The infinite partially observable Markov decision process
    • Bengio Y. Schuurmans D. Lafferty J. Williams C.K.I. Culotta A.
    • Doshi-Velez F. The infinite partially observable Markov decision process Bengio Y. Schuurmans D. Lafferty J. Williams C.K.I. Culotta A. Advances in Neural Information Processing Systems, vol. 22 2009 477 485
    • (2009) Advances in Neural Information Processing Systems, Vol. 22 , pp. 477-485
    • Doshi-Velez, F.1
  • 8
    • 2942598511 scopus 로고    scopus 로고
    • Evaluation and usability of multimodal spoken language dialogue systems
    • 10.1016/j.specom.2004.02.001
    • Dybkjaer L., Bernsen N.O., and Minker W. Evaluation and usability of multimodal spoken language dialogue systems Speech Communication 43 1-2 2004 33 54 10.1016/j.specom.2004.02.001
    • (2004) Speech Communication , vol.43 , Issue.12 , pp. 33-54
    • Dybkjaer, L.1    Bernsen, N.O.2    Minker, W.3
  • 10
    • 84942484786 scopus 로고
    • Ridge regression: Biased estimation for nonorthogonal problems
    • Hoerl A.E., and Kennard R.W. Ridge regression: biased estimation for nonorthogonal problems Technometrics 12 1970 55 67
    • (1970) Technometrics , vol.12 , pp. 55-67
    • Hoerl, A.E.1    Kennard, R.W.2
  • 12
    • 79959813974 scopus 로고    scopus 로고
    • Natural Belief-Critic: A reinforcement algorithm for parameter estimation in statistical spoken dialogue systems
    • Kobayashi T. Hirose K. Nakamura S.
    • Jurčíček F., Thomson B., Keizer S., Mairesse F., Gašić M., Yu K., and Young S. Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems Kobayashi T. Hirose K. Nakamura S. Proc. Interspeech. ISCA 2010 90 93
    • (2010) Proc. Interspeech. ISCA , pp. 90-93
    • Jurčíček, F.1    Thomson, B.2    Keizer, S.3    Mairesse, F.4    Gašić, M.5    Yu, K.6    Young, S.7
  • 13
    • 80052051092 scopus 로고    scopus 로고
    • Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as pomdps
    • JUNE 6:1-6:26.
    • Jurčíček F., Thomson B., and Young S. Natural actor and belief critic: reinforcement algorithm for learning parameters of dialogue systems modelled as pomdps ACM Transactions on Speech and Language Processing 7 June 2011 6:1-6:26. http://doi.acm.org/10.1145/1966407.1966411
    • (2011) ACM Transactions on Speech and Language Processing , vol.7
    • Jurčíček, F.1    Thomson, B.2    Young, S.3
  • 23
    • 79959834356 scopus 로고    scopus 로고
    • Using automatically transcribed dialogs to learn user models in a spoken dialog system
    • Morristown, USA
    • Syed U., and Williams J.D. Using automatically transcribed dialogs to learn user models in a spoken dialog system HLT Morristown, USA 2008 121 124
    • (2008) HLT , pp. 121-124
    • Syed, U.1    Williams, J.D.2
  • 28
    • 77950862681 scopus 로고    scopus 로고
    • Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
    • Thomson B., and Young S. Bayesian update of dialogue state: a POMDP framework for spoken dialogue systems Computer Speech and Language 24 4 2010 562 588
    • (2010) Computer Speech and Language , vol.24 , Issue.4 , pp. 562-588
    • Thomson, B.1    Young, S.2
  • 34
    • 33750703175 scopus 로고    scopus 로고
    • Partially observable Markov decision processes for spoken dialog systems
    • DOI 10.1016/j.csl.2006.06.008, PII S0885230806000283
    • Williams J.D., and Young S. Partially observable Markov decision processes for spoken dialog systems Computer Speech and Language 21 2 2007 393 422 (Pubitemid 44709839)
    • (2007) Computer Speech and Language , vol.21 , Issue.2 , pp. 393-422
    • Williams, J.D.1    Young, S.2
  • 36
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning Machine Learning 8 1992 229 256
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.J.1
  • 39
    • 78549277875 scopus 로고    scopus 로고
    • Tech. rep., Cambridge University Engineering Dept
    • Young, S., 2007. CUED Standard Dialogue Acts. Tech. rep., Cambridge University Engineering Dept. http://mi.eng.cam.ac.uk/research/dialogue/ LocalDocs/dastd.pdf.
    • (2007) CUED Standard Dialogue Acts
    • Young, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.