메뉴 건너뛰기




Volumn 22, Issue 1, 2014, Pages 28-40

Gaussian processes for POMDP-based dialogue manager optimization

Author keywords

Gaussian process; POMDP; Statistical dialog systems

Indexed keywords

GAUSSIAN DISTRIBUTION; GAUSSIAN NOISE (ELECTRONIC); MANAGEMENT; MANAGERS; MARKOV PROCESSES;

EID: 84897936325     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2282190     Document Type: Article
Times cited : (130)

References (40)
  • 1
    • 85009291577 scopus 로고    scopus 로고
    • Talking to machines (Statistically speaking)
    • S. Young, "Talking to machines (Statistically speaking)," in Proc. ICSLP, 2002.
    • Proc. ICSLP, 2002
    • Young, S.1
  • 2
    • 84880707672 scopus 로고    scopus 로고
    • Spoken dialogue management using probabilistic reasoning
    • N. Roy, J. Pineau, and S. Thrun, "Spoken dialogue management using probabilistic reasoning," in Proc. ACL, 2000.
    • Proc. ACL, 2000
    • Roy, N.1    Pineau, J.2    Thrun, S.3
  • 4
    • 33750703175 scopus 로고    scopus 로고
    • Partially observable Markov decision processes for spoken dialog systems
    • DOI 10.1016/j.csl.2006.06.008, PII S0885230806000283
    • J. Williams and S. Young, "Partially observable Markov decision processes for spoken dialog systems," Comput. Speech Lang., vol. 21, no. 2, pp. 393-422, 2007. (Pubitemid 44709839)
    • (2007) Computer Speech and Language , vol.21 , Issue.2 , pp. 393-422
    • Williams, J.D.1    Young, S.2
  • 6
    • 70349231178 scopus 로고    scopus 로고
    • The hidden information state model: A practical framework for POMDP-based spoken dialogue management
    • S. Young, M. Gašić, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson, and K. Yu, "The hidden information state model: A practical framework for POMDP-based spoken dialogue management," Comput. Speech Lang., vol. 24, no. 2, pp. 150-174, 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.2 , pp. 150-174
    • Young, S.1    Gašić, M.2    Keizer, S.3    Mairesse, F.4    Schatzmann, J.5    Thomson, B.6    Yu, K.7
  • 8
    • 80052051092 scopus 로고    scopus 로고
    • Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs
    • F. Jurèíèek, B. Thomson, and S. Young, "Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs," ACM Trans. Speech Lang. Process., pp. 6:1-6:26, 2011.
    • (2011) ACM Trans. Speech Lang. Process.
    • Jurèíèek, F.1    Thomson, B.2    Young, S.3
  • 9
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • PII S000437029800023X
    • L. Kaelbling, M. Littman, and A. Cassandra, "Planning and acting in partially observable stochastic domains," Artif. Intell., vol. 101, pp. 99-134, 1998. (Pubitemid 128387390)
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 10
    • 84880772945 scopus 로고    scopus 로고
    • Point-based value iteration: An anytime algorithm for POMDPs
    • J. Pineau, G. Gordon, and S. Thrun, "Point-based value iteration: An anytime algorithm for POMDPs," in Proc. IJCAI, 2003, pp. 1025-1032.
    • Proc. IJCAI, 2003 , pp. 1025-1032
    • Pineau, J.1    Gordon, G.2    Thrun, S.3
  • 11
    • 52949143575 scopus 로고    scopus 로고
    • Scaling POMDPs for spoken dialog management
    • Sep.
    • J. Williams and S. Young, "Scaling POMDPs for spoken dialog management," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2116-2129, Sep. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2116-2129
    • Williams, J.1    Young, S.2
  • 12
    • 51449120317 scopus 로고    scopus 로고
    • Hybrid reinforcement/supervised learning for dialogue policies from fixed data sets
    • J. Henderson, O. Lemon, and K. Georgila, "Hybrid reinforcement/supervised learning for dialogue policies from fixed data sets," Comput. Linguist., vol. 34, no. 4, pp. 487-511, 2008.
    • (2008) Comput. Linguist. , vol.34 , Issue.4 , pp. 487-511
    • Henderson, J.1    Lemon, O.2    Georgila, K.3
  • 13
    • 77950862681 scopus 로고    scopus 로고
    • Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
    • B. Thomson and S. Young, "Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems," Comput. Speech Lang., vol. 24, no. 4, pp. 562-588, 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.4 , pp. 562-588
    • Thomson, B.1    Young, S.2
  • 14
    • 70450186275 scopus 로고    scopus 로고
    • Reinforcement learning for dialog management using least-squares policy iteration and fast feature selection
    • L. Li, J. Williams, and S. Balakrishnan, "Reinforcement learning for dialog management using least-squares policy iteration and fast feature selection," in Proc. Interspeech, 2009.
    • Proc. Interspeech, 2009
    • Li, L.1    Williams, J.2    Balakrishnan, S.3
  • 15
    • 84865791247 scopus 로고    scopus 로고
    • Lossless value directed compression of complex user goal states for statistical spoken dialogue systems
    • P. Crook and O. Lemon, "Lossless value directed compression of complex user goal states for statistical spoken dialogue systems," in Proc. Interspeech, 2011.
    • Proc. Interspeech, 2011
    • Crook, P.1    Lemon, O.2
  • 16
    • 84867619228 scopus 로고    scopus 로고
    • Off-policy learning in large-scale POMDP-based dialogue systems
    • L. Daubigney, M. Geist, and O. Pietquin, "Off-policy learning in large-scale POMDP-based dialogue systems," in Proc. ICASSP, 2012, pp. 4989-4992.
    • Proc. ICASSP, 2012 , pp. 4989-4992
    • Daubigney, L.1    Geist, M.2    Pietquin, O.3
  • 17
    • 84897938811 scopus 로고    scopus 로고
    • [Online]. Available
    • Amazon Mechanical Turk Amazon, 2011 [Online]. Available: https://www.mturk.com/mturk/welcome
    • (2011)
  • 18
    • 1942421151 scopus 로고    scopus 로고
    • Bayes meets Bellman: The Gaussian process approach to temporal difference learning
    • Y. Engel, S. Mannor, and R. Meir, "Bayes meets Bellman: The Gaussian process approach to temporal difference learning," in Proc. ICML, 2003.
    • Proc. ICML, 2003
    • Engel, Y.1    Mannor, S.2    Meir, R.3
  • 20
    • 84899026055 scopus 로고    scopus 로고
    • Gaussian processes in reinforcement learning
    • Cambridge, MA, USA: MIT Press
    • C. E. Rasmussen and M. Kuss, "Gaussian processes in reinforcement learning," in Advances in Neural Information Processing Systems. Cambridge, MA, USA: MIT Press, 2004, vol. 16, pp. 751-759.
    • (2004) Advances in Neural Information Processing Systems , vol.16 , pp. 751-759
    • Rasmussen, C.E.1    Kuss, M.2
  • 21
    • 61849173491 scopus 로고    scopus 로고
    • Gaussian process dynamic programming
    • M. Deisenroth, C. Rasmussen, and J. Peters, "Gaussian process dynamic programming," Neurocomputing, vol. 72, no. 7-9, pp. 1508-1524, 2009.
    • (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1508-1524
    • Deisenroth, M.1    Rasmussen, C.2    Peters, J.3
  • 23
    • 84858973294 scopus 로고    scopus 로고
    • Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K.
    • M. Gašić, "Statistical dialogue modelling," Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K., 2011.
    • (2011) Statistical Dialogue Modelling
    • Gašić, M.1
  • 25
    • 0028424239 scopus 로고
    • Improving generalization with active learning
    • D. Cohn, L. Atlas, and R. Ladner, "Improving generalization with active learning," Mach. Learn., vol. 15, pp. 201-221, 1994.
    • (1994) Mach. Learn. , vol.15 , pp. 201-221
    • Cohn, D.1    Atlas, L.2    Ladner, R.3
  • 26
    • 0000695404 scopus 로고
    • Information-based objective functions for active data selection
    • D. J. C. MacKay, "Information-based objective functions for active data selection," Neural Comput., vol. 4, no. 4, pp. 590-604, 1992.
    • (1992) Neural Comput. , vol.4 , Issue.4 , pp. 590-604
    • MacKay, D.J.C.1
  • 27
  • 29
  • 31
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • J. Peters and S. Schaal, "Natural actor-critic," Neurocomputing, vol. 71, pp. 1180-1190, 2008.
    • (2008) Neurocomputing , vol.71 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 37
    • 84858956984 scopus 로고    scopus 로고
    • On-line policy optimisation of spoken dialogue systems via live interaction with human subjects
    • M. Gašić, F. Jurèíèek, B. Thomson, K. Yu, and S. Young, "On-line policy optimisation of spoken dialogue systems via live interaction with human subjects," in Proc. ASRU, 2011.
    • Proc. ASRU, 2011
    • Gašić, M.1    Jurèíèek, F.2    Thomson, B.3    Yu, K.4    Young, S.5
  • 40
    • 84862630897 scopus 로고    scopus 로고
    • Hilbertian metrics and positive definite kernels on probability measures
    • M. Hein and O. Bousquet, "Hilbertian metrics and positive definite kernels on probability measures," in Proc. AISTATS '05, 2004.
    • Proc. AISTATS '05, 2004
    • Hein, M.1    Bousquet, O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.