-
3
-
-
84857755225
-
Gaussian processes for fast policy optimisation of pomdp-based dialogue managers
-
Milica Gasic, Filip Jurcicek, Simon Keizer, François Mairesse, Blaise Thomson, Kai Yu, and Steve Young. Gaussian processes for fast policy optimisation of pomdp-based dialogue managers. In SIGDIAL'10, Tokyo, Japan, 2010.
-
SIGDIAL'10, Tokyo, Japan, 2010
-
-
Gasic, M.1
Jurcicek, F.2
Keizer, S.3
Mairesse, F.4
Thomson, B.5
Yu, K.6
Young, S.7
-
5
-
-
84881043838
-
Managing Uncertainty within Value Function Approximation in Reinforcement Learning
-
Matthieu Geist and Olivier Pietquin. Managing Uncertainty within Value Function Approximation in Reinforcement Learning. In Journal of Machine Learning Research, Workshop & Conference Proceedings (JMLR W& CP): Active Learning and Experimental Design, Sardinia, Italy, 2010.
-
Journal of Machine Learning Research, Workshop & Conference Proceedings (JMLR W& CP): Active Learning and Experimental Design, Sardinia, Italy, 2010
-
-
Geist, M.1
Pietquin, O.2
-
6
-
-
84880694195
-
Stable Function Approximation in Dynamic Programming
-
Geoffrey Gordon. Stable Function Approximation in Dynamic Programming. In ICML'95, 1995.
-
(1995)
ICML'95
-
-
Gordon, G.1
-
7
-
-
51449120317
-
Hybrid reinforcement/supervised learning of dialogue policies from fixed data sets
-
James Henderson, Oliver Lemon, and Kallirroi Georgila. Hybrid reinforcement/supervised learning of dialogue policies from fixed data sets. Computational Linguistics, 2008.
-
(2008)
Computational Linguistics
-
-
Henderson, J.1
Lemon, O.2
Georgila, K.3
-
8
-
-
79959813974
-
Natural Belief-Critic: A reinforcement algorithm for parameter estimation in statistical spoken dialogue systems
-
Filip Jurcicek, Blaise Thomson, Simon Keizer, Milica Gasic, François Mairesse, Kai Yu, and Steve Young. Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems. In Interspeech'10, Makuhari (Japan), 2010.
-
Interspeech'10, Makuhari (Japan), 2010
-
-
Jurcicek, F.1
Thomson, B.2
Keizer, S.3
Gasic, M.4
Mairesse, F.5
Yu, K.6
Young, S.7
-
9
-
-
85024429815
-
A new approach to linear filtering and prediction problems
-
Series D
-
Rudolf Kalman. A new approach to linear filtering and prediction problems. Transactions of the ASME-Journal of Basic Engineering, 82(Series D):35-45, 1960.
-
(1960)
Transactions of the ASME-Journal of Basic Engineering
, vol.82
, pp. 35-45
-
-
Kalman, R.1
-
11
-
-
85009087667
-
Information state and dialogue management in the TRINDI dialogue move engine toolkit
-
Staffan Larsson and David R. Traum. Information state and dialogue management in the TRINDI dialogue move engine toolkit. Natural Language Engineering, 2000.
-
(2000)
Natural Language Engineering
-
-
Larsson, S.1
Traum, D.R.2
-
12
-
-
84893350028
-
An ISU dialogue system exhibiting reinforcement learning of dialogue policies: Generic slot-filling in the TALK in-car system
-
Oliver Lemon, Kalliroi Georgila, James Henderson, and Matthew Stuttle. An ISU dialogue system exhibiting reinforcement learning of dialogue policies: generic slot-filling in the TALK in-car system. In EACL'06, Morristown, NJ, USA, 2006.
-
EACL'06, Morristown, NJ, USA, 2006
-
-
Lemon, O.1
Georgila, K.2
Henderson, J.3
Stuttle, M.4
-
13
-
-
0031624616
-
Using markov decision process for learning dialogue strategies
-
Esther Levin and Roberto Pieraccini. Using markov decision process for learning dialogue strategies. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'98), Seattle, Washington, 1998.
-
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'98), Seattle, Washington, 1998
-
-
Levin, E.1
Pieraccini, R.2
-
14
-
-
0033894474
-
Stochastic model of human-machine interaction for learning dialog strategies
-
DOI 10.1109/89.817450
-
Esther Levin, Roberto Pieraccini, and Wieland Eckert. A stochastic model of human-machine interaction for learning dialog strategies. IEEE Transactions on Speech and Audio Processing, 8(1):11-23, 2000. (Pubitemid 30540744)
-
(2000)
IEEE Transactions on Speech and Audio Processing
, vol.8
, Issue.1
, pp. 11-23
-
-
Levin, E.1
Pieraccini, R.2
Eckert, W.3
-
15
-
-
70450186275
-
Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection
-
Lihong Li, Suhrid Balakrishnan, and Jason Williams. Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection. In InterSpeech'09, Brighton (UK), 2009.
-
InterSpeech'09, Brighton (UK), 2009
-
-
Li, L.1
Balakrishnan, S.2
Williams, J.3
-
16
-
-
33750253118
-
A probabilistic framework for dialog simulation and optimal strategy learning
-
DOI 10.1109/TSA.2005.855836
-
O. Pietquin and T. Dutoit. A probabilistic framework for dialog simulation and optimal strategy learning. IEEE Transactions on Audio, Speech and Language Processing, 14(2):589-599, 2006. (Pubitemid 46405357)
-
(2006)
IEEE Transactions on Audio, Speech and Language Processing
, vol.14
, Issue.2
, pp. 589-599
-
-
Pietquin, O.1
Dutoit, T.2
-
18
-
-
33846257740
-
Effects of the user model on simulation-based learning of dialogue strategies
-
Jost Schatzmann, Matthew N. Stuttle, Karl Weilhammer, and Steve Young. Effects of the user model on simulation-based learning of dialogue strategies. In ASRU'05, San Juan, Puerto Rico, 2005.
-
ASRU'05, San Juan, Puerto Rico, 2005
-
-
Schatzmann, J.1
Stuttle, M.N.2
Weilhammer, K.3
Young, S.4
-
19
-
-
33747607273
-
A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies
-
June
-
Jost Schatzmann, Karl Weilhammer, Matt Stuttle, and Steve Young. A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies. The Knowledge Engineering Review, 21(2):97-126, June 2006.
-
(2006)
The Knowledge Engineering Review
, vol.21
, Issue.2
, pp. 97-126
-
-
Schatzmann, J.1
Weilhammer, K.2
Stuttle, M.3
Young, S.4
-
20
-
-
84898955256
-
Reinforcement learning for spoken dialogue systems
-
S. Singh, M. Kearns, D. Litman, and M.Walker. Reinforcement learning for spoken dialogue systems. In NIPS'99, Denver, USA, 1999.
-
NIPS'99, Denver, USA, 1999
-
-
Singh, S.1
Kearns, M.2
Litman, D.3
Walker, M.4
-
22
-
-
85065183198
-
PARADISE: A framework for evaluating spoken dialogue agents
-
M. Walker, D. Litman, C. Kamm, and A. Abella. PARADISE: A framework for evaluating spoken dialogue agents. In ACL'97, Madrid (Spain), 1997.
-
ACL'97, Madrid (Spain), 1997
-
-
Walker, M.1
Litman, D.2
Kamm, C.3
Abella, A.4
-
23
-
-
33750703175
-
Partially observable Markov decision processes for spoken dialog systems
-
Jason Williams and Steve Young. Partially observable Markov decision processes for spoken dialog systems. Computer Speech and Language, 21(2):231-422, 2007.
-
(2007)
Computer Speech and Language
, vol.21
, Issue.2
, pp. 231-422
-
-
Williams, J.1
Young, S.2
|