메뉴 건너뛰기




Volumn , Issue , 2012, Pages 4989-4992

Off-policy learning in large-scale POMDP-based dialogue systems

Author keywords

Reinforcement Learning; Spoken Dialogue Systems

Indexed keywords

DIALOGUE SYSTEMS; GAUSSIAN PROCESSES; OPTIMAL POLICIES; OPTIMAL STRATEGIES; OPTIMISATIONS; PERCEPTRON; REAL-WORLD SYSTEM; SCALE-UP; SPOKEN DIALOGUE SYSTEM; STATE OF THE ART; VALUE FUNCTIONS;

EID: 84867619228     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2012.6289040     Document Type: Conference Paper
Times cited : (18)

References (19)
  • 3
    • 84867590498 scopus 로고    scopus 로고
    • A stochastic model of human-machine interaction for learning dialog strategies
    • E. Levin, R. Pieraccini, and W. Eckert, "A stochastic model of human-machine interaction for learning dialog strategies," IEEE TSAP, 2000.
    • (2000) IEEE TSAP
    • Levin, E.1    Pieraccini, R.2    Eckert, W.3
  • 4
    • 84867603402 scopus 로고    scopus 로고
    • A probabilistic framework for dialog simulation and optimal strategy learning
    • O. Pietquin and T. Dutoit, "A probabilistic framework for dialog simulation and optimal strategy learning," IEEE TSAP, 2006.
    • (2006) IEEE TSAP
    • Pietquin, O.1    Dutoit, T.2
  • 10
    • 79959813974 scopus 로고    scopus 로고
    • Natural Belief-Critic: A reinforcement algorithm for parameter estimation in statistical spoken dialogue systems
    • F. Jurcicek, B. Thomson, S. Keizer, M. Gasic, F. Mairesse, K. Yu, and S. Young, "Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems," in Interspeech' 10, 2010.
    • (2010) Interspeech' 10
    • Jurcicek, F.1    Thomson, B.2    Keizer, S.3    Gasic, M.4    Mairesse, F.5    Yu, K.6    Young, S.7
  • 11
    • 33846220727 scopus 로고    scopus 로고
    • Scaling up POMDPs for dialogue management: The summary POMDP method
    • J. Williams and S. Young, "Scaling up POMDPs for dialogue management: the summary POMDP method," in Proc. of ASRU, 2005.
    • Proc. of ASRU, 2005.
    • Williams, J.1    Young, S.2
  • 12
    • 78651465938 scopus 로고    scopus 로고
    • Kalman Temporal Differences
    • M. Geist and O. Pietquin, "Kalman Temporal Differences," JAIR, 2010.
    • (2010) JAIR
    • Geist, M.1    Pietquin, O.2
  • 13
    • 84881039547 scopus 로고    scopus 로고
    • Sample Efficient Online Learning of Optimal Dialogue Policies with Kalman Temporal Differences
    • O. Pietquin, M. Geist, and S. Chandramohan, "Sample Efficient Online Learning of Optimal Dialogue Policies with Kalman Temporal Differences," in Proc. of IJCAI 2011, 2011.
    • (2011) Proc. of IJCAI 2011
    • Pietquin, O.1    Geist, M.2    Chandramohan, S.3
  • 14
    • 33750703175 scopus 로고    scopus 로고
    • Partially observable Markov decision processes for spoken dialog systems
    • J. Williams and S. Young, "Partially observable Markov decision processes for spoken dialog systems," Comp. Speech and Language, 2007.
    • (2007) Comp. Speech and Language
    • Williams, J.1    Young, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.