메뉴 건너뛰기




Volumn , Issue , 2017, Pages

An actor-critic algorithm for sequence prediction

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; COMPUTER AIDED LANGUAGE TRANSLATION; MACHINE LEARNING; MODELING LANGUAGES; NATURAL LANGUAGE PROCESSING SYSTEMS;

EID: 85088229482     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (346)

References (47)
  • 1
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translation by jointly learning to align and translate
    • Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In Proceedings of the ICLR 2015, 2015.
    • (2015) Proceedings of the ICLR 2015
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 8
    • 84986286501 scopus 로고    scopus 로고
    • Attention-based models for speech recognition
    • abs/1506.07503
    • Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, KyungHyun Cho, and Yoshua Bengio. Attention-based models for speech recognition. CoRR, abs/1506.07503, 2015. URL http://arxiv.org/abs/1506.07503.
    • (2015) CoRR
    • Chorowski, J.1    Bahdanau, D.2    Serdyuk, D.3    Cho, K.4    Bengio, Y.5
  • 10
    • 67349244372 scopus 로고    scopus 로고
    • Search-based structured prediction
    • Hal Daumé Iii, John Langford, and Daniel Marcu. Search-based structured prediction. Machine learning, 75(3):297-325, 2009.
    • (2009) Machine Learning , vol.75 , Issue.3 , pp. 297-325
    • Iii, H.D.1    Langford, J.2    Marcu, D.3
  • 12
    • 0001492251 scopus 로고    scopus 로고
    • Minimum bayes-risk automatic speech recognition
    • Vaibhava Goel and William J Byrne. Minimum bayes-risk automatic speech recognition. Computer Speech & Language, 14(2):115-135, 2000.
    • (2000) Computer Speech & Language , vol.14 , Issue.2 , pp. 115-135
    • Goel, V.1    Byrne, W.J.2
  • 21
    • 72449136767 scopus 로고    scopus 로고
    • Structured prediction with reinforcement learning
    • Francis Maes, Ludovic Denoyer, and Patrick Gallinari. Structured prediction with reinforcement learning. Machine learning, 77(2-3):271-301, 2009.
    • (2009) Machine Learning , vol.77 , Issue.2-3 , pp. 271-301
    • Maes, F.1    Denoyer, L.2    Gallinari, P.3
  • 24
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: Theory and application to reward shaping
    • Andrew Y Ng, Daishi Harada, and Stuart Russell. Policy invariance under reward transformations: Theory and application to reward shaping. In ICML, volume 99, pp. 278-287, 1999.
    • (1999) ICML , vol.99 , pp. 278-287
    • Ng, A.Y.1    Harada, D.2    Russell, S.3
  • 25
    • 84944098666 scopus 로고    scopus 로고
    • Minimum error rate training in statistical machine translation
    • Association for Computational Linguistics
    • Franz Josef Och. Minimum error rate training in statistical machine translation. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1, pp. 160-167. Association for Computational Linguistics, 2003.
    • (2003) Proceedings of the 41st Annual Meeting on Association for Computational Linguistics- , vol.1 , pp. 160-167
    • Och, F.J.1
  • 33
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Richard S Sutton. Learning to predict by the methods of temporal differences. Machine learning, 3 (1):9-44, 1988.
    • (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
    • Sutton, R.S.1
  • 35
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • Richard S Sutton, David A McAllester, Satinder P Singh, Yishay Mansour, et al. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pp. 1057-1063, 1999.
    • (1999) NIPS , vol.99 , pp. 1057-1063
    • Sutton, R.S.1    McAllester, D.A.2    Singh, S.P.3    Mansour, Y.4
  • 37
    • 0000985504 scopus 로고
    • Td-gammon, a self-teaching backgammon program, achieves master-level play
    • Gerald Tesauro. Td-gammon, a self-teaching backgammon program, achieves master-level play. Neural computation, 6(2):215-219, 1994.
    • (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.1
  • 39
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • John N Tsitsiklis and Benjamin Van Roy. An analysis of temporal-difference learning with function approximation. Automatic Control, IEEE Transactions on, 42(5):674-690, 1997.
    • (1997) Automatic Control, IEEE Transactions on , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 42
    • 85017437235 scopus 로고    scopus 로고
    • An investigation of imitation learning algorithms for structured prediction
    • Citeseer
    • Andreas Vlachos. An investigation of imitation learning algorithms for structured prediction. In EWRL, pp. 143-154. Citeseer, 2012.
    • (2012) EWRL , pp. 143-154
    • Vlachos, A.1
  • 43
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229-256, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
    • Williams, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.