메뉴 건너뛰기




Volumn , Issue , 2014, Pages 4908-4912

Efficient lattice rescoring using recurrent neural network language models

Author keywords

language model; recurrent neural network; speech recognition

Indexed keywords

COMPUTATIONAL LINGUISTICS; RECURRENT NEURAL NETWORKS; SIGNAL PROCESSING;

EID: 84905240726     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854535     Document Type: Conference Paper
Times cited : (95)

References (28)
  • 1
    • 84890469222 scopus 로고    scopus 로고
    • Converting neural network language models into back-off language models for efficient decoding in automatic speech recognition
    • Vancouver, Canada, 2013
    • E. Arisoy, S. F. Chen, B. Ramabhadran, and A. Sethy (2013), "Converting neural network language models into back-off language models for efficient decoding in automatic speech recognition, " in Proc. ICASSP, Vancouver, Canada, 2013, pp. 8242-8246.
    • (2013) Proc. ICASSP , pp. 8242-8246
    • Arisoy, E.1    Chen, S.F.2    Ramabhadran, B.3    Sethy, A.4
  • 3
    • 44949090835 scopus 로고    scopus 로고
    • Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
    • Edmonton, Canada, 2003
    • I. Bulyko, M. Ostendorf, and A. Stolcke (2003), "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures, " in Proc. HLT, Edmonton, Canada, 2003.
    • (2003) Proc. HLT
    • Bulyko, I.1    Ostendorf, M.2    Stolcke, A.3
  • 4
    • 80051613991 scopus 로고    scopus 로고
    • Variational approximation of long-span language models for LVCSR
    • Prague, Czech Republic, 2011
    • A. Deoras, T. Mikolov, S. Kombrink, M. Karafiat, and S. Khudanpur (2011), "Variational approximation of long-span language models for LVCSR, " in Proc. ICASSP, Prague, Czech Republic, 2011, pp. 5532-5535.
    • (2011) Proc. ICASSP , pp. 5532-5535
    • Deoras, A.1    Mikolov, T.2    Kombrink, S.3    Karafiat, M.4    Khudanpur, S.5
  • 5
    • 80053284315 scopus 로고    scopus 로고
    • A fast re-scoring strategy to capture long-distance dependencies
    • Edinburgh, UK, 2011
    • A. Deoras, T. Mikolov and K. Church (2011), "A fast re-scoring strategy to capture long-distance dependencies", Proc. EMNLP, Edinburgh, UK, 2011, pp. 1116-1127.
    • (2011) Proc. EMNLP , pp. 1116-1127
    • Deoras, A.1    Mikolov, T.2    Church, K.3
  • 6
    • 84870293590 scopus 로고    scopus 로고
    • Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model
    • January 2013
    • A. Deoras, T. Mikolov, S. Kombrink, and K. Church (2013), "Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model, " Speech Communication, vol. 55, no. 1, pp. 162-177, January 2013.
    • (2013) Speech Communication , vol.55 , Issue.1 , pp. 162-177
    • Deoras, A.1    Mikolov, T.2    Kombrink, S.3    Church, K.4
  • 7
    • 44849092930 scopus 로고    scopus 로고
    • Empirical study of neural network language models for Arabic speech recognition
    • Kyoto, Japan, 2007
    • A. Emami and L. Mangu (2007), "Empirical study of neural network language models for Arabic speech recognition, " in Proc. ASRU, Kyoto, Japan, 2007, pp. 147-152.
    • (2007) Proc. ASRU , pp. 147-152
    • Emami, A.1    Mangu, L.2
  • 8
    • 4544253834 scopus 로고    scopus 로고
    • Posterior probability decoding, confidence estimation and system combination
    • College Park, MD, 2000
    • G. Evermann and P. C. Woodland (2000), "Posterior probability decoding, confidence estimation and system combination, " in Proc. Speech Transcription Workshop, College Park, MD, 2000.
    • (2000) Proc. Speech Transcription Workshop
    • Evermann, G.1    Woodland, P.C.2
  • 9
    • 33645760470 scopus 로고    scopus 로고
    • Training LVCSR systems on thousands of hours of data
    • Philadelphia, PA, 2005
    • G. Evermann, H. Y. Chan, M. J. F. Gales, B. Jia, D. Mrva, P. C.Woodland, and K. Yu (2005), "Training LVCSR systems on thousands of hours of data, " in Proc. ICASSP, Philadelphia, PA, 2005, vol. 1, pp. 209-212.
    • (2005) Proc. ICASSP , vol.1 , pp. 209-212
    • Evermann, G.1    Chan, H.Y.2    Gales, M.J.F.3    Jia, B.4    Mrva, D.5    Woodland, P.C.6    Yu, K.7
  • 10
    • 84878381641 scopus 로고    scopus 로고
    • Conversion of recurrent neural network language models to weighted finite state transducers for automatic speech recognition
    • Portland, OR, 2012
    • G. Lecorve and P. Motlicek (2012), "Conversion of recurrent neural network language models to weighted finite state transducers for automatic speech recognition, " in Proc. ISCA Interspeech, Portland, OR, 2012.
    • (2012) Proc. ISCA Interspeech
    • Lecorve, G.1    Motlicek, P.2
  • 12
    • 78049379945 scopus 로고    scopus 로고
    • Language model combination and adaptation using weighted finite state transducers
    • Dallas, TX, 2010
    • X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland (2010), "Language model combination and adaptation using weighted finite state transducers, " in Proc. ICASSP, Dallas, TX, 2010, pp. 5390-5393.
    • (2010) Proc. ICASSP , pp. 5390-5393
    • Liu, X.1    Gales, M.J.F.2    Hieronymus, J.L.3    Woodland, P.C.4
  • 13
    • 84867332205 scopus 로고    scopus 로고
    • Use of contexts in language model interpolation and adaptation
    • January 2013
    • X. Liu, M. J. F. Gales, and P. C. Woodland (2013), "Use of contexts in language model interpolation and adaptation, " Computer Speech &Language, vol. 27, no. 1, pp. 301-321, January 2013.
    • (2013) Computer Speech &Language , vol.27 , Issue.1 , pp. 301-321
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 15
    • 80051643236 scopus 로고    scopus 로고
    • Extensions of recurrent neural network language model
    • Prague, Czech Republic, 2011
    • T. Mikolov, S. Kombrink, L. Burget, J. H. Cernocky, and S. Khudanpur (2011), "Extensions of recurrent neural network language model, " in Proc. ICASSP, Prague, Czech Republic, 2011, pp. 5528-5531.
    • (2011) Proc. ICASSP , pp. 5528-5531
    • Mikolov, T.1    Kombrink, S.2    Burget, L.3    Cernocky, J.H.4    Khudanpur, S.5
  • 18
    • 0348198473 scopus 로고    scopus 로고
    • Finite-state transducers in language and speech processing
    • 1997
    • M. Mohri (1997), "Finite-state transducers in language and speech processing, " Computational linguistics, vol. 23, no. 2, pp. 269-311, 1997.
    • (1997) Computational Linguistics , vol.23 , Issue.2 , pp. 269-311
    • Mohri, M.1
  • 19
    • 0029725456 scopus 로고    scopus 로고
    • A variable-length category-based n-gram language model
    • Atlanta, GA, 1996
    • T. R. Niesler and P. C. Woodland (1996), "A variable-length category-based n-gram language model, " in Proc. ICASSP, Atlanta, GA, 1996, vol. 1, pp. 164-167.
    • (1996) Proc. ICASSP , vol.1 , pp. 164-167
    • Niesler, T.R.1    Woodland, P.C.2
  • 20
    • 85032751521 scopus 로고    scopus 로고
    • Dynamic programming search for continuous speech recognition
    • 1999
    • H. Ney and S. Ortmanns (1999), "Dynamic programming search for continuous speech recognition, " IEEE Signal Processing Magazine, vol. 16, no. 5, pp. 64-83, 1999.
    • (1999) IEEE Signal Processing Magazine , vol.16 , Issue.5 , pp. 64-83
    • Ney, H.1    Ortmanns, S.2
  • 21
    • 0001889147 scopus 로고
    • A one pass decoder design for large vocabulary recognition
    • Stroudsburg, PA, 1994
    • J. J. Odell, V. Valtchev, P. C. Woodland, and S. J. Young (1994), "A one pass decoder design for large vocabulary recognition, " in Proc. HLT, Stroudsburg, PA, 1994, pp. 405-410.
    • (1994) Proc. HLT , pp. 405-410
    • Odell, J.J.1    Valtchev, V.2    Woodland, P.C.3    Young, S.J.4
  • 22
    • 79959850026 scopus 로고    scopus 로고
    • Improved neural network based language modelling and adaptation
    • Makuhari, Japan, 2010
    • J. Park, X. Liu, M. J. F. Gales, and P. C. Woodland (2010), "Improved neural network based language modelling and adaptation, " in Proc. ISCA Interspeech, Makuhari, Japan, 2010, pp. 1041-1044.
    • (2010) Proc. ISCA Interspeech , pp. 1041-1044
    • Park, J.1    Liu, X.2    Gales, M.J.F.3    Woodland, P.C.4
  • 23
    • 0022471098 scopus 로고
    • Learning representations by back-propagating errors
    • D. E. Rumelhart, G. E. Hintont, and R. J. Williams (1986), "Learning representations by back-propagating errors, " Nature, vol. 323, no. 6088, pp. 533-536, 1986.
    • (1986) Nature , vol.323 , Issue.6088 , pp. 533-536
    • Rumelhart, D.E.1    Hintont, G.E.2    Williams, R.J.3
  • 24
    • 33847610331 scopus 로고    scopus 로고
    • Continuous space language models
    • H. Schwenk (2007), "Continuous space language models, " Computer Speech &Language, vol. 21, no. 3, pp. 492-518, 2007.
    • (2007) Computer Speech &Language , vol.21 , Issue.3 , pp. 492-518
    • Schwenk, H.1
  • 25
    • 84906240855 scopus 로고    scopus 로고
    • Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system
    • Lyon, France, 2013
    • Y. Si, Q. Zhang, T. Li, J. Pan, and Y. Yan (2013), "Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system, " in Proc. ISCA Interspeech, Lyon, France, 2013, pp. 3419-3423.
    • (2013) Proc. ISCA Interspeech , pp. 3419-3423
    • Si, Y.1    Zhang, Q.2    Li, T.3    Pan, J.4    Yan, Y.5
  • 27
  • 28
    • 84890480734 scopus 로고    scopus 로고
    • Comparison of feedforward and recurrent neural network language models
    • Vancouver, Canada, 2013
    • M. Sundermeyer, I. Oparin, J. L. Gauvain, B. Freiberg, R. Schluter, and H. Ney (2013), "Comparison of feedforward and recurrent neural network language models, " in Proc. ICASSP, Vancouver, Canada, 2013, pp. 8430-8434
    • (2013) Proc. ICASSP , pp. 8430-8434
    • Sundermeyer, M.1    Oparin, I.2    Gauvain, J.L.3    Freiberg, B.4    Schluter, R.5    Ney, H.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.