SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 4908-4912

Efficient lattice rescoring using recurrent neural network language models

(5) Liu, X a Wang, Y a Chen, X a Gales, M J F a Woodland, P C a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

language model; recurrent neural network; speech recognition

Indexed keywords

COMPUTATIONAL LINGUISTICS; RECURRENT NEURAL NETWORKS; SIGNAL PROCESSING;

CONFUSION NETWORKS; CONVERSATIONAL TELEPHONE SPEECH RECOGNITION; GENERALIZATION PERFORMANCE; INTRINSIC CHARACTERISTICS; LANGUAGE MODEL; RESCORING APPROACH; SPEECH RECOGNITION SYSTEMS; VECTOR REPRESENTATIONS;

SPEECH RECOGNITION;

EID: 84905240726 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854535 Document Type: Conference Paper

Times cited : (95)

References (28)

1
- 84890469222
- Converting neural network language models into back-off language models for efficient decoding in automatic speech recognition
- Vancouver, Canada, 2013
- E. Arisoy, S. F. Chen, B. Ramabhadran, and A. Sethy (2013), "Converting neural network language models into back-off language models for efficient decoding in automatic speech recognition, " in Proc. ICASSP, Vancouver, Canada, 2013, pp. 8242-8246.
- (2013) Proc. ICASSP , pp. 8242-8246
- Arisoy, E.¹ Chen, S.F.² Ramabhadran, B.³ Sethy, A.⁴

2
- 0142166851
- A neural probabilistic language model
- 2003
- Y. Bengio and R. Ducharme (2003), "A neural probabilistic language model, " Journal of Machine Learning Research, vol. 3, pp. 1137-1155, 2003.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 1137-1155
- Bengio, Y.¹ Ducharme, R.²

3
- 44949090835
- Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
- Edmonton, Canada, 2003
- I. Bulyko, M. Ostendorf, and A. Stolcke (2003), "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures, " in Proc. HLT, Edmonton, Canada, 2003.
- (2003) Proc. HLT
- Bulyko, I.¹ Ostendorf, M.² Stolcke, A.³

4
- 80051613991
- Variational approximation of long-span language models for LVCSR
- Prague, Czech Republic, 2011
- A. Deoras, T. Mikolov, S. Kombrink, M. Karafiat, and S. Khudanpur (2011), "Variational approximation of long-span language models for LVCSR, " in Proc. ICASSP, Prague, Czech Republic, 2011, pp. 5532-5535.
- (2011) Proc. ICASSP , pp. 5532-5535
- Deoras, A.¹ Mikolov, T.² Kombrink, S.³ Karafiat, M.⁴ Khudanpur, S.⁵

5
- 80053284315
- A fast re-scoring strategy to capture long-distance dependencies
- Edinburgh, UK, 2011
- A. Deoras, T. Mikolov and K. Church (2011), "A fast re-scoring strategy to capture long-distance dependencies", Proc. EMNLP, Edinburgh, UK, 2011, pp. 1116-1127.
- (2011) Proc. EMNLP , pp. 1116-1127
- Deoras, A.¹ Mikolov, T.² Church, K.³

6
- 84870293590
- Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model
- January 2013
- A. Deoras, T. Mikolov, S. Kombrink, and K. Church (2013), "Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model, " Speech Communication, vol. 55, no. 1, pp. 162-177, January 2013.
- (2013) Speech Communication , vol.55 , Issue.1 , pp. 162-177
- Deoras, A.¹ Mikolov, T.² Kombrink, S.³ Church, K.⁴

7
- 44849092930
- Empirical study of neural network language models for Arabic speech recognition
- Kyoto, Japan, 2007
- A. Emami and L. Mangu (2007), "Empirical study of neural network language models for Arabic speech recognition, " in Proc. ASRU, Kyoto, Japan, 2007, pp. 147-152.
- (2007) Proc. ASRU , pp. 147-152
- Emami, A.¹ Mangu, L.²

8
- 4544253834
- Posterior probability decoding, confidence estimation and system combination
- College Park, MD, 2000
- G. Evermann and P. C. Woodland (2000), "Posterior probability decoding, confidence estimation and system combination, " in Proc. Speech Transcription Workshop, College Park, MD, 2000.
- (2000) Proc. Speech Transcription Workshop
- Evermann, G.¹ Woodland, P.C.²

9
- 33645760470
- Training LVCSR systems on thousands of hours of data
- Philadelphia, PA, 2005
- G. Evermann, H. Y. Chan, M. J. F. Gales, B. Jia, D. Mrva, P. C.Woodland, and K. Yu (2005), "Training LVCSR systems on thousands of hours of data, " in Proc. ICASSP, Philadelphia, PA, 2005, vol. 1, pp. 209-212.
- (2005) Proc. ICASSP , vol.1 , pp. 209-212
- Evermann, G.¹ Chan, H.Y.² Gales, M.J.F.³ Jia, B.⁴ Mrva, D.⁵ Woodland, P.C.⁶ Yu, K.⁷

10
- 84878381641
- Conversion of recurrent neural network language models to weighted finite state transducers for automatic speech recognition
- Portland, OR, 2012
- G. Lecorve and P. Motlicek (2012), "Conversion of recurrent neural network language models to weighted finite state transducers for automatic speech recognition, " in Proc. ISCA Interspeech, Portland, OR, 2012.
- (2012) Proc. ISCA Interspeech
- Lecorve, G.¹ Motlicek, P.²

11
- 84869479578
- Structured output layer neural network language models for speech recognition
- 2013
- H.-S. Le, I. Oparin, A. Allauzen, J. Gauvain, and F. Yvon (2013), "Structured output layer neural network language models for speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 21, no. 1, pp. 197-206, 2013.
- (2013) IEEE Transactions on Audio, Speech and Language Processing , vol.21 , Issue.1 , pp. 197-206
- Le, H.-S.¹ Oparin, I.² Allauzen, A.³ Gauvain, J.⁴ Yvon, F.⁵

12
- 78049379945
- Language model combination and adaptation using weighted finite state transducers
- Dallas, TX, 2010
- X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland (2010), "Language model combination and adaptation using weighted finite state transducers, " in Proc. ICASSP, Dallas, TX, 2010, pp. 5390-5393.
- (2010) Proc. ICASSP , pp. 5390-5393
- Liu, X.¹ Gales, M.J.F.² Hieronymus, J.L.³ Woodland, P.C.⁴

13
- 84867332205
- Use of contexts in language model interpolation and adaptation
- January 2013
- X. Liu, M. J. F. Gales, and P. C. Woodland (2013), "Use of contexts in language model interpolation and adaptation, " Computer Speech &Language, vol. 27, no. 1, pp. 301-321, January 2013.
- (2013) Computer Speech &Language , vol.27 , Issue.1 , pp. 301-321
- Liu, X.¹ Gales, M.J.F.² Woodland, P.C.³

14
- 79959829092
- Recurrent neural network based language model
- Makuhari, Japan, 2010
- T. Mikolov, M. Karafiat, L. Burget, J. Cernocky, and S. Khudanpur (2010), "Recurrent neural network based language model, " in Proc. ISCA Interspeech, Makuhari, Japan, 2010, pp. 1045-1048.
- (2010) Proc. ISCA Interspeech , pp. 1045-1048
- Mikolov, T.¹ Karafiat, M.² Burget, L.³ Cernocky, J.⁴ Khudanpur, S.⁵

15
- 80051643236
- Extensions of recurrent neural network language model
- Prague, Czech Republic, 2011
- T. Mikolov, S. Kombrink, L. Burget, J. H. Cernocky, and S. Khudanpur (2011), "Extensions of recurrent neural network language model, " in Proc. ICASSP, Prague, Czech Republic, 2011, pp. 5528-5531.
- (2011) Proc. ICASSP , pp. 5528-5531
- Mikolov, T.¹ Kombrink, S.² Burget, L.³ Cernocky, J.H.⁴ Khudanpur, S.⁵

16
- 84901784231
- RNNLM-Recurrent neural network language modeling toolkit
- Hawaii
- T. Mikolov, S. Kombrink, L. Burget, J. H. Cernocky and S. Khudanpur (2011), "RNNLM-Recurrent neural network language modeling toolkit", in demo session of IEEE ASRU2011, Hawaii.
- (2011) Demo Session of IEEE ASRU2011
- Mikolov, T.¹ Kombrink, S.² Burget, L.³ Cernocky, J.H.⁴ Khudanpur, S.⁵

17
- 34547997987
- Hierarchical probabilistic neural network language model
- Barbados, 2005
- F. Morin and Y. Bengio (2005), "Hierarchical probabilistic neural network language model, " in Proc. International workshop on artificial intelligence and statistics, Barbados, 2005, pp. 246-252.
- (2005) Proc. International Workshop on Artificial Intelligence and Statistics , pp. 246-252
- Morin, F.¹ Bengio, Y.²

18
- 0348198473
- Finite-state transducers in language and speech processing
- 1997
- M. Mohri (1997), "Finite-state transducers in language and speech processing, " Computational linguistics, vol. 23, no. 2, pp. 269-311, 1997.
- (1997) Computational Linguistics , vol.23 , Issue.2 , pp. 269-311
- Mohri, M.¹

19
- 0029725456
- A variable-length category-based n-gram language model
- Atlanta, GA, 1996
- T. R. Niesler and P. C. Woodland (1996), "A variable-length category-based n-gram language model, " in Proc. ICASSP, Atlanta, GA, 1996, vol. 1, pp. 164-167.
- (1996) Proc. ICASSP , vol.1 , pp. 164-167
- Niesler, T.R.¹ Woodland, P.C.²

20
- 85032751521
- Dynamic programming search for continuous speech recognition
- 1999
- H. Ney and S. Ortmanns (1999), "Dynamic programming search for continuous speech recognition, " IEEE Signal Processing Magazine, vol. 16, no. 5, pp. 64-83, 1999.
- (1999) IEEE Signal Processing Magazine , vol.16 , Issue.5 , pp. 64-83
- Ney, H.¹ Ortmanns, S.²

21
- 0001889147
- A one pass decoder design for large vocabulary recognition
- Stroudsburg, PA, 1994
- J. J. Odell, V. Valtchev, P. C. Woodland, and S. J. Young (1994), "A one pass decoder design for large vocabulary recognition, " in Proc. HLT, Stroudsburg, PA, 1994, pp. 405-410.
- (1994) Proc. HLT , pp. 405-410
- Odell, J.J.¹ Valtchev, V.² Woodland, P.C.³ Young, S.J.⁴

22
- 79959850026
- Improved neural network based language modelling and adaptation
- Makuhari, Japan, 2010
- J. Park, X. Liu, M. J. F. Gales, and P. C. Woodland (2010), "Improved neural network based language modelling and adaptation, " in Proc. ISCA Interspeech, Makuhari, Japan, 2010, pp. 1041-1044.
- (2010) Proc. ISCA Interspeech , pp. 1041-1044
- Park, J.¹ Liu, X.² Gales, M.J.F.³ Woodland, P.C.⁴

23
- 0022471098
- Learning representations by back-propagating errors
- D. E. Rumelhart, G. E. Hintont, and R. J. Williams (1986), "Learning representations by back-propagating errors, " Nature, vol. 323, no. 6088, pp. 533-536, 1986.
- (1986) Nature , vol.323 , Issue.6088 , pp. 533-536
- Rumelhart, D.E.¹ Hintont, G.E.² Williams, R.J.³

24
- 33847610331
- Continuous space language models
- H. Schwenk (2007), "Continuous space language models, " Computer Speech &Language, vol. 21, no. 3, pp. 492-518, 2007.
- (2007) Computer Speech &Language , vol.21 , Issue.3 , pp. 492-518
- Schwenk, H.¹

25
- 84906240855
- Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system
- Lyon, France, 2013
- Y. Si, Q. Zhang, T. Li, J. Pan, and Y. Yan (2013), "Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system, " in Proc. ISCA Interspeech, Lyon, France, 2013, pp. 3419-3423.
- (2013) Proc. ISCA Interspeech , pp. 3419-3423
- Si, Y.¹ Zhang, Q.² Li, T.³ Pan, J.⁴ Yan, Y.⁵

26
- 0012611072
- Entropy-based pruning of backoff language models
- Landsdowne, VA, 1998
- A. Stolcke (1998), "Entropy-based pruning of backoff language models, " in Proc. DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA, 1998, pp. 270-274.
- (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 270-274
- Stolcke, A.¹

27
- 84878402147
- LSTM neural networks for language modeling
- Portland, OR, 2012
- M. Sundermeyer, R. Schluter, and H. Ney (2012), "LSTM neural networks for language modeling, " in Proc. ISCA Interspeech, Portland, OR, 2012.
- (2012) Proc. ISCA Interspeech
- Sundermeyer, M.¹ Schluter, R.² Ney, H.³

28
- 84890480734
- Comparison of feedforward and recurrent neural network language models
- Vancouver, Canada, 2013
- M. Sundermeyer, I. Oparin, J. L. Gauvain, B. Freiberg, R. Schluter, and H. Ney (2013), "Comparison of feedforward and recurrent neural network language models, " in Proc. ICASSP, Vancouver, Canada, 2013, pp. 8430-8434
- (2013) Proc. ICASSP , pp. 8430-8434
- Sundermeyer, M.¹ Oparin, I.² Gauvain, J.L.³ Freiberg, B.⁴ Schluter, R.⁵ Ney, H.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.