SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 2635-2639

One billion word benchmark for measuring progress in statistical language modeling

(7) Chelba, Ciprian a Mikolov, Tomas a Schuster, Mike a Ge, Qi a Brants, Thorsten a Koehn, Phillipp b Robinson, Tony c

a GOOGLE INC (United States)

b UNIVERSITY OF EDINBURGH (United Kingdom)

c Cantab Research Ltd (United Kingdom)

Author keywords

Benchmark; Language modeling; Reproducible research

Indexed keywords

BENCHMARKING; NATURAL LANGUAGE PROCESSING SYSTEMS; RECURRENT NEURAL NETWORKS; SPEECH COMMUNICATION;

CROSS ENTROPY; LANGUAGE MODEL; N-GRAM MODELS; REPRODUCIBLE RESEARCH; STATISTICAL LANGUAGE MODELING; TRAINING DATA;

COMPUTATIONAL LINGUISTICS;

EID: 84910091099 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (283)

References (32)

1
- 0142166851
- A neural probabilistic language model
- Bengio et al. 2003
- [Bengio et al., 2003] Y. Bengio, R. Ducharme, and P. Vincent. 2003. A neural probabilistic language model. Journal of Machine Learning Research, 3:1137-1155.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 1137-1155
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³

2
- 80053375619
- Large language models in machine translation
- Brants et al. 2007
- [Brants et al., 2007] T. Brants, A. C. Popat, P. Xu, F. J. Och, and J. Dean. 2007. Large language models in machine translation. In Proceedings of EMNLP.
- (2007) Proceedings of EMNLP
- Brants, T.¹ Popat, A.C.² Xu, P.³ Och, F.J.⁴ Dean, J.⁵

3
- 85022919385
- Class-based n-gram models of natural language
- Brown et al. 1992
- [Brown et al., 1992] P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. Della Pietra, and J. C. Lai. 1992. Class-Based N-gram Models of Natural Language. Computational Linguistics, 18, 467-479.
- (1992) Computational Linguistics , vol.18 , pp. 467-479
- Brown, P.F.¹ Desouza, P.V.² Mercer, R.L.³ Della Pietra, V.J.⁴ Lai, J.C.⁵

4
- 26444565569
- Finding structure in time
- Elman, 1990
- [Elman, 1990] J. Elman. 1990. Finding Structure in Time. Cognitive Science, 14, 179-211.
- (1990) Cognitive Science , vol.14 , pp. 179-211
- Elman, J.¹

5
- 85055309630
- Emami, 2006 Ph.D. thesis, Johns Hopkins University
- [Emami, 2006] A. Emami. 2006. A Neural Syntactic Language Model. Ph.D. thesis, Johns Hopkins University.
- (2006) A Neural Syntactic Language Model
- Emami, A.¹

6
- 0012356157
- A bit of progress in language modeling, extended version
- Goodman, 2001a
- [Goodman, 2001a] J. T. Goodman. 2001a. A bit of progress in language modeling, extended version. Technical report MSR-TR- 2001-72.
- (2001) Technical Report MSR-TR
- Goodman, J.T.¹

7
- 0034856455
- Classes for fast maximum entropy training
- Goodman, 2001b
- [Goodman, 2001b] J. T. Goodman. 2001b. Classes for fast maximum entropy training. In Proceedings of ICASSP.
- (2001) Proceedings of ICASSP
- Goodman, J.T.¹

8
- 0012357341
- A dynamic language model for speech recognition
- Jelinek et al. 1991
- [Jelinek et al., 1991] F. Jelinek, B. Merialdo, S. Roukos, and M. Strauss. 1991. A Dynamic Language Model for Speech Recognition. In Proceedings of the DARPA Workshop on Speech and Natural Language.
- (1991) Proceedings of the DARPA Workshop on Speech and Natural Language
- Jelinek, F.¹ Merialdo, B.² Roukos, S.³ Strauss, M.⁴

9
- 0034295822
- Structured language modeling
- Chelba and Jelinek, 2000
- [Chelba and Jelinek, 2000] C. Chelba and F. Jelinek. 2000. Structured language modeling. Computer Speech & Language.
- (2000) Computer Speech & Language
- Chelba, C.¹ Jelinek, F.²

10
- 79959818347
- Study on interaction between entropy pruning and kneser-ney smoothing
- Chelba et al. 2010
- [Chelba et al., 2010] C. Chelba, T. Brants, W. Neveitt, and P. Xu. 2010. Study on Interaction between Entropy Pruning and Kneser-Ney Smoothing. In Proceedings of Interspeech.
- (2010) Proceedings of Interspeech
- Chelba, C.¹ Brants, T.² Neveitt, W.³ Xu, P.⁴

11
- 85024115120
- An empirical study of smoothing techniques for language modeling
- Chen and Goodman, 1996
- [Chen and Goodman, 1996] S. F. Chen and J. T. Goodman. 1996. An empirical study of smoothing techniques for language modeling. In Proceedings of ACL.
- (1996) Proceedings of ACL
- Chen, S.F.¹ Goodman, J.T.²

12
- 84863387613
- Shrinking exponential language models
- Chen, 2009
- [Chen, 2009] S. F. Chen. 2009. Shrinking exponential language models. In Proceedings of NAACL-HLT.
- (2009) Proceedings of NAACL-HLT
- Chen, S.F.¹

13
- 0023312404
- Estimation of probabilities from sparse data for the language model component of a speech recognizer
- Katz, 1995
- [Katz, 1995] S. Katz. 1987. Estimation of probabilities from sparse data for the language model component of a speech recognizer. In IEEE Transactions on Acoustics, Speech and Signal Processing.
- (1987) IEEE Transactions on Acoustics, Speech and Signal Processing
- Katz, S.¹

14
- 0028996876
- Improved backing-off for M-gram language modeling
- Kneser and Ney, 1995
- [Kneser and Ney, 1995] R. Kneser and H. Ney. 1995. Improved Backing-Off For M-Gram Language Modeling. In Proceedings of ICASSP.
- (1995) Proceedings of ICASSP
- Kneser, R.¹ Ney, H.²

15
- 79959829092
- Recurrent neural network based language model
- Mikolov et al. 2010
- [Mikolov et al., 2010] T. Mikolov, M. Karafiát, L. Burget, J. Cě rnocký, and S. Khudanpur. 2010. Recurrent neural network based language model. In Proceedings of Interspeech.
- (2010) Proceedings of Interspeech
- Mikolov, T.¹ Karafiát, M.² Burget, L.³ Rnocký, J.C.⁴ Khudanpur, S.⁵

16
- 80051643236
- Extensions of recurrent neural network language model
- Mikolov et al. 2011a
- [Mikolov et al., 2011a] T. Mikolov, S. Kombrink, L. Burget, J. Cěrnocky, and S. Khudanpur. 2011. Extensions of Recurrent Neural Network Language Model. In Proceedings of ICASSP.
- (2011) Proceedings of ICASSP
- Mikolov, T.¹ Kombrink, S.² Burget, L.³ Cěrnocky, J.⁴ Khudanpur, S.⁵

17
- 84865803833
- Empirical evaluation and combination of advanced language modeling techniques
- Mikolov et al. 2011b
- [Mikolov et al., 2011b] T. Mikolov, A. Deoras, S. Kombrink, L. Burget, and J. Cěrnocky. 2011a. Empirical Evaluation and Combination of Advanced Language Modeling Techniques. In Proceedings of Interspeech.
- (2011) Proceedings of Interspeech
- Mikolov, T.¹ Deoras, A.² Kombrink, S.³ Burget, L.⁴ Cěrnocky, J.⁵

18
- 84858966958
- Strategies for training large scale neural network language models
- Mikolov et al. 2011c
- [Mikolov et al., 2011c] T. Mikolov, A. Deoras, D. Povey, L. Burget, and J. Cěrnocky. 2011b. Strategies for Training Large Scale Neural Network Language Models. In Proceedings of ASRU.
- (2011) Proceedings of ASRU
- Mikolov, T.¹ Deoras, A.² Povey, D.³ Burget, L.⁴ Cěrnocky, J.⁵

19
- 84874250121
- Mikolov, 2012], . Ph.D. thesis, Brno University of Technology
- [Mikolov, 2012] T. Mikolov. 2012. Statistical Language Models based on Neural Networks. Ph.D. thesis, Brno University of Technology.
- (2012) Statistical Language Models Based on Neural Networks
- Mikolov, T.¹

20
- 34547970628
- Three new graphical models for statistical language modelling
- Mnih and Hinton, 2007
- [Mnih and Hinton, 2007] A. Mnih and G. Hinton. 2007. Three new graphical models for statistical language modelling. In Proceedings of ICML.
- (2007) Proceedings of ICML
- Mnih, A.¹ Hinton, G.²

21
- 34547997987
- Hierarchical probabilistic neural network language model
- Morin and Bengio, 2005
- [Morin and Bengio, 2005] F. Morin and Y. Bengio. 2005. Hierarchical Probabilistic Neural Network Language Model. In Proceedings of AISTATS.
- (2005) Proceedings of AISTATS
- Morin, F.¹ Bengio, Y.²

22
- 85149106909
- Discriminative language modeling with conditional random fields and the perceptron algorithm
- Roark et al. 2004
- [Roark et al., 2004] B. Roark, M. Saralar, M. Collins, and M. Johnson. 2004. Discriminative language modeling with conditional random fields and the perceptron algorithm. In Proceedings of ACL.
- (2004) Proceedings of ACL
- Roark, B.¹ Saralar, M.² Collins, M.³ Johnson, M.⁴

23
- 0003904645
- [Rosenfeld, 1994], . Ph.D. thesis, Carnegie Mellon University
- [Rosenfeld, 1994] R. Rosenfeld. 1994. Adaptive Statistical Language Modeling: A Maximum Entropy Approach. Ph.D. thesis, Carnegie Mellon University.
- (1994) Adaptive Statistical Language Modeling: A Maximum Entropy Approach
- Rosenfeld, R.¹

24
- 0022471098
- Learning internal representations by backpropagating errors
- Rumelhart et al. 1986
- [Rumelhart et al., 1986] D. E. Rumelhart, G. E. Hinton, and R. J. Williams. 1986. Learning internal representations by backpropagating errors. Nature, 323:533-536.
- (1986) Nature , vol.323 , pp. 533-536
- Rumelhart, D.E.¹ Hinton, G.E.² Williams, R.J.³

25
- 33847610331
- Continuous space language models
- Schwenk, 2007
- [Schwenk, 2007] H. Schwenk. 2007. Continuous space language models. Computer Speech and Language, vol. 21.
- (2007) Computer Speech and Language , vol.21
- Schwenk, H.¹

26
- 0012611072
- Entropy-based pruning of backoff language models
- Stolcke, 1998
- [Stolcke, 1998] A. Stolcke. 1998. Entropy-based Pruning of Backoff Language Models. In Proceedings of News Transcription and Understanding Workshop.
- (1998) Proceedings of News Transcription and Understanding Workshop
- Stolcke, A.¹

27
- 84878402147
- LSTM neural networks for language modeling
- Sundermeyer et al. 2012
- [Sundermeyer et al., 2012] M. Sundermeyer, R. Schluter, and H. Ney. 2012. LSTM Neural Networks for Language Modeling. In Proceedings of Interspeech.
- (2012) Proceedings of Interspeech
- Sundermeyer, M.¹ Schluter, R.² Ney, H.³

28
- 38049151407
- A hierarchical bayesian language model based on pitman yor processes
- Teh, 2006
- [Teh, 2006] Y.W. Teh. 2006. A hierarchical Bayesian language model based on Pitman Yor processes. In Proceedings of Coling/ACL.
- (2006) Proceedings of Coling/ACL
- Teh, Y.W.¹

29
- 84910084714
- Factored recurrent neural network language model in TED lecture transcription
- Wu et al. 2012
- [Wu et al., 2012] Y. Wu, H. Yamamoto, X. Lu, S. Matsuda, C. Hori, and H. Kashioka. 2012. Factored Recurrent Neural Network Language Model in TED Lecture Transcription. In Proceedings of IWSLT.
- (2012) Proceedings of IWSLT
- Wu, Y.¹ Yamamoto, H.² Lu, X.³ Matsuda, S.⁴ Hori, C.⁵ Kashioka, H.⁶

30
- 85055307672
- [Xu, 2005], .eling. Ph.D. thesis, Johns Hopkins University
- [Xu, 2005] Peng Xu. 2005. Random forests and the data sparseness problem in language modeling. Ph.D. thesis, Johns Hopkins University.
- (2005) Random Forests and the Data Sparseness Problem in Language Mod
- Xu, P.¹

31
- 80053246370
- Efficient subsampling for training complex language models
- Xu et al. 2011
- [Xu et al., 2011] Puyang Xu, A. Gunawardana, and S. Khudanpur. 2011. Efficient Subsampling for Training Complex Language Models. In Proceedings of EMNLP.
- (2011) Proceedings of EMNLP
- Xu, P.¹ Gunawardana, A.² Khudanpur, S.³

32
- 84890477112
- Speed regularization and optimality in word classing
- Zweig and Makarychev, 2013
- [Zweig and Makarychev, 2013] G. Zweig and K. Makarychev. 2013. Speed Regularization and Optimality in Word Classing. In Proceedings of ICASSP.
- (2013) Proceedings of ICASSP
- Zweig, G.¹ Makarychev, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.