메뉴 건너뛰기




Volumn 19, Issue 4, 2008, Pages 713-722

Adaptive importance sampling to accelerate training of a neural probabilistic language model

Author keywords

Energy based models; Fast training; Importance sampling; Language modeling; Monte Carlo methods; Probabilistic neural networks

Indexed keywords

APPROXIMATION THEORY; COMPUTER SIMULATION; MAXIMUM LIKELIHOOD; PROBABILITY; STATISTICAL METHODS;

EID: 42549142788     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/TNN.2007.912312     Document Type: Article
Times cited : (223)

References (34)
  • 2
    • 0036293862 scopus 로고    scopus 로고
    • Connectionist language modeling for large vocabulary continuous speech recognition
    • Orlando, FL
    • H. Schwenk and J.-L. Gauvain, "Connectionist language modeling for large vocabulary continuous speech recognition," in Proc. Int. Conf. Acoust. Speech Signal Process., Orlando, FL, 2002, pp. 765-768.
    • (2002) Proc. Int. Conf. Acoust. Speech Signal Process , pp. 765-768
    • Schwenk, H.1    Gauvain, J.-L.2
  • 3
    • 10944267136 scopus 로고    scopus 로고
    • Efficient training of large neural networks for language modeling
    • Jul
    • H. Schwenk, "Efficient training of large neural networks for language modeling," in Proc. IEEE Int. Joint Conf. Neural Netw., Jul. 2004, vol. 4, pp. 3059-3064.
    • (2004) Proc. IEEE Int. Joint Conf. Neural Netw , vol.4 , pp. 3059-3064
    • Schwenk, H.1
  • 6
    • 0002553443 scopus 로고
    • Interpolated estimation of Markov source parameters from sparse data
    • F. Jelinek and R. L. Mercer E. S. Gelsema and L. N. Kanal, Eds, Amsterdam, The Netherlands: North-Holland
    • F. Jelinek and R. L. Mercer" E. S. Gelsema and L. N. Kanal, Eds., "Interpolated estimation of Markov source parameters from sparse data," in Pattern Recognition in Practice. Amsterdam, The Netherlands: North-Holland, 1980.
    • (1980) Pattern Recognition in Practice
  • 7
    • 0023312404 scopus 로고
    • Estimation of probabilities from sparse data for the language model component of a speech recognizer
    • Mar
    • S. M. Katz, "Estimation of probabilities from sparse data for the language model component of a speech recognizer," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-35, no. 3, pp. 400-401, Mar. 1987.
    • (1987) IEEE Trans. Acoust. Speech Signal Process , vol.ASSP-35 , Issue.3 , pp. 400-401
    • Katz, S.M.1
  • 9
    • 84899005563 scopus 로고    scopus 로고
    • A neural probabilistic language model
    • T. K. Leen, T. G. Dietterich, and V. Tresp, Eds. Cambridge, MA: MIT Press
    • Y. Bengio, R. Ducharme, and P. Vincent, "A neural probabilistic language model," in Advances in Neural Information Processing Systems 13, T. K. Leen, T. G. Dietterich, and V. Tresp, Eds. Cambridge, MA: MIT Press, 2001, pp. 932-938.
    • (2001) Advances in Neural Information Processing Systems 13 , pp. 932-938
    • Bengio, Y.1    Ducharme, R.2    Vincent, P.3
  • 10
    • 0002623785 scopus 로고
    • Learning distributed representations of concerts
    • Amherst, Hillsdale
    • G. Hinton, "Learning distributed representations of concerts," in Proc. 8th Annu. Conf. Cogn. Sci. Soc., Amherst, Hillsdale, 1986, pp. 1-12.
    • (1986) Proc. 8th Annu. Conf. Cogn. Sci. Soc , pp. 1-12
    • Hinton, G.1
  • 11
    • 84988402904 scopus 로고    scopus 로고
    • Can artificial neural network learn language models
    • Beijing, China
    • W. Xu and A. Rudnicky, "Can artificial neural network learn language models," in Proc. Int. Conf. Statist. Lang. Process., Beijing, China, 2000, pp. M1-13.
    • (2000) Proc. Int. Conf. Statist. Lang. Process
    • Xu, W.1    Rudnicky, A.2
  • 12
    • 0006273786 scopus 로고    scopus 로고
    • A latent semantic analysis framework for large-span language modeling
    • Rhodes, Greece
    • J. Bellegarda, "A latent semantic analysis framework for large-span language modeling," in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 1451-1454.
    • (1997) Proc. Eurospeech , pp. 1451-1454
    • Bellegarda, J.1
  • 13
    • 0029984070 scopus 로고    scopus 로고
    • Improving protein secondary structure prediction using structured neural networks and multiple sequence profiles
    • S. Riis and A. Krogh, "Improving protein secondary structure prediction using structured neural networks and multiple sequence profiles," J. Comput. Biol., pp. 163-183, 1996.
    • (1996) J. Comput. Biol , pp. 163-183
    • Riis, S.1    Krogh, A.2
  • 14
    • 85009143810 scopus 로고    scopus 로고
    • Self organizing letter code-book for text-to-phoneme neural network model
    • K. Jensen and S. Riis, "Self organizing letter code-book for text-to-phoneme neural network model," in Proc. Int. Conf. Spoken Lang. Process., 2000, vol. 3, pp. 318-321.
    • (2000) Proc. Int. Conf. Spoken Lang. Process , vol.3 , pp. 318-321
    • Jensen, K.1    Riis, S.2
  • 16
    • 8344290493 scopus 로고    scopus 로고
    • Energy-based models for sparse overcomplete representations
    • Y.-W. Teh, M. Welling, S. Osindero, and G. E. Hinton, "Energy-based models for sparse overcomplete representations," J. Mach. Learn. Res., vol. 4, pp. 1235-1260, 2003.
    • (2003) J. Mach. Learn. Res , vol.4 , pp. 1235-1260
    • Teh, Y.-W.1    Welling, M.2    Osindero, S.3    Hinton, G.E.4
  • 17
    • 0003757760 scopus 로고
    • Fundamentals of stafisfical exponential families
    • Bethesda, MD: Institute of Mathematical Statistics
    • L. D. Brown, "Fundamentals of stafisfical exponential families," in Lecture Notes Monograph Series. Bethesda, MD: Institute of Mathematical Statistics, 1986, vol. 9.
    • (1986) Lecture Notes Monograph Series , vol.9
    • Brown, L.D.1
  • 18
    • 42549093328 scopus 로고    scopus 로고
    • G. E. Hinton and T. J. Sejnowski, Learning and releasing in Boltzmann machines, in Parallel Distributed Processing: Explorations in the Microstructure of Cognition. 1: Foundations, D. E. Rumelhart and J. L. McClelland, Eds. Cambridge, MA: MIT Press, 1986.
    • G. E. Hinton and T. J. Sejnowski, "Learning and releasing in Boltzmann machines," in Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Volume 1: Foundations, D. E. Rumelhart and J. L. McClelland, Eds. Cambridge, MA: MIT Press, 1986.
  • 19
    • 0035059194 scopus 로고    scopus 로고
    • Whole-sentence exponential language models: A vehicle for linguistic-statistical integration
    • Online, Available
    • R. Rosenfeld, S. F. Chen, and X. Zhu, "Whole-sentence exponential language models: A vehicle for linguistic-statistical integration," Comput. Speech Lang. vol. 15, no. 1, 2001 [Online]. Available: citeseer.nj.nec.com/448532.html
    • (2001) Comput. Speech Lang , vol.15 , Issue.1
    • Rosenfeld, R.1    Chen, S.F.2    Zhu, X.3
  • 20
    • 0002652285 scopus 로고    scopus 로고
    • A maximum entropy approach to natural language processing
    • A. Berger, S. Della Pietra, and V. Della Pietra, "A maximum entropy approach to natural language processing," Comput. Linguist., vol. 22, pp. 39-71, 1996.
    • (1996) Comput. Linguist , vol.22 , pp. 39-71
    • Berger, A.1    Della Pietra, S.2    Della Pietra, V.3
  • 22
    • 84950943564 scopus 로고
    • Sequential imputations and Bayesian missing data problems
    • A. Kong, J. S. Liu, and W. H. Wong, "Sequential imputations and Bayesian missing data problems," J. Amer. Statist. Assoc., vol. 89, pp. 278-288, 1994.
    • (1994) J. Amer. Statist. Assoc , vol.89 , pp. 278-288
    • Kong, A.1    Liu, J.S.2    Wong, W.H.3
  • 27
    • 0012356157 scopus 로고    scopus 로고
    • Mach. Learn. Appl. Statist. Group, Microsoft Res, Redmond, WA, Tech. Rep. MSR-TR-2001-72
    • J. Goodman, "A bit of progress in language modeling-extended version," Mach. Learn. Appl. Statist. Group, Microsoft Res., Redmond, WA, Tech. Rep. MSR-TR-2001-72, 2003.
    • (2003) A bit of progress in language modeling-extended version
    • Goodman, J.1
  • 28
    • 0001249662 scopus 로고    scopus 로고
    • Ais-bn: An adaptive importance sampling algorithm for evidential reasoning in large Bayesian networks
    • J. Cheng and M. J. Druzdzel, "Ais-bn: An adaptive importance sampling algorithm for evidential reasoning in large Bayesian networks," J. Artif. Intell. Res., vol. 13, pp. 155-188, 2000.
    • (2000) J. Artif. Intell. Res , vol.13 , pp. 155-188
    • Cheng, J.1    Druzdzel, M.J.2
  • 29
    • 33646907991 scopus 로고    scopus 로고
    • Two decades of statistical language modeling: Where do we go from here?
    • Aug
    • R. Rosenfeld, "Two decades of statistical language modeling: Where do we go from here?," Proc. IEEE, vol. 88, no. 8, pp. 1270-1278, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1270-1278
    • Rosenfeld, R.1
  • 30
    • 0012356157 scopus 로고    scopus 로고
    • A bit of progress in language modeling
    • Microsoft Res, Tech. Rep. MSR -TR-2001-72
    • J. Goodman, "A bit of progress in language modeling," Microsoft Res., Tech. Rep. MSR -TR-2001-72, 2001.
    • (2001)
    • Goodman, J.1
  • 31
    • 4544358964 scopus 로고    scopus 로고
    • The super ARV language model: Investigating th effectiveness of tightly integrating multiple knowledge sources
    • Morristown, NJ
    • W. Wang and M. P. Harper, "The super ARV language model: Investigating th effectiveness of tightly integrating multiple knowledge sources," in Proc. ACL-02 Conf. Empirical Methods Natural Lang. Proress. Morristown, NJ, 2002, pp. 238-247.
    • (2002) Proc. ACL-02 Conf. Empirical Methods Natural Lang. Proress , pp. 238-247
    • Wang, W.1    Harper, M.P.2
  • 32
    • 0036293862 scopus 로고    scopus 로고
    • Connectionist language modeling for large vocabulary continuous speech recognition
    • Orlando, FL
    • H. Schwenk and J.-L. Gauvain, "Connectionist language modeling for large vocabulary continuous speech recognition," in Proc. Int. Conf. Acoust. Speach Signal Process., Orlando, FL, 2002, pp. 765-768.
    • (2002) Proc. Int. Conf. Acoust. Speach Signal Process , pp. 765-768
    • Schwenk, H.1    Gauvain, J.-L.2
  • 34
    • 0142192256 scopus 로고    scopus 로고
    • Dept. IRO, Université de Montr_aveal, Montréal, QC, Canada, Tech Rep. 1215
    • Y. Bengio, "New distributed probabilistic language models," Dept. IRO, Université de Montr_aveal, Montréal, QC, Canada, Tech Rep. 1215, 2002.
    • (2002) New distributed probabilistic language models
    • Bengio, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.