메뉴 건너뛰기




Volumn 15, Issue 5, 2007, Pages 1617-1624

On growing and pruning Kneser-Ney smoothed N-gram models

Author keywords

Modeling; Natural languages; Smoothing methods; Speech recognition

Indexed keywords

BACK OFFS; FINNISH; GOOD-TURING; GROWING AND PRUNING; LANGUAGE MODELS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; MODEL ORDERS; MODELING; N-GRAM MODELS; NATURAL LANGUAGES; PRUNING ALGORITHMS; PRUNING METHODS; SMOOTHING METHODS; TEXT CORPORA; TRAINING DATUM;

EID: 58349107420     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.896666     Document Type: Article
Times cited : (69)

References (28)
  • 1
    • 0030361237 scopus 로고    scopus 로고
    • Scalable backoff language models
    • K. Seymore and R. Rosenfeld, "Scalable backoff language models," in Proc. ICSLP, 1996, pp. 232-235.
    • (1996) Proc. ICSLP , pp. 232-235
    • Seymore, K.1    Rosenfeld, R.2
  • 2
    • 0030375251 scopus 로고    scopus 로고
    • Statistical language modeling using a variable context length
    • R. Kneser, "Statistical language modeling using a variable context length," in Proc. ICSLP, 1996, pp. 494-497.
    • (1996) Proc. ICSLP , pp. 494-497
    • Kneser, R.1
  • 4
    • 0033329799 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • Oct
    • S. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Comput. Speech Lang., vol. 13, no. 4, pp. 359-393, Oct. 1999.
    • (1999) Comput. Speech Lang , vol.13 , Issue.4 , pp. 359-393
    • Chen, S.1    Goodman, J.2
  • 5
    • 0035497388 scopus 로고    scopus 로고
    • Abit of progress in language modeling
    • Oct
    • J. Goodman, "Abit of progress in language modeling," Comput. Speech Lang., vol. 15, no. 4, pp. 403-434, Oct. 2001.
    • (2001) Comput. Speech Lang , vol.15 , Issue.4 , pp. 403-434
    • Goodman, J.1
  • 6
    • 0028996876 scopus 로고
    • Improved backing-off for m-gram language modeling
    • R. Kneser and H. Ney, "Improved backing-off for m-gram language modeling," in Proc. ICASSP, 1995, pp. 181-184.
    • (1995) Proc. ICASSP , pp. 181-184
    • Kneser, R.1    Ney, H.2
  • 7
    • 84945903856 scopus 로고    scopus 로고
    • Language model size reduction by pruning and clustering
    • J. Goodman and J. Gao, "Language model size reduction by pruning and clustering," in Proc. ICSLP, 2000, pp. 110-113.
    • (2000) Proc. ICSLP , pp. 110-113
    • Goodman, J.1    Gao, J.2
  • 9
    • 0033873049 scopus 로고    scopus 로고
    • Variable N-grams and extensions for conversational speech language modeling
    • Jan
    • M. Siu and M. Ostendorf, "Variable N-grams and extensions for conversational speech language modeling," IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 63-75, Jan. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.1 , pp. 63-75
    • Siu, M.1    Ostendorf, M.2
  • 10
    • 0032650074 scopus 로고    scopus 로고
    • Variable-length category n-gram language models
    • Jan
    • T. R. Niesler and P. C. Woodland, "Variable-length category n-gram language models," Comput. Speech Lang., vol. 13, no. 1, pp. 99-124, Jan. 1999.
    • (1999) Comput. Speech Lang , vol.13 , Issue.1 , pp. 99-124
    • Niesler, T.R.1    Woodland, P.C.2
  • 11
    • 33745217822 scopus 로고    scopus 로고
    • Growing an N-gram model
    • V. Siivola and B. Pellom, "Growing an N-gram model," in Proc. Interspeech, 2005, pp. 1309-1312.
    • (2005) Proc. Interspeech , pp. 1309-1312
    • Siivola, V.1    Pellom, B.2
  • 12
    • 0038373395 scopus 로고    scopus 로고
    • Multi-class composite n-gram language model
    • Oct
    • H. Yamamoto, S. Isogai, and Y. Sagisaka, "Multi-class composite n-gram language model," Speech Commun., vol. 41, no. 2-3, pp. 369-379, Oct. 2003.
    • (2003) Speech Commun , vol.41 , Issue.2-3 , pp. 369-379
    • Yamamoto, H.1    Isogai, S.2    Sagisaka, Y.3
  • 13
    • 0031273765 scopus 로고    scopus 로고
    • Inference of variable-length linguistic and acoustic units by multigrams
    • S. Deligne and F. Bimbot, "Inference of variable-length linguistic and acoustic units by multigrams," Speech Commun., vol. 23, no. 3, pp. 223-241, 1997.
    • (1997) Speech Commun , vol.23 , Issue.3 , pp. 223-241
    • Deligne, S.1    Bimbot, F.2
  • 15
    • 44949140825 scopus 로고    scopus 로고
    • Compact n-gram models by incremental growing and clustering of histories
    • S. Virpioja and M. Kurimo, "Compact n-gram models by incremental growing and clustering of histories," in Proc. Interspeech, 2006, pp. 1037-1040.
    • (2006) Proc. Interspeech , pp. 1037-1040
    • Virpioja, S.1    Kurimo, M.2
  • 16
    • 0030366442 scopus 로고    scopus 로고
    • Language modeling using x-grams
    • A. Bonafonte and J. Mariño, "Language modeling using x-grams," in Proc. ICSLP, 1996, pp. 394-397.
    • (1996) Proc. ICSLP , pp. 394-397
    • Bonafonte, A.1    Mariño, J.2
  • 17
    • 0033887568 scopus 로고    scopus 로고
    • A survey of smoothing techniques for ME models
    • Jan
    • S. Chen and R. Rosenfeld, "A survey of smoothing techniques for ME models," IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 37-50, Jan. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.1 , pp. 37-50
    • Chen, S.1    Rosenfeld, R.2
  • 18
    • 84860520072 scopus 로고    scopus 로고
    • Modified Kneser-Ney Smoothing of N-Gram Models Res. Inst. Adv. Comput. Sci
    • Tech. Rep. 00.07, Oct
    • F. James, Modified Kneser-Ney Smoothing of N-Gram Models Res. Inst. Adv. Comput. Sci., Tech. Rep. 00.07, Oct. 2000.
    • (2000)
    • James, F.1
  • 19
    • 85009115852 scopus 로고    scopus 로고
    • Quantization-based language model compression
    • E. W. D. Whittaker and B. Raj, "Quantization-based language model compression," in Proc. Eurospeech, 2001, pp. 33-36.
    • (2001) Proc. Eurospeech , pp. 33-36
    • Whittaker, E.W.D.1    Raj, B.2
  • 20
    • 0141703229 scopus 로고    scopus 로고
    • Lossless compression of language model structure and word identifiers
    • B. Raj and E. W. D. Whittaker, "Lossless compression of language model structure and word identifiers," in Proc. ICASSP, 2003, pp. 388-391.
    • (2003) Proc. ICASSP , pp. 388-391
    • Raj, B.1    Whittaker, E.W.D.2
  • 21
    • 64149109545 scopus 로고    scopus 로고
    • quot;Finnish Text Collection, 2004, collection of Finnish text documents from years 1990-2000. Compiled by Department of General Linguistics, University of Helsinki, Linguistics and Language Technology Department, University of Joensuu, Research Institute for the Languages of Finland, and CSC.[Online]. Available: http://www.csc.fi/kielipankki/
    • quot;Finnish Text Collection," 2004, collection of Finnish text documents from years 1990-2000. Compiled by Department of General Linguistics, University of Helsinki, Linguistics and Language Technology Department, University of Joensuu, Research Institute for the Languages of Finland, and CSC.[Online]. Available: http://www.csc.fi/kielipankki/
  • 22
    • 33746524944 scopus 로고    scopus 로고
    • Unlimited vocabulary speech recognition with morph language models applied to Finnish
    • Oct
    • T. Hirsimäki, M. Creutz, V. Siivola, M. Kurimo, S. Virpioja, and J. Pylkkönen, "Unlimited vocabulary speech recognition with morph language models applied to Finnish," Comput. Speech Lang., vol. 20, no. 4, pp. 515-541, Oct. 2006.
    • (2006) Comput. Speech Lang , vol.20 , Issue.4 , pp. 515-541
    • Hirsimäki, T.1    Creutz, M.2    Siivola, V.3    Kurimo, M.4    Virpioja, S.5    Pylkkönen, J.6
  • 24
    • 64149097895 scopus 로고    scopus 로고
    • M. Creutz and K. Lagus, Unsupervised morpheme segmentation and morphology induction from text corpora Using Morfessor 1.0, Publications Comput. Inf. Sci., Helsinki Univ.Technol., Tech. Rep. A81, 2005.
    • M. Creutz and K. Lagus, "Unsupervised morpheme segmentation and morphology induction from text corpora Using Morfessor 1.0," Publications Comput. Inf. Sci., Helsinki Univ.Technol., Tech. Rep. A81, 2005.
  • 25
    • 84891308106 scopus 로고    scopus 로고
    • SRILM-An extensible language modeling toolkit
    • A. Stolcke, "SRILM-An extensible language modeling toolkit," in Proc. ICSLP, 2002, pp. 901-904.
    • (2002) Proc. ICSLP , pp. 901-904
    • Stolcke, A.1
  • 26
    • 19944407315 scopus 로고    scopus 로고
    • Second Edition. Philadelphia, PA: Linguistic Data Consortium
    • D. Graff, J. Kong, K. Chen, and K. Maeda, English Gigaword Second Edition. Philadelphia, PA: Linguistic Data Consortium, 2005.
    • (2005) English Gigaword
    • Graff, D.1    Kong, J.2    Chen, K.3    Maeda, K.4
  • 28
    • 33745200763 scopus 로고    scopus 로고
    • New pruning criteria for efficient decoding
    • J. Pylkkönen, "New pruning criteria for efficient decoding," in Proc. Interspeech, 2005, pp. 581-584.
    • (2005) Proc. Interspeech , pp. 581-584
    • Pylkkönen, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.