메뉴 건너뛰기




Volumn 22, Issue 2, 2004, Pages 179-214

A study of smoothing methods for language models applied to information retrieval

Author keywords

Absolute discounting smoothing; Backoff smoothing; Dirichlet prior smoothing; EM algorithm; Interpolation smoothing; Jelinek Mercer smoothing; Leave one out; Risk minimization; Statistical language models; Term weighting; TF IDF weighting; Two stage smoothing

Indexed keywords

LANGUAGE MODELS; SENSITIVITY PATTERNS; SMOOTHING PARAMETERS; STATISTICAL LANGUAGE MODELING;

EID: 3042824043     PISSN: 10468188     EISSN: None     Source Type: Journal    
DOI: 10.1145/984321.984322     Document Type: Article
Times cited : (1020)

References (30)
  • 2
    • 0003396042 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • Harvard University
    • CHEN, S. F. AND GOODMAN, J. 1998. An empirical study of smoothing techniques for language modeling. Tech. Rep. TR-10-98, Harvard University.
    • (1998) Tech. Rep. , vol.TR-10-98
    • Chen, S.F.1    Goodman, J.2
  • 3
    • 77957175435 scopus 로고
    • Probabilistic models in information retrieval
    • FUHR, N. 1992. Probabilistic models in information retrieval. Comput J. 35, 3, 243-255.
    • (1992) Comput J. , vol.35 , Issue.3 , pp. 243-255
    • Fuhr, N.1
  • 4
    • 0000803388 scopus 로고
    • The population frequencies of species and the estimation of population parameters
    • GOOD, I. J. 1953. The population frequencies of species and the estimation of population parameters. Biometrika 40, parts 3, 4, 237-264.
    • (1953) Biometrika , vol.40 , Issue.PARTS 3 AND 4 , pp. 237-264
    • Good, I.J.1
  • 6
    • 0019114666 scopus 로고
    • Interpolated estimation of markov sourceparameters from sparse data
    • E. S. Gelsema and L. N. Kanal, Eds.
    • JELINEK, F. AND MERCER, R. 1980. Interpolated estimation of markov sourceparameters from sparse data. In Pattern Recognition in Practice, E. S. Gelsema and L. N. Kanal, Eds. 381-402.
    • (1980) Pattern Recognition in Practice , pp. 381-402
    • Jelinek, F.1    Mercer, R.2
  • 7
    • 0023312404 scopus 로고
    • Estimation of probabilities from sparse data for the language model component of a speech recognizer
    • KATZ, S. M. 1987. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Trans. Acoustics, Speech and Signal Processing (ASSP) 35 400-401.
    • (1987) IEEE Trans. Acoustics, Speech and Signal Processing (ASSP) , vol.35 , pp. 400-401
    • Katz, S.M.1
  • 9
    • 0036993292 scopus 로고    scopus 로고
    • The importance of prior probabilities for entry-page search
    • ACM, New York
    • KRAAIJ, W., WESTERVELD, T., AND HIEMSTRA, D. 2002. The importance of prior probabilities for entry-page search. In Proceedings of SIGIR'02. ACM, New York, 27-34.
    • (2002) Proceedings of SIGIR'02 , pp. 27-34
    • Kraaij, W.1    Westerveld, T.2    Hiemstra, D.3
  • 10
    • 0032286297 scopus 로고    scopus 로고
    • Improving two-stage ad-hoc retrieval for short queries
    • ACM, New York
    • KWOK, K. AND CHAN, M. 1998. Improving two-stage ad-hoc retrieval for short queries. In Proceedings of SIGIR'98. ACM, New York, 250-256.
    • (1998) Proceedings of SIGIR'98 , pp. 250-256
    • Kwok, K.1    Chan, M.2
  • 11
    • 0034790672 scopus 로고    scopus 로고
    • Document language models, query models, and risk minimization for information retrieval
    • ACM, New York
    • LAFFBRTY, J. AND ZHAI, C. 2001. Document language models, query models, and risk minimization for information retrieval. In Proceedings of SIGIR'01, ACM, New York, 111-119.
    • (2001) Proceedings of SIGIR'01 , pp. 111-119
    • Laffbrty, J.1    Zhai, C.2
  • 12
    • 0034785304 scopus 로고    scopus 로고
    • Relevance-based language models
    • ACM, New York
    • LAVBENKO, V. AND CROFT, B. 2001. Relevance-based language models. In Proceedings of SIGIR'01. ACM, New York, 120-127.
    • (2001) Proceedings of SIGIR'01 , pp. 120-127
    • Lavbenko, V.1    Croft, B.2
  • 13
    • 84974288913 scopus 로고
    • A hierarchical Dirichlet language model
    • MACKAY, D. AND PETO, L. 1995. A hierarchical Dirichlet language model. Nat. Lang. Eng. 1, 3, 289-307.
    • (1995) Nat. Lang. Eng. , vol.1 , Issue.3 , pp. 289-307
    • Mackay, D.1    Peto, L.2
  • 15
    • 0027929445 scopus 로고
    • On structuring probabilistic dependencies in stochastic language modeling
    • NEY, H., ESSEN, U., AND KNESER, R. 1994. On structuring probabilistic dependencies in stochastic language modeling. Comput. Speech Lang. 8, 1-38.
    • (1994) Comput. Speech Lang. , vol.8 , pp. 1-38
    • Ney, H.1    Essen, U.2    Kneser, R.3
  • 16
    • 0029492893 scopus 로고
    • On the estimation of 'small' probabilities by leaving-one-out
    • NEY, H., ESSEN, U., AND KNESER, R. 1995. On the estimation of 'small' probabilities by leaving-one-out. IEEE Trans. Pattern Anal. Machine Intel. 17, 12, 1202-1212.
    • (1995) IEEE Trans. Pattern Anal. Machine Intel. , vol.17 , Issue.12 , pp. 1202-1212
    • Ney, H.1    Essen, U.2    Kneser, R.3
  • 18
    • 0032268440 scopus 로고    scopus 로고
    • A language modeling approach to information retrieval
    • ACM, New York
    • PONTE, J. AND CROFT, W. B. 1998. A language modeling approach to information retrieval. In Proceedings of the ACM SIGIR'98. ACM, New York, 275-281.
    • (1998) Proceedings of the ACM SIGIR'98 , pp. 275-281
    • Ponte, J.1    Croft, W.B.2
  • 19
    • 0019693672 scopus 로고
    • Probabilistic models of indexing and searching
    • R. N. Oddy et al., Eds. Butterworths
    • ROBERTSON, S. E., VAN RUSBERGEN, C. J., AND PORTER, M. F. 1981. Probabilistic models of indexing and searching. In Information Retrieval Research, R. N. Oddy et al., Eds. Butterworths, 35-56.
    • (1981) Information Retrieval Research , pp. 35-56
    • Robertson, S.E.1    Van Rusbergen, C.J.2    Porter, M.F.3
  • 21
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • SALTON, G. AND BUCKLEY, C. 1988. Term-weighting approaches in automatic text retrieval. Inf. Proc. Manage. 24, 513-523.
    • (1988) Inf. Proc. Manage. , vol.24 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 22
    • 84989552705 scopus 로고
    • Improving retrieval performance by relevance feedback
    • SALTON, G. AND BUCKLEY, C. 1990. Improving retrieval performance by relevance feedback. J. Amer. Soc. Inf. Sci. 44, 4, 288-297.
    • (1990) J. Amer. Soc. Inf. Sci. , vol.44 , Issue.4 , pp. 288-297
    • Salton, G.1    Buckley, C.2
  • 23
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • SALTON, G., WONG, A., AND YANG, C. S. 1975. A vector space model for automatic indexing. Commun. ACM 18, 11, 613-620.
    • (1975) Commun. ACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.S.3
  • 27
    • 0023014685 scopus 로고
    • A non-classical logic for information retrieval
    • VAN RIJSBERGEN, C. J. 1986. A non-classical logic for information retrieval. Comput. J. 29, 6, 481-485.
    • (1986) Comput. J. , vol.29 , Issue.6 , pp. 481-485
    • Van Rijsbergen, C.J.1
  • 29
    • 0029214138 scopus 로고
    • On modeling information retrieval with probabilistic inference
    • WONG, S. K. M. AND YAO, Y. Y. 1995. On modeling information retrieval with probabilistic inference. ACM Trans. Inf. Syst. 13, 1, 69-99.
    • (1995) ACM Trans. Inf. Syst. , vol.13 , Issue.1 , pp. 69-99
    • Wong, S.K.M.1    Yao, Y.Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.