메뉴 건너뛰기




Volumn , Issue , 2013, Pages 129-177

Topic-based Generative Models for Text Information Access

Author keywords

Fisher kernels; Generative models; Generative vs. discriminative models; Language models; Latent Dirichlet Allocation (LDA); Probabilistic latent semantic indexing (PLSI); Term models; Text information access; Text models; Topic based models

Indexed keywords


EID: 84870290379     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9781118562796.ch5     Document Type: Chapter
Times cited : (1)

References (109)
  • 1
    • 77955658320 scopus 로고    scopus 로고
    • A geometric view of conjugate priors
    • Springer
    • Agarwal A., Daumé H., "A geometric view of conjugate priors", Machine Learning, vol. 81, no. 1, p. 99-113, Springer, 2010.
    • (2010) Machine Learning , vol.81 , Issue.1 , pp. 99-113
    • Agarwal, A.1    Daumé, H.2
  • 3
    • 0000396062 scopus 로고    scopus 로고
    • Natural gradient works efficiently in learning
    • Amari S.I., "Natural gradient works efficiently in learning", Neural Computation, vol. 10, p. 251-276, 1998.
    • (1998) Neural Computation , vol.10 , pp. 251-276
    • Amari, S.I.1
  • 4
    • 0003530945 scopus 로고    scopus 로고
    • Translations of Mathematical Monographs, American Mathematical Society
    • Amari S.I., Nagaoka H., Methods of Information Geometry, vol. 191 of Translations of Mathematical Monographs, American Mathematical Society, 2000.
    • (2000) Methods of Information Geometry , vol.191
    • Amari, S.I.1    Nagaoka, H.2
  • 6
    • 78649698593 scopus 로고    scopus 로고
    • Asynchronous distributed estimation of topic models for document analysis
    • Elsevier, January
    • Asuncion A.U., Smyth P., Welling M., "Asynchronous distributed estimation of topic models for document analysis", Statistical Methodology, vol. 8, no. 1, p. 3-17, Elsevier, January 2011.
    • (2011) Statistical Methodology , vol.8 , Issue.1 , pp. 3-17
    • Asuncion, A.U.1    Smyth, P.2    Welling, M.3
  • 16
    • 52449116403 scopus 로고    scopus 로고
    • A correlated topic model of science
    • Blei D.M., Lafferty J.D., "A correlated topic model of science", Annals of Applied Statistics, vol. 1, no. 1, p. 17-35, 2007.
    • (2007) Annals of Applied Statistics , vol.1 , Issue.1 , pp. 17-35
    • Blei, D.M.1    Lafferty, J.D.2
  • 17
    • 74549185029 scopus 로고    scopus 로고
    • Supervised topic models
    • Platt J., Koller D., Singer Y., Roweis S. (eds), Advances in Neural Information Processing Systems 20 (NIPS '07), MIT Press
    • Blei D.M., McAuliffe J.D., "Supervised topic models", in Platt J., Koller D., Singer Y., Roweis S. (eds), Advances in Neural Information Processing Systems 20 (NIPS '07), MIT Press, p. 121-128, 2008.
    • (2008) , pp. 121-128
    • Blei, D.M.1    McAuliffe, J.D.2
  • 19
    • 76849117578 scopus 로고    scopus 로고
    • The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies
    • Blei D., Griffiths T., Jordan M., "The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies", Journal of the Association for Computing Machinery, vol. 57, no. 2, p. 1-30, 2010.
    • (2010) Journal of the Association for Computing Machinery , vol.57 , Issue.2 , pp. 1-30
    • Blei, D.1    Griffiths, T.2    Jordan, M.3
  • 21
    • 84945258136 scopus 로고    scopus 로고
    • Variational extensions to EM and multinomial PCA
    • vol. 2430 of LNAI
    • Buntine W., "Variational extensions to EM and multinomial PCA", in Proceedings of ECML '02, vol. 2430 of LNAI, p. 23-34, 2002.
    • (2002) Proceedings of ECML '02 , pp. 23-34
    • Buntine, W.1
  • 23
    • 33745846804 scopus 로고    scopus 로고
    • Discrete component analysis
    • vol. 3940 of LNCS
    • Buntine W., Jakulin A., "Discrete component analysis", in Proceedings of SLSFS '05, vol. 3940 of LNCS, p. 1-33, 2006.
    • (2006) Proceedings of SLSFS '05 , pp. 1-33
    • Buntine, W.1    Jakulin, A.2
  • 24
    • 70549102400 scopus 로고    scopus 로고
    • Estimating likelihoods for topic models
    • vol. 5828 of LNAI
    • Buntine W., "Estimating likelihoods for topic models", in Proceedings of ACML '09, vol. 5828 of LNAI, p. 51-64, 2009.
    • (2009) Proceedings of ACML '09 , pp. 51-64
    • Buntine, W.1
  • 28
    • 70350589160 scopus 로고    scopus 로고
    • An ad hoc information retrieval perspective on PLSI through language model identification
    • Azzopardi L., et al. (eds), Advances in Information Retrieval Theory (Proceedings of ICTIR '09), vol. 5766 of Lecture Notes in Computer Science, Springer-Verlag
    • Chappelier J.C., Eckard E., "An ad hoc information retrieval perspective on PLSI through language model identification", in Azzopardi L., et al. (eds), Advances in Information Retrieval Theory (Proceedings of ICTIR '09), vol. 5766 of Lecture Notes in Computer Science, Springer-Verlag, p. 346-349, 2009.
    • (2009) , pp. 346-349
    • Chappelier, J.C.1    Eckard, E.2
  • 29
    • 70350648690 scopus 로고    scopus 로고
    • PLSI: The true Fisher kernel and beyond - IID processes, information matrix and model identification in PLSI
    • Buntine W., et al. (eds), Machine Learning and Knowledge Discovery in Databases (Proceedings ECMLPKDD '09), vol. 5781 of Lecture Notes in Computer Science, Springer-Verlag
    • Chappelier J.C., Eckard E., "PLSI: The true Fisher kernel and beyond - IID processes, information matrix and model identification in PLSI", in Buntine W., et al. (eds), Machine Learning and Knowledge Discovery in Databases (Proceedings ECMLPKDD '09), vol. 5781 of Lecture Notes in Computer Science, Springer-Verlag, p. 195-210, 2009.
    • (2009) , pp. 195-210
    • Chappelier, J.C.1    Eckard, E.2
  • 34
    • 84915676562 scopus 로고
    • Multiple hypergeometric functions: probabilistic interpretations and statistical uses
    • Dickey J., "Multiple hypergeometric functions: probabilistic interpretations and statistical uses", Journal of the American Statistical Association, vol. 78, p. 628-637, 1983.
    • (1983) Journal of the American Statistical Association , vol.78 , pp. 628-637
    • Dickey, J.1
  • 36
    • 33646725912 scopus 로고    scopus 로고
    • Deriving TF-IDF as a Fisher kernel
    • Navarro G. (eds), Proceedings of String Processing and Information Retrieval (SPIRE '05), vol. 3772 of Lecture Notes in Computer Science, Chapter 33, Springer
    • Elkan C., "Deriving TF-IDF as a Fisher kernel", in Consens M., Navarro G. (eds), Proceedings of String Processing and Information Retrieval (SPIRE '05), vol. 3772 of Lecture Notes in Computer Science, Chapter 33, p. 295-300, Springer, 2005.
    • (2005) Consens M. , pp. 295-300
    • Elkan, C.1
  • 37
    • 33749257142 scopus 로고    scopus 로고
    • Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution
    • Elkan C., "Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution", in Proceedings of the 23rd International Conference on Machine Learning (ICML '06), p. 289-296, 2006.
    • (2006) Proceedings of the 23rd International Conference on Machine Learning (ICML '06) , pp. 289-296
    • Elkan, C.1
  • 39
    • 25144479614 scopus 로고    scopus 로고
    • A latent variable model for chemogenomic profiling
    • Oxford University Press, August
    • Flaherty P., Giaever G., Kumm J., Jordan M.I., Arkin A.P., "A latent variable model for chemogenomic profiling", Bioinformatics, vol. 21, no. 15, p. 3286-3293, Oxford University Press, August 2005.
    • (2005) Bioinformatics , vol.21 , Issue.15 , pp. 3286-3293
    • Flaherty, P.1    Giaever, G.2    Kumm, J.3    Jordan, M.I.4    Arkin, A.P.5
  • 43
    • 56449095616 scopus 로고    scopus 로고
    • Memory bounded inference in topic models
    • McCallum A., Roweis S. (eds), Proceedings 25th International Conference on Machine Learning (ICML '08), Omnipress
    • Gomes R., Welling M., Perona P., "Memory bounded inference in topic models", in McCallum A., Roweis S. (eds), Proceedings 25th International Conference on Machine Learning (ICML '08), Omnipress, p. 344-351, 2008.
    • (2008) , pp. 344-351
    • Gomes, R.1    Welling, M.2    Perona, P.3
  • 49
    • 70350641332 scopus 로고    scopus 로고
    • A generic approach to topic models
    • Buntine W., et al. (eds), Machine Learning and Knowledge Discovery in Databases (Proceedings ECML-PKDD '09), vol. 5781 of Lecture Notes in Computer Science, Springer-Verlag
    • Heinrich G., "A generic approach to topic models", in Buntine W., et al. (eds), Machine Learning and Knowledge Discovery in Databases (Proceedings ECML-PKDD '09), vol. 5781 of Lecture Notes in Computer Science, Springer-Verlag, p. 517-532, 2009.
    • (2009) , pp. 517-532
    • Heinrich, G.1
  • 50
    • 50249144225 scopus 로고    scopus 로고
    • Report, Fraunhofer Institute for Computer Graphics Research, Darmstadt, Germany
    • Heinrich G., Parameter Estimation for Text Analysis (v. 2.9), Report, Fraunhofer Institute for Computer Graphics Research, Darmstadt, Germany, 2009.
    • (2009) Parameter Estimation for Text Analysis (v. 2.9)
    • Heinrich, G.1
  • 53
    • 84898996741 scopus 로고    scopus 로고
    • Learning the similarity of documents: an information-geometric approach to document retrieval and categorization
    • Hofmann T., "Learning the similarity of documents: an information-geometric approach to document retrieval and categorization", in Advances in Neural Information Processing Systems, vol. 12, p. 914-920, 2000.
    • (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 914-920
    • Hofmann, T.1
  • 54
    • 0034818212 scopus 로고    scopus 로고
    • Unsupervised learning by probabilistic latent semantic analysis
    • Hofmann T., "Unsupervised learning by probabilistic latent semantic analysis", Machine Learning, vol. 42, no. 1, p. 177-196, 2001.
    • (2001) Machine Learning , vol.42 , Issue.1 , pp. 177-196
    • Hofmann, T.1
  • 59
    • 0033225865 scopus 로고    scopus 로고
    • Introduction to variational methods for graphical models
    • Jordan M., Ghahramani Y., Jaakkola T., Saul L., "Introduction to variational methods for graphical models", Machine Learning, vol. 37, p. 183-233, 1999.
    • (1999) Machine Learning , vol.37 , pp. 183-233
    • Jordan, M.1    Ghahramani, Y.2    Jaakkola, T.3    Saul, L.4
  • 60
    • 0030092468 scopus 로고    scopus 로고
    • Distribution of content words and phrases in text and language modelling
    • Katz S.M., "Distribution of content words and phrases in text and language modelling", Natural Language Engineering, vol. 2, p. 15-59, 1996.
    • (1996) Natural Language Engineering , vol.2 , pp. 15-59
    • Katz, S.M.1
  • 62
    • 79957489009 scopus 로고    scopus 로고
    • DiscLDA: discriminative learning for dimensionality reduction and classification
    • Koller D., Schuurmans D., Bengio Y., Bottou L. (eds)
    • Lacoste-Julien S., Sha F., Jordan M., "DiscLDA: discriminative learning for dimensionality reduction and classification", in Koller D., Schuurmans D., Bengio Y., Bottou L. (eds), Advances in Neural Information Processing Systems 21 (NIPS '08), p. 897-904, 2009.
    • (2009) Advances in Neural Information Processing Systems 21 (NIPS '08) , pp. 897-904
    • Lacoste-Julien, S.1    Sha, F.2    Jordan, M.3
  • 64
    • 0033592606 scopus 로고    scopus 로고
    • Algorithms for non-negative matrix factorization
    • Lee D.D., Seung H.S., "Algorithms for non-negative matrix factorization", Nature, vol. 401, p. 788-791, 1999.
    • (1999) Nature , vol.401 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 65
    • 84957069091 scopus 로고    scopus 로고
    • Naive (bayes) at forty: the independence assumption in information retrieval
    • vol. 1398 of Lecture Notes in Computer Science, Springer
    • Lewis D., "Naive (bayes) at forty: the independence assumption in information retrieval", in Proceedings of 10th European Conference on Machine Learning (ECML '98), vol. 1398 of Lecture Notes in Computer Science, Springer, p. 4-15, 1998.
    • (1998) Proceedings of 10th European Conference on Machine Learning (ECML '98) , pp. 4-15
    • Lewis, D.1
  • 68
    • 0025952277 scopus 로고
    • Divergence measures based on Shannon entropy
    • Lin J., "Divergence measures based on Shannon entropy", IEEE Transactions on Information Theory, vol. 37, no. 14, p. 145-151, 1991.
    • (1991) IEEE Transactions on Information Theory , vol.37 , Issue.14 , pp. 145-151
    • Lin, J.1
  • 71
    • 33750699291 scopus 로고    scopus 로고
    • Multi-conditional learning: generative/discriminative training for clustering and classification
    • McCallum A., Pal C., Druck G., Wang X., "Multi-conditional learning: generative/discriminative training for clustering and classification", in Proceedings of AAAI '06, 2006.
    • (2006) Proceedings of AAAI '06
    • McCallum, A.1    Pal, C.2    Druck, G.3    Wang, X.4
  • 72
    • 38349172091 scopus 로고    scopus 로고
    • Topic and role discovery in social networks with experiments on enron and academic email
    • AI Access Foundation
    • McCallum A., Wang X., Corrada-Emmanuel A., "Topic and role discovery in social networks with experiments on enron and academic email", Journal of Artificial Intelligence Research, vol. 30, no. 1, p. 249-272, AI Access Foundation, 2007.
    • (2007) Journal of Artificial Intelligence Research , vol.30 , Issue.1 , pp. 249-272
    • McCallum, A.1    Wang, X.2    Corrada-Emmanuel, A.3
  • 74
    • 33947156744 scopus 로고    scopus 로고
    • Comparing clusterings-an information based distance
    • Meilǎ M., "Comparing clusterings-an information based distance", Journal of Multivariate Analysis, vol. 98, no. 5, p. 873-895, 2007.
    • (2007) Journal of Multivariate Analysis , vol.98 , Issue.5 , pp. 873-895
    • Meilǎ, M.1
  • 75
    • 0141596527 scopus 로고    scopus 로고
    • Estimating a Dirichlet distribution
    • available on the Internet (revision 2009)
    • Minka T., "Estimating a Dirichlet distribution", available on the Internet (revision 2009), 2000.
    • (2000)
    • Minka, T.1
  • 82
    • 0033886806 scopus 로고    scopus 로고
    • Text classification from labeled and unlabeled documents using EM
    • Nigam K., McCallum A.K., Thrun S., Mitchell T., "Text classification from labeled and unlabeled documents using EM", Machine Learning, vol. 39, p. 103-134, 2000.
    • (2000) Machine Learning , vol.39 , pp. 103-134
    • Nigam, K.1    McCallum, A.K.2    Thrun, S.3    Mitchell, T.4
  • 86
    • 0034118493 scopus 로고    scopus 로고
    • Inference of population structure unsing multilocus genotype data
    • Pritchard J.K., Stephens M., Donnelly P., "Inference of population structure unsing multilocus genotype data", Genetics, vol. 155, p. 945-959, 2000.
    • (2000) Genetics , vol.155 , pp. 945-959
    • Pritchard, J.K.1    Stephens, M.2    Donnelly, P.3
  • 92
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani F., "Machine learning in automated text categorization", ACM Computing Surveys, vol. 34, no. 1, p. 1-47, 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 94
    • 85145541793 scopus 로고    scopus 로고
    • Probabilistic topic models
    • McNamara D., Dennis S., Kintsch W. (eds), Handbook of Latent Semantic Analysis, Chapter 21, Laurence Erlbaum
    • Steyvers M., Griffiths T., "Probabilistic topic models", in Landauer T., McNamara D., Dennis S., Kintsch W. (eds), Handbook of Latent Semantic Analysis, Chapter 21, p. 427-448, Laurence Erlbaum, 2007.
    • (2007) Landauer T. , pp. 427-448
    • Steyvers, M.1    Griffiths, T.2
  • 98
    • 0036498205 scopus 로고    scopus 로고
    • A probabilistic framework for the hierarchic organisation and classification of document collections
    • Vinokourov A., Girolami M., "A probabilistic framework for the hierarchic organisation and classification of document collections", Journal of Intelligent Information Systems, vol. 18, no. 2-3, p. 153-172, 2002.
    • (2002) Journal of Intelligent Information Systems , vol.18 , Issue.2-3 , pp. 153-172
    • Vinokourov, A.1    Girolami, M.2
  • 105
    • 3042824043 scopus 로고    scopus 로고
    • A study of smoothing methods for language models applied to information retrieval
    • Zhai C., Lafferty J., "A study of smoothing methods for language models applied to information retrieval", ACM Transactions on Information Systems, vol. 22, no. 2, p. 179-214, 2004.
    • (2004) ACM Transactions on Information Systems , vol.22 , Issue.2 , pp. 179-214
    • Zhai, C.1    Lafferty, J.2
  • 106
    • 58149265101 scopus 로고    scopus 로고
    • Statistical language models for information retrieval a critical review
    • Zhai C., "Statistical language models for information retrieval a critical review", Foundations and Trends in Information Retrieval, vol. 2, no. 3, p. 137-213, 2008.
    • (2008) Foundations and Trends in Information Retrieval , vol.2 , Issue.3 , pp. 137-213
    • Zhai, C.1
  • 108
    • 24944501423 scopus 로고    scopus 로고
    • Generative model-based document clustering: a comparative study
    • Springer-Verlag
    • Zhong S., Ghosh J., "Generative model-based document clustering: a comparative study", Knowledge and Information Systems, vol. 8, no. 3, p. 374-384, Springer-Verlag, 2005.
    • (2005) Knowledge and Information Systems , vol.8 , Issue.3 , pp. 374-384
    • Zhong, S.1    Ghosh, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.