메뉴 건너뛰기




Volumn 31, Issue 8, 2004, Pages 885-924

Pattern discovery and detection: A unified statistical methodology

Author keywords

Association analysis; Bioinformatics; Configural frequency analysis; Data mining; Market basket analysis; Pattern discovery; Patterns; Scan statistics; Spatial epidemiology; Technical analysis

Indexed keywords


EID: 8744299420     PISSN: 02664763     EISSN: None     Source Type: Journal    
DOI: 10.1080/0266476042000270518     Document Type: Article
Times cited : (21)

References (102)
  • 2
    • 84942446106 scopus 로고    scopus 로고
    • Discovery of actionable patterns in databases: The action hierarchy approach
    • D. Heckerman, H. Mannila, D. Pregibon & R. Uthurusamy (Eds) (Menlo Park, CA: AAAI Press)
    • Adomavicius, G. & Tuzhilin, A. (1997) Discovery of actionable patterns in databases: the action hierarchy approach, in: D. Heckerman, H. Mannila, D. Pregibon & R. Uthurusamy (Eds) Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pp. 111-114 (Menlo Park, CA: AAAI Press).
    • (1997) Proceedings of the Third International Conference on Knowledge Discovery and Data Mining , pp. 111-114
    • Adomavicius, G.1    Tuzhilin, A.2
  • 4
    • 84937677595 scopus 로고    scopus 로고
    • Discovery of frequent word sequences in text
    • D. J. Hand, N. M. Adams & R. J. Bolton (Eds) London, UK, Proceedings, LNAI 2447, (Berlin: Springer)
    • Ahonen-Myka, H. (2002) Discovery of frequent word sequences in text, in: D. J. Hand, N. M. Adams & R. J. Bolton (Eds) Pattern Detection and Discovery, ESF Exploratory Workshop, London, UK, Proceedings, LNAI 2447, pp. 180-189 (Berlin: Springer).
    • (2002) Pattern Detection and Discovery, ESF Exploratory Workshop , pp. 180-189
    • Ahonen-Myka, H.1
  • 7
    • 0001677717 scopus 로고
    • Controlling the false discovery rate: A practical and powerful approach to multiple testing
    • Benjamini, Y. & Hochberg, Y. (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing, Journal of the Royal Statistical Society, Series B, 57, pp. 289-300.
    • (1995) Journal of the Royal Statistical Society, Series B , vol.57 , pp. 289-300
    • Benjamini, Y.1    Hochberg, Y.2
  • 10
    • 0025308637 scopus 로고
    • An explanation of density estimation to geographical epidemiology
    • Bithell, J. F. (1990) An explanation of density estimation to geographical epidemiology, Statistics of Medicine, 9, pp. 691-701.
    • (1990) Statistics of Medicine , vol.9 , pp. 691-701
    • Bithell, J.F.1
  • 11
    • 78149302119 scopus 로고    scopus 로고
    • Significance tests for patterns in continuous data
    • N. Cercone, T. Y. Lin & X. Wu (Eds) (Los Alamitos, CA: IEEE Computer Society)
    • Bolton, R. J. & Hand, D. J. (2001) Significance tests for patterns in continuous data, in: N. Cercone, T. Y. Lin & X. Wu (Eds) Proceedings of the 2001 IEEE International Conference on Data Mining, pp. 67-74 (Los Alamitos, CA: IEEE Computer Society).
    • (2001) Proceedings of the 2001 IEEE International Conference on Data Mining , pp. 67-74
    • Bolton, R.J.1    Hand, D.J.2
  • 12
    • 0042421807 scopus 로고    scopus 로고
    • Statistical fraud detection: A review
    • (with discussion)
    • Bolton, R. J. & Hand, D. J. (2002) Statistical fraud detection: a review (with discussion), Statistical Science, 17, pp. 235-255.
    • (2002) Statistical Science , vol.17 , pp. 235-255
    • Bolton, R.J.1    Hand, D.J.2
  • 13
    • 41549157049 scopus 로고    scopus 로고
    • Determining hit rate in pattern search
    • D. J. Hand, N. M. Adams & R. J. Bolton (Eds) London, UK, Proceedings, (Berlin: Springer)
    • Bolton, R. J., Hand, D. J. & Adams, N. M. (2002) Determining hit rate in pattern search, in: D. J. Hand, N. M. Adams & R. J. Bolton (Eds) Pattern Detection and Discovery, ESF Exploratory Workshop, London, UK, Proceedings, pp. 36-48 (Berlin: Springer).
    • (2002) Pattern Detection and Discovery, ESF Exploratory Workshop , pp. 36-48
    • Bolton, R.J.1    Hand, D.J.2    Adams, N.M.3
  • 16
    • 0025090128 scopus 로고
    • Some sources of error in the coding of birth weight
    • Brunskill, A. J. (1990) Some sources of error in the coding of birth weight, American Journal of Public Health, 80, pp. 72-73.
    • (1990) American Journal of Public Health , vol.80 , pp. 72-73
    • Brunskill, A.J.1
  • 18
    • 0008643229 scopus 로고    scopus 로고
    • Approximations for the distribution and the moments of discrete scan statistics
    • J. Glaz & N. Balakrishnan (Eds) (Boston, MA: Birkhäuser)
    • Chen, J. & Glaz, J. (1999) Approximations for the distribution and the moments of discrete scan statistics, in: J. Glaz & N. Balakrishnan (Eds) Scan Statistics and Applications, pp. 27-66 (Boston, MA: Birkhäuser).
    • (1999) Scan Statistics and Applications , pp. 27-66
    • Chen, J.1    Glaz, J.2
  • 22
    • 0022662794 scopus 로고
    • Extreme value distributions for the largest cube in random lattice
    • Darling, R. W. R. & Waterman, M. S. (1986) Extreme value distributions for the largest cube in random lattice, SIAM Journal of Applied Mathematics, 46, pp. 118-132.
    • (1986) SIAM Journal of Applied Mathematics , vol.46 , pp. 118-132
    • Darling, R.W.R.1    Waterman, M.S.2
  • 23
    • 0018962004 scopus 로고
    • The quality and completeness of birthweight and gestational age data in computerized birth files
    • David, R. J. (1980) The quality and completeness of birthweight and gestational age data in computerized birth files, American Journal of Public Health, 70, pp. 964-973.
    • (1980) American Journal of Public Health , vol.70 , pp. 964-973
    • David, R.J.1
  • 24
    • 0039747575 scopus 로고    scopus 로고
    • A comparative study of perinatal mortality using a two-component normal mixture
    • D. A. Berry & D. K. Stangl (Eds) (New York: Marcel Dekker)
    • Dellaportas, P., Stephens, D. A., Smith, A. F. M. & Guttman, I. (1996) A comparative study of perinatal mortality using a two-component normal mixture, in: D. A. Berry & D. K. Stangl (Eds) Bayesian Biostatistics, pp. 601-615 (New York: Marcel Dekker).
    • (1996) Bayesian Biostatistics , pp. 601-615
    • Dellaportas, P.1    Stephens, D.A.2    Smith, A.F.M.3    Guttman, I.4
  • 29
    • 0038931405 scopus 로고    scopus 로고
    • Bayesian data mining in large frequency tables, with an application to the FDA spontaneous reporting system
    • (with discussion)
    • DuMouchel, W. (1999) Bayesian data mining in large frequency tables, with an application to the FDA spontaneous reporting system (with discussion), The American Statistician, 53, pp. 177-202.
    • (1999) The American Statistician , vol.53 , pp. 177-202
    • DuMouchel, W.1
  • 42
    • 0041111360 scopus 로고    scopus 로고
    • Statistics and the theory of measurement
    • (with discussion)
    • Hand, D. J. (1996) Statistics and the theory of measurement (with discussion), Journal of the Royal Statistical Society, Series A 159, pp. 445-492.
    • (1996) Journal of the Royal Statistical Society, Series A , vol.159 , pp. 445-492
    • Hand, D.J.1
  • 44
    • 0041299747 scopus 로고    scopus 로고
    • Data mining - Reaching beyond statistics
    • Hand, D. J. (1998a) Data mining - reaching beyond statistics, Research in Official Statistics, 2, pp. 5-17.
    • (1998) Research in Official Statistics , vol.2 , pp. 5-17
    • Hand, D.J.1
  • 45
    • 0032367976 scopus 로고    scopus 로고
    • Data mining: Statistics and more?
    • Hand, D. J. (1998b) Data mining: statistics and more?, The American Statistician, 52, pp. 112-118.
    • (1998) The American Statistician , vol.52 , pp. 112-118
    • Hand, D.J.1
  • 46
    • 0040625530 scopus 로고    scopus 로고
    • Breaking misconceptions - Statistics and its relationship to mathematics
    • (with discussion) 245-250 and
    • Hand, D. J. (1998c) Breaking misconceptions - statistics and its relationship to mathematics (with discussion), Journal of the Royal Statistical Society, Series D, 47, pp. 245-250 and 284-286.
    • (1998) Journal of the Royal Statistical Society, Series D , vol.47 , pp. 284-286
    • Hand, D.J.1
  • 47
    • 0008593610 scopus 로고    scopus 로고
    • Methodological issues in data mining
    • J. G. Bethlehem & P. G. M. van der Heijden (Eds) (Berlin: Physica-Verlag)
    • Hand, D. J. (2000) Methodological issues in data mining, in: J. G. Bethlehem & P. G. M. van der Heijden (Eds) COMPSTAT 2000: Proceedings in Computational Statistics, pp. 77-85 (Berlin: Physica-Verlag).
    • (2000) COMPSTAT 2000: Proceedings in Computational Statistics , pp. 77-85
    • Hand, D.J.1
  • 48
    • 0000264058 scopus 로고    scopus 로고
    • Defining attributes for scorecard construction
    • Hand, D. J. & Adams, N. M. (2000) Defining attributes for scorecard construction, Journal of Applied Statistics, 27, pp. 527-540.
    • (2000) Journal of Applied Statistics , vol.27 , pp. 527-540
    • Hand, D.J.1    Adams, N.M.2
  • 50
    • 0035528674 scopus 로고    scopus 로고
    • Idiot's Bayes - Not so stupid after all?
    • Hand, D. J. & Yu, K. (2001) Idiot's Bayes - not so stupid after all? International Statistical Review, 69, pp. 385-398.
    • (2001) International Statistical Review , vol.69 , pp. 385-398
    • Hand, D.J.1    Yu, K.2
  • 52
    • 8744230140 scopus 로고    scopus 로고
    • A note on confidence and support
    • Technical Report, Department of Mathematics, Imperial College, London
    • Hand, D. J., Blunt, G. & Bolton, R. J. (2001a) A note on confidence and support, Technical Report, Department of Mathematics, Imperial College, London.
    • (2001)
    • Hand, D.J.1    Blunt, G.2    Bolton, R.J.3
  • 56
    • 0002784345 scopus 로고    scopus 로고
    • Algorithms for association rule mining - A general survey and comparison
    • Hipp, J., Guntzer, U. & Nakhaeizadeh, G. (2000) Algorithms for association rule mining - a general survey and comparison, SIGKDD Explorations, 2, pp. 58-64.
    • (2000) SIGKDD Explorations , vol.2 , pp. 58-64
    • Hipp, J.1    Guntzer, U.2    Nakhaeizadeh, G.3
  • 57
    • 0000314341 scopus 로고
    • A simpler expression for kth nearest neighbour coincidence probabilities
    • Huntingdon, R. & Naus, J. (1975) A simpler expression for kth nearest neighbour coincidence probabilities, Annals of Probability, 3, pp. 894-896.
    • (1975) Annals of Probability , vol.3 , pp. 894-896
    • Huntingdon, R.1    Naus, J.2
  • 60
    • 0345872001 scopus 로고    scopus 로고
    • Cluster detection in databases: The adaptive matched filter algorithm and implementation
    • Kepner, J. & Kim, R. (2003) Cluster detection in databases: the adaptive matched filter algorithm and implementation, Data Mining and Knowledge Discovery, 7, 57-79.
    • (2003) Data Mining and Knowledge Discovery , vol.7 , pp. 57-79
    • Kepner, J.1    Kim, R.2
  • 62
    • 0029055093 scopus 로고
    • Spatial disease clusters: Detection and inference
    • Kulldorff, M. & Nagarwalla, N. (1995) Spatial disease clusters: detection and inference, Statistics in Medicine, 14, pp. 799-810.
    • (1995) Statistics in Medicine , vol.14 , pp. 799-810
    • Kulldorff, M.1    Nagarwalla, N.2
  • 63
    • 0027912333 scopus 로고
    • Discovering subtle sequence signals: A Gibbs sampling strategy for multiple alignment
    • Lawrence, C. E., Altschul, S. F., Boguski, M. S., Liu, J. S., Neuwald, A. F. & Wootton, J. C. (1993) Discovering subtle sequence signals: a Gibbs sampling strategy for multiple alignment, Science, 262, pp. 208-214.
    • (1993) Science , vol.262 , pp. 208-214
    • Lawrence, C.E.1    Altschul, S.F.2    Boguski, M.S.3    Liu, J.S.4    Neuwald, A.F.5    Wootton, J.C.6
  • 64
    • 0040470867 scopus 로고
    • Die 'Konfigurationsfrequnzanalyse' als Klassifikationsmethode in der klinischen Psychologie
    • Presented at the 26th Kongress der Deutschen Gesellschaft für Psychologie in Tubingen, 1968
    • Lienert, G. A. (1969) Die 'Konfigurationsfrequnzanalyse' als Klassifikationsmethode in der klinischen Psychologie. Presented at the 26th Kongress der Deutschen Gesellschaft für Psychologie in Tubingen, 1968.
    • (1969)
    • Lienert, G.A.1
  • 66
    • 0002484986 scopus 로고
    • Stock market prices do not follow random walks: Evidence from a simple specification test
    • Lo, A. W. & MacKinlay, A. C. (1988) Stock market prices do not follow random walks: evidence from a simple specification test, Review of Financial Studies, 1, pp. 41-66.
    • (1988) Review of Financial Studies , vol.1 , pp. 41-66
    • Lo, A.W.1    MacKinlay, A.C.2
  • 68
    • 0031236412 scopus 로고    scopus 로고
    • A suboptimal lossy data compression based on approximate pattern matching
    • Luczak, T. & Szpankowski, W. (1997) A suboptimal lossy data compression based on approximate pattern matching, IEEE Transactions on Information Theory, 43, pp. 1439-1451.
    • (1997) IEEE Transactions on Information Theory , vol.43 , pp. 1439-1451
    • Luczak, T.1    Szpankowski, W.2
  • 71
    • 0001259364 scopus 로고
    • A review of methods for the statistical analysis of spatial patterns of disease
    • Marshall, R. J. (1991) A review of methods for the statistical analysis of spatial patterns of disease, Journal of the Royal Statistical Society, 154, pp. 421-441.
    • (1991) Journal of the Royal Statistical Society , vol.154 , pp. 421-441
    • Marshall, R.J.1
  • 75
    • 0012369154 scopus 로고
    • A community study of the relationship between birth weight and gestational age
    • Clinics in Developmental Medicine Spastics Society Medical Education Unit
    • Neligan, G. (1965) A community study of the relationship between birth weight and gestational age, in Gestational Age, Size and Maturity. Clinics in Developmental Medicine Spastics Society Medical Education Unit, 19, pp. 28-32.
    • (1965) Gestational Age, Size and Maturity , vol.19 , pp. 28-32
    • Neligan, G.1
  • 76
    • 0033345672 scopus 로고    scopus 로고
    • Unexpectedness as a measure of interestingness in knowledge discovery
    • Padmanabhan, B. & Tuzhilin, A. (1999) Unexpectedness as a measure of interestingness in knowledge discovery, Decision Support Systems, 27, pp. 303-318.
    • (1999) Decision Support Systems , vol.27 , pp. 303-318
    • Padmanabhan, B.1    Tuzhilin, A.2
  • 79
    • 0039813920 scopus 로고    scopus 로고
    • Basic concepts of multiple tests - A survey
    • Pigeot, I. (2000) Basic concepts of multiple tests - a survey, Statistical Papers, 41, pp. 3-36.
    • (2000) Statistical Papers , vol.41 , pp. 3-36
    • Pigeot, I.1
  • 81
    • 0031684427 scopus 로고    scopus 로고
    • Combinatorial pattern discovery in biological sequences
    • Rigoutsos, I. & Floratos, A. (1998) Combinatorial pattern discovery in biological sequences, Bioinformatics, 14, pp. 55-67.
    • (1998) Bioinformatics , vol.14 , pp. 55-67
    • Rigoutsos, I.1    Floratos, A.2
  • 82
    • 2542482294 scopus 로고    scopus 로고
    • Measures and tests of heaping in discrete quantitative distributions
    • Roberts, J. M. Jr & Brewer, D. D. (2001) Measures and tests of heaping in discrete quantitative distributions, Journal of Applied Statistics, 28, pp. 887-896.
    • (2001) Journal of Applied Statistics , vol.28 , pp. 887-896
    • Roberts Jr., J.M.1    Brewer, D.D.2
  • 89
    • 84956974614 scopus 로고    scopus 로고
    • WUM: A tool for Web Utilization Analysis
    • (Berlin: Springer-Verlag)
    • Spiliopoulou, M. & Faulstich, L. C. (1999) WUM: a tool for Web Utilization Analysis, Lecture Notes in Computer Science, 1590, pp. 184-203 (Berlin: Springer-Verlag).
    • (1999) Lecture Notes in Computer Science , vol.1590 , pp. 184-203
    • Spiliopoulou, M.1    Faulstich, L.C.2
  • 94
    • 8744313151 scopus 로고
    • The electroencephalogram
    • P. McGuffin, M. F. Shanks & R. J. Hodgson (Eds) (London: Grune and Stratton)
    • Toone, B. (1984) The electroencephalogram, in: P. McGuffin, M. F. Shanks & R. J. Hodgson (Eds) The Scientific Principles of Psychopathology, pp. 36-55 (London: Grune and Stratton).
    • (1984) The Scientific Principles of Psychopathology , pp. 36-55
    • Toone, B.1
  • 95
    • 0034651804 scopus 로고    scopus 로고
    • Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals
    • Van Helden, J., del Olmo, M. & Perez-Ortin, J. E. (2000) Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals, Nucleic Acids Research, 28, pp. 1000-1010.
    • (2000) Nucleic Acids Research , vol.28 , pp. 1000-1010
    • Van Helden, J.1    del Olmo, M.2    Perez-Ortin, J.E.3
  • 97
    • 0345856472 scopus 로고
    • Probabilities for the size of the largest clusters and smallest intervals
    • Wallenstein, S. & Naus, J. (1974) Probabilities for the size of the largest clusters and smallest intervals, Journal of the American Statistical Association, 69, pp. 690-697.
    • (1974) Journal of the American Statistical Association , vol.69 , pp. 690-697
    • Wallenstein, S.1    Naus, J.2
  • 101
    • 84958521275 scopus 로고    scopus 로고
    • Investigating temporal patterns of fault behaviour within large telephony networks
    • F. Hoffmann, D. J. Hand, N. Adams, D. Fisher & G. Guimaraes (Eds) (Portugal: Cascais)
    • Yearling, D. & Hand, D. J. (2001) Investigating temporal patterns of fault behaviour within large telephony networks, in: F. Hoffmann, D. J. Hand, N. Adams, D. Fisher & G. Guimaraes (Eds) Advances in Intelligent Data Analysis, pp. 340-349 (Portugal: Cascais).
    • (2001) Advances in Intelligent Data Analysis , pp. 340-349
    • Yearling, D.1    Hand, D.J.2
  • 102
    • 0002598545 scopus 로고    scopus 로고
    • From contingency tables to various forms of knowledge in databases
    • U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth & R. Uthurusamy (Eds) (Menlo Park, CA: AAAI Press)
    • Zembowicz, R. & Zytkow, J. (1996) From contingency tables to various forms of knowledge in databases, in: U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth & R. Uthurusamy (Eds) Advances in Knowledge Discovery and Data Mining, pp. 329-349 (Menlo Park, CA: AAAI Press).
    • (1996) Advances in Knowledge Discovery and Data Mining , pp. 329-349
    • Zembowicz, R.1    Zytkow, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.