SCOPUS 정보 검색 플랫폼

Proceedings of the VLDB Endowment

Volumn 10, Issue 7, 2017, Pages 757-768

Local search methods for k-means with outliers

(5) Gupta, Shalmoli a Kumar, Ravi b Lu, Kefu c Moseley, Benjamin c Vassilvitskii, Sergei b

a UNIVERSITY OF ILLINOIS AT URBANA CHAMPAIGN (United States)

b GOOGLE INC (United States)

c WASHINGTON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DATA HANDLING; HEURISTIC METHODS; LOCAL SEARCH (OPTIMIZATION); STATISTICS;

APPROXIMATE SOLUTION; CONSTANT FACTORS; EMPIRICAL EVALUATIONS; HEURISTIC APPROACH; K-MEANS CLUSTERING; LARGE DATASETS; LOCAL SEARCH; LOCAL SEARCH METHOD;

CLUSTERING ALGORITHMS;

EID: 85026321311 PISSN: None EISSN: 21508097 Source Type: Conference Proceeding
DOI: 10.14778/3067421.3067425 Document Type: Conference Paper

Times cited : (130)

References (39)

1
- 70449722914
- Adaptive sampling for k-means clustering
- A. Aggarwal, A. Deshpande, and R. Kannan. Adaptive sampling for k-means clustering. In RANDOM, pages 15-28, 2009.
- (2009) RANDOM , pp. 15-28
- Aggarwal, A.¹ Deshpande, A.² Kannan, R.³

2
- 0034832620
- Outlier detection for high dimensional data
- C. Aggarwal and P. Yu. Outlier detection for high dimensional data. In SIGMOD, pages 37-46, 2001.
- (2001) SIGMOD , pp. 37-46
- Aggarwal, C.¹ Yu, P.²

3
- 84872905210
- Chapman and Hall/CRC
- C. C. Aggarwal and C. K. Reddy. Data Clustering: Algorithms and Applications. Chapman and Hall/CRC, 2013.
- (2013) Data Clustering: Algorithms and Applications
- Aggarwal, C.C.¹ Reddy, C.K.²

4
- 85039571873
- A linear method for deviation detection in large databases
- A. Arning, R. Agrawal, and P. Raghavan. A linear method for deviation detection in large databases. In KDD, pages 164-169, 1996.
- (1996) KDD , pp. 164-169
- Arning, A.¹ Agrawal, R.² Raghavan, P.³

5
- 84969135721
- k-means++: the advantages of careful seeding
- D. Arthur and S. Vassilvitskii. k-means++: the advantages of careful seeding. In SODA, pages 1027-1035, 2007.
- (2007) SODA , pp. 1027-1035
- Arthur, D.¹ Vassilvitskii, S.²

6
- 84863760691
- Scalable k-means++
- B. Bahmani, B. Moseley, A. Vattani, R. Kumar, and S. Vassilvitskii. Scalable k-means++. PVLDB, 5(7):622-633, 2012.
- (2012) PVLDB , vol.5 , Issue.7 , pp. 622-633
- Bahmani, B.¹ Moseley, B.² Vattani, A.³ Kumar, R.⁴ Vassilvitskii, S.⁵

7
- 77952380096
- Mining distance-based outliers in near linear time with randomization and a simple pruning rule
- S. Bay and M. Schwabacher. Mining distance-based outliers in near linear time with randomization and a simple pruning rule. In KDD, pages 29-38, 2003.
- (2003) KDD , pp. 29-38
- Bay, S.¹ Schwabacher, M.²

8
- 84892062680
- Survey of clustering data mining techniques
- J. Kogan, C. K. Nicholas, and M. Teboulle, editors. Springer
- P. Berkhin. Survey of clustering data mining techniques. In J. Kogan, C. K. Nicholas, and M. Teboulle, editors, Grouping Multidimensional Data: Recent Advances in Clustering. Springer, 2006.
- (2006) Grouping Multidimensional Data: Recent Advances in Clustering
- Berkhin, P.¹

9
- 57149146298
- Outlier-robust clustering using independent components
- C. Bohm, C. Faloutsos, and C. Plant. Outlier-robust clustering using independent components. In SIGMOD, pages 185-198, 2008.
- (2008) SIGMOD , pp. 185-198
- Bohm, C.¹ Faloutsos, C.² Plant, C.³

10
- 0039253819
- LOF: Identifying density-based local outliers
- M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. LOF: Identifying density-based local outliers. SIGMOD Record, 29(2):93-104, 2000.
- (2000) SIGMOD Record , vol.29 , Issue.2 , pp. 93-104
- Breunig, M.M.¹ Kriegel, H.-P.² Ng, R.T.³ Sander, J.⁴

11
- 26944440987
- Algorithms for facility location problems with outliers
- M. Charikar, S. Khuller, D. M. Mount, and G. Narasimhan. Algorithms for facility location problems with outliers. In SODA, pages 642-651, 2001.
- (2001) SODA , pp. 642-651
- Charikar, M.¹ Khuller, S.² Mount, D.M.³ Narasimhan, G.⁴

12
- 84960498671
- k-means-: A unified approach to clustering and outlier detection
- S. Chawla and A. Gionis. k-means-: A unified approach to clustering and outlier detection. In ICDM, pages 189-197, 2013.
- (2013) ICDM , pp. 189-197
- Chawla, S.¹ Gionis, A.²

13
- 51849153520
- A constant factor approximation algorithm for k-median clustering with outliers
- K. Chen. A constant factor approximation algorithm for k-median clustering with outliers. In SODA, pages 826-835, 2008.
- (2008) SODA , pp. 826-835
- Chen, K.¹

14
- 85170282443
- A density-based algorithm for discovering clusters in large spatial databases with noise
- M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In AAAI, pages 226-231, 1996.
- (1996) AAAI , pp. 226-231
- Ester, M.¹ Kriegel, H.-P.² Sander, J.³ Xu, X.⁴

15
- 0002679222
- Scalability for clustering algorithms revisited
- F. Farnstrom, J. Lewis, and C. Elkan. Scalability for clustering algorithms revisited. SIGKDD Explor. Newsl., 2:51-57, 2000.
- (2000) SIGKDD Explor. Newsl. , vol.2 , pp. 51-57
- Farnstrom, F.¹ Lewis, J.² Elkan, C.³

16
- 35348830377
- A PTAS for k-means clustering based on weak coresets
- D. Feldman, M. Monemizadeh, and C. Sohler. A PTAS for k-means clustering based on weak coresets. In SOCG, pages 11-18, 2007.
- (2007) SOCG , pp. 11-18
- Feldman, D.¹ Monemizadeh, M.² Sohler, C.³

17
- 0038633423
- Clustering data streams: Theory and practice
- S. Guha, A. Meyerson, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams: Theory and practice. TKDE, 15(3):515-528, 2003.
- (2003) TKDE , vol.15 , Issue.3 , pp. 515-528
- Guha, S.¹ Meyerson, A.² Mishra, N.³ Motwani, R.⁴ O'Callaghan, L.⁵

18
- 0032091595
- CURE: An efficient clustering algorithm for large databases
- S. Guha, R. Rastogi, and K. Shim. CURE: An efficient clustering algorithm for large databases. In SIGMOD, pages 73-84, 1998.
- (1998) SIGMOD , pp. 73-84
- Guha, S.¹ Rastogi, R.² Shim, K.³

19
- 19944363201
- Research issues in automatic database clustering
- S. Guinepain and L. Gruenwald. Research issues in automatic database clustering. SIGMOD Record, 34(1):33-38, 2005.
- (2005) SIGMOD Record , vol.34 , Issue.1 , pp. 33-38
- Guinepain, S.¹ Gruenwald, L.²

20
- 77954912225
- Simpler analyses of local search algorithms for facility location
- 0809.2554
- A. Gupta and K. Tangwongsan. Simpler analyses of local search algorithms for facility location. CoRR, abs/0809.2554, 2008.
- (2008) CoRR
- Gupta, A.¹ Tangwongsan, K.²

21
- 33846811413
- Smaller coresets for k-median and k-means clustering
- S. Har-Peled and A. Kushal. Smaller coresets for k-median and k-means clustering. Discrete & Computational Geometry, 37(1):3-19, 2007.
- (2007) Discrete & Computational Geometry , vol.37 , Issue.1 , pp. 3-19
- Peled-Har, S.¹ Kushal, A.²

22
- 85140527321
- An efficient approach to clustering in large multimedia databases with noise
- A. Hinneburg and D. A. Keim. An efficient approach to clustering in large multimedia databases with noise. In KDD, pages 58-65, 1998.
- (1998) KDD , pp. 58-65
- Hinneburg, A.¹ Keim, D.A.²

23
- 84893405732
- Data clustering: A review
- A. K. Jain, M. N. Murty, and P. J. Flynn. Data clustering: A review. ACM Computing Surveys, 31:264-323, 1999.
- (1999) ACM Computing Surveys , vol.31 , pp. 264-323
- Jain, A.K.¹ Murty, M.N.² Flynn, P.J.³

24
- 2442683961
- A local search approximation algorithm for k-means clustering
- T. Kanungo, D. M. Mount, N. S. Netanyahu, C. D. Piatko, R. Silverman, and A. Y. Wu. A local search approximation algorithm for k-means clustering. Comput. Geom., 28(2-3):89-112, 2004.
- (2004) Comput. Geom. , vol.28 , Issue.2-3 , pp. 89-112
- Kanungo, T.¹ Mount, D.M.² Netanyahu, N.S.³ Piatko, C.D.⁴ Silverman, R.⁵ Wu, A.Y.⁶

25
- 0003858566
- Algorithms for mining distance-based outliers in large datasets
- E. Knorr and R. Ng. Algorithms for mining distance-based outliers in large datasets. In VLDB, pages 392-403, 1998.
- (1998) VLDB , pp. 392-403
- Knorr, E.¹ Ng, R.²

26
- 0003203996
- A unified notion of outliers: Properties and computation
- E. M. Knorr and R. T. Ng. A unified notion of outliers: Properties and computation. In KDD, pages 19-22, 1997.
- (1997) KDD , pp. 19-22
- Knorr, E.M.¹ Ng, R.T.²

27
- 11244288693
- A simple linear time (1+ε)-approximation algorithm for k-means clustering in any dimensions
- A. Kumar, Y. Sabharwal, and S. Sen. A simple linear time (1+ε)-approximation algorithm for k-means clustering in any dimensions. In FOCS, pages 454-462, 2004.
- (2004) FOCS , pp. 454-462
- Kumar, A.¹ Sabharwal, Y.² Sen, S.³

28
- 84886567160
- M. Lichman. UCI machine learning repository, 2013.
- (2013) UCI machine learning repository
- Lichman, M.¹

29
- 0020102027
- Least squares quantization in PCM
- S. P. Lloyd. Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2):129-136, 1982.
- (1982) IEEE Transactions on Information Theory , vol.28 , Issue.2 , pp. 129-136
- Lloyd, S.P.¹

30
- 0012905555
- Finding intensional knowledge of distance-based outliers
- K. E. M. and N. R. T.
- K. E. M. and N. R. T. Finding intensional knowledge of distance-based outliers. In VLDB, pages 211-222, 1999.
- (1999) VLDB , pp. 211-222

31
- 84965161129
- Fast distributed k-center clustering with outliers on massive data
- G. Malkomes, M. Kusner, W. Chen, K. Weinberger, and B. Moseley. Fast distributed k-center clustering with outliers on massive data. In NIPS, pages 1063-1071, 2015.
- (2015) NIPS , pp. 1063-1071
- Malkomes, G.¹ Kusner, M.² Chen, W.³ Weinberger, K.⁴ Moseley, B.⁵

32
- 51849117754
- Streaming algorithms for k-center clustering with outliers and with anonymity
- R. M. McCutchen and S. Khuller. Streaming algorithms for k-center clustering with outliers and with anonymity. In APPROX, pages 165-178, 2008.
- (2008) APPROX , pp. 165-178
- McCutchen, R.M.¹ Khuller, S.²

33
- 85008009667
- Integrating k-means clustering with a relational DBMS using SQL
- C. Ordonez. Integrating k-means clustering with a relational DBMS using SQL. TKDE, 18:188-201, 2006.
- (2006) TKDE , vol.18 , pp. 188-201
- Ordonez, C.¹

34
- 0039845361
- SQLEM: Fast clustering in SQL using the EM algorithm
- C. Ordonez and P. Cereghini. SQLEM: Fast clustering in SQL using the EM algorithm. In SIGMOD, pages 559-570, 2000.
- (2000) SIGMOD , pp. 559-570
- Ordonez, C.¹ Cereghini, P.²

35
- 4344647570
- Efficient disk-based k-means clustering for relational databases
- C. Ordonez and E. Omiecinski. Efficient disk-based k-means clustering for relational databases. TKDE, 16:909-921, 2004.
- (2004) TKDE , vol.16 , pp. 909-921
- Ordonez, C.¹ Omiecinski, E.²

36
- 84937938726
- On integrated clustering and outlier detection
- L. Ott, L. Pang, F. T. Ramos, and S. Chawla. On integrated clustering and outlier detection. In NIPS, pages 1359-1367, 2014.
- (2014) NIPS , pp. 1359-1367
- Ott, L.¹ Pang, L.² Ramos, F.T.³ Chawla, S.⁴

37
- 0345359208
- LOCI: Fast outlier detection using the local correlation integral
- S. Papadimitriou, H. Kitagawa, P. Gibbons, and C. Faloutsos. LOCI: Fast outlier detection using the local correlation integral. In ICDE, pages 315-326, 2003.
- (2003) ICDE , pp. 315-326
- Papadimitriou, S.¹ Kitagawa, H.² Gibbons, P.³ Faloutsos, C.⁴

38
- 0039845384
- Efficient algorithms for mining outliers from large data sets
- S. Ramaswamy, R. Rastogi, and K. Shim. Efficient algorithms for mining outliers from large data sets. In SIGMOD, pages 427-438, 2000.
- (2000) SIGMOD , pp. 427-438
- Ramaswamy, S.¹ Rastogi, R.² Shim, K.³

39
- 37549018049
- Top 10 algorithms in data mining
- X. Wu, V. Kumar, J. Ross Quinlan, J. Ghosh, Q. Yang, H. Motoda, G. J. McLachlan, A. Ng, B. Liu, P. S. Yu, Z.-H. Zhou, M. Steinbach, D. J. Hand, and D. Steinberg. Top 10 algorithms in data mining. Knowl. Inf. Syst., 14:1-37, 2007.
- (2007) Knowl. Inf. Syst. , vol.14 , pp. 1-37
- Wu, X.¹ Kumar, V.² Ross Quinlan, J.³ Ghosh, J.⁴ Yang, Q.⁵ Motoda, H.⁶ McLachlan, G.J.⁷ Ng, A.⁸ Liu, B.⁹ Yu, P.S.¹⁰ Zhou, Z.-H.¹¹ Steinbach, M.¹² Hand, D.J.¹³ Steinberg, D.¹⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.