SCOPUS 정보 검색 플랫폼

Volumn 35, Issue 1, 2013, Pages

An optimized sparse approximate matrix multiply for matrices with decay

a LOS ALAMOS NATIONAL LABORATORY (United States)

Author keywords

Matrices with decay; N body; Quantum chemistry; Reduced complexity algorithm; SpAMM; Sparse approximate matrix multiply; Sparse linear algebra

Indexed keywords

COMPUTATIONAL COMPLEXITY; QUANTUM CHEMISTRY;

COMPLEXITY ALGORITHMS; MATRIX; MATRIX MULTIPLY; MATRIX WITH DECAY; N-BODY; REDUCED COMPLEXITY ALGORITHM; REDUCED-COMPLEXITY; SPARSE APPROXIMATE MATRIX MULTIPLY; SPARSE LINEAR ALGEBRA;

MATRIX ALGEBRA;

EID: 84876211167 PISSN: 10648275 EISSN: 10957200 Source Type: Journal
DOI: 10.1137/120870761 Document Type: Article

Times cited : (31)

References (154)

1
- 9344227702
- Experiments with quadtree representation of matrices
- P. Gianni, ed., Lecture Notes in Comput. Sci., Springer, Berlin
- S. K. Abdali and D. S. Wise, Experiments with quadtree representation of matrices, in Symbolic and Algebraic Computation, P. Gianni, ed., Lecture Notes in Comput. Sci. 358, Springer, Berlin, 1989, pp. 96-108.
- (1989) Symbolic and Algebraic Computation , vol.358 , pp. 96-108
- Abdali, S.K.¹ Wise, D.S.²

2
- 34547509407
- Seven at one stroke: Results from a cache-oblivious paradigm for scalable matrix algorithms
- New York, ACM
- M. D. Adams and D. S. Wise, Seven at one stroke: Results from a cache-oblivious paradigm for scalable matrix algorithms, in Proceedings of the 2006 Workshop on Memory System Performance and Correctness, MSPC '06, New York, 2006, ACM, pp. 41-50.
- (2006) Proceedings of the 2006 Workshop on Memory System Performance and Correctness, MSPC '06 , pp. 41-50
- Adams, M.D.¹ Wise, D.S.²

3
- 0031379633
- Parallel domain decomposition and load balancing using space-filling curves
- S. Aluru and F. E. Sevilgen, Parallel domain decomposition and load balancing using space-filling curves, in Proceedings of the 4th IEEE Conference on High Performance Computing, 1997, pp. 230-235.
- (1997) Proceedings of the 4th IEEE Conference on High Performance Computing , pp. 230-235
- Aluru, S.¹ Sevilgen, F.E.²

4
- 70349090119
- Faster join-projects and sparse matrix multiplications
- ACM
- R. R. Amossen and R. Pagh, Faster join-projects and sparse matrix multiplications, in Proceedings of the 12th International Conference on Database Theory, ACM, 2009, pp. 121- 126.
- (2009) Proceedings of the 12th International Conference on Database Theory , pp. 121-126
- Amossen, R.R.¹ Pagh, R.²

5
- 84876235136
- New trends in collision detection performance
- Laval, France, S. Richir and A. Shirai, ed.
- Q. Avril, V. Gouranton, and B. Arnaldi, New trends in collision detection performance, in VRIC'09 Proceedings, vol. 11, Laval, France, S. Richir and A. Shirai, ed., 2009, pp. 53-62.
- (2009) VRIC'09 Proceedings , vol.11 , pp. 53-62
- Avril, Q.¹ Gouranton, V.² Arnaldi, B.³

6
- 33747416213
- The efficacy of software prefetching and locality optimizations on future memory systems
- A. H. Badawy, A. Aggarwal, D. Yeung, and C. W. Tseng, The efficacy of software prefetching and locality optimizations on future memory systems, J. Instruction-Level Parallelism, 6 (2004).
- (2004) J. Instruction-Level Parallelism , vol.6
- Badawy, A.H.¹ Aggarwal, A.² Yeung, D.³ Tseng, C.W.⁴

7
- 45449120592
- Hardware-oriented implementation of cache oblivious matrix operations based on space-filling curves
- R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, eds., Lecture Notes in Comput. Sci., Springer, Berlin
- M. Bader, R. Franz, S. Guenther, and A. Heinecke, Hardware-oriented implementation of cache oblivious matrix operations based on space-filling curves, in Parallel Processing and Applied Mathematics, 7th International Conference, PPAM 2007, R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, eds., Lecture Notes in Comput. Sci. 4967, Springer, Berlin, 2008, p. 628.
- (2008) Parallel Processing and Applied Mathematics, 7th International Conference, PPAM 2007 , vol.4967 , pp. 628
- Bader, M.¹ Franz, R.² Guenther, S.³ Heinecke, A.⁴

8
- 79960683667
- Performance analysis on multicore system using PAPI
- E. H. Barragan and J. J. Steves, Performance analysis on multicore system using PAPI, in Proceedings of the 6th Colombian Computing Congress (CCC), 2011, pp. 1-5.
- (2011) Proceedings of the 6th Colombian Computing Congress (CCC) , pp. 1-5
- Barragan, E.H.¹ Steves, J.J.²

9
- 0037252817
- A new error estimate of the fast Gauss transform
- B. J. C. Baxter and G. Roussos, A new error estimate of the fast Gauss transform, SIAM J. Sci. Comput., 24 (2002), pp. 257-259.
- (2002) SIAM J. Sci. Comput. , vol.24 , pp. 257-259
- Baxter, B.J.C.¹ Roussos, G.²

10
- 9344228838
- Ph.D. thesis, Indiana University
- P. Beckman, Parallel LU Decomposition for Sparse Matrices Using Quadtrees on a Shared- Heap Multiprocessor, Ph.D. thesis, Indiana University, 1993.
- (1993) Parallel LU Decomposition for Sparse Matrices Using Quadtrees on A Shared- Heap Multiprocessor
- Beckman, P.¹

11
- 0000843403
- Three-dimensional adaptive mesh refinement for hyperbolic conservation laws
- J. Bell, M. J. Berger, J. Saltzman, and M. Welcome, Three-dimensional adaptive mesh refinement for hyperbolic conservation laws, SIAM J. Sci. Comput., 15 (1994), pp. 127- 138.
- (1994) SIAM J. Sci. Comput. , vol.15 , pp. 127-138
- Bell, J.¹ Berger, M.J.² Saltzman, J.³ Welcome, M.⁴

12
- 84880559267
- arXiv:1203.3953 [math.NA]
- M. Benzi, P. Boito, and N. Razouk, Decay Properties of Spectral Projectors with Applications to Electronic Structure, arXiv:1203.3953 [math.NA], 2012.
- (2012) Decay Properties of Spectral Projectors with Applications to Electronic Structure
- Benzi, M.¹ Boito, P.² Razouk, N.³

13
- 0043044851
- Bounds for the entries of matrix functions with applications to preconditioning
- M. Benzi and G. H. Golub, Bounds for the entries of matrix functions with applications to preconditioning, BIT, 39 (1999), pp. 417-438.
- (1999) BIT , vol.39 , pp. 417-438
- Benzi, M.¹ Golub, G.H.²

14
- 54549085632
- Decay bounds and O(n) algorithms for approximating functions of sparse matrices
- M. Benzi and N. Razouk, Decay bounds and O(n) algorithms for approximating functions of sparse matrices, Electron. Trans. Numer. Anal., 28 (2007), pp. 16-39.
- (2007) Electron. Trans. Numer. Anal. , vol.28 , pp. 16-39
- Benzi, M.¹ Razouk, N.²

15
- 18844471650
- Orderings for factorized sparse approximate inverse preconditioners
- M. Benzi and M. Tuma, Orderings for factorized sparse approximate inverse preconditioners, SIAM J. Sci. Comput., 21 (2000), pp. 1851-1868.
- (2000) SIAM J. Sci. Comput. , vol.21 , pp. 1851-1868
- Benzi, M.¹ Tuma, M.²

16
- 11744289966
- Local adaptive mesh refinement for shock hydrodynamics
- M. J. Berger and P. Colella, Local adaptive mesh refinement for shock hydrodynamics, J. Comput. Phys., 82 (1989), pp. 64-84.
- (1989) J. Comput. Phys. , vol.82 , pp. 64-84
- Berger, M.J.¹ Colella, P.²

17
- 48749141209
- Adaptive mesh refinement for hyperbolic partial differential equations
- M. J. Berger and J. Oliger, Adaptive mesh refinement for hyperbolic partial differential equations, J. Comput. Phys., 53 (1984), pp. 484-512.
- (1984) J. Comput. Phys. , vol.53 , pp. 484-512
- Berger, M.J.¹ Oliger, J.²

18
- 0003018688
- Stability of fast algorithms for matrix multiplication
- D. Bini and G. Lotti, Stability of fast algorithms for matrix multiplication, Numer. Math., 36 (1980), pp. 63-72.
- (1980) Numer. Math. , vol.36 , pp. 63-72
- Bini, D.¹ Lotti, G.²

19
- 85022109307
- Hierarchical visibility culling with occlusion trees
- J. Bittner, V. Havran, and P. Slavik, Hierarchical visibility culling with occlusion trees, in Proceedings, Computer Graphics International, 1998, pp. 207-219.
- (1998) Proceedings, Computer Graphics International , pp. 207-219
- Bittner, J.¹ Havran, V.² Slavik, P.³

20
- 84876241972
- N. Bock, M. Challacombe, C. K. Gan, G. Henkelman, K. Nemeth, A. M. N. Niklasson, A. Odell, E. Schwegler, C. J. Tymczak, and V. Weber, FreeON: A Suite of Programs for Linear Scaling Quantum Chemistry, http://www.freeon.org (2011).
- (2011) FreeON: A Suite of Programs for Linear Scaling Quantum Chemistry
- Bock, N.¹ Challacombe, M.² Gan, C.K.³ Henkelman, G.⁴ Nemeth, K.⁵ Niklasson, A.M.N.⁶ Odell, A.⁷ Schwegler, E.⁸ Tymczak, C.J.⁹ Weber, V.¹⁰

21
- 0037171094
- Recent progress in linear scaling ab initio electronic structure techniques
- D. R. Bowler, T. Miyazaki, and M. J. Gillan, Recent progress in linear scaling ab initio electronic structure techniques, J. Phys. Condensed Matter, 14 (2002), pp. 2781-2798.
- (2002) J. Phys. Condensed Matter , vol.14 , pp. 2781-2798
- Bowler, D.R.¹ Miyazaki, T.² Gillan, M.J.³

22
- 84876225062
- arXiv:1108.5976 [cond-mat.mtrl-sci]
- D. R. Bowler and T. Miyazaki, O(N) Methods in Electronic Structure Calculations, arXiv:1108.5976 [cond-mat.mtrl-sci], 2011.
- (2011) O(N)Methods in Electronic Structure Calculations
- Bowler, D.R.¹ Miyazaki, T.²

23
- 55849139091
- Challenges and advances in parallel sparse matrix-matrix multiplication
- Washington, DC, IEEE Computer Society
- A. Buluç and J. R. Gilbert, Challenges and advances in parallel sparse matrix-matrix multiplication, in ICPP '08: Proceedings of the 2008 37th International Conference on Parallel Processing, Washington, DC, IEEE Computer Society, 2008, pp. 503-510.
- (2008) ICPP '08: Proceedings of the 2008 37th International Conference on Parallel Processing , pp. 503-510
- Buluç, A.¹ Gilbert, J.R.²

24
- 51049121603
- On the representation and multiplication of hypersparse matrices
- A. Buluç and J. R. Gilbert, On the representation and multiplication of hypersparse matrices, in IEEE International Symposium on Parallel and Distributed Processing, 2008, pp. 1-11.
- (2008) IEEE International Symposium on Parallel and Distributed Processing , pp. 1-11
- Buluç, A.¹ Gilbert, J.R.²

25
- 84876222280
- arXiv:1109.3739
- A. Buluç and J. R. Gilbert, Parallel Sparse Matrix-Matrix Multiplication and Indexing: Implementation and Experiments, arXiv:1109.3739, 2011.
- (2011) Parallel Sparse Matrix-Matrix Multiplication and Indexing: Implementation and Experiments
- Buluç, A.¹ Gilbert, J.R.²

26
- 0001222111
- A linear scaling method for Hartree-Fock exchange calculations of large molecules
- J. C. Burant, G. E. Scuseria, and M. J. Frisch, A linear scaling method for Hartree-Fock exchange calculations of large molecules, J. Chem. Phys., 105 (1996), pp. 8969-8972.
- (1996) J. Chem. Phys. , vol.105 , pp. 8969-8972
- Burant, J.C.¹ Scuseria, G.E.² Frisch, M.J.³

27
- 18144424983
- Department of Computer Science, Williams College
- P. M. Campbell, K. D. Devine, J. E. Flaherty, L. G. Gervasio, and J. D. Teresco, Dynamic Octree Load Balancing Using Space-Filling Curves, Technical report, Department of Computer Science, Williams College, 2003.
- (2003) Dynamic Octree Load Balancing Using Space-Filling Curves, Technical Report
- Campbell, P.M.¹ Devine, K.D.² Flaherty, J.E.³ Gervasio, L.G.⁴ Teresco, J.D.⁵

28
- 0003712293
- Ph.D. dissertation, Montana State University
- L. E. Cannon, A Cellular Computer to Implement the Kaiman Filter Algorithm, Ph.D. dissertation, Montana State University, 1969.
- (1969) A Cellular Computer to Implement the Kaiman Filter Algorithm
- Cannon, L.E.¹

29
- 84876234179
- arXiv:1011.3534 [cs.DS]
- M. Challacombe and N. Bock, Fast multiplication of matrices with decay, arXiv:1011.3534 [cs.DS], 2010.
- (2010) Fast Multiplication of Matrices with Decay
- Challacombe, M.¹ Bock, N.²

30
- 0037747633
- World Scientific, Singapore
- M. Challacombe, E. Schwegler, and J. Almlöf, Computational Chemistry: Review of Current Trends, World Scientific, Singapore, 1996, pp. 53-107.
- (1996) Computational Chemistry: Review of Current Trends , pp. 53-107
- Challacombe, M.¹ Schwegler, E.² Almlöf, J.³

31
- 0001027297
- Fast assembly of the Coulomb matrix: A quantum chemical tree code
- M. Challacombe, E. Schwegler, and J. Almlöf, Fast assembly of the Coulomb matrix: A quantum chemical tree code, J. Chem. Phys., 104 (1996), pp. 4685-4698.
- (1996) J. Chem. Phys. , vol.104 , pp. 4685-4698
- Challacombe, M.¹ Schwegler, E.² Almlöf, J.³

32
- 0039758422
- Modern developments in hartree-fock theory: Fast methods for computing the coulomb matrix
- J. Leszczynski, ed., Computational Chemistry: Reviews of Current Trends, World Scientific, Singapore
- M. Challacombe, E. Schwegler, and J. Almlöf, Modern Developments in Hartree-Fock Theory: Fast Methods for Computing the Coulomb Matrix, in Computational Chemistry: Reviews of Current Trends, J. Leszczynski, ed., Computational Chemistry: Reviews of Current Trends, vol. 1, World Scientific, Singapore, 1996, pp. 53-107.
- (1996) Computational Chemistry: Reviews of Current Trends , vol.1 , pp. 53-107
- Challacombe, M.¹ Schwegler, E.² Almlöf, J.³

33
- 0010259404
- Linear scaling computation of the Fock matrix
- M. Challacombe and E. Schwegler, Linear scaling computation of the Fock matrix, J. Chem. Phys., 106 (1997), pp. 5526-5536.
- (1997) J. Chem. Phys. , vol.106 , pp. 5526-5536
- Challacombe, M.¹ Schwegler, E.²

34
- 0001514496
- A simplified density matrix minimization for linear scaling self-consistent field theory
- M. Challacombe, A simplified density matrix minimization for linear scaling self-consistent field theory, J. Chem. Phys., 110 (1999), pp. 2332-2342.
- (1999) J. Chem. Phys. , vol.110 , pp. 2332-2342
- Challacombe, M.¹

35
- 0034625292
- A general parallel sparse-blocked matrix multiply for linear scaling SCF theory
- M. Challacombe, A general parallel sparse-blocked matrix multiply for linear scaling SCF theory, Comput. Phys. Comm., 128 (2000), pp. 93-107.
- (2000) Comput. Phys. Comm. , vol.128 , pp. 93-107
- Challacombe, M.¹

36
- 0034506257
- Linear scaling computation of the Fock matrix. V. Hierarchical cubature for numerical integration of the exchange-correlation matrix
- M. Challacombe, Linear scaling computation of the Fock matrix. V. Hierarchical cubature for numerical integration of the exchange-correlation matrix, J. Chem. Phys., 113 (2000), p. 10037.
- (2000) J. Chem. Phys. , vol.113 , pp. 10037
- Challacombe, M.¹

37
- 33845283552
- A fast adaptive solver for hierarchically semiseparable representations
- S. Chandrasekaran, M. Gu, and W. Lyons, A fast adaptive solver for hierarchically semiseparable representations, Calcolo, 42 (2005), pp. 171-185.
- (2005) Calcolo , vol.42 , pp. 171-185
- Chandrasekaran, S.¹ Gu, M.² Lyons, W.³

38
- 34547225006
- A fast ULV decomposition solver for hierarchically semiseparable representations
- S. Chandrasekaran, M. Gu, and T. Pals, A Fast ULV Decomposition Solver for Hierarchically Semiseparable Representations, SIAM J. Matrix Anal. Appl., 28 (2006), pp. 603- 622.
- (2006) SIAM J. Matrix Anal. Appl. , vol.28 , pp. 603-622
- Chandrasekaran, S.¹ Gu, M.² Pals, T.³

39
- 0000362580
- A fast adaptive multipole algorithm in three dimensions
- H. Cheng, L. Greengard, and V. Rokhlin, A fast adaptive multipole algorithm in three dimensions, J. Comput. Phys., 155 (1999), pp. 468-498.
- (1999) J. Comput. Phys. , vol.155 , pp. 468-498
- Cheng, H.¹ Greengard, L.² Rokhlin, V.³

40
- 34548432523
- Improving hash join performance through prefetching
- S. Chen, A. Ailamaki, P. B. Gibbons, and T. C. Mowry, Improving hash join performance through prefetching, ACM Trans. Database Syst., 32 (2007), pp. 1-36.
- (2007) ACM Trans. Database Syst. , vol.32 , pp. 1-36
- Chen, S.¹ Ailamaki, A.² Gibbons, P.B.³ Mowry, T.C.⁴

41
- 0034501522
- Making pointer-based data structures cache conscious
- T. M. Chilimbi, M. D. Hill, and J. R. Larus, Making pointer-based data structures cache conscious, Computer, 33 (2000), pp. 67-74.
- (2000) Computer , vol.33 , pp. 67-74
- Chilimbi, T.M.¹ Hill, M.D.² Larus, J.R.³

42
- 71949116983
- Linkless octree using multi-level perfect hashing
- M. G. Choi, E. Ju, J.-W. Chang, J. Lee, and Y. J. Kim, Linkless octree using multi-level perfect hashing, Computer Graphics Forum, 28 (2009), pp. 1773-1780.
- (2009) Computer Graphics Forum , vol.28 , pp. 1773-1780
- Choi, M.G.¹ Ju, E.² Chang, J.-W.³ Lee, J.⁴ Kim, Y.J.⁵

43
- 85023205150
- Matrix multiplication via arithmetic progressions
- D. Coppersmith and S. Winograd, Matrix multiplication via arithmetic progressions, J. Symbolic Comput., 9 (1990), pp. 251-280.
- (1990) J. Symbolic Comput. , vol.9 , pp. 251-280
- Coppersmith, D.¹ Winograd, S.²

44
- 0000876466
- Semiempirical methods with conjugate gradient density matrix search to replace diagonalization for molecular systems containing thousands of atoms
- A. D. Daniels, J. M. Millam, and G. E. Scuseria, Semiempirical methods with conjugate gradient density matrix search to replace diagonalization for molecular systems containing thousands of atoms, J. Chem. Phys., 107 (1997), pp. 425-431.
- (1997) J. Chem. Phys. , vol.107 , pp. 425-431
- Daniels, A.D.¹ Millam, J.M.² Scuseria, G.E.³

45
- 84966252352
- Decay rates for inverses of band matrices
- S. Demko, W. F. Moss, and P. W. Smith, Decay rates for inverses of band matrices, Math. Comp., 43 (1984), pp. 491-499.
- (1984) Math. Comp. , vol.43 , pp. 491-499
- Demko, S.¹ Moss, W.F.² Smith, P.W.³

46
- 34248224610
- Fast matrix multiplication is stable
- J. W. Demmel, I. Dumitriu, O. Holtz, and R. Kleinberg, Fast matrix multiplication is stable, Numer. Math., 106 (2007), pp. 199-224.
- (2007) Numer. Math. , vol.106 , pp. 199-224
- Demmel, J.W.¹ Dumitriu, I.² Holtz, O.³ Kleinberg, R.⁴

47
- 0026913668
- Stability of block algorithms with fast level-3 blas
- J. W. Demmel and N. J. Higham, Stability of block algorithms with fast level-3 blas, ACM Trans. Math. Software, 18 (1992), pp. 274-291.
- (1992) ACM Trans. Math. Software , vol.18 , pp. 274-291
- Demmel, J.W.¹ Higham, N.J.²

48
- 10444272489
- New challenges in dynamic load balancing
- K. D. Devine, E. G. Boman, R. T. Heaphy, B. A. Hendrickson, J. D. Teresco, J. Faik, J. E. Flaherty, and L. G. Gervasio, New challenges in dynamic load balancing, Appl. Numer. Math., 52 (2005), pp. 133-152.
- (2005) Appl. Numer. Math. , vol.52 , pp. 133-152
- Devine, K.D.¹ Boman, E.G.² Heaphy, R.T.³ Hendrickson, B.A.⁴ Teresco, J.D.⁵ Faik, J.⁶ Flaherty, J.E.⁷ Gervasio, L.G.⁸

49
- 0016353777
- Quad trees a data structure for retrieval on composite keys
- J. L. Finkel and R. A. Bentley, Quad trees a data structure for retrieval on composite keys, Acta Inform., 4 (1974), pp. 1-9.
- (1974) Acta Inform. , vol.4 , pp. 1-9
- Finkel, J.L.¹ Bentley, R.A.²

50
- 0347771690
- Auto-blocking matrix-multiplication or tracking blas3 performance from source code
- J. D. Frens and D. S. Wise, Auto-blocking matrix-multiplication or tracking blas3 performance from source code, SIGPLAN Not., 32 (1997), pp. 206-216.
- (1997) SIGPLAN Not. , vol.32 , pp. 206-216
- Frens, J.D.¹ Wise, D.S.²

51
- 0033350255
- Cache-oblivious algorithms
- Washington, DC, IEEE Computer Society
- M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran, Cache-oblivious algorithms, in Proceedings of the 40th Annual Symposium on Foundations of Computer Science, FOCS '99, Washington, DC, IEEE Computer Society, 1999, pp. 285-297.
- (1999) Proceedings of the 40th Annual Symposium on Foundations of Computer Science, FOCS '99 , pp. 285-297
- Frigo, M.¹ Leiserson, C.E.² Prokop, H.³ Ramachandran, S.⁴

52
- 77954450405
- The opie compiler from row-major source to morton-ordered matrices
- New York, ACM
- S. T. Gabriel and D. S. Wise, The opie compiler from row-major source to morton-ordered matrices, in Proceedings of the 3rd Workshop on Memory Performance Issues: In Conjunction with the 31st International Symposium on Computer Architecture, WMPI '04, New York, ACM, 2004, pp. 136-144.
- (2004) Proceedings of the 3rd Workshop on Memory Performance Issues: In Conjunction with the 31st International Symposium on Computer Architecture, WMPI '04 , pp. 136-144
- Gabriel, S.T.¹ Wise, D.S.²

53
- 0000639195
- Linear scaling methods for electronic structure calculations and quantum molecular-dynamics simulations
- G. Galli, Linear scaling methods for electronic structure calculations and quantum molecular-dynamics simulations, Current Opinion in Solid State & Materials Science, 1 (1996), pp. 864-874.
- (1996) Current Opinion in Solid State & Materials Science , vol.1 , pp. 864-874
- Galli, G.¹

54
- 0037603314
- Linear scaling computation of the Fock matrix. VI. Data parallel computation of the exchange-correlation matrix
- C. K. Gan and M. Challacombe, Linear scaling computation of the Fock matrix. VI. Data parallel computation of the exchange-correlation matrix, J. Comput. Phys., 118 (2003), pp. 9128-9135.
- (2003) J. Comput. Phys. , vol.118 , pp. 9128-9135
- Gan, C.K.¹ Challacombe, M.²

55
- 0020249952
- An effective way to represent quadtrees
- I. Gargantini, An effective way to represent quadtrees, Commun. ACM, 25 (1982), pp. 905- 910.
- (1982) Commun. ACM , vol.25 , pp. 905-910
- Gargantini, I.¹

56
- 33746701980
- Sad prefetching for mpeg4 using flux caches
- Springer, Berlin
- G. Gaydadjiev and S. Vassiliadis, Sad prefetching for mpeg4 using flux caches, in Embedded Computer Systems: Architectures, Modeling, and Simulation, Lecture Notes in Comput. Sci. 4017, Springer, Berlin, 2006, pp. 248-258.
- (2006) Embedded Computer Systems: Architectures, Modeling, and Simulation, Lecture Notes in Comput. Sci. , vol.4017 , pp. 248-258
- Gaydadjiev, G.¹ Vassiliadis, S.²

57
- 84876265888
- GNU compiler collection, http://gcc.gnu.org/.

58
- 84876220567
- A special flavor of Linux that can be automatically optimized and customized for just about any application or need, http://www.gentoo.org.
- A Special Flavor of Linux That Can Be Automatically Optimized and Customized for Just about Any Application or Need

59
- 0038450200
- Linear scaling electronic structure methods in chemistry and physics
- S. Goedecker and G. Scuseria, Linear scaling electronic structure methods in chemistry and physics, Comput. Sci. Engrg., 5 (2003), pp. 14-21.
- (2003) Comput. Sci. Engrg. , vol.5 , pp. 14-21
- Goedecker, S.¹ Scuseria, G.²

60
- 84876213977
- arXiv:9806073 [cond-mat]
- S. Goedecker, Electronic Structure Methods Exhibiting Linear Scaling of the Computational Effort with Respect to the Size of the System, arXiv:9806073 [cond-mat], 1998.
- (1998) Electronic Structure Methods Exhibiting Linear Scaling of the Computational Effort with Respect to the Size of the System
- Goedecker, S.¹

61
- 0033246389
- Linear scaling electronic structure methods
- S. Goedecker, Linear scaling electronic structure methods, Rev. Mod. Phys., 71 (1999), pp. 1085-1123.
- (1999) Rev. Mod. Phys. , vol.71 , pp. 1085-1123
- Goedecker, S.¹

62
- 1542392269
- Department of Computer Sciences, University of Texas at Austin
- K. Goto and R. A. van de Geijn, On Reducing TLB Misses in Matrix Multiplication, Technical report CS-TR-02-55, Department of Computer Sciences, University of Texas at Austin, 2002.
- (2002) On Reducing TLB Misses in Matrix Multiplication, Technical Report CS-TR-02-55
- Goto, K.¹ Geijn De Van, R.A.²

63
- 44249094647
- Anatomy of high-performance matrix multiplication
- K. Goto and R. A. van de Geijn, Anatomy of high-performance matrix multiplication, ACM Trans. Math. Software, 34 (2008), pp. 12:1-12:25.
- (2008) ACM Trans. Math. Software , vol.34 , pp. 121-1225
- Goto, K.¹ Geijn De Van, R.A.²

64
- 48849089104
- High-performance implementation of the level-3 blas
- K. Goto and R. A. van de Geijn, High-performance implementation of the level-3 blas, ACM Trans. Math. Software, 35 (2008), pp. 4:1-4:14.
- (2008) ACM Trans. Math. Software , vol.35 , pp. 41-414
- Goto, K.¹ Geijn De Van, R.A.²

65
- 34548020763
- Representation-transparent matrix algorithms with scalable performance
- New York, ACM
- P. Gottschling, D. S. Wise, and M. D. Adams, Representation-transparent matrix algorithms with scalable performance, in Proceedings of the 21st Annual International Conference on Supercomputing, ICS '07, New York, ACM, 2007, pp. 116-125.
- (2007) Proceedings of the 21st Annual International Conference on Supercomputing, ICS '07 , pp. 116-125
- Gottschling, P.¹ Wise, D.S.² Adams, M.D.³

66
- 0141829814
- Construction and arithmetics of H-matrices
- L. Grasedyck and W. Hackbusch, Construction and arithmetics of H-matrices, Computing, 70 (2003), pp. 295-334.
- (2003) Computing , vol.70 , pp. 295-334
- Grasedyck, L.¹ Hackbusch, W.²

67
- 84899027721
- N-body' Problems in statistical learning
- MIT Press, Cambridge, MA
- A. G. Gray and A. W. Moore, N-Body' Problems in Statistical Learning, in Advances in Neural Information Processing Systems, vol. 4, MIT Press, Cambridge, MA, 2001, pp. 521-527.
- (2001) Advances in Neural Information Processing Systems , vol.4 , pp. 521-527
- Gray, A.G.¹ Moore, A.W.²

68
- 84864048782
- Ph.D. thesis, CMU-CS-04-189, School of Computer Science, Carnegie Mellon University
- A. Gray, Bringing Tractability to Generalized N-Body Problems in Statistical and Scientific Computation, Ph.D. thesis, CMU-CS-04-189, School of Computer Science, Carnegie Mellon University, 2003.
- (2003) Bringing Tractability to Generalized N-Body Problems in Statistical and Scientific Computation
- Gray, A.¹

69
- 0000396658
- A fast algorithm for particle simulations
- L. Greengard and V. Rokhlin, A fast algorithm for particle simulations, J. Comput. Phys., 73 (1987), pp. 325-348.
- (1987) J. Comput. Phys. , vol.73 , pp. 325-348
- Greengard, L.¹ Rokhlin, V.²

70
- 0000468568
- The rapid evaluation of potential fields in three dimensions
- C. Anderson and C. Greengard, eds., Lecture Notes in Math., Springer, Berlin
- L. Greengard and V. Rokhlin, The rapid evaluation of potential fields in three dimensions, in Vortex Methods, C. Anderson and C. Greengard, eds., Lecture Notes in Math. 1360, Springer, Berlin, 1988, pp. 121-141.
- (1988) Vortex Methods , vol.1360 , pp. 121-141
- Greengard, L.¹ Rokhlin, V.²

71
- 85011484616
- A new version of the fast multipole method for the Laplace equation in three dimensions
- L. Greengard and V. Rokhlin, A new version of the fast multipole method for the Laplace equation in three dimensions, Acta Numer., (1997), pp. 229-269.
- (1997) Acta Numer. , pp. 229-269
- Greengard, L.¹ Rokhlin, V.²

72
- 0001896329
- The fast Gauss transform
- L. Greengard and J. Strain, The fast Gauss transform, SIAM J. Sci. Statist. Comput., 12 (1991), pp. 79-94.
- (1991) SIAM J. Sci. Statist. Comput. , vol.12 , pp. 79-94
- Greengard, L.¹ Strain, J.²

73
- 38049046758
- Is cache-oblivious dgemm viable?
- Springer-Verlag, Berlin
- J. Gunnels, F. G. Gustavson, K. Pingali, and K. Yotov, Is cache-oblivious dgemm viable?, in Proceedings of the 8th International Conference on Applied Parallel Computing: State of the Art in Scientific Computing, Springer-Verlag, Berlin, 2007, pp. 919-928.
- (2007) Proceedings of the 8th International Conference on Applied Parallel Computing: State of the Art in Scientific Computing , pp. 919-928
- Gunnels, J.¹ Gustavson, F.G.² Pingali, K.³ Yotov, K.⁴

74
- 84862107202
- Parallel and cache-efficient in-place matrix storage format conversion
- F. G. Gustavson, L. Karlsson, and B. Ka°gström, Parallel and cache-efficient in-place matrix storage format conversion, ACM Trans. Math. Software, 38 (2012), pp. 1-32.
- (2012) ACM Trans. Math. Software , vol.38 , pp. 1-32
- Gustavson, F.G.¹ Karlsson, L.² Kagström, B.³

75
- 0036387334
- Data-sparse approximation by adaptive H2-matrices
- W. Hackbusch and S. Börm, Data-sparse approximation by adaptive H2-matrices, Computing, 69 (2002), pp. 1-35.
- (2002) Computing , vol.69 , pp. 1-35
- Hackbusch, W.¹ Börm, S.²

76
- 0035416223
- The combinatorics of cache misses during matrix multiplication
- P. J. Hanlon, D. Chung, S. Chatterjee, D. Genius, A. R. Lebeck, and E. Parker, The combinatorics of cache misses during matrix multiplication, J. Comput. Systems Sci., 63 (2001), pp. 80-126.
- (2001) J. Comput. Systems Sci. , vol.63 , pp. 80-126
- Hanlon, P.J.¹ Chung, D.² Chatterjee, S.³ Genius, D.⁴ Lebeck, A.R.⁵ Parker, E.⁶

77
- 0023165505
- Hierarchical n-body methods
- L. Hernquist, Hierarchical n-body methods, Comput. Phys. Commun., 48 (1988), pp. 107- 115.
- (1988) Comput. Phys. Commun. , vol.48 , pp. 107-115
- Hernquist, L.¹

78
- 33846316938
- Über die stetige Abbildung einer Linie auf ein Flächenstü ck
- D. Hilbert, Über die stetige Abbildung einer Linie auf ein Flächenstück, Math. Ann. (1891), p. 459.
- (1891) Math. Ann. , pp. 459
- Hilbert, D.¹

79
- 0141599220
- Data-parallel spatial join algorithms
- E. G. Hoel and H. Samet, Data-parallel spatial join algorithms, in International Conference on Parallel Processing, vol. 3, 1994, pp. 227-234.
- (1994) International Conference on Parallel Processing , vol.3 , pp. 227-234
- Hoel, E.G.¹ Samet, H.²

80
- 77952749689
- Performance comparison of data prefetching for pointer-chasing applications
- IEEE
- Y. Huang and Z. Gu, Performance comparison of data prefetching for pointer-chasing applications, in 1st International Conference on Information Science and Engineering (ICISE), IEEE, 2009, pp. 307-310.
- (2009) 1st International Conference on Information Science and Engineering (ICISE) , pp. 307-310
- Huang, Y.¹ Gu, Z.²

81
- 70449098063
- available online from Intel
- Intel 64 and IA-32 Architectures Optimization Reference Manual, 2009; available online from Intel, http://www.intel.com/content/www/us/en/ architecture-and-technology/64- ia-32-architectures-optimization-manual.html.
- (2009) Intel 64 and IA-32 Architectures Optimization Reference Manual

82
- 0007295634
- How large is the exponential of a banded matrix?
- A. Iserles, How large is the exponential of a banded matrix?, J. New Zealand Math. Soc., 29 (2000), pp. 177-192.
- (2000) J. New Zealand Math. Soc. , vol.29 , pp. 177-192
- Iserles, A.¹

83
- 19544371805
- Iterative spatial join
- E. H. Jacox and H. Samet, Iterative spatial join, ACM Trans. Database Syst., 28 (2003), pp. 230-256.
- (2003) ACM Trans. Database Syst. , vol.28 , pp. 230-256
- Jacox, E.H.¹ Samet, H.²

84
- 84876266645
- B. Jenkins, 32-Bit Hashes for Hash Table Lookup, http://burtleburtle.net/ bob/c/lookup3.c (2006).
- (2006) 32-Bit Hashes for Hash Table Lookup
- Jenkins, B.¹

85
- 58549084590
- Using space-filling curves for computation reordering
- G. Jin and J. Mellor-Crummey, Using space-filling curves for computation reordering, in Proceedings of the Los Alamos Computer Science Institute, 2005.
- (2005) Proceedings of the Los Alamos Computer Science Institute
- Jin, G.¹ Mellor-Crummey, J.²

86
- 0002479236
- Charm++: Parallel programming with message-driven objects
- G. V. Wilson and P. Lu, eds., MIT Press, Cambridge, MA
- L. V. Kale and S. Krishnan, Charm++: Parallel programming with message-driven objects, in Parallel Programming Using C++, G. V. Wilson and P. Lu, eds., MIT Press, Cambridge, MA, 1996, pp. 175-213.
- (1996) Parallel Programming Using C++ , pp. 175-213
- Kale, L.V.¹ Krishnan, S.²

87
- 0034581346
- A prefetching technique for irregular accesses to linked data structures
- IEEE
- M. Karlsson, F. Dahlgren, and P. Stenström, A prefetching technique for irregular accesses to linked data structures, in Proceedings of the Sixth International Symposium on High-Performance Computer Architecture, IEEE, 2000, pp. 206-217.
- (2000) Proceedings of the Sixth International Symposium on High-Performance Computer Architecture , pp. 206-217
- Karlsson, M.¹ Dahlgren, F.² Stenström, P.³

88
- 77954705147
- Sort vs. hash revisited: Fast join implementation on modern multi-core cpus
- C. Kim, T. Kaldewey, V. W. Lee, E. Sedlar, A. D. Nguyen, N. Satish, J. Chhugani, A. Di Blas, and P. Dubey, Sort vs. hash revisited: Fast join implementation on modern multi-core cpus, Proc. VLDB Endow., 2 (2009), pp. 1378-1389.
- (2009) Proc. VLDB Endow. , vol.2 , pp. 1378-1389
- Kim, C.¹ Kaldewey, T.² Lee, V.W.³ Sedlar, E.⁴ Nguyen, A.D.⁵ Satish, N.⁶ Chhugani, J.⁷ Di Blas, A.⁸ Dubey, P.⁹

89
- 0000990502
- Linear-scaling quantum mechanical calculations of biological molecules: The divide-and-conquer approach
- T. S. Lee, J. P. Lewis, and W. Yang, Linear-scaling quantum mechanical calculations of biological molecules: The divide-and-conquer approach, Comput. Materials Sci., 12 (1998), pp. 259-277.
- (1998) Comput. Materials Sci. , vol.12 , pp. 259-277
- Lee, T.S.¹ Lewis, J.P.² Yang, W.³

90
- 50949096788
- Perfect spatial hashing
- S. Lefebvre and H. Hoppe, Perfect spatial hashing, ACM Trans. Graph., 25 (2006), pp. 579- 588.
- (2006) ACM Trans. Graph. , vol.25 , pp. 579-588
- Lefebvre, S.¹ Hoppe, H.²

91
- 50949096788
- Perfect spatial hashing
- New York, ACM
- S. Lefebvre and H. Hoppe, Perfect spatial hashing, in Proceedings of SIGGRAPH '06, New York, ACM, 2006, p. 579.
- (2006) Proceedings of SIGGRAPH '06 , pp. 579
- Lefebvre, S.¹ Hoppe, H.²

92
- 79953092482
- Intel
- D. Levinthal, Performance Analysis Guide for Intel Core i7 Processor and Intel Xeon 5500 Processors, Intel, 2009.
- (2009) Performance Analysis Guide for Intel Core i7 Processor and Intel Xeon 5500 Processors
- Levinthal, D.¹

93
- 78650491505
- Fast generation of pointerless octree duals
- T. Lewiner, V. Mello, A. Peixoto, S. Pesco, and H. Lopes, Fast generation of pointerless octree duals, Computer Graphics Forum, 29 (2010), p. 1661.
- (2010) Computer Graphics Forum , vol.29 , pp. 1661
- Lewiner, T.¹ Mello, V.² Peixoto, A.³ Pesco, S.⁴ Lopes, H.⁵

94
- 52649161208
- A fast similarity join algorithm using graphics processing units
- Washington, DC, IEEE Computer Society
- M. D. Lieberman, J. Sankaranarayanan, and H. Samet, A fast similarity join algorithm using graphics processing units, in Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, Washington, DC, IEEE Computer Society, 2008, pp. 1111- 1120.
- (2008) Proceedings of the 2008 IEEE 24th International Conference on Data Engineering , pp. 1111-1120
- Lieberman, M.D.¹ Sankaranarayanan, J.² Samet, H.³

95
- 84863959775
- Work stealing and persistence-based load balancers for iterative overdecomposed applications
- ACM
- J. Lifflander, S. Krishnamoorthy, and L. Kale, Work stealing and persistence-based load balancers for iterative overdecomposed applications, in Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing, ACM, 2012, pp. 137-148.
- (2012) Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing , pp. 137-148
- Lifflander, J.¹ Krishnamoorthy, S.² Kale, L.³

96
- 0034818343
- Reducing dram latencies with an integrated memory hierarchy design
- IEEE
- W. F. Lin, S. K. Reinhardt, and D. Burger, Reducing dram latencies with an integrated memory hierarchy design, in Proceedings of the Seventh International Symposium on High-Performance Computer Architecture, IEEE, 2001, pp. 301-312.
- (2001) Proceedings of the Seventh International Symposium on High-Performance Computer Architecture , pp. 301-312
- Lin, W.F.¹ Reinhardt, S.K.² Burger, D.³

97
- 84947725835
- Experiences with parallel n-body simulation
- ACM
- P. Liu and S. Bhatt, Experiences with parallel n-body simulation, in Proceedings of the Sixth Annual ACM Symposium on Parallel Algorithms and Architectures, ACM, 1994, pp. 122-131.
- (1994) Proceedings of the Sixth Annual ACM Symposium on Parallel Algorithms and Architectures , pp. 122-131
- Liu, P.¹ Bhatt, S.²

98
- 16244396071
- Recent progress in density functional theory and its numerical methods
- Z. Y. Li, W. He, and J. L. Yang, Recent progress in density functional theory and its numerical methods, Progress in Chemistry, 17 (2005), pp. 192-202.
- (2005) Progress in Chemistry , vol.17 , pp. 192-202
- Li, Z.Y.¹ He, W.² Yang, J.L.³

99
- 34248336283
- Analyzing block locality in morton-order and morton-hybrid matrices
- New York, ACM
- K. P. Lorton and D. S. Wise, Analyzing block locality in morton-order and morton-hybrid matrices, in Proceedings of the 2006 Workshop on Memory Performance: Dealing with Applications, Systems and Architectures, MEDEA '06, New York, ACM, 2006, pp. 5-12.
- (2006) Proceedings of the 2006 Workshop on Memory Performance: Dealing with Applications, Systems and Architectures, MEDEA '06 , pp. 5-12
- Lorton, K.P.¹ Wise, D.S.²

100
- 84864069036
- Data-driven fault tolerance for work stealing computations
- ACM
- W. Ma and S. Krishnamoorthy, Data-driven fault tolerance for work stealing computations, in Proceedings of the 26th ACM International Conference on Supercomputing, ACM, 2012, pp. 79-90.
- (2012) Proceedings of the 26th ACM International Conference on Supercomputing , pp. 79-90
- Ma, W.¹ Krishnamoorthy, S.²

101
- 0001332894
- The density matrix in self-consistent field theory. I. Iterative construction of the density matrix
- R. McWeeny, The density matrix in self-consistent field theory. I. Iterative construction of the density matrix, Proc. Roy. Soc. London A Mat., 235 (1956), pp. 496-509.
- (1956) Proc. Roy. Soc. London A Mat. , vol.235 , pp. 496-509
- McWeeny, R.¹

102
- 1542601822
- Improving memory hierarchy performance for irregular applications using data and computation reorderings
- J. Mellor-Crummey, D. Whalley, and K. Kennedy, Improving memory hierarchy performance for irregular applications using data and computation reorderings, Internat. J. Parallel Programming, 29 (2001), pp. 217-247.
- (2001) Internat. J. Parallel Programming , vol.29 , pp. 217-247
- Mellor-Crummey, J.¹ Whalley, D.² Kennedy, K.³

103
- 5244240756
- Linear scaling conjugate gradient density matrix search as an alternative to diagonalization for first principles electronic structure calculations
- J. M. Millam and G. E. Scuseria, Linear scaling conjugate gradient density matrix search as an alternative to diagonalization for first principles electronic structure calculations, J. Chem. Phys., 106 (1997), pp. 5569-5577.
- (1997) J. Chem. Phys. , vol.106 , pp. 5569-5577
- Millam, J.M.¹ Scuseria, G.E.²

104
- 0026826969
- Join processing in relational databases
- P. Mishra and M. H. Eich, Join processing in relational databases, ACM Comput. Surv., 24 (1992), pp. 63-113.
- (1992) ACM Comput. Surv. , vol.24 , pp. 63-113
- Mishra, P.¹ Eich, M.H.²

105
- 0003460690
- IBM, Ottawa
- G. M. Morton, A Computer Oriented Geodetic Data Base and a New Technique in File Sequencing, Technical report, IBM, Ottawa, 1966.
- (1966) A Computer Oriented Geodetic Data Base and A New Technique in File Sequencing, Technical Report
- Morton, G.M.¹

106
- 33646222013
- PAPI: A portable interface to hardware performance counters
- P. J. Mucci, S. Browne, C. Deane, and G. Ho, PAPI: A portable interface to hardware performance counters, in Proceedings of the Department of Defense HPCMP Users Group Conference, 1999, pp. 7-10.
- (1999) Proceedings of the Department of Defense HPCMP Users Group Conference , pp. 7-10
- Mucci, P.J.¹ Browne, S.² Deane, C.³ Ho, G.⁴

107
- 33846340428
- A divideand- conquer/cellular-decomposition framework for million-to-billion atom simulations of chemical reactions
- A. Nakano, R. K. Kalia, K.-I. Nomura, A. Sharma, P. Vashishta, F. Shimojo, A. C. T. van Duin, W. A. Goddard, R. Biswas, and D. Srivastava, A divideand- conquer/cellular-decomposition framework for million-to-billion atom simulations of chemical reactions, Comput. Materials Sci., 38 (2007), pp. 642-652.
- (2007) Comput. Materials Sci. , vol.38 , pp. 642-652
- Nakano, A.¹ Kalia, R.K.² Nomura, K.-I.³ Sharma, A.⁴ Vashishta, P.⁵ Shimojo, F.⁶ Van Duin, A.C.T.⁷ Goddard, W.A.⁸ Biswas, R.⁹ Srivastava, D.¹⁰

108
- 17044459517
- Density matrix perturbation theory
- A. M. N. Niklasson and M. Challacombe, Density matrix perturbation theory, Phys. Rev. Lett., 92 (2004), 193001.
- (2004) Phys. Rev. Lett. , vol.92 , pp. 193001
- Niklasson, A.M.N.¹ Challacombe, M.²

109
- 0037863744
- Trace resetting density matrix purification in O(N) self-consistent-field theory
- A. M. N. Niklasson, C. J. Tymczak, and M. Challacombe, Trace resetting density matrix purification in O(N) self-consistent-field theory, J. Chem. Phys., 118 (2003), pp. 8611- 8620.
- (2003) J. Chem. Phys. , vol.118 , pp. 8611-8620
- Niklasson, A.M.N.¹ Tymczak, C.J.² Challacombe, M.³

110
- 0000247202
- Linear and sublinear scaling formation of Hartree-Fock-type exchange matrices
- C. Ochsenfeld, C. A. White, and M. Head-Gordon, Linear and sublinear scaling formation of Hartree-Fock-type exchange matrices, J. Chem. Phys., 109 (1998), pp. 1663-1669.
- (1998) J. Chem. Phys. , vol.109 , pp. 1663-1669
- Ochsenfeld, C.¹ White, C.A.² Head-Gordon, M.³

111
- 25944474315
- Master's thesis, Department of Computer Science, University of Copenhagen, Denmark
- J. H. Olsen and S. C. Skov, Cache-Oblivious Algorithms in Practice, Master's thesis, Department of Computer Science, University of Copenhagen, Denmark, 2002.
- (2002) Cache-Oblivious Algorithms in Practice
- Olsen, J.H.¹ Skov, S.C.²

112
- 0001241557
- Canonical purification of the density matrix in electronic-structure theory
- A. H. R. Palser and D. E. Manolopoulos, Canonical purification of the density matrix in electronic-structure theory, Phys. Rev. B, 58 (1998), pp. 12704-12711.
- (1998) Phys. Rev. B , vol.58 , pp. 12704-12711
- Palser, A.H.R.¹ Manolopoulos, D.E.²

113
- 34547953706
- Algorithms to take advantage of hardware prefetching
- S. Pan, C. Cherng, K. Dick, and R. E. Ladner, Algorithms to take advantage of hardware prefetching, in Proceedings of the 9th Workshop on Algorithm Engineering and Experiments (ALENEX), 2007, pp. 91-98.
- (2007) Proceedings of the 9th Workshop on Algorithm Engineering and Experiments (ALENEX) , pp. 91-98
- Pan, S.¹ Cherng, C.² Dick, K.³ Ladner, R.E.⁴

114
- 84870671761
- Performance Application Programming Interface, http://icl.cs.utk.edu/ papi/.
- Performance Application Programming Interface

115
- 0005083863
- Sur une courbe, qui remplit toute une aire plane
- G. Peano, Sur une courbe, qui remplit toute une aire plane, Math. Ann., 36 (1890), pp. 157- 160.
- (1890) Math. Ann. , vol.36 , pp. 157-160
- Peano, G.¹

116
- 85040889046
- Addison-Wesley Longman, Boston, MA
- H. Samet, The Design and Analysis of Spatial Data Structures, Addison-Wesley Longman, Boston, MA, 1990.
- (1990) The Design and Analysis of Spatial Data Structures
- Samet, H.¹

117
- 33244479933
- Morgan Kaufmann, San Francisco
- H. Samet, Foundations of Multidimensional and Metric Data Structures, Morgan Kaufmann, San Francisco, 2006.
- (2006) Foundations of Multidimensional and Metric Data Structures
- Samet, H.¹

118
- 0000343198
- Tradeoffs in processing complex join queries via hashing in multiprocessor database machines
- San Francisco, Morgan Kaufmann
- D. A. Schneider and D. J. DeWitt, Tradeoffs in processing complex join queries via hashing in multiprocessor database machines, in Proceedings of the 16th International Conference on Very Large Databases, San Francisco, Morgan Kaufmann, 1990, pp. 469-480.
- (1990) Proceedings of the 16th International Conference on Very Large Databases , pp. 469-480
- Schneider, D.A.¹ Dewitt, D.J.²

119
- 0000484587
- Linear scaling computation of the Fock matrix. II. Rigorous bounds on exchange integrals and incremental Fock build
- E. Schwegler, M. Challacombe, and M. Head-Gordon, Linear scaling computation of the Fock matrix. II. Rigorous bounds on exchange integrals and incremental Fock build, J. Chem. Phys., 106 (1997), pp. 9708-9717.
- (1997) J. Chem. Phys. , vol.106 , pp. 9708-9717
- Schwegler, E.¹ Challacombe, M.² Head-Gordon, M.³

120
- 0007132664
- A multipole acceptability criterion for electronic structure theory
- E. Schwegler, M. Challacombe, and M. Head-Gordon, A multipole acceptability criterion for electronic structure theory, J. Chem. Phys., 109 (1998), pp. 8764-8769.
- (1998) J. Chem. Phys. , vol.109 , pp. 8764-8769
- Schwegler, E.¹ Challacombe, M.² Head-Gordon, M.³

121
- 78650423051
- Traversal caches: A first step towards FPGA acceleration of pointer-based data structures
- ACM
- G. Stitt, G. Chaudhari, and J. Coole, Traversal caches: A first step towards FPGA acceleration of pointer-based data structures, in Proceedings of the 6th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, ACM, 2008, pp. 61-66.
- (2008) Proceedings of the 6th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis , pp. 61-66
- Stitt, G.¹ Chaudhari, G.² Coole, J.³

122
- 56749083221
- Recursion flattening
- New York, ACM
- G. Stitt and J. Villarreal, Recursion flattening, in Proceedings of the 18th ACM Great Lakes Symposium on VLSI, GLSVLSI '08, New York, ACM, 2008, pp. 131-134.
- (2008) Proceedings of the 18th ACM Great Lakes Symposium on VLSI, GLSVLSI '08 , pp. 131-134
- Stitt, G.¹ Villarreal, J.²

123
- 0007898394
- The fast Gauss transform with variable scales
- J. Strain, The fast Gauss transform with variable scales, SIAM J. Sci. Statist. Comput., 12 (1991), pp. 1131-1139.
- (1991) SIAM J. Sci. Statist. Comput. , vol.12 , pp. 1131-1139
- Strain, J.¹

124
- 34250487811
- Gaussian elimination is not optimal
- V. Strassen, Gaussian elimination is not optimal, Numer. Math., 13 (1969), pp. 354-356.
- (1969) Numer. Math. , vol.13 , pp. 354-356
- Strassen, V.¹

125
- 0003468774
- Dover, New York
- A. Szabo and N. Ostlund, Modern Quantum Chemistry: Introduction to Advanced Electronic Structure Theory, Dover, New York, 1996.
- (1996) Modern Quantum Chemistry: Introduction to Advanced Electronic Structure Theory
- Szabo, A.¹ Ostlund, N.²

126
- 34248387649
- Complete inlining of recursive calls: Beyond tail-recursion elimination
- New York, ACM
- P. Tang, Complete inlining of recursive calls: Beyond tail-recursion elimination, in Proceedings of the 44th Annual Southeast Regional Conference, ACM-SE 44, New York, ACM, 2006, pp. 579-584.
- (2006) Proceedings of the 44th Annual Southeast Regional Conference, ACM-SE , vol.44 , pp. 579-584
- Tang, P.¹

127
- 35048894812
- Improving the performance of Morton layout by array alignment and loop unrolling: Reducing the price of naivety
- Springer-Verlag, Berlin
- J. Thiyagalingam, O. Beckmann, and P. H. J. Kelly, Improving the performance of Morton layout by array alignment and loop unrolling: Reducing the price of naivety, in Proceedings of 16th International Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Comput. Sci. 2958, Springer-Verlag, Berlin, 2003, pp. 241- 257.
- (2003) Proceedings of 16th International Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Comput. Sci. , vol.2958 , pp. 241-257
- Thiyagalingam, J.¹ Beckmann, O.² Kelly, P.H.J.³

128
- 84867384854
- Linear scaling self-consistent field calculations with millions of atoms in the condensed phase
- J. VandeVondele, U. Borštnik, and J. Hutter, Linear scaling self-consistent field calculations with millions of atoms in the condensed phase, J. Chem. Theory Comput., (2012).
- (2012) J. Chem. Theory Comput.
- Vandevondele, J.¹ Borštnik, U.² Hutter, J.³

129
- 0031123769
- Summa: Scalable universal matrix multiplication algorithm
- R. A. van de Geijn and J. Watts, Summa: Scalable universal matrix multiplication algorithm, Concurrency Practice and Experience, 9 (1997), pp. 255-274.
- (1997) Concurrency Practice and Experience , vol.9 , pp. 255-274
- Geijn De Van, R.A.¹ Watts, J.²

130
- 84862595166
- Multiplying matrices faster than Coppersmith-Winograd
- New York, ACM
- V. Vassilevska Williams, Multiplying matrices faster than Coppersmith-Winograd, in Proceedings of the 44th Symposium on Theory of Computing, STOC '12, New York, ACM, 2012, pp. 887-898.
- (2012) Proceedings of the 44th Symposium on Theory of Computing, STOC '12 , pp. 887-898
- Vassilevska Williams, V.¹

131
- 0038345683
- Guided region prefetching: A cooperative hardware/software approach
- ACM, New York
- Z. Wang, D. Burger, K. S. McKinley, S. K. Reinhardt, and C. C. Weems, Guided region prefetching: A cooperative hardware/software approach, in ACM SIGARCH Computer Architecture News 31, ACM, New York, 2003, pp. 388-398.
- (2003) ACM SIGARCH Computer Architecture News , vol.31 , pp. 388-398
- Wang, Z.¹ Burger, D.² McKinley, K.S.³ Reinhardt, S.K.⁴ Weems, C.C.⁵

132
- 33750378674
- A sharp error estimate for the fast Gauss transform
- X. Wan and G. E. Karniadakis, A sharp error estimate for the fast Gauss transform, J. Comput. Phys., 219 (2006), pp. 7-12.
- (2006) J. Comput. Phys. , vol.219 , pp. 7-12
- Wan, X.¹ Karniadakis, G.E.²

133
- 84967045101
- Astrophysical N-body simulations using hierarchical tree data structures
- Los Alamitos, IEEE
- M. S. Warren and J. K. Salmon, Astrophysical N-body simulations using hierarchical tree data structures, in Supercomputing '92, Los Alamitos, IEEE, 1992, pp. 570-576.
- (1992) Supercomputing '92 , pp. 570-576
- Warren, M.S.¹ Salmon, J.K.²

134
- 84967045101
- Astrophysical n-body simulations using hierarchical tree data structures
- M. S. Warren and J. K. Salmon, Astrophysical n-body simulations using hierarchical tree data structures, in Proceedings of SC Conference, 1992, pp. 570-576.
- (1992) Proceedings of SC Conference , pp. 570-576
- Warren, M.S.¹ Salmon, J.K.²

135
- 0027747808
- A parallel hashed oct-tree n-body algorithm
- New York, ACM
- M. S. Warren and J. K. Salmon, A parallel hashed oct-tree n-body algorithm, in Proceedings of the ACM/IEEE Conference on Supercomputing, New York, ACM, 1993, pp. 12-21.
- (1993) Proceedings of the ACM/IEEE Conference on Supercomputing , pp. 12-21
- Warren, M.S.¹ Salmon, J.K.²

136
- 33750503563
- SIAM, Philadelphia
- M. S. Warren and J. K. Salmon, A Parallel, Portable and Versatile Treecode, SIAM, Philadelphia, 1995.
- (1995) A Parallel, Portable and Versatile Treecode
- Warren, M.S.¹ Salmon, J.K.²

137
- 1842471081
- A portable parallel particle program
- M. S. Warren and J. K. Salmon, A portable parallel particle program, Comput. Phys. Comm., 87 (1995), p. 266.
- (1995) Comput. Phys. Comm. , vol.87 , pp. 266
- Warren, M.S.¹ Salmon, J.K.²

138
- 56449095414
- Can hardware performance counters be trusted?
- V. M. Weaver and S. A. McKee, Can hardware performance counters be trusted?, in Proceedings of the IEEE International Symposium on Workload Characterization, 2008, p. 141.
- (2008) Proceedings of the IEEE International Symposium on Workload Characterization , pp. 141
- Weaver, V.M.¹ McKee, S.A.²

139
- 0025460758
- Costs of quadtree representation of nondense matrices
- D. S. Wise and J. Franco, Costs of quadtree representation of nondense matrices, J. Parallel Distributed Computing, 9 (1990), pp. 282-296.
- (1990) J. Parallel Distributed Computing , vol.9 , pp. 282-296
- Wise, D.S.¹ Franco, J.²

140
- 0034819362
- Language support for Morton-order matrices
- D. S. Wise, J. D. Frens, Y. Gu, and G. A. Alexander, Language support for Morton-order matrices, SIGPLAN Not., 36 (2001), pp. 24-33.
- (2001) SIGPLAN Not. , vol.36 , pp. 24-33
- Wise, D.S.¹ Frens, J.D.² Gu, Y.³ Alexander, G.A.⁴

141
- 0347738087
- Representing matrices as quadtrees for parallel processors: Extended abstract
- D. S. Wise, Representing matrices as quadtrees for parallel processors: Extended abstract, SIGSAM Bull., 18 (1984), pp. 24-25.
- (1984) SIGSAM Bull. , vol.18 , pp. 24-25
- Wise, D.S.¹

142
- 84937431996
- Ahnentafel indexing into Morton-ordered arrays, or matrix locality for free
- A. Bode, T. Ludwig, W. Karl, and R. Wismüller, eds., Lecture Notes in Comput. Sci., Springer, Berlin
- D. S. Wise, Ahnentafel indexing into Morton-ordered arrays, or matrix locality for free, in Euro-Par 2000 Parallel Processing, A. Bode, T. Ludwig, W. Karl, and R. Wismüller, eds., Lecture Notes in Comput. Sci. 1900, Springer, Berlin, 2000, pp. 774-783.
- (2000) Euro-Par 2000 Parallel Processing , vol.1900 , pp. 774-783
- Wise, D.S.¹

143
- 0033705677
- Push vs. pull: Data movement for linked data structures
- ACM
- C. L. Yang and A. R. Lebeck, Push vs. pull: Data movement for linked data structures, in Proceedings of the 14th International Conference on Supercomputing, ACM, 2000, pp. 176-186.
- (2000) Proceedings of the 14th International Conference on Supercomputing , pp. 176-186
- Yang, C.L.¹ Lebeck, A.R.²

144
- 67549121236
- Department of Computer Science, University of Maryland, College Park
- C. Yang, R. Duraiswami, and N. Gumerov, Improved Fast Gauss Transform, Technical rep. CS-TR-4495, Department of Computer Science, University of Maryland, College Park, 2003.
- (2003) Improved Fast Gauss Transform, Technical Rep. CS-TR-4495
- Yang, C.¹ Duraiswami, R.² Gumerov, N.³

145
- 0043144732
- A density-matrix divide-and-conquer approach for electronic structure calculations of large molecules
- W. Yang and T. S. Lee, A density-matrix divide-and-conquer approach for electronic structure calculations of large molecules, J. Chem. Phys., 103 (1995), p. 5674.
- (1995) J. Chem. Phys. , vol.103 , pp. 5674
- Yang, W.¹ Lee, T.S.²

146
- 0011621942
- Direct calculation of electron density in density-functional theory
- W. Yang, Direct calculation of electron density in density-functional theory, Phys. Rev. Lett, 66 (1991), pp. 1438-1441.
- (1991) Phys. Rev. Lett , vol.66 , pp. 1438-1441
- Yang, W.¹

147
- 35248846531
- An experimental comparison of cache-oblivious and cache-conscious programs
- ACM
- K. Yotov, T. Roeder, K. Pingali, J. Gunnels, and F. Gustavson, An experimental comparison of cache-oblivious and cache-conscious programs, in Proceedings of the 19th Annual ACM Symposium on Parallel Algorithms and Architectures, ACM, 2007, pp. 93- 104.
- (2007) Proceedings of the 19th Annual ACM Symposium on Parallel Algorithms and Architectures
- Yotov, K.¹ Roeder, T.² Pingali, K.³ Gunnels, J.⁴ Gustavson, F.⁵

148
- 34250883179
- Fast sparse matrix multiplication
- R. Yuster and U. Zwick, Fast sparse matrix multiplication, ACM Trans. Algorithms, 1 (2005), pp. 2-13.
- (2005) ACM Trans. Algorithms , vol.1 , pp. 2-13
- Yuster, R.¹ Zwick, U.²

149
- 70350780227
- Survey on real-time collision detection algorithms
- Y.-S. Zou, G.-F. Ding, M.-H. Xu, and Y. He, Survey on real-time collision detection algorithms, Appl. Res. Comput., 25 (2008), pp. 8-12.
- (2008) Appl. Res. Comput. , vol.25 , pp. 8-12
- Zou, Y.-S.¹ Ding, G.-F.² Xu, M.-H.³ He, Y.⁴

150
- 84872028027
- AMD Core Math Library (ACML), http://developer.amd.com/libraries/acml/.
- AMD Core Math Library (ACML)

151
- 84870430661
- Intel Cilk Plus, http://software.intel.com/en-us/articles/intel-cilk- plus/.
- Intel Cilk Plus

152
- 77957918414
- Intel Math Kernel Library (MKL), http://www.intel.com/software/products/ mkl/.
- Intel Math Kernel Library (MKL)

153
- 84876264103
- Intel SPMD Program Compiler, http://ispc.github.com/.

154
- 84876211927
- OpenMP, http://openmp.org/.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.