-
1
-
-
0032179944
-
Distribution-independent hierarchical algorithms for the nbody problem
-
October
-
S. Aluru, J. Gustafson, G. M. Prabhu, and F. E. Sevilgen. Distribution-independent hierarchical algorithms for the nbody problem. J. Supercomput., 12:303-323, October 1998.
-
(1998)
J. Supercomput.
, vol.12
, pp. 303-323
-
-
Aluru, S.1
Gustafson, J.2
Prabhu, G.M.3
Sevilgen, F.E.4
-
2
-
-
84957021854
-
A data parallel formulation of the barnes-hut method for n-body simulations
-
Applied Parallel Computing New Paradigms for HPC in Industry and Academia 5th International Workshop, PARA 2000 Bergen, Norway, June 18-20, 2000 Proceedings
-
M. Amor, F. Argüello, J. López, O. G. Plata, and E. L. Zapata. A data parallel formulation of the barnes-hut method for n -body simulations. In Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia, PARA '00, pages 342-349, London, UK, 2001. Springer-Verlag. (Pubitemid 33239315)
-
(2001)
Lecture Notes in Computer Science
, Issue.1947
, pp. 342-349
-
-
Amor, M.1
Arguello, F.2
Lopez, J.3
Plata, O.4
Zapata, E.L.5
-
3
-
-
33846349887
-
A hierarchical o(nlogn) forcecalculation algorithm
-
December
-
J. Barnes and P. Hut. A hierarchical o(nlogn) forcecalculation algorithm. Nature, 324(4):446-449, December 1986.
-
(1986)
Nature
, vol.324
, Issue.4
, pp. 446-449
-
-
Barnes, J.1
Hut, P.2
-
4
-
-
17244376579
-
Cacheconscious structure definition
-
New York, NY, USA. ACM
-
T. M. Chilimbi, B. Davidson, and J. R. Larus. Cacheconscious structure definition. In Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, PLDI '99, pages 13-24, New York, NY, USA, 1999. ACM.
-
(1999)
Proceedings of the ACM SIGPLAN 1999 Conference on Programming Language Design and Implementation, PLDI '99
, pp. 13-24
-
-
Chilimbi, T.M.1
Davidson, B.2
Larus, J.R.3
-
5
-
-
0032667164
-
Cache-conscious structure layout
-
New York, NY, USA. ACM
-
T. M. Chilimbi, M. D. Hill, and J. R. Larus. Cache-conscious structure layout. In Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, PLDI '99, pages 1-12, New York, NY, USA, 1999. ACM.
-
Proceedings of the ACM SIGPLAN 1999 Conference on Programming Language Design and Implementation, PLDI '99
, vol.1999
, pp. 1-12
-
-
Chilimbi, T.M.1
Hill, M.D.2
Larus, J.R.3
-
6
-
-
0008572520
-
Using generational garbage collection to implement cache-conscious data placement
-
T. M. Chilimbi and J. R. Larus. Using generational garbage collection to implement cache-conscious data placement. In Proceedings of the 1st international symposium on Memory management, ISMM '98, pages 37-48, New York, NY, USA, 1998. ACM. (Pubitemid 129686058)
-
(1999)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.34
, Issue.3
, pp. 37-48
-
-
Chilimbi, T.M.1
Larus, J.R.2
-
7
-
-
42149093282
-
The jastadd extensible java compiler
-
New York, NY, USA. ACM
-
T. Ekman and G. Hedin. The jastadd extensible java compiler. In Proceedings of the 22nd annual ACM SIGPLAN conference on Object-oriented programming systems and applications, OOPSLA '07, pages 1-18, New York, NY, USA, 2007. ACM.
-
(2007)
Proceedings of the 22nd Annual ACM SIGPLAN Conference on Object-oriented Programming Systems and Applications, OOPSLA '07
, pp. 1-18
-
-
Ekman, T.1
Hedin, G.2
-
8
-
-
0003861399
-
A graphical approach to load balancing and sparse matrix vector multiplication on the hypercube
-
G. C. Fox. A graphical approach to load balancing and sparse matrix vector multiplication on the hypercube. Institute for Mathematics and Its Applications, 13:37-+, 1988.
-
(1988)
Institute for Mathematics and Its Applications
, vol.13
-
-
Fox, G.C.1
-
9
-
-
0347507496
-
The implementation of the cilk-5 multithreaded language
-
M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the cilk-5 multithreaded language. SIGPLAN Not., 33(5):212-223, 1998. (Pubitemid 128454798)
-
(1998)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.33
, Issue.5
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
11
-
-
0029719687
-
Is it a tree, a dag, or a cyclic graph? a shape analysis for heap-directed pointers in c
-
New York, NY, USA. ACM
-
R. Ghiya and L. J. Hendren. Is it a tree, a dag, or a cyclic graph? a shape analysis for heap-directed pointers in c. In POPL '96: Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pages 1-15, New York, NY, USA, 1996. ACM.
-
(1996)
POPL ' 96: Proceedings of the 23rd ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages
, pp. 1-15
-
-
Ghiya, R.1
Hendren, L.J.2
-
12
-
-
0042482650
-
N-body problems in statistical learning
-
In T. K. Leen, T. G. Dietterich, and V. Tresp, editors. Dec, MIT Press
-
A. G. Gray and A.W. Moore. N-Body Problems in Statistical Learning. In T. K. Leen, T. G. Dietterich, and V. Tresp, editors, Advances in Neural Information Processing Systems (NIPS) 13 (Dec 2000). MIT Press, 2001.
-
(2000)
Advances in Neural Information Processing Systems (NIPS)
, vol.13
-
-
Gray, A.G.1
Moore, A.W.2
-
13
-
-
74049152899
-
42 tflops hierarchical n-body simulations on gpus with applications in both astrophysics and turbulence
-
New York, NY, USA, ACM
-
T. Hamada, T. Narumi, R. Yokota, K. Yasuoka, K. Nitadori, and M. Taiji. 42 tflops hierarchical n-body simulations on gpus with applications in both astrophysics and turbulence. In SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, pages 1-12, New York, NY, USA, 2009. ACM.
-
(2009)
SC ' 09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
, pp. 1-12
-
-
Hamada, T.1
Narumi, T.2
Yokota, R.3
Yasuoka, K.4
Nitadori, K.5
Taiji, M.6
-
15
-
-
0141427127
-
Vectorization of tree traversals
-
March
-
L. Hernquist. Vectorization of tree traversals. J. Comput. Phys., 87:137-147, March 1990.
-
(1990)
J. Comput. Phys.
, vol.87
, pp. 137-147
-
-
Hernquist, L.1
-
17
-
-
70349191933
-
Lonestar: A suite of parallel irregular programs
-
April 2009
-
M. Kulkarni, M. Burtscher, K. Pingali, and C. Cascaval. Lonestar: A suite of parallel irregular programs. In 2009 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pages 65-76, April 2009.
-
(2009)
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
, pp. 65-76
-
-
Kulkarni, M.1
Burtscher, M.2
Pingali, K.3
Cascaval, C.4
-
18
-
-
35448941890
-
Optimistic parallelism requires abstractions
-
DOI 10.1145/1250734.1250759, PLDI'07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation
-
M. Kulkarni, K. Pingali, B. Walter, G. Ramanarayanan, K. Bala, and L. P. Chew. Optimistic parallelism requires abstractions. SIGPLAN Not. (Proceedings of PLDI 2007), 42(6):211-222, 2007. (Pubitemid 47630689)
-
(2007)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
, pp. 211-222
-
-
Kulkarni, M.1
Pingali, K.2
Walter, B.3
Ramanarayanan, G.4
Bala, K.5
Chew, L.P.6
-
19
-
-
44849094749
-
Fast n-body simulation with cuda
-
M. H. L. Nyland and J. Prins. Fast n-body simulation with cuda. GPU Gems, (3):677-695, 2007.
-
(2007)
GPU Gems
, vol.3
, pp. 677-695
-
-
Nyland, M.H.L.1
Prins, J.2
-
20
-
-
31844446709
-
Automatic pool allocation: Improving performance by controlling data structure layout in the heap
-
New York, NY, USA. ACM
-
C. Lattner and V. Adve. Automatic pool allocation: improving performance by controlling data structure layout in the heap. In Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation, PLDI '05, pages 129-142, New York, NY, USA, 2005. ACM.
-
(2005)
Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '05
, pp. 129-142
-
-
Lattner, C.1
Adve, V.2
-
21
-
-
0009157947
-
Vectorization of a treecode
-
March
-
J. Makino. Vectorization of a treecode. J. Comput. Phys., 87:148-160, March 1990.
-
(1990)
J. Comput. Phys.
, vol.87
, pp. 148-160
-
-
Makino, J.1
-
22
-
-
47249159442
-
Deep coherent ray tracing
-
Washington, DC, USA. IEEE Computer Society
-
E. Mansson, J. Munkberg, and T. Akenine-Moller. Deep coherent ray tracing. In Proceedings of the 2007 IEEE Symposium on Interactive Ray Tracing, pages 79-85, Washington, DC, USA, 2007. IEEE Computer Society.
-
(2007)
Proceedings of the 2007 IEEE Symposium on Interactive Ray Tracing
, pp. 79-85
-
-
Mansson, E.1
Munkberg, J.2
Akenine-Moller, T.3
-
23
-
-
1542601822
-
Improving memory hierarchy performance for irregular applications using data and computation reorderings
-
J. Mellor-Crummey, D. Whalley, and K. Kennedy. Improving memory hierarchy performance for irregular applications using data and computation reorderings. Int. J. Parallel Program., 29(3):217-247, 2001. (Pubitemid 33818435)
-
(2001)
International Journal of Parallel Programming
, vol.29
, Issue.3
, pp. 217-247
-
-
Mellor-Crummey, J.1
Whalley, D.2
Kennedy, K.3
-
24
-
-
0030720468
-
Rendering complex scenes with memory-coherent ray tracing
-
New York, NY, USA. ACMPress/Addison-Wesley Publishing Co
-
M. Pharr, C. Kolb, R. Gershbein, and P. Hanrahan. Rendering complex scenes with memory-coherent ray tracing. In Proceedings of the 24th annual conference on Computer graphics and interactive techniques, SIGGRAPH '97, pages 101-108, New York, NY, USA, 1997. ACMPress/Addison-Wesley Publishing Co.
-
(1997)
Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH '97
, pp. 101-108
-
-
Pharr, M.1
Kolb, C.2
Gershbein, R.3
Hanrahan, P.4
-
25
-
-
77956254799
-
Amorphous data-parallelism in irregular algorithms
-
The University of Texas at Austin, February
-
K. Pingali, M. Kulkarni, D. Nguyen, M. Burtscher, M. Mendez-Lojo, D. Prountzos, X. Sui, and Z. Zhong. Amorphous data-parallelism in irregular algorithms. Technical Report TR-09-05, Department of Computer Science, The University of Texas at Austin, February 2009.
-
(2009)
Technical Report TR-09-05, Department of Computer Science
-
-
Pingali, K.1
Kulkarni, M.2
Nguyen, D.3
Burtscher, M.4
Mendez-Lojo, M.5
Prountzos, D.6
Sui, X.7
Zhong, Z.8
-
29
-
-
0043005053
-
Load balancing and data locality in adaptive hierarchical nbody methods: Barnes-hut fast multipole, and radiosity
-
J. P. Singh, C. Holt, T. Totsuka, A. Gupta, and J. Hennessy. Load balancing and data locality in adaptive hierarchical nbody methods: Barnes-hut, fast multipole, and radiosity. J. Parallel Distrib. Comput., 27(2):118-141, 1995.
-
(1995)
J. Parallel Distrib. Comput.
, vol.27
, Issue.2
, pp. 118-141
-
-
Singh, J.P.1
Holt, C.2
Totsuka, T.3
Gupta, A.4
Hennessy, J.5
-
31
-
-
70449844310
-
A scalable auto-tuning framework for compiler optimization
-
Washington DC, USA, 2009. IEEE Computer Society
-
A. Tiwari, C. Chen, J. Chame, M. Hall, and J. K. Hollingsworth. A scalable auto-tuning framework for compiler optimization. In IPDPS '09: Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing, pages 1-12, Washington, DC, USA, 2009. IEEE Computer Society.
-
IPDPS ' 09: Proceedings of the 2009, IEEE International Symposium on Parallel&Distributed Processing
, pp. 1-12
-
-
Tiwari, A.1
Chen, C.2
Chame, J.3
Hall, M.4
Hollingsworth, J.K.5
-
32
-
-
85006879958
-
Improving cache behavior of dynamically allocated data structures
-
Washington, DC, USA, IEEE Computer Society
-
D. N. Truong, F. Bodin, and A. Seznec. Improving cache behavior of dynamically allocated data structures. In Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, PACT '98, pages 322-, Washington, DC, USA, 1998. IEEE Computer Society.
-
(1998)
Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, PACT '98
, pp. 322
-
-
Truong, D.N.1
Bodin, F.2
Seznec, A.3
-
34
-
-
47249090497
-
On fast construction of sah-based bounding volume hierarchies
-
Washington, DC, USA, 2007. IEEE Computer Society
-
I. Wald. On fast construction of sah-based bounding volume hierarchies. In RT '07: Proceedings of the 2007 IEEE Symposium on Interactive Ray Tracing, pages 33-40, Washington, DC, USA, 2007. IEEE Computer Society.
-
RT ' 07: Proceedings of the 2007 IEEE Symposium on Interactive Ray Tracing
, pp. 33-40
-
-
Wald, I.1
-
35
-
-
56649096133
-
Fast agglomerative clustering for rendering
-
August
-
B. Walter, K. Bala, M. Kulkarni, and K. Pingali. Fast agglomerative clustering for rendering. In IEEE Symposium on Interactive Ray Tracing (RT), pages 81-86, August 2008.
-
(2008)
IEEE Symposium on Interactive Ray Tracing (RT)
, pp. 81-86
-
-
Walter, B.1
Bala, K.2
Kulkarni, M.3
Pingali, K.4
-
36
-
-
33645154106
-
Lightcuts: A scalable approach to illumination
-
July
-
B.Walter, S. Fernandez, A. Arbree, K. Bala, M. Donikian, and D. Greenberg. Lightcuts: a scalable approach to illumination. ACM Transactions on Graphics (SIGGRAPH), 24(3):1098-1107, July 2005.
-
(2005)
ACM Transactions on Graphics (SIGGRAPH)
, vol.24
, Issue.3
, pp. 1098-1107
-
-
Walter, B.1
Fernandez, S.2
Arbree, A.3
Bala, K.4
Donikian, M.5
Greenberg, D.6
-
37
-
-
0343462141
-
Automated empirical optimization of software and the atlas project
-
C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the atlas project. Parallel Computing, 27:2001, 2000.
-
(2001)
Parallel Computing
, vol.27
, pp. 2000
-
-
Whaley, C.1
Petitet, A.2
Dongarra, J.J.3
|