메뉴 건너뛰기




Volumn 46, Issue 10, 2011, Pages 463-482

Enhancing locality for recursive traversals of recursive structures

Author keywords

Irregular programs; Locality transformations; Tree traversals

Indexed keywords

AS GRAPH; AUTOTUNING TECHNIQUES; DATA REUSE; DENSE MATRICES; LOOP TRANSFORMATION; OPTIMIZATION FRAMEWORK; PERFORMANCE IMPROVEMENTS; POINTER-BASED DATA STRUCTURES; REAL-WORLD APPLICATION; RECURSIVE STRUCTURE; TEMPORAL LOCALITY; TRAVERSAL ALGORITHMS; TREE TRAVERSAL;

EID: 84858310773     PISSN: 15232867     EISSN: None     Source Type: Journal    
DOI: 10.1145/2076021.2048104     Document Type: Conference Paper
Times cited : (14)

References (37)
  • 1
    • 0032179944 scopus 로고    scopus 로고
    • Distribution-independent hierarchical algorithms for the nbody problem
    • October
    • S. Aluru, J. Gustafson, G. M. Prabhu, and F. E. Sevilgen. Distribution-independent hierarchical algorithms for the nbody problem. J. Supercomput., 12:303-323, October 1998.
    • (1998) J. Supercomput. , vol.12 , pp. 303-323
    • Aluru, S.1    Gustafson, J.2    Prabhu, G.M.3    Sevilgen, F.E.4
  • 2
    • 84957021854 scopus 로고    scopus 로고
    • A data parallel formulation of the barnes-hut method for n-body simulations
    • Applied Parallel Computing New Paradigms for HPC in Industry and Academia 5th International Workshop, PARA 2000 Bergen, Norway, June 18-20, 2000 Proceedings
    • M. Amor, F. Argüello, J. López, O. G. Plata, and E. L. Zapata. A data parallel formulation of the barnes-hut method for n -body simulations. In Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia, PARA '00, pages 342-349, London, UK, 2001. Springer-Verlag. (Pubitemid 33239315)
    • (2001) Lecture Notes in Computer Science , Issue.1947 , pp. 342-349
    • Amor, M.1    Arguello, F.2    Lopez, J.3    Plata, O.4    Zapata, E.L.5
  • 3
    • 33846349887 scopus 로고
    • A hierarchical o(nlogn) forcecalculation algorithm
    • December
    • J. Barnes and P. Hut. A hierarchical o(nlogn) forcecalculation algorithm. Nature, 324(4):446-449, December 1986.
    • (1986) Nature , vol.324 , Issue.4 , pp. 446-449
    • Barnes, J.1    Hut, P.2
  • 6
    • 0008572520 scopus 로고    scopus 로고
    • Using generational garbage collection to implement cache-conscious data placement
    • T. M. Chilimbi and J. R. Larus. Using generational garbage collection to implement cache-conscious data placement. In Proceedings of the 1st international symposium on Memory management, ISMM '98, pages 37-48, New York, NY, USA, 1998. ACM. (Pubitemid 129686058)
    • (1999) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.34 , Issue.3 , pp. 37-48
    • Chilimbi, T.M.1    Larus, J.R.2
  • 8
    • 0003861399 scopus 로고
    • A graphical approach to load balancing and sparse matrix vector multiplication on the hypercube
    • G. C. Fox. A graphical approach to load balancing and sparse matrix vector multiplication on the hypercube. Institute for Mathematics and Its Applications, 13:37-+, 1988.
    • (1988) Institute for Mathematics and Its Applications , vol.13
    • Fox, G.C.1
  • 12
    • 0042482650 scopus 로고    scopus 로고
    • N-body problems in statistical learning
    • In T. K. Leen, T. G. Dietterich, and V. Tresp, editors. Dec, MIT Press
    • A. G. Gray and A.W. Moore. N-Body Problems in Statistical Learning. In T. K. Leen, T. G. Dietterich, and V. Tresp, editors, Advances in Neural Information Processing Systems (NIPS) 13 (Dec 2000). MIT Press, 2001.
    • (2000) Advances in Neural Information Processing Systems (NIPS) , vol.13
    • Gray, A.G.1    Moore, A.W.2
  • 15
    • 0141427127 scopus 로고
    • Vectorization of tree traversals
    • March
    • L. Hernquist. Vectorization of tree traversals. J. Comput. Phys., 87:137-147, March 1990.
    • (1990) J. Comput. Phys. , vol.87 , pp. 137-147
    • Hernquist, L.1
  • 19
    • 44849094749 scopus 로고    scopus 로고
    • Fast n-body simulation with cuda
    • M. H. L. Nyland and J. Prins. Fast n-body simulation with cuda. GPU Gems, (3):677-695, 2007.
    • (2007) GPU Gems , vol.3 , pp. 677-695
    • Nyland, M.H.L.1    Prins, J.2
  • 21
    • 0009157947 scopus 로고
    • Vectorization of a treecode
    • March
    • J. Makino. Vectorization of a treecode. J. Comput. Phys., 87:148-160, March 1990.
    • (1990) J. Comput. Phys. , vol.87 , pp. 148-160
    • Makino, J.1
  • 23
    • 1542601822 scopus 로고    scopus 로고
    • Improving memory hierarchy performance for irregular applications using data and computation reorderings
    • J. Mellor-Crummey, D. Whalley, and K. Kennedy. Improving memory hierarchy performance for irregular applications using data and computation reorderings. Int. J. Parallel Program., 29(3):217-247, 2001. (Pubitemid 33818435)
    • (2001) International Journal of Parallel Programming , vol.29 , Issue.3 , pp. 217-247
    • Mellor-Crummey, J.1    Whalley, D.2    Kennedy, K.3
  • 26
    • 0031274872 scopus 로고    scopus 로고
    • Commutativity analysis: A new analysis technique for parallelizing compilers
    • M. Rinard and P. C. Diniz. Commutativity analysis: a new analysis technique for parallelizing compilers. ACM Trans. Program. Lang. Syst., 19(6):942-991, 1997. (Pubitemid 127455640)
    • (1997) ACM Transactions on Programming Languages and Systems , vol.19 , Issue.6 , pp. 942-991
    • Rinard, M.C.1    Diniz, P.C.2
  • 29
    • 0043005053 scopus 로고
    • Load balancing and data locality in adaptive hierarchical nbody methods: Barnes-hut fast multipole, and radiosity
    • J. P. Singh, C. Holt, T. Totsuka, A. Gupta, and J. Hennessy. Load balancing and data locality in adaptive hierarchical nbody methods: Barnes-hut, fast multipole, and radiosity. J. Parallel Distrib. Comput., 27(2):118-141, 1995.
    • (1995) J. Parallel Distrib. Comput. , vol.27 , Issue.2 , pp. 118-141
    • Singh, J.P.1    Holt, C.2    Totsuka, T.3    Gupta, A.4    Hennessy, J.5
  • 34
    • 47249090497 scopus 로고    scopus 로고
    • On fast construction of sah-based bounding volume hierarchies
    • Washington, DC, USA, 2007. IEEE Computer Society
    • I. Wald. On fast construction of sah-based bounding volume hierarchies. In RT '07: Proceedings of the 2007 IEEE Symposium on Interactive Ray Tracing, pages 33-40, Washington, DC, USA, 2007. IEEE Computer Society.
    • RT ' 07: Proceedings of the 2007 IEEE Symposium on Interactive Ray Tracing , pp. 33-40
    • Wald, I.1
  • 37
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the atlas project
    • C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the atlas project. Parallel Computing, 27:2001, 2000.
    • (2001) Parallel Computing , vol.27 , pp. 2000
    • Whaley, C.1    Petitet, A.2    Dongarra, J.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.