-
1
-
-
0030382365
-
Shared memory consistency models: A tutorial
-
S. V. Adve and K. Gharachorloo. Shared memory consistency models: A tutorial. Computer, 29(12):66-76,December 1996. (Pubitemid 126517873)
-
(1996)
Computer
, vol.29
, Issue.12
, pp. 66-76
-
-
Adve, S.V.1
Gharachorloo, K.2
-
2
-
-
0023563093
-
A model for hierarchical memory
-
New York, NY, USA, ACM
-
A. Aggarwal, B. Alpern, A. Chandra, and M. Snir. A model for hierarchical memory. In Proceedings of the nineteenth annual ACM symposium on Theory of computing, STOC '87, pages 305-314,New York, NY, USA, 1987. ACM.
-
(1987)
Proceedings of the Nineteenth Annual ACM Symposium on Theory of Computing, STOC '87
, pp. 305-314
-
-
Aggarwal, A.1
Alpern, B.2
Chandra, A.3
Snir, M.4
-
3
-
-
0028483922
-
The uniform memory hierarchy model of computation
-
10.1007/BF01185206
-
B. Alpern, L. Carter, E. Feig, and T. Selker. The uniform memory hierarchy model of computation. Algorithmica, 12:72-109, 1994. 10.1007/BF01185206.
-
(1994)
Algorithmica
, vol.12
, pp. 72-109
-
-
Alpern, B.1
Carter, L.2
Feig, E.3
Selker, T.4
-
5
-
-
85060036181
-
Validity of the single processor approach to achieving large scale computing capabilities
-
New York, NY, USA, ACM
-
G.M. Amdahl. Validity of the single processor approach to achieving large scale computing capabilities. In Proceedings of the April 18-20, 1967, spring joint computer conference, AFIPS '67 (Spring), pages 483-485, New York, NY, USA, 1967. ACM.
-
(1967)
Proceedings of the April 18-20, 1967, Spring Joint Computer Conference, AFIPS '67 (Spring)
, pp. 483-485
-
-
Amdahl, G.M.1
-
6
-
-
33846349887
-
A hierarchical O(N log N) force-calculation algorithm
-
December
-
J. Barnes and P. Hut. A hierarchical O(N log N) force-calculation algorithm. Nature, 324(6096):446-449,December 1986.
-
(1986)
Nature
, vol.324
, Issue.6096
, pp. 446-449
-
-
Barnes, J.1
Hut, P.2
-
7
-
-
85015899515
-
The price of performance
-
September
-
L. A. Barroso. The price of performance. Queue, 3(7):48-53, September 2005.
-
(2005)
Queue
, vol.3
, Issue.7
, pp. 48-53
-
-
Barroso, L.A.1
-
8
-
-
84929524862
-
Cellular automata in triangular, pentagonal and hexagonal tessellations
-
Springer New York
-
C. Bays. Cellular automata in triangular, pentagonal and hexagonal tessellations. In Robert A. Meyers, editor, Computational Complexity, pages 434-442. Springer New York, 2012.
-
(2012)
Robert A. Meyers, Editor, Computational Complexity
, pp. 434-442
-
-
Bays, C.1
-
10
-
-
84856569900
-
A sparse octree gravitational n-body code that runs entirely on the GPU processor
-
April
-
J. Bédorf, E. Gaburov, and S. P. Zwart. A sparse octree gravitational n-body code that runs entirely on the GPU processor. J. Comput. Phys., 231(7):2825-2839,April 2012.
-
(2012)
J. Comput. Phys.
, vol.231
, Issue.7
, pp. 2825-2839
-
-
Bédorf, J.1
Gaburov, E.2
Zwart, S.P.3
-
11
-
-
84857176549
-
Real-time terrain modeling using cpu-GPU coupled computation
-
Washington, DC, USA, IEEE Computer Society
-
A. Bernhardt, A. Maximo, L. Velho, H. Hnaidi, andM.-P. Cani. Real-time terrain modeling using cpu-GPU coupled computation. In Proceedings of the 2011 24th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI '11, pages 64-71,Washington, DC, USA, 2011. IEEE Computer Society.
-
(2011)
Proceedings of the 2011 24th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI '11
, pp. 64-71
-
-
Bernhardt, A.1
Maximo, A.2
Velho, L.3
Hnaidi, H.4
Cani, M.-P.5
-
12
-
-
84887763139
-
Civil and structural engineering computing: 2001
-
Saxe-Coburg Publications
-
Z. Bittnar, J. Kruis, J. Němeček, B. Patzák, and D. Rypl. Civil and structural engineering computing: 2001. chapter Parallel and distributed computations for structural mechanics: a review, pages 211-233. Saxe-Coburg Publications, 2001.
-
(2001)
Chapter Parallel and Distributed Computations for Structural Mechanics: A Review
, pp. 211-233
-
-
Bittnar, Z.1
Kruis, J.2
Němeček, J.3
Patzák, B.4
Rypl, D.5
-
15
-
-
10644248153
-
Brook for GPUs: Stream computing on graphics hardware
-
August
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian,M. Houston, and P. Hanrahan. Brook for GPUs: stream computing on graphics hardware. ACM Trans. Graph., 23(3):777-786, August 2004.
-
(2004)
ACM Trans. Graph.
, vol.23
, Issue.3
, pp. 777-786
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
16
-
-
78149352231
-
K-model: A new computational model for stream processors
-
Washington, DC, USA, IEEE Computer Society
-
G. Capannini, F. Silvestri, and R. Baraglia. K-model: A new computational model for stream processors. In Proceedings of the 2010 IEEE 12th International Conference on High Performance Computing and Communications, HPCC '10, pages 239-246, Washington, DC, USA, 2010. IEEE Computer Society.
-
(2010)
Proceedings of the 2010 IEEE 12th International Conference on High Performance Computing and Communications, HPCC '10
, pp. 239-246
-
-
Capannini, G.1
Silvestri, F.2
Baraglia, R.3
-
19
-
-
0025431398
-
The impact of synchronization and granularity on parallel systems
-
May
-
D.-K. Chen, H.-M. Su, and P.-C. Yew. The impact of synchronization and granularity on parallel systems. SIGARCH Comput. Archit. News, 18(3a):239-248,May 1990.
-
(1990)
SIGARCH Comput. Archit. News
, vol.18
, Issue.3 A
, pp. 239-248
-
-
Chen, D.-K.1
Su, H.-M.2
Yew, P.-C.3
-
20
-
-
34248201490
-
A parallel implementation of the Cellular Potts Model for simulation of cell-based morphogenesis
-
DOI 10.1016/j.cpc.2007.03.007, PII S0010465507002044
-
N. Chen, J. A. Glazier, J. A. Izaguirre, and M. S. Alber. A parallel implementation of the cellular potts model for simulation of cell-based morphogenesis. Computer Physics Communications, 176(11-12):670-681, 2007. (Pubitemid 46722711)
-
(2007)
Computer Physics Communications
, vol.176
, Issue.11-12
, pp. 670-681
-
-
Chen, N.1
Glazier, J.A.2
Izaguirre, J.A.3
Alber, M.S.4
-
22
-
-
79957635333
-
GPU-based lighting and shadowing of complex natural scenes
-
August, Los Angeles, USA
-
F. Cohen, P. Decaudin, and F. Neyret. GPU-based lighting and shadowing of complex natural scenes. In Siggraph'04 Conf. DVD-ROM (Poster), August 2004. Los Angeles, USA.
-
(2004)
Siggraph'04 Conf. DVD-ROM (Poster)
-
-
Cohen, F.1
Decaudin, P.2
Neyret, F.3
-
25
-
-
77951291942
-
Exploring nvidia-cuda for video coding
-
New York, NY, USA, ACM
-
A. Colic, H. Kalva, and B. Furht. Exploring nvidia-cuda for video coding. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems, MMSys '10, pages 13-22, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the First Annual ACM SIGMM Conference on Multimedia Systems, MMSys '10
, pp. 13-22
-
-
Colic, A.1
Kalva, H.2
Furht, B.3
-
26
-
-
8744267587
-
Universality in elementary cellular automata
-
M. Cook. Universality in Elementary Cellular Automata. Complex Systems, 15(1):1-40, 2004. (Pubitemid 39203440)
-
(2004)
Complex Systems
, vol.15
, Issue.1
, pp. 1-40
-
-
Cook, M.1
-
27
-
-
0004116989
-
-
McGraw-Hill Higher Education, 2nd edition
-
T. H. Cormen, C. Stein, R. L. Rivest, and C. E. Leiserson. Introduction to Algorithms. McGraw-Hill Higher Education, 2nd edition, 2001.
-
(2001)
Introduction to Algorithms
-
-
Cormen, T.H.1
Stein, C.2
Rivest, R.L.3
Leiserson, C.E.4
-
30
-
-
84878137471
-
A GPU-based method for generating quasi-delaunay triangulations based on edge-flips
-
GRAPP 2013, February
-
E. Scheihing, C. A. Navarro, N. Hitschfeld-Kahler. A GPU-based method for generating quasi-delaunay triangulations based on edge-flips. In Proceedings of the 8th International on Computer Graphics, Theory and Applications, GRAPP 2013, pages 27-34, February 2013.
-
(2013)
Proceedings of the 8th International on Computer Graphics, Theory and Applications
, pp. 27-34
-
-
Scheihing, E.1
Navarro, C.A.2
Hitschfeld-Kahler, N.3
-
31
-
-
0009346826
-
Logp: Towards a realistic model of parallel computation
-
July
-
D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser, E. Santos, R. Subramonian, and T. von Eicken. Logp: towards a realistic model of parallel computation. SIGPLAN Not., 28(7):1-12, July 1993.
-
(1993)
SIGPLAN Not.
, vol.28
, Issue.7
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
Von Eicken, T.8
-
32
-
-
37549003336
-
Mapreduce: Simplified data processing on large clusters
-
January
-
J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. Commun. ACM, 51(1):107-113, January 2008.
-
(2008)
Commun. ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
33
-
-
84945709358
-
Solution of a problemin concurrent programming control
-
September
-
E.W. Dijkstra. Solution of a problemin concurrent programming control. Commun. ACM, 8(9):569-, September 1965.
-
(1965)
Commun. ACM
, vol.8
, Issue.9
, pp. 569
-
-
Dijkstra, E.W.1
-
34
-
-
60649084618
-
Semaphores for fair scheduling monitor conditions
-
May
-
N. Dunstan. Semaphores for fair scheduling monitor conditions. SIGOPS Oper. Syst. Rev., 25(3):27-31,May 1991.
-
(1991)
SIGOPS Oper. Syst. Rev.
, vol.25
, Issue.3
, pp. 27-31
-
-
Dunstan, N.1
-
35
-
-
38249040751
-
Superlinear speedup of an efficient sequential algorithm is not possible
-
July
-
V. Faber, O. M. Lubeck, and A. B. White, Jr. Superlinear speedup of an efficient sequential algorithm is not possible. Parallel Comput., 3(3):259-260, July 1986.
-
(1986)
Parallel Comput.
, vol.3
, Issue.3
, pp. 259-260
-
-
Faber, V.1
Lubeck, O.M.2
White Jr., A.B.3
-
36
-
-
78650745600
-
Octree-based, GPU implementation of a continuous cellular automaton for the simulation of complex, evolving surfaces
-
N. Ferrando, M. A. Gosalvez, J. Cerda, R. G. Girones, and K. Sato. Octree-based, GPU implementation of a continuous cellular automaton for the simulation of complex, evolving surfaces. Computer Physics Communications, pages 628-640, 2011.
-
(2011)
Computer Physics Communications
, pp. 628-640
-
-
Ferrando, N.1
Gosalvez, M.A.2
Cerda, J.3
Girones, R.G.4
Sato, K.5
-
37
-
-
84860312760
-
Q-state pottsmodelmetastability study using optimized GPU-based monte carlo algorithms
-
E. E. Ferrero, J. P. De Francesco,N.Wolovick, and S.A. Cannas. q-state pottsmodelmetastability study using optimized GPU-based monte carlo algorithms. Computer Physics Communications, 183(8):1578 - 1587, 2012.
-
(2012)
Computer Physics Communications
, vol.183
, Issue.8
, pp. 1578-1587
-
-
Ferrero, E.E.1
De Francesco, J.P.2
Wolovick, N.3
Cannas, S.A.4
-
38
-
-
0015401565
-
Some computer organizations and their effectiveness
-
September
-
M. J. Flynn. Some computer organizations and their effectiveness. IEEE Trans. Comput., 21(9):948-960, September 1972.
-
(1972)
IEEE Trans. Comput.
, vol.21
, Issue.9
, pp. 948-960
-
-
Flynn, M.J.1
-
39
-
-
0018052202
-
Parallelism in random access machines
-
New York, NY, USA, ACM
-
S. Fortune and J. Wyllie. Parallelism in random access machines. In Proceedings of the tenth annual ACM symposium on Theory of computing, STOC '78, pages 114-118, New York, NY, USA, 1978. ACM.
-
(1978)
Proceedings of the Tenth Annual ACM Symposium on Theory of Computing, STOC '78
, pp. 114-118
-
-
Fortune, S.1
Wyllie, J.2
-
40
-
-
0004146408
-
-
Addison-Wesley Longman Publishing Co., Inc., Boston,MA, USA
-
I. Foster. Designing and building parallel programs: Concepts and tools for parallel software engineering. Addison-Wesley Longman Publishing Co., Inc., Boston,MA, USA, 1995.
-
(1995)
Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering
-
-
Foster, I.1
-
41
-
-
35048884271
-
Open MPI: Goals, concept, and design of a next generation MPI implementation
-
Budapest, Hungary, September
-
E. Gabriel, G. E. Fagg, G. Bosilca, T. Angskun, J. J. Dongarra, J. M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine, R. H. Castain, D. J. Daniel, R. L. Graham, and T. S. Woodall. Open MPI: Goals, concept, and design of a next generation MPI implementation. In Proceedings, 11th European PVM/MPI Users' Group Meeting, pages 97-104, Budapest, Hungary, September 2004.
-
(2004)
Proceedings, 11th European PVM/MPI Users' Group Meeting
, pp. 97-104
-
-
Gabriel, E.1
Fagg, G.E.2
Bosilca, G.3
Angskun, T.4
Dongarra, J.J.5
Squyres, J.M.6
Sahay, V.7
Kambadur, P.8
Barrett, B.9
Lumsdaine, A.10
Castain, R.H.11
Daniel, D.J.12
Graham, R.L.13
Woodall, T.S.14
-
42
-
-
0000870032
-
The fantastic combinations of John Conway's new solitaire game "life
-
October
-
M. Gardner. The fantastic combinations of John Conway's new solitaire game "life". Scientific American, 223:120-123,October 1970.
-
(1970)
Scientific American
, vol.223
, pp. 120-123
-
-
Gardner, M.1
-
43
-
-
53049083461
-
GPU accelerated computation and visualization of hexagonal cellular automata
-
Berlin, Heidelberg, Springer-Verlag
-
S. Gobron, H. Bonafos, and D. Mestre. GPU accelerated computation and visualization of hexagonal cellular automata. In Proceedings of the 8th international conference on Cellular Automata for Reseach and Industry, ACRI '08, pages 512-521, Berlin, Heidelberg, 2008. Springer-Verlag.
-
(2008)
Proceedings of the 8th International Conference on Cellular Automata for Reseach and Industry, ACRI '08
, pp. 512-521
-
-
Gobron, S.1
Bonafos, H.2
Mestre, D.3
-
44
-
-
84865249616
-
GPGPU computation and visualization of three-dimensional cellular automata
-
S. Gobron, A. Çöltekin, H. Bonafos, and D. Thalmann. GPGPU computation and visualization of three-dimensional cellular automata. The Visual Computer, 27(1):67-81, 2011.
-
(2011)
The Visual Computer
, vol.27
, Issue.1
, pp. 67-81
-
-
Gobron, S.1
Çöltekin, A.2
Bonafos, H.3
Thalmann, D.4
-
45
-
-
36549080465
-
Retina simulation using cellular automata and GPU programming
-
DOI 10.1007/s00138-006-0065-8
-
S. Gobron, F. Devillard, and B. Heit. Retina simulation using cellular automata and GPU programming. Mach. Vision Appl., 18(6):331-342,November 2007. (Pubitemid 350178426)
-
(2007)
Machine Vision and Applications
, vol.18
, Issue.6
, pp. 331-342
-
-
Gobron, S.1
Devillard, F.2
Heit, B.3
-
48
-
-
0034513885
-
Automatic parallelization of recursive procedures
-
DOI 10.1023/A:1007560600904
-
M. Gupta, S. Mukhopadhyay, and N. Sinha. Automatic parallelization of recursive procedures. Int. J. Parallel Program., 28(6):537-562,December 2000. (Pubitemid 32076092)
-
(2000)
International Journal of Parallel Programming
, vol.28
, Issue.6
, pp. 537-562
-
-
Gupta, M.1
Mukhopadhyay, S.2
Sinha, N.3
-
52
-
-
74049152899
-
42 tflops hierarchical n-body simulations on GPUs with applications in both astrophysics and turbulence
-
T. Hamada, T. Narumi, R. Yokota, K. Yasuoka, K. Nitadori, and M. Taiji. 42 tflops hierarchical n-body simulations on GPUs with applications in both astrophysics and turbulence. In SC, 2009.
-
(2009)
SC
-
-
Hamada, T.1
Narumi, T.2
Yokota, R.3
Yasuoka, K.4
Nitadori, K.5
Taiji, M.6
-
53
-
-
65249114041
-
Real-time rigid body simulation on GPUs
-
editor, GPU Gems 3, Addison-Wesley
-
T. Harada. Real-time rigid body simulation on GPUs. In Hubert Nguyen, editor, GPU Gems 3, pages 611-632. Addison-Wesley, 2008.
-
(2008)
Hubert Nguyen
, pp. 611-632
-
-
Harada, T.1
-
54
-
-
0016114085
-
Monitors: An operating system structuring concept
-
October
-
C. A. R. Hoare. Monitors: an operating system structuring concept. Commun. ACM, 17(10):549-557,October 1974.
-
(1974)
Commun. ACM
, vol.17
, Issue.10
, pp. 549-557
-
-
Hoare, C.A.R.1
-
55
-
-
78149231331
-
Mapcg: Writing parallel program portable between cpu and GPU
-
New York, NY, USA, ACM
-
C. Hong, D. Chen, W. Chen, W. Zheng, and H. Lin. Mapcg: writing parallel program portable between cpu and GPU. In Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pages 217-226, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, PACT '10
, pp. 217-226
-
-
Hong, C.1
Chen, D.2
Chen, W.3
Zheng, W.4
Lin, H.5
-
56
-
-
49249134204
-
Interactive k-d tree GPU raytracing
-
New York, NY, USA, ACM
-
D. R. Horn, J. Sugerman, M. Houston, and P. Hanrahan. Interactive k-d tree GPU raytracing. In Proceedings of the 2007 symposium on Interactive 3D graphics and games, I3D '07, pages 167-174,New York, NY, USA, 2007. ACM.
-
(2007)
Proceedings of the 2007 Symposium on Interactive 3D Graphics and Games, I3D '07
, pp. 167-174
-
-
Horn, D.R.1
Sugerman, J.2
Houston, M.3
Hanrahan, P.4
-
57
-
-
84869451864
-
An energy efficient 32nm 20 MB L3 cache for IntelR XeonR processor E5 family
-
IEEE
-
M. Huang, M. Mehalel, R. Arvapalli, and S. He. An energy efficient 32nm 20 MB L3 cache for IntelR XeonR processor E5 family. In CICC, pages 1-4. IEEE, 2012.
-
(2012)
CICC
, pp. 1-4
-
-
Huang, M.1
Mehalel, M.2
Arvapalli, R.3
He, S.4
-
58
-
-
84887667003
-
The n-body problem throughout the computer science curriculum
-
June
-
L. Ivanov. The n-body problem throughout the computer science curriculum. J. Comput. Sci. Coll., 22(6):43-52, June 2007.
-
(2007)
J. Comput. Sci. Coll.
, vol.22
, Issue.6
, pp. 43-52
-
-
Ivanov, L.1
-
60
-
-
0035311963
-
3D collision detection: A survey
-
DOI 10.1016/S0097-8493(00)00130-8, PII S0097849300001308
-
P. Jimenez, F. Thomas, and C. Torras. 3d collision detection: A survey. Computers and Graphics, 25:269-285, 2000. (Pubitemid 32272850)
-
(2001)
Computers and Graphics (Pergamon)
, vol.25
, Issue.2
, pp. 269-285
-
-
Jimenez, P.1
Thomas, F.2
Torras, C.3
-
61
-
-
75149160554
-
Latticemethods for fluid animation in games
-
January
-
S. F. Judice, B. Barcellos, S. Coutinho, and G. A. Giraldi. Latticemethods for fluid animation in games. Comput. Entertain., 7(4):56:1-56:29, January 2010.
-
(2010)
Comput. Entertain.
, vol.7
, Issue.4
, pp. 561-5629
-
-
Judice, S.F.1
Barcellos, B.2
Coutinho, S.3
Giraldi, G.A.4
-
62
-
-
79952161329
-
Implicit surface octrees for ray tracing point models
-
New York, NY, USA, ACM
-
S. Kashyap, R. Goradia, P. Chaudhuri, and S. Chandran. Implicit surface octrees for ray tracing point models. In Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP '10, pages 227-234, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP '10
, pp. 227-234
-
-
Kashyap, S.1
Goradia, R.2
Chaudhuri, P.3
Chandran, S.4
-
64
-
-
0034764291
-
Real-time bump map synthesis
-
J. Kautz, W. Heidrich, and H.-P. Seidel. Real-time bump map synthesis. In Proceedings of the ACMSIGGRAPH/EUROGRAPHICS workshop on Graphics hardware,HWWS '01, pages 109-114,New York, NY, USA, 2001. ACM. (Pubitemid 33046565)
-
(2001)
Proceedings of the ACM SIGGRAPH Conference on Computer Graphics
, Issue.WORKSHOP
, pp. 109-114
-
-
Kautz, J.1
Heidrich, W.2
Seidel, H.-P.3
-
66
-
-
0031399546
-
Parallel processing for terrain analysis in GIS: Visibility as a case study
-
August
-
D. B. Kidner, P. J. Rallings, and J. A. Ware. Parallel processing for terrain analysis in GIS: Visibility as a case study. Geoinformatica, 1(2):183-207, August 1997.
-
(1997)
Geoinformatica
, vol.1
, Issue.2
, pp. 183-207
-
-
Kidner, D.B.1
Rallings, P.J.2
Ware, J.A.3
-
68
-
-
35248867363
-
The structure of a compiler for explicit and implicit parallelism
-
Berlin, Heidelberg, Springer-Verlag
-
S. W. Kim and R. Eigenmann. The structure of a compiler for explicit and implicit parallelism. In Proceedings of the 14th international conference on Languages and compilers for parallel computing, LCPC'01, pages 336-351, Berlin, Heidelberg, 2003. Springer-Verlag.
-
(2003)
Proceedings of the 14th International Conference on Languages and Compilers for Parallel Computing, LCPC'01
, pp. 336-351
-
-
Kim, S.W.1
Eigenmann, R.2
-
69
-
-
70449559873
-
LCP algorithms for collision detection using CUDA
-
editor, GPUGems 3, Addison-Wesley
-
P. Kipfer. LCP algorithms for collision detection using CUDA. In Hubert Nguyen, editor, GPUGems 3, pages 723-739. Addison-Wesley, 2007.
-
(2007)
Hubert Nguyen
, pp. 723-739
-
-
Kipfer, P.1
-
70
-
-
0016318617
-
Computer programming as an art
-
December
-
D. E. Knuth. Computer programming as an art. Commun. ACM, 17(12):667-673,December 1974.
-
(1974)
Commun. ACM
, vol.17
, Issue.12
, pp. 667-673
-
-
Knuth, D.E.1
-
71
-
-
84855206756
-
GPU-based single-cluster algorithm for the simulation of the ising model
-
February
-
Y. Komura and Y. Okabe. GPU-based single-cluster algorithm for the simulation of the ising model. J. Comput. Phys., 231(4):1209-1215, February 2012.
-
(2012)
J. Comput. Phys.
, vol.231
, Issue.4
, pp. 1209-1215
-
-
Komura, Y.1
Okabe, Y.2
-
72
-
-
84867582560
-
Multi-GPU-based swendsenVwang multi-cluster algorithm for the simulation of two-dimensional -state pottsmodel
-
Y. Komura and Y. Okabe. Multi-GPU-based swendsenVwang multi-cluster algorithm for the simulation of two-dimensional -state pottsmodel. Computer Physics Communications, 184(1):40 - 44, 2013.
-
(2013)
Computer Physics Communications
, vol.184
, Issue.1
, pp. 40-44
-
-
Komura, Y.1
Okabe, Y.2
-
74
-
-
67650035436
-
Effective automatic parallelization of stencil computations
-
DOI 10.1145/1250734.1250761, PLDI'07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation
-
S. Krishnamoorthy, M. Baskaran, U. Bondhugula, J. Ramanujam, A. Rountev, and P. Sadayappan. Effective automatic parallelization of stencil computations. SIGPLAN Not., 42(6):235-244, June 2007. (Pubitemid 47630691)
-
(2007)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
, pp. 235-244
-
-
Krishnamoorthy, S.1
Baskaran, M.2
Bondhugula, U.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
75
-
-
77954995885
-
-
June
-
V. W. Lee, C. Kim, J. Chhugani, M. Deisher, D. Kim, A. D. Nguyen, N. Satish, M. Smelyanskiy, S. Chennupaty, P. Hammarlund, R. Singhal, and P. Dubey. Debunking the 100x GPU vs. cpu myth: an evaluation of throughput computing on cpu and GPU. SIGARCH Comput. Archit. News, 38(3):451-460, June 2010.
-
(2010)
Debunking the 100x GPU Vs. Cpu Myth: An Evaluation of Throughput Computing on Cpu and GPU. SIGARCH Comput. Archit. News
, vol.38
, Issue.3
, pp. 451-460
-
-
Lee, V.W.1
Kim, C.2
Chhugani, J.3
Deisher, M.4
Kim, D.5
Nguyen, A.D.6
Satish, N.7
Smelyanskiy, M.8
Chennupaty, S.9
Hammarlund, P.10
Singhal, R.11
Dubey, P.12
-
76
-
-
0003819663
-
-
Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
-
F. T. Leighton. Introduction to parallel algorithms and architectures: array, trees, hypercubes. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1992.
-
(1992)
Introduction to Parallel Algorithms and Architectures: Array, Trees, Hypercubes
-
-
Leighton, F.T.1
-
78
-
-
77956874347
-
Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the international space station
-
10.1007/s11554-009-0133-1
-
P. Lu, H. Oki, C. Frey, G. Chamitoff, L. Chiao, E. Fincke, C. Foale, S.Magnus,W. McArthur, D. Tani, P. Whitson, J. Williams, W. Meyer, R. Sicker, B. Au, M. Christiansen, A. Schofield, and D. Weitz. Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the international space station. Journal of Real-Time Image Processing, 5:179-193, 2010. 10.1007/s11554-009-0133-1.
-
(2010)
Journal of Real-Time Image Processing
, vol.5
, pp. 179-193
-
-
Lu, P.1
Oki, H.2
Frey, C.3
Chamitoff, G.4
Chiao, L.5
Fincke, E.6
Foale, C.7
McArthur, S.MagnusW.8
Tani, D.9
Whitson, P.10
Williams, J.11
Meyer, W.12
Sicker, R.13
Au, B.14
Christiansen, M.15
Schofield, A.16
Weitz, D.17
-
79
-
-
34548757823
-
Automatic parallelization of scripting languages: Toward transparent desktop parallel computing
-
IPDPS 2007. IEEE International
-
X. Ma, J. Li, and N. F. Samatova. Automatic parallelization of scripting languages: Toward transparent desktop parallel computing. In Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International, pages 1-6, 2007.
-
(2007)
Parallel and Distributed Processing Symposium, 2007
, pp. 1-6
-
-
Ma, X.1
Li, J.2
Samatova, N.F.3
-
80
-
-
0142103318
-
The GPU enters computing's mainstream
-
M.Macedonia. The GPU enters computing's mainstream. Computer, 36(10):106-108, 2003.
-
(2003)
Computer
, vol.36
, Issue.10
, pp. 106-108
-
-
Macedonia, M.1
-
82
-
-
33646031235
-
Cg: A system for programming graphics hardware in a c-like language
-
July
-
W. R. Mark, R. S. Glanville, K. Akeley, and M. J. Kilgard. Cg: a system for programming graphics hardware in a c-like language. ACMTrans. Graph., 22(3):896-907, July 2003.
-
(2003)
ACMTrans. Graph.
, vol.22
, Issue.3
, pp. 896-907
-
-
Mark, W.R.1
Glanville, R.S.2
Akeley, K.3
Kilgard, M.J.4
-
83
-
-
77949643305
-
Introduction to GPU programming with glsl
-
Washington, DC, USA, IEEE Computer Society
-
R. Marroquim and A. Maximo. Introduction to GPU programming with glsl. In Proceedings of the 2009 Tutorials of the XXII Brazilian Symposium on Computer Graphics and Image Processing, SIBGRAPI-TUTORIALS '09, pages 3-16, Washington, DC, USA, 2009. IEEE Computer Society.
-
(2009)
Proceedings of the 2009 Tutorials of the XXII Brazilian Symposium on Computer Graphics and Image Processing, SIBGRAPI-TUTORIALS '09
, pp. 3-16
-
-
Marroquim, R.1
Maximo, A.2
-
84
-
-
85031896513
-
On parallel hashing and integer sorting
-
Springer Berlin / Heidelberg, 10.1007/BFb0032070
-
Y. Matias and U. Vishkin. On parallel hashing and integer sorting. In Michael Paterson, editor, Automata, Languages and Programming, volume 443 of LectureNotes in Computer Science, pages 729-743. Springer Berlin / Heidelberg, 1990. 10.1007/BFb0032070.
-
(1990)
Michael Paterson, Editor, Automata, Languages and Programming, Volume 443 of LectureNotes in Computer Science
, pp. 729-743
-
-
Matias, Y.1
Vishkin, U.2
-
85
-
-
0036954153
-
Shader metaprogramming
-
Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
-
M. D. McCool, Z. Qin, and T. S. Popa. Shader metaprogramming. In Proceedings of the ACMSIGGRAPH/EUROGRAPHICS conference on Graphics hardware,HWWS '02, pages 57-68, Aire-la-Ville, Switzerland, Switzerland, 2002. Eurographics Association.
-
(2002)
Proceedings of the ACMSIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, HWWS '02
, pp. 57-68
-
-
McCool, M.D.1
Qin, Z.2
Popa, T.S.3
-
86
-
-
5744249209
-
Equation of state calculations by fast computing machines
-
N. Metropolis, A. Rosenbluth, M. Rosenbluth, A. Teller, and E. Teller. Equation of state calculations by fast computing machines. J. Chem. Phys., 21:1087, 1953.
-
(1953)
J. Chem. Phys.
, vol.21
, pp. 1087
-
-
Metropolis, N.1
Rosenbluth, A.2
Rosenbluth, M.3
Teller, A.4
Teller, E.5
-
89
-
-
55649109070
-
-
Addison-Wesley Professional, first edition
-
H. Nguyen. GPU gems 3. Addison-Wesley Professional, first edition, 2007.
-
(2007)
GPU Gems 3
-
-
Nguyen, H.1
-
90
-
-
0003500941
-
-
O'Reilly, 101Morris Street, Sebastopol, CA 95472
-
B. Nichols, D. Buttlar, and J. P. Farrell. Pthreads Programming. O'Reilly, 101Morris Street, Sebastopol, CA 95472, 1998.
-
(1998)
Pthreads Programming
-
-
Nichols, B.1
Buttlar, D.2
Farrell, J.P.3
-
94
-
-
85086423044
-
Hlsl shader model 4.0
-
New York, NY, USA, ACM
-
M. Oneppo. Hlsl shader model 4.0. In ACM SIGGRAPH 2007 courses, SIGGRAPH '07, pages 112-152,New York, NY, USA, 2007. ACM.
-
(2007)
ACM SIGGRAPH 2007 Courses, SIGGRAPH '07
, pp. 112-152
-
-
Oneppo, M.1
-
95
-
-
0141463137
-
-
Routledge, New York, NY
-
S. Openshaw and I. Turton. High Performance Computing and the Art of Parallel Programming: An Introduction for Geographers, Social Scientists, and Engineers. Routledge, New York, NY, 10001, 1999.
-
(1999)
High Performance Computing and the Art of Parallel Programming: An Introduction for Geographers, Social Scientists, and Engineers
, pp. 10001
-
-
Openshaw, S.1
Turton, I.2
-
96
-
-
78650496873
-
Fast and scalable CPU/GPU collision detection for rigid and deformable surfaces
-
S. Pabst, A. Koch, andW. Straßer. Fast and scalable CPU/GPU collision detection for rigid and deformable surfaces. Computer Graphics Forum, 29(5):1605-1612, 2010.
-
(2010)
Computer Graphics Forum
, vol.29
, Issue.5
, pp. 1605-1612
-
-
Pabst, S.1
Koch, A.2
Straßer, W.3
-
98
-
-
72449147182
-
Parallel reduction in resource lambda-calculus
-
M. Pagani and P. Tranquilli. Parallel reduction in resource lambda-calculus. In APLAS, pages 226-242, 2009.
-
(2009)
APLAS
, pp. 226-242
-
-
Pagani, M.1
Tranquilli, P.2
-
99
-
-
38249040569
-
Parallel efficiency can be greater than unity
-
D. Parkinson. Parallel efficiency can be greater than unity. Parallel Computing, 3(3):261-262, 1986.
-
(1986)
Parallel Computing
, vol.3
, Issue.3
, pp. 261-262
-
-
Parkinson, D.1
-
100
-
-
84887799548
-
To teach Newton's square root algorithm
-
December
-
H. A. Peelle. To teach Newton's square root algorithm. SIGAPL APL Quote Quad, 5(4):48-50, December 1974.
-
(1974)
SIGAPL APL Quote Quad
, vol.5
, Issue.4
, pp. 48-50
-
-
Peelle, H.A.1
-
101
-
-
0035417514
-
Locating and computing in parallel all the simple roots of special functions using PVM
-
DOI 10.1016/S0377-0427(00)00675-0, PII S0377042700006750, Special Issue: Orthogonal Polynomials Special Functions and their Applications
-
V. P. Plagianakos, N. K. Nousis, andM. N. Vrahatis. Locating and computing in parallel all the simple roots of special functions using pvm. J. Comput. Appl.Math., 133(1-2):545-554, August 2001. (Pubitemid 32826690)
-
(2001)
Journal of Computational and Applied Mathematics
, vol.133
, Issue.1-2
, pp. 545-554
-
-
Plagianakos, V.P.1
Nousis, N.K.2
Vrahatis, M.N.3
-
102
-
-
67349267818
-
GPU accelerated monte carlo simulation of the 2d and 3d ising model
-
July
-
T. Preis, P. Virnau, W. Paul, and J. J. Schneider. GPU accelerated monte carlo simulation of the 2d and 3d ising model. J. Comput. Phys., 228(12):4468-4477, July 2009.
-
(2009)
J. Comput. Phys.
, vol.228
, Issue.12
, pp. 4468-4477
-
-
Preis, T.1
Virnau, P.2
Paul, W.3
Schneider, J.J.4
-
103
-
-
85023166542
-
A work-efficient GPU algorithm for level set segmentation
-
Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
-
M. Roberts, J. Packer, M. C. Sousa, and J. R. Mitchell. A work-efficient GPU algorithm for level set segmentation. In Proceedings of the Conference on High Performance Graphics, HPG '10, pages 123-132, Aire-la-Ville, Switzerland, Switzerland, 2010. Eurographics Association.
-
(2010)
Proceedings of the Conference on High Performance Graphics, HPG '10
, pp. 123-132
-
-
Roberts, M.1
Packer, J.2
Sousa, M.C.3
Mitchell, J.R.4
-
104
-
-
85008065154
-
Why cpu frequency stalled
-
April
-
P. E. Ross. Why cpu frequency stalled. IEEE Spectr., 45(4):72-72, April 2008.
-
(2008)
IEEE Spectr.
, vol.45
, Issue.4
, pp. 72-72
-
-
Ross, P.E.1
-
106
-
-
74349129727
-
Experiments with single core, multicore, and GPU based computation of cellular automata
-
Washington, DC, USA
-
S. Rybacki, J. Himmelspach, and A. M. Uhrmacher. Experiments with single core, multicore, and GPU based computation of cellular automata. In Proceedings of the 2009 First International Conference on Advances in System Simulation, SIMUL '09, pages 62-67,Washington, DC, USA, 2009. IEEE Computer Society.
-
(2009)
Proceedings of the 2009 First International Conference on Advances in System Simulation, SIMUL '09
, pp. 62-67
-
-
Rybacki, S.1
Himmelspach, J.2
Uhrmacher, A.M.3
-
107
-
-
85122636849
-
Progressive buffers: View-dependent geometry and texture lod rendering
-
Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
-
P. V. Sander and J. L. Mitchell. Progressive buffers: view-dependent geometry and texture lod rendering. In Proceedings of the third Eurographics symposium on Geometry processing, SGP '05, Aire-la-Ville, Switzerland, Switzerland, 2005. Eurographics Association.
-
(2005)
Proceedings of the Third Eurographics Symposium on Geometry Processing, SGP '05
-
-
Sander, P.V.1
Mitchell, J.L.2
-
108
-
-
84870946930
-
Evaluation of a nearest-neighbor load balancing strategy for parallel molecular simulations in mpi environment
-
A. Di Serio and M. B. Ibáñez. Evaluation of a nearest-neighbor load balancing strategy for parallel molecular simulations in mpi environment. In PVM/MPI, pages 226-233, 2002.
-
(2002)
PVM/MPI
, pp. 226-233
-
-
Di Serio, A.1
Ibáñez, M.B.2
-
109
-
-
49049132956
-
An o(log n) parallel connectivity algorithm
-
Y. Shiloach and U. Vishkin. An o(log n) parallel connectivity algorithm. J. Algorithms, 3(1):57-67, 1982.
-
(1982)
J. Algorithms
, vol.3
, Issue.1
, pp. 57-67
-
-
Shiloach, Y.1
Vishkin, U.2
-
110
-
-
0003914107
-
-
Oxford University Press, Inc., New York, NY, USA
-
J. R. Smith. The design and analysis of parallel algorithms. Oxford University Press, Inc., New York, NY, USA, 1993.
-
(1993)
The Design and Analysis of Parallel Algorithms
-
-
Smith, J.R.1
-
112
-
-
60349097423
-
Gramps: A programming model for graphics pipelines
-
February
-
J. Sugerman, K. Fatahalian, S. Boulos, K. Akeley, and P. Hanrahan. Gramps: A programming model for graphics pipelines. ACM Trans. Graph., 28(1):4:1-4:11, February 2009.
-
(2009)
ACM Trans. Graph.
, vol.28
, Issue.1
, pp. 41-411
-
-
Sugerman, J.1
Fatahalian, K.2
Boulos, S.3
Akeley, K.4
Hanrahan, P.5
-
113
-
-
33747349191
-
Nonuniversal, critical dynamics in Monte Carlo simulations
-
R. H. Swendsen and J. S. Wang. Nonuniversal, critical dynamics in Monte Carlo simulations. Phys. Rev. Lett., 58:86, 1987.
-
(1987)
Phys. Rev. Lett.
, vol.58
, pp. 86
-
-
Swendsen, R.H.1
Wang, J.S.2
-
114
-
-
84887807755
-
Preliminary evaluations for hybrid memory cube with gather functions using FPGA
-
2012-03-19
-
N. Tanabe, N. Hori, B. Nuttapon, and H. Nakajo. Preliminary evaluations for hybrid memory cube with gather functions using FPGA. IPSJ SIG Notes, 2012(6):1-10, 2012-03-19.
-
(2012)
IPSJ SIG Notes
, Issue.6
, pp. 1-10
-
-
Tanabe, N.1
Hori, N.2
Nuttapon, B.3
Nakajo, H.4
-
116
-
-
74849085797
-
Data-parallel algorithms for large-scale real-time simulation of the cellular Potts model on graphics processing units
-
J. J. Tapia and R. D'Souza. Data-parallel algorithms for large-scale real-time simulation of the cellular Potts model on graphics processing units. 2009 IEEE International Conference on Systems Man and Cybernetics, (10):1411-1418, 2009.
-
(2009)
2009 IEEE International Conference on Systems Man and Cybernetics
, Issue.10
, pp. 1411-1418
-
-
Tapia, J.J.1
D'Souza, R.2
-
117
-
-
79251598410
-
Parallelizing the cellular potts model on graphics processing units
-
J. J. Tapia and R. D'Souza. Parallelizing the cellular potts model on graphics processing units. Computer Physics Communications, 182(4):857-865, 2011.
-
(2011)
Computer Physics Communications
, vol.182
, Issue.4
, pp. 857-865
-
-
Tapia, J.J.1
D'Souza, R.2
-
118
-
-
84865201971
-
GpGPU implementation of cellular automata model of water flow
-
Berlin, Heidelberg, Springer-Verlag
-
P. Topa and P. Mlocek. GpGPU implementation of cellular automata model of water flow. In Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, PPAM'11, pages 630-639, Berlin, Heidelberg, 2012. Springer- Verlag.
-
(2012)
Proceedings of the 9th International Conference on Parallel Processing and Applied Mathematics - Volume Part I, PPAM'11
, pp. 630-639
-
-
Topa, P.1
Mlocek, P.2
-
119
-
-
0025467711
-
A bridging model for parallel computation
-
August
-
L. G. Valiant. A bridging model for parallel computation. Commun. ACM, 33(8):103-111, August 1990.
-
(1990)
Commun. ACM
, vol.33
, Issue.8
, pp. 103-111
-
-
Valiant, L.G.1
-
120
-
-
57349106747
-
A pram-on-chip vision (invited abstract)
-
U. Vishkin. A pram-on-chip vision (invited abstract). In SPIRE, page 260, 2000.
-
(2000)
SPIRE
, pp. 260
-
-
Vishkin, U.1
-
121
-
-
0031629796
-
Explicit multi-threading (XMT) bridging models for instruction parallelism (extended abstract)
-
U. Vishkin, S. Dascal, E. Berkovich, and J. Nuzman. Explicit multi-threading (XMT) bridging models for instruction parallelism (extended abstract). In SPAA, pages 140-151, 1998.
-
(1998)
SPAA
, pp. 140-151
-
-
Vishkin, U.1
Dascal, S.2
Berkovich, E.3
Nuzman, J.4
-
123
-
-
35248898344
-
-
Springer-Verlag New York, Inc., New York, NY, USA
-
G. J. Woeginger. Combinatorial optimization - eureka, you shrink! chapter Exact algorithms for NP-hard problems: a survey, pages 185-207. Springer-Verlag New York, Inc., New York, NY, USA, 2003.
-
(2003)
Combinatorial Optimization - Eureka, You Shrink! Chapter Exact Algorithms for NP-hard Problems: A Survey
, pp. 185-207
-
-
Woeginger, G.J.1
-
124
-
-
5244336186
-
Collective Monte Carlo updating for spin systems
-
U. Wolff. Collective Monte Carlo updating for spin systems. Physical Review Letters, 62:361-364, 1989.
-
(1989)
Physical Review Letters
, vol.62
, pp. 361-364
-
-
Wolff, U.1
-
125
-
-
0042492825
-
The Potts model
-
January
-
F. Y. Wu. The Potts model. Reviews of Modern Physics, 54(1):235-268, January 1982.
-
(1982)
Reviews of Modern Physics
, vol.54
, Issue.1
, pp. 235-268
-
-
Wu, F.Y.1
-
126
-
-
85093009412
-
Scaling fast multipole methods up to 4000 GPUs
-
Singapore, Singapore, A*STAR Computational Resource Centre
-
R. Yokota, L. Barba, T. Narumi, and K. Yasuoka. Scaling fast multipole methods up to 4000 GPUs. In Proceedings of the ATIP/A CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?, ATIP '12, pages 9:1-9:6, Singapore, Singapore, 2012. A*STAR Computational Resource Centre.
-
(2012)
Proceedings of the ATIP/A CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?, ATIP '12
, pp. 91-96
-
-
Yokota, R.1
Barba, L.2
Narumi, T.3
Yasuoka, K.4
-
129
-
-
84860491027
-
Hierarchical n-body simulations with autotuning for heterogeneous systems
-
R. Yokota and L. A. Barba. Hierarchical n-body simulations with autotuning for heterogeneous systems. Computing in Science and Engineering, 14(3):30-39, 2012.
-
(2012)
Computing in Science and Engineering
, vol.14
, Issue.3
, pp. 30-39
-
-
Yokota, R.1
Barba, L.A.2
-
130
-
-
29844456426
-
Cellular automata in non-euclidean spaces
-
Stevens Point, Wisconsin, USA, World Scientific and Engineering Academy and Society (WSEAS)
-
S. Yukita. Cellular automata in non-euclidean spaces. In Proceedings of the 7thWSEAS International Conference on Mathematical Methods and Computational Techniques In Electrical Engineering, MMACTE'05, pages 200-207, Stevens Point, Wisconsin, USA, 2005. World Scientific and Engineering Academy and Society (WSEAS).
-
(2005)
Proceedings of the 7thWSEAS International Conference on Mathematical Methods and Computational Techniques in Electrical Engineering, MMACTE'05
, pp. 200-207
-
-
Yukita, S.1
-
131
-
-
57749174539
-
Real-time kd-tree construction on graphics hardware
-
December
-
K. Zhou, Q. Hou, R. Wang, and B. Guo. Real-time kd-tree construction on graphics hardware. ACM Trans. Graph., 27(5):126:1-126:11,December 2008.
-
(2008)
ACM Trans. Graph.
, vol.27
, Issue.5
, pp. 1261-12611
-
-
Zhou, K.1
Hou, Q.2
Wang, R.3
Guo, B.4
|