SCOPUS 정보 검색 플랫폼

Volumn 15, Issue 2, 2014, Pages 285-329

A survey on parallel computing and its applications in data-parallel problems using GPU architectures

(3) Navarro, Cristóbal A a,b Hitschfeld Kahler, Nancy a Mateu, Luis a

b CENTRO DE ESTUDIOS CIENTÍFICOS CECS (Chile)

Author keywords

Algorithms; Cellular Automata; Collision detection; Computing models; Data parallel; GPU computing; Ising Model; Massive parallelism; N body; Parallel computing; Potts model

Indexed keywords

EID: 84887710331 PISSN: 18152406 EISSN: 19917120 Source Type: Journal
DOI: 10.4208/cicp.110113.010813a Document Type: Review

Times cited : (158)

References (131)

1
- 0030382365
- Shared memory consistency models: A tutorial
- S. V. Adve and K. Gharachorloo. Shared memory consistency models: A tutorial. Computer, 29(12):66-76,December 1996. (Pubitemid 126517873)
- (1996) Computer , vol.29 , Issue.12 , pp. 66-76
- Adve, S.V.¹ Gharachorloo, K.²

2
- 0023563093
- A model for hierarchical memory
- New York, NY, USA, ACM
- A. Aggarwal, B. Alpern, A. Chandra, and M. Snir. A model for hierarchical memory. In Proceedings of the nineteenth annual ACM symposium on Theory of computing, STOC '87, pages 305-314,New York, NY, USA, 1987. ACM.
- (1987) Proceedings of the Nineteenth Annual ACM Symposium on Theory of Computing, STOC '87 , pp. 305-314
- Aggarwal, A.¹ Alpern, B.² Chandra, A.³ Snir, M.⁴

3
- 0028483922
- The uniform memory hierarchy model of computation
- 10.1007/BF01185206
- B. Alpern, L. Carter, E. Feig, and T. Selker. The uniform memory hierarchy model of computation. Algorithmica, 12:72-109, 1994. 10.1007/BF01185206.
- (1994) Algorithmica , vol.12 , pp. 72-109
- Alpern, B.¹ Carter, L.² Feig, E.³ Selker, T.⁴

4
- 84988767938
- Modeling parallel computers as memory hierarchies
- IEEE Computer Society Press
- B. Alpern, L. Carter, and J. Ferrante. Modeling parallel computers as memory hierarchies. In In Proc. Programming Models for Massively Parallel Computers, pages 116-123. IEEE Computer Society Press, 1993.
- (1993) Proc. Programming Models for Massively Parallel Computers , pp. 116-123
- Alpern, B.¹ Carter, L.² Ferrante, J.³

5
- 85060036181
- Validity of the single processor approach to achieving large scale computing capabilities
- New York, NY, USA, ACM
- G.M. Amdahl. Validity of the single processor approach to achieving large scale computing capabilities. In Proceedings of the April 18-20, 1967, spring joint computer conference, AFIPS '67 (Spring), pages 483-485, New York, NY, USA, 1967. ACM.
- (1967) Proceedings of the April 18-20, 1967, Spring Joint Computer Conference, AFIPS '67 (Spring) , pp. 483-485
- Amdahl, G.M.¹

6
- 33846349887
- A hierarchical O(N log N) force-calculation algorithm
- December
- J. Barnes and P. Hut. A hierarchical O(N log N) force-calculation algorithm. Nature, 324(6096):446-449,December 1986.
- (1986) Nature , vol.324 , Issue.6096 , pp. 446-449
- Barnes, J.¹ Hut, P.²

7
- 85015899515
- The price of performance
- September
- L. A. Barroso. The price of performance. Queue, 3(7):48-53, September 2005.
- (2005) Queue , vol.3 , Issue.7 , pp. 48-53
- Barroso, L.A.¹

8
- 84929524862
- Cellular automata in triangular, pentagonal and hexagonal tessellations
- Springer New York
- C. Bays. Cellular automata in triangular, pentagonal and hexagonal tessellations. In Robert A. Meyers, editor, Computational Complexity, pages 434-442. Springer New York, 2012.
- (2012) Robert A. Meyers, Editor, Computational Complexity , pp. 434-442
- Bays, C.¹

9
- 84887775781
- Optimal bounds for decision problems on the crcw pram
- New, ACM
- P. Beame and J. Hastad. Optimal bounds for decision problems on the crcw pram. In In Proceedings of the 19th ACM Symposium on Theory of Computing (New, pages 25-27. ACM.
- Proceedings of the 19th ACM Symposium on Theory of Computing , pp. 25-27
- Beame, P.¹ Hastad, J.²

10
- 84856569900
- A sparse octree gravitational n-body code that runs entirely on the GPU processor
- April
- J. Bédorf, E. Gaburov, and S. P. Zwart. A sparse octree gravitational n-body code that runs entirely on the GPU processor. J. Comput. Phys., 231(7):2825-2839,April 2012.
- (2012) J. Comput. Phys. , vol.231 , Issue.7 , pp. 2825-2839
- Bédorf, J.¹ Gaburov, E.² Zwart, S.P.³

11
- 84857176549
- Real-time terrain modeling using cpu-GPU coupled computation
- Washington, DC, USA, IEEE Computer Society
- A. Bernhardt, A. Maximo, L. Velho, H. Hnaidi, andM.-P. Cani. Real-time terrain modeling using cpu-GPU coupled computation. In Proceedings of the 2011 24th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI '11, pages 64-71,Washington, DC, USA, 2011. IEEE Computer Society.
- (2011) Proceedings of the 2011 24th SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI '11 , pp. 64-71
- Bernhardt, A.¹ Maximo, A.² Velho, L.³ Hnaidi, H.⁴ Cani, M.-P.⁵

12
- 84887763139
- Civil and structural engineering computing: 2001
- Saxe-Coburg Publications
- Z. Bittnar, J. Kruis, J. Němeček, B. Patzák, and D. Rypl. Civil and structural engineering computing: 2001. chapter Parallel and distributed computations for structural mechanics: a review, pages 211-233. Saxe-Coburg Publications, 2001.
- (2001) Chapter Parallel and Distributed Computations for Structural Mechanics: A Review , pp. 211-233
- Bittnar, Z.¹ Kruis, J.² Němeček, J.³ Patzák, B.⁴ Rypl, D.⁵

13
- 84887767295
- L. Carter B. Alpern. The rammodel considered harmful towards a science of performance programming, 1994.
- (1994) The Rammodel Considered Harmful Towards A Science of Performance Programming
- Carter, L.¹ Alpern, B.²

14
- 79956283936
- O'Reilly
- C. P. Breshears. The Art of Concurrency - A Thread Monkey's Guide to Writing Parallel Applications. O'Reilly, 2009.
- (2009) The Art of Concurrency - A Thread Monkey's Guide to Writing Parallel Applications
- Breshears, C.P.¹

15
- 10644248153
- Brook for GPUs: Stream computing on graphics hardware
- August
- I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian,M. Houston, and P. Hanrahan. Brook for GPUs: stream computing on graphics hardware. ACM Trans. Graph., 23(3):777-786, August 2004.
- (2004) ACM Trans. Graph. , vol.23 , Issue.3 , pp. 777-786
- Buck, I.¹ Foley, T.² Horn, D.³ Sugerman, J.⁴ Fatahalian, K.⁵ Houston, M.⁶ Hanrahan, P.⁷

16
- 78149352231
- K-model: A new computational model for stream processors
- Washington, DC, USA, IEEE Computer Society
- G. Capannini, F. Silvestri, and R. Baraglia. K-model: A new computational model for stream processors. In Proceedings of the 2010 IEEE 12th International Conference on High Performance Computing and Communications, HPCC '10, pages 239-246, Washington, DC, USA, 2010. IEEE Computer Society.
- (2010) Proceedings of the 2010 IEEE 12th International Conference on High Performance Computing and Communications, HPCC '10 , pp. 239-246
- Capannini, G.¹ Silvestri, F.² Baraglia, R.³

17
- 84887763902
- Chapel (Cray inc. Hpcs language)
- B. L. Chamberlain. Chapel (cray inc. hpcs language). In Encyclopedia of Parallel Computing, pages 249-256. 2011.
- (2011) Encyclopedia of Parallel Computing , pp. 249-256
- Chamberlain, B.L.¹

18
- 54949115201
- The MIT Press
- B. Chapman, G. Jost, and R. van der Pas. Using OpenMP: Portable SharedMemory Parallel Programming (Scientific and Engineering Computation). The MIT Press, 2007.
- (2007) Using OpenMP: Portable SharedMemory Parallel Programming (Scientific and Engineering Computation)
- Chapman, B.¹ Jost, G.² Pas Der R.Van³

19
- 0025431398
- The impact of synchronization and granularity on parallel systems
- May
- D.-K. Chen, H.-M. Su, and P.-C. Yew. The impact of synchronization and granularity on parallel systems. SIGARCH Comput. Archit. News, 18(3a):239-248,May 1990.
- (1990) SIGARCH Comput. Archit. News , vol.18 , Issue.3 A , pp. 239-248
- Chen, D.-K.¹ Su, H.-M.² Yew, P.-C.³

20
- 34248201490
- A parallel implementation of the Cellular Potts Model for simulation of cell-based morphogenesis
- DOI 10.1016/j.cpc.2007.03.007, PII S0010465507002044
- N. Chen, J. A. Glazier, J. A. Izaguirre, and M. S. Alber. A parallel implementation of the cellular potts model for simulation of cell-based morphogenesis. Computer Physics Communications, 176(11-12):670-681, 2007. (Pubitemid 46722711)
- (2007) Computer Physics Communications , vol.176 , Issue.11-12 , pp. 670-681
- Chen, N.¹ Glazier, J.A.² Izaguirre, J.A.³ Alber, M.S.⁴

21
- 84887793551
- online at
- P. Coddington. Visualizations of spin models of magnetism, online at http://cs.adelaide.edu.au/ paulc/physics/spinmodels.html,August 2013.
- Visualizations of Spin Models of Magnetism
- Coddington, P.¹

22
- 79957635333
- GPU-based lighting and shadowing of complex natural scenes
- August, Los Angeles, USA
- F. Cohen, P. Decaudin, and F. Neyret. GPU-based lighting and shadowing of complex natural scenes. In Siggraph'04 Conf. DVD-ROM (Poster), August 2004. Los Angeles, USA.
- (2004) Siggraph'04 Conf. DVD-ROM (Poster)
- Cohen, F.¹ Decaudin, P.² Neyret, F.³

23
- 84881052005
- Real-time dynamic shadows for image-based lighting
- Charles River Media
- M. Colbert and J. Křivánek. Real-time dynamic shadows for image-based lighting. In ShaderX 7 - Advanced Rendering Technicques. Charles River Media, 2009.
- (2009) ShaderX 7 - Advanced Rendering Technicques
- Colbert, M.¹ Křivánek, J.²

24
- 0003587629
- MIT Press, Cambridge, MA, USA
- M. Cole. Algorithmic skeletons: structured management of parallel computation. MIT Press, Cambridge, MA, USA, 1991.
- (1991) Algorithmic Skeletons: Structured Management of Parallel Computation
- Cole, M.¹

25
- 77951291942
- Exploring nvidia-cuda for video coding
- New York, NY, USA, ACM
- A. Colic, H. Kalva, and B. Furht. Exploring nvidia-cuda for video coding. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems, MMSys '10, pages 13-22, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the First Annual ACM SIGMM Conference on Multimedia Systems, MMSys '10 , pp. 13-22
- Colic, A.¹ Kalva, H.² Furht, B.³

26
- 8744267587
- Universality in elementary cellular automata
- M. Cook. Universality in Elementary Cellular Automata. Complex Systems, 15(1):1-40, 2004. (Pubitemid 39203440)
- (2004) Complex Systems , vol.15 , Issue.1 , pp. 1-40
- Cook, M.¹

27
- 0004116989
- McGraw-Hill Higher Education, 2nd edition
- T. H. Cormen, C. Stein, R. L. Rivest, and C. E. Leiserson. Introduction to Algorithms. McGraw-Hill Higher Education, 2nd edition, 2001.
- (2001) Introduction to Algorithms
- Cormen, T.H.¹ Stein, C.² Rivest, R.L.³ Leiserson, C.E.⁴

28
- 84884467772
- Intel Corporation
- Intel Corporation. IntelR XeonR Processor E5-2600 Product Family Uncore Performance Monitoring Guide, 2012.
- (2012) IntelR XeonR Processor E5-2600 Product Family Uncore Performance Monitoring Guide

29
- 84887658704
- Nvidia Corporation
- Nvidia Corporation. Kepler Whitepaper for the GK110 architecture, 2012.
- (2012) Kepler Whitepaper for the GK110 Architecture

30
- 84878137471
- A GPU-based method for generating quasi-delaunay triangulations based on edge-flips
- GRAPP 2013, February
- E. Scheihing, C. A. Navarro, N. Hitschfeld-Kahler. A GPU-based method for generating quasi-delaunay triangulations based on edge-flips. In Proceedings of the 8th International on Computer Graphics, Theory and Applications, GRAPP 2013, pages 27-34, February 2013.
- (2013) Proceedings of the 8th International on Computer Graphics, Theory and Applications , pp. 27-34
- Scheihing, E.¹ Navarro, C.A.² Hitschfeld-Kahler, N.³

31
- 0009346826
- Logp: Towards a realistic model of parallel computation
- July
- D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser, E. Santos, R. Subramonian, and T. von Eicken. Logp: towards a realistic model of parallel computation. SIGPLAN Not., 28(7):1-12, July 1993.
- (1993) SIGPLAN Not. , vol.28 , Issue.7 , pp. 1-12
- Culler, D.¹ Karp, R.² Patterson, D.³ Sahay, A.⁴ Schauser, K.E.⁵ Santos, E.⁶ Subramonian, R.⁷ Von Eicken, T.⁸

32
- 37549003336
- Mapreduce: Simplified data processing on large clusters
- January
- J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. Commun. ACM, 51(1):107-113, January 2008.
- (2008) Commun. ACM , vol.51 , Issue.1 , pp. 107-113
- Dean, J.¹ Ghemawat, S.²

33
- 84945709358
- Solution of a problemin concurrent programming control
- September
- E.W. Dijkstra. Solution of a problemin concurrent programming control. Commun. ACM, 8(9):569-, September 1965.
- (1965) Commun. ACM , vol.8 , Issue.9 , pp. 569
- Dijkstra, E.W.¹

34
- 60649084618
- Semaphores for fair scheduling monitor conditions
- May
- N. Dunstan. Semaphores for fair scheduling monitor conditions. SIGOPS Oper. Syst. Rev., 25(3):27-31,May 1991.
- (1991) SIGOPS Oper. Syst. Rev. , vol.25 , Issue.3 , pp. 27-31
- Dunstan, N.¹

35
- 38249040751
- Superlinear speedup of an efficient sequential algorithm is not possible
- July
- V. Faber, O. M. Lubeck, and A. B. White, Jr. Superlinear speedup of an efficient sequential algorithm is not possible. Parallel Comput., 3(3):259-260, July 1986.
- (1986) Parallel Comput. , vol.3 , Issue.3 , pp. 259-260
- Faber, V.¹ Lubeck, O.M.² White Jr., A.B.³

36
- 78650745600
- Octree-based, GPU implementation of a continuous cellular automaton for the simulation of complex, evolving surfaces
- N. Ferrando, M. A. Gosalvez, J. Cerda, R. G. Girones, and K. Sato. Octree-based, GPU implementation of a continuous cellular automaton for the simulation of complex, evolving surfaces. Computer Physics Communications, pages 628-640, 2011.
- (2011) Computer Physics Communications , pp. 628-640
- Ferrando, N.¹ Gosalvez, M.A.² Cerda, J.³ Girones, R.G.⁴ Sato, K.⁵

37
- 84860312760
- Q-state pottsmodelmetastability study using optimized GPU-based monte carlo algorithms
- E. E. Ferrero, J. P. De Francesco,N.Wolovick, and S.A. Cannas. q-state pottsmodelmetastability study using optimized GPU-based monte carlo algorithms. Computer Physics Communications, 183(8):1578 - 1587, 2012.
- (2012) Computer Physics Communications , vol.183 , Issue.8 , pp. 1578-1587
- Ferrero, E.E.¹ De Francesco, J.P.² Wolovick, N.³ Cannas, S.A.⁴

38
- 0015401565
- Some computer organizations and their effectiveness
- September
- M. J. Flynn. Some computer organizations and their effectiveness. IEEE Trans. Comput., 21(9):948-960, September 1972.
- (1972) IEEE Trans. Comput. , vol.21 , Issue.9 , pp. 948-960
- Flynn, M.J.¹

39
- 0018052202
- Parallelism in random access machines
- New York, NY, USA, ACM
- S. Fortune and J. Wyllie. Parallelism in random access machines. In Proceedings of the tenth annual ACM symposium on Theory of computing, STOC '78, pages 114-118, New York, NY, USA, 1978. ACM.
- (1978) Proceedings of the Tenth Annual ACM Symposium on Theory of Computing, STOC '78 , pp. 114-118
- Fortune, S.¹ Wyllie, J.²

40
- 0004146408
- Addison-Wesley Longman Publishing Co., Inc., Boston,MA, USA
- I. Foster. Designing and building parallel programs: Concepts and tools for parallel software engineering. Addison-Wesley Longman Publishing Co., Inc., Boston,MA, USA, 1995.
- (1995) Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering
- Foster, I.¹

41
- 35048884271
- Open MPI: Goals, concept, and design of a next generation MPI implementation
- Budapest, Hungary, September
- E. Gabriel, G. E. Fagg, G. Bosilca, T. Angskun, J. J. Dongarra, J. M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine, R. H. Castain, D. J. Daniel, R. L. Graham, and T. S. Woodall. Open MPI: Goals, concept, and design of a next generation MPI implementation. In Proceedings, 11th European PVM/MPI Users' Group Meeting, pages 97-104, Budapest, Hungary, September 2004.
- (2004) Proceedings, 11th European PVM/MPI Users' Group Meeting , pp. 97-104
- Gabriel, E.¹ Fagg, G.E.² Bosilca, G.³ Angskun, T.⁴ Dongarra, J.J.⁵ Squyres, J.M.⁶ Sahay, V.⁷ Kambadur, P.⁸ Barrett, B.⁹ Lumsdaine, A.¹⁰ Castain, R.H.¹¹ Daniel, D.J.¹² Graham, R.L.¹³ Woodall, T.S.¹⁴

42
- 0000870032
- The fantastic combinations of John Conway's new solitaire game "life
- October
- M. Gardner. The fantastic combinations of John Conway's new solitaire game "life". Scientific American, 223:120-123,October 1970.
- (1970) Scientific American , vol.223 , pp. 120-123
- Gardner, M.¹

43
- 53049083461
- GPU accelerated computation and visualization of hexagonal cellular automata
- Berlin, Heidelberg, Springer-Verlag
- S. Gobron, H. Bonafos, and D. Mestre. GPU accelerated computation and visualization of hexagonal cellular automata. In Proceedings of the 8th international conference on Cellular Automata for Reseach and Industry, ACRI '08, pages 512-521, Berlin, Heidelberg, 2008. Springer-Verlag.
- (2008) Proceedings of the 8th International Conference on Cellular Automata for Reseach and Industry, ACRI '08 , pp. 512-521
- Gobron, S.¹ Bonafos, H.² Mestre, D.³

44
- 84865249616
- GPGPU computation and visualization of three-dimensional cellular automata
- S. Gobron, A. Çöltekin, H. Bonafos, and D. Thalmann. GPGPU computation and visualization of three-dimensional cellular automata. The Visual Computer, 27(1):67-81, 2011.
- (2011) The Visual Computer , vol.27 , Issue.1 , pp. 67-81
- Gobron, S.¹ Çöltekin, A.² Bonafos, H.³ Thalmann, D.⁴

45
- 36549080465
- Retina simulation using cellular automata and GPU programming
- DOI 10.1007/s00138-006-0065-8
- S. Gobron, F. Devillard, and B. Heit. Retina simulation using cellular automata and GPU programming. Mach. Vision Appl., 18(6):331-342,November 2007. (Pubitemid 350178426)
- (2007) Machine Vision and Applications , vol.18 , Issue.6 , pp. 331-342
- Gobron, S.¹ Devillard, F.² Heit, B.³

46
- 84887666884
- Real-time textured volume reconstruction using virtual and real video cameras
- S. Gobron, C. Marx, J. Ahn, and D. Thalmann. Real-time textured volume reconstruction using virtual and real video cameras. In proceedings of the Computer Graphics International 2010 conference, 2010.
- (2010) Proceedings of the Computer Graphics International 2010 Conference
- Gobron, S.¹ Marx, C.² Ahn, J.³ Thalmann, D.⁴

47
- 0003557427
- Oxford University Press, USA, April
- R. Greenlaw, J. H. Hoover, and W. L. Ruzzo. Limits to Parallel Computation: PCompleteness Theory. Oxford University Press, USA, April 1995.
- (1995) Limits to Parallel Computation: PCompleteness Theory
- Greenlaw, R.¹ Hoover, J.H.² Ruzzo, W.L.³

48
- 0034513885
- Automatic parallelization of recursive procedures
- DOI 10.1023/A:1007560600904
- M. Gupta, S. Mukhopadhyay, and N. Sinha. Automatic parallelization of recursive procedures. Int. J. Parallel Program., 28(6):537-562,December 2000. (Pubitemid 32076092)
- (2000) International Journal of Parallel Programming , vol.28 , Issue.6 , pp. 537-562
- Gupta, M.¹ Mukhopadhyay, S.² Sinha, N.³

49
- 0024012163
- Reevaluating Amdahl's law
- J. L. Gustafson. Reevaluating Amdahl's law. Communications of the ACM, 31:532-533, 1988.
- (1988) Communications of the ACM , vol.31 , pp. 532-533
- Gustafson, J.L.¹

50
- 33747934288
- Fixed time, tiered memory, and superlinear speedup
- DMCC5
- J. L. Gustafson. Fixed time, tiered memory, and superlinear speedup. In In Proceedings of the Fifth Distributed Memory Computing Conference (DMCC5, 1990.
- (1990) Proceedings of the Fifth Distributed Memory Computing Conference
- Gustafson, J.L.¹

51
- 0040741908
- The consequences of fixed time performancemeasurement
- IEEE Computer Society
- J. L. Gustafson. The consequences of fixed time performancemeasurement. In Proceedings of the 25th Hawaii International Conference on Systems Sciences, IEEE Computer Society, 1992.
- (1992) Proceedings of the 25th Hawaii International Conference on Systems Sciences
- Gustafson, J.L.¹

52
- 74049152899
- 42 tflops hierarchical n-body simulations on GPUs with applications in both astrophysics and turbulence
- T. Hamada, T. Narumi, R. Yokota, K. Yasuoka, K. Nitadori, and M. Taiji. 42 tflops hierarchical n-body simulations on GPUs with applications in both astrophysics and turbulence. In SC, 2009.
- (2009) SC
- Hamada, T.¹ Narumi, T.² Yokota, R.³ Yasuoka, K.⁴ Nitadori, K.⁵ Taiji, M.⁶

53
- 65249114041
- Real-time rigid body simulation on GPUs
- editor, GPU Gems 3, Addison-Wesley
- T. Harada. Real-time rigid body simulation on GPUs. In Hubert Nguyen, editor, GPU Gems 3, pages 611-632. Addison-Wesley, 2008.
- (2008) Hubert Nguyen , pp. 611-632
- Harada, T.¹

54
- 0016114085
- Monitors: An operating system structuring concept
- October
- C. A. R. Hoare. Monitors: an operating system structuring concept. Commun. ACM, 17(10):549-557,October 1974.
- (1974) Commun. ACM , vol.17 , Issue.10 , pp. 549-557
- Hoare, C.A.R.¹

55
- 78149231331
- Mapcg: Writing parallel program portable between cpu and GPU
- New York, NY, USA, ACM
- C. Hong, D. Chen, W. Chen, W. Zheng, and H. Lin. Mapcg: writing parallel program portable between cpu and GPU. In Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pages 217-226, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, PACT '10 , pp. 217-226
- Hong, C.¹ Chen, D.² Chen, W.³ Zheng, W.⁴ Lin, H.⁵

56
- 49249134204
- Interactive k-d tree GPU raytracing
- New York, NY, USA, ACM
- D. R. Horn, J. Sugerman, M. Houston, and P. Hanrahan. Interactive k-d tree GPU raytracing. In Proceedings of the 2007 symposium on Interactive 3D graphics and games, I3D '07, pages 167-174,New York, NY, USA, 2007. ACM.
- (2007) Proceedings of the 2007 Symposium on Interactive 3D Graphics and Games, I3D '07 , pp. 167-174
- Horn, D.R.¹ Sugerman, J.² Houston, M.³ Hanrahan, P.⁴

57
- 84869451864
- An energy efficient 32nm 20 MB L3 cache for IntelR XeonR processor E5 family
- IEEE
- M. Huang, M. Mehalel, R. Arvapalli, and S. He. An energy efficient 32nm 20 MB L3 cache for IntelR XeonR processor E5 family. In CICC, pages 1-4. IEEE, 2012.
- (2012) CICC , pp. 1-4
- Huang, M.¹ Mehalel, M.² Arvapalli, R.³ He, S.⁴

58
- 84887667003
- The n-body problem throughout the computer science curriculum
- June
- L. Ivanov. The n-body problem throughout the computer science curriculum. J. Comput. Sci. Coll., 22(6):43-52, June 2007.
- (2007) J. Comput. Sci. Coll. , vol.22 , Issue.6 , pp. 43-52
- Ivanov, L.¹

59
- 84887682092
- Technical Report MSU-CSE-00-2, Virginia University
- D. Luebke J. Tran, D. Jordan. New challenges for cellular automata simulation on the GPU. Technical Report MSU-CSE-00-2, Virginia University, 2003.
- (2003) New Challenges for Cellular Automata Simulation on the GPU
- Luebke Tran J, D.¹ Jordan, D.²

60
- 0035311963
- 3D collision detection: A survey
- DOI 10.1016/S0097-8493(00)00130-8, PII S0097849300001308
- P. Jimenez, F. Thomas, and C. Torras. 3d collision detection: A survey. Computers and Graphics, 25:269-285, 2000. (Pubitemid 32272850)
- (2001) Computers and Graphics (Pergamon) , vol.25 , Issue.2 , pp. 269-285
- Jimenez, P.¹ Thomas, F.² Torras, C.³

61
- 75149160554
- Latticemethods for fluid animation in games
- January
- S. F. Judice, B. Barcellos, S. Coutinho, and G. A. Giraldi. Latticemethods for fluid animation in games. Comput. Entertain., 7(4):56:1-56:29, January 2010.
- (2010) Comput. Entertain. , vol.7 , Issue.4 , pp. 561-5629
- Judice, S.F.¹ Barcellos, B.² Coutinho, S.³ Giraldi, G.A.⁴

62
- 79952161329
- Implicit surface octrees for ray tracing point models
- New York, NY, USA, ACM
- S. Kashyap, R. Goradia, P. Chaudhuri, and S. Chandran. Implicit surface octrees for ray tracing point models. In Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP '10, pages 227-234, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP '10 , pp. 227-234
- Kashyap, S.¹ Goradia, R.² Chaudhuri, P.³ Chandran, S.⁴

63
- 77953621395
- Seeded ndmedical image segmentation by cellular automaton on GPU
- C. Kauffmann and N. Piche. Seeded ndmedical image segmentation by cellular automaton on GPU. Int. J. Computer Assisted Radiology and Surgery, 5(3):251-262, 2010.
- (2010) Int. J. Computer Assisted Radiology and Surgery , vol.5 , Issue.3 , pp. 251-262
- Kauffmann, C.¹ Piche, N.²

64
- 0034764291
- Real-time bump map synthesis
- J. Kautz, W. Heidrich, and H.-P. Seidel. Real-time bump map synthesis. In Proceedings of the ACMSIGGRAPH/EUROGRAPHICS workshop on Graphics hardware,HWWS '01, pages 109-114,New York, NY, USA, 2001. ACM. (Pubitemid 33046565)
- (2001) Proceedings of the ACM SIGGRAPH Conference on Computer Graphics , Issue.WORKSHOP , pp. 109-114
- Kautz, J.¹ Heidrich, W.² Seidel, H.-P.³

65
- 79951728783
- Khronos OpenCL Working Group. 8 December
- Khronos OpenCL Working Group. The OpenCL Specification, version 1.0.29, 8 December 2008.
- (2008) The OpenCL Specification, Version 1.0.29

66
- 0031399546
- Parallel processing for terrain analysis in GIS: Visibility as a case study
- August
- D. B. Kidner, P. J. Rallings, and J. A. Ware. Parallel processing for terrain analysis in GIS: Visibility as a case study. Geoinformatica, 1(2):183-207, August 1997.
- (1997) Geoinformatica , vol.1 , Issue.2 , pp. 183-207
- Kidner, D.B.¹ Rallings, P.J.² Ware, J.A.³

67
- 0002857928
- Nvidia
- M. J. Kilgard. A practical and robust bump-mapping technique for todays GPUs. Nvidia, 2000.
- (2000) A Practical and Robust Bump-mapping Technique for Todays GPUs
- Kilgard, M.J.¹

68
- 35248867363
- The structure of a compiler for explicit and implicit parallelism
- Berlin, Heidelberg, Springer-Verlag
- S. W. Kim and R. Eigenmann. The structure of a compiler for explicit and implicit parallelism. In Proceedings of the 14th international conference on Languages and compilers for parallel computing, LCPC'01, pages 336-351, Berlin, Heidelberg, 2003. Springer-Verlag.
- (2003) Proceedings of the 14th International Conference on Languages and Compilers for Parallel Computing, LCPC'01 , pp. 336-351
- Kim, S.W.¹ Eigenmann, R.²

69
- 70449559873
- LCP algorithms for collision detection using CUDA
- editor, GPUGems 3, Addison-Wesley
- P. Kipfer. LCP algorithms for collision detection using CUDA. In Hubert Nguyen, editor, GPUGems 3, pages 723-739. Addison-Wesley, 2007.
- (2007) Hubert Nguyen , pp. 723-739
- Kipfer, P.¹

70
- 0016318617
- Computer programming as an art
- December
- D. E. Knuth. Computer programming as an art. Commun. ACM, 17(12):667-673,December 1974.
- (1974) Commun. ACM , vol.17 , Issue.12 , pp. 667-673
- Knuth, D.E.¹

71
- 84855206756
- GPU-based single-cluster algorithm for the simulation of the ising model
- February
- Y. Komura and Y. Okabe. GPU-based single-cluster algorithm for the simulation of the ising model. J. Comput. Phys., 231(4):1209-1215, February 2012.
- (2012) J. Comput. Phys. , vol.231 , Issue.4 , pp. 1209-1215
- Komura, Y.¹ Okabe, Y.²

72
- 84867582560
- Multi-GPU-based swendsenVwang multi-cluster algorithm for the simulation of two-dimensional -state pottsmodel
- Y. Komura and Y. Okabe. Multi-GPU-based swendsenVwang multi-cluster algorithm for the simulation of two-dimensional -state pottsmodel. Computer Physics Communications, 184(1):40 - 44, 2013.
- (2013) Computer Physics Communications , vol.184 , Issue.1 , pp. 40-44
- Komura, Y.¹ Okabe, Y.²

73
- 84904159862
- Cellular automata based traffic simulation accelerated on GPU
- Institute of Automation and Computer Science FME BUT
- P. Korček, L. Sekanina, and O. Fučik. Cellular automata based traffic simulation accelerated on GPU. In Proceedings of the 17th International Conference on Soft Computing (MENDEL2011), pages 395-402. Institute of Automation and Computer Science FME BUT, 2011.
- (2011) Proceedings of the 17th International Conference on Soft Computing (MENDEL2011) , pp. 395-402
- Korček, P.¹ Sekanina, L.² Fučik, O.³

74
- 67650035436
- Effective automatic parallelization of stencil computations
- DOI 10.1145/1250734.1250761, PLDI'07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation
- S. Krishnamoorthy, M. Baskaran, U. Bondhugula, J. Ramanujam, A. Rountev, and P. Sadayappan. Effective automatic parallelization of stencil computations. SIGPLAN Not., 42(6):235-244, June 2007. (Pubitemid 47630691)
- (2007) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI) , pp. 235-244
- Krishnamoorthy, S.¹ Baskaran, M.² Bondhugula, U.³ Ramanujam, J.⁴ Rountev, A.⁵ Sadayappan, P.⁶

75
- 77954995885
- June
- V. W. Lee, C. Kim, J. Chhugani, M. Deisher, D. Kim, A. D. Nguyen, N. Satish, M. Smelyanskiy, S. Chennupaty, P. Hammarlund, R. Singhal, and P. Dubey. Debunking the 100x GPU vs. cpu myth: an evaluation of throughput computing on cpu and GPU. SIGARCH Comput. Archit. News, 38(3):451-460, June 2010.
- (2010) Debunking the 100x GPU Vs. Cpu Myth: An Evaluation of Throughput Computing on Cpu and GPU. SIGARCH Comput. Archit. News , vol.38 , Issue.3 , pp. 451-460
- Lee, V.W.¹ Kim, C.² Chhugani, J.³ Deisher, M.⁴ Kim, D.⁵ Nguyen, A.D.⁶ Satish, N.⁷ Smelyanskiy, M.⁸ Chennupaty, S.⁹ Hammarlund, P.¹⁰ Singhal, R.¹¹ Dubey, P.¹²

76
- 0003819663
- Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
- F. T. Leighton. Introduction to parallel algorithms and architectures: array, trees, hypercubes. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1992.
- (1992) Introduction to Parallel Algorithms and Architectures: Array, Trees, Hypercubes
- Leighton, F.T.¹

77
- 0002603030
- High performance Fortran
- D. Loveman. High performance Fortran. IEEE Parallel & Distributed Technology: Systems & Applications, 1(1):25-42, 1993.
- (1993) IEEE Parallel & Distributed Technology: Systems & Applications , vol.1 , Issue.1 , pp. 25-42
- Loveman, D.¹

78
- 77956874347
- Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the international space station
- 10.1007/s11554-009-0133-1
- P. Lu, H. Oki, C. Frey, G. Chamitoff, L. Chiao, E. Fincke, C. Foale, S.Magnus,W. McArthur, D. Tani, P. Whitson, J. Williams, W. Meyer, R. Sicker, B. Au, M. Christiansen, A. Schofield, and D. Weitz. Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the international space station. Journal of Real-Time Image Processing, 5:179-193, 2010. 10.1007/s11554-009-0133-1.
- (2010) Journal of Real-Time Image Processing , vol.5 , pp. 179-193
- Lu, P.¹ Oki, H.² Frey, C.³ Chamitoff, G.⁴ Chiao, L.⁵ Fincke, E.⁶ Foale, C.⁷ McArthur, S.MagnusW.⁸ Tani, D.⁹ Whitson, P.¹⁰ Williams, J.¹¹ Meyer, W.¹² Sicker, R.¹³ Au, B.¹⁴ Christiansen, M.¹⁵ Schofield, A.¹⁶ Weitz, D.¹⁷

79
- 34548757823
- Automatic parallelization of scripting languages: Toward transparent desktop parallel computing
- IPDPS 2007. IEEE International
- X. Ma, J. Li, and N. F. Samatova. Automatic parallelization of scripting languages: Toward transparent desktop parallel computing. In Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International, pages 1-6, 2007.
- (2007) Parallel and Distributed Processing Symposium, 2007 , pp. 1-6
- Ma, X.¹ Li, J.² Samatova, N.F.³

80
- 0142103318
- The GPU enters computing's mainstream
- M.Macedonia. The GPU enters computing's mainstream. Computer, 36(10):106-108, 2003.
- (2003) Computer , vol.36 , Issue.10 , pp. 106-108
- Macedonia, M.¹

81
- 84947910149
- ERCW PRAMs and optical communication
- Euro-Par '96 Parallel Processing
- P. D. Mackenzie and V. Ramachandran. ERCW PRAMs and optical communication. In in Proceedings of the European Conference on Parallel Processing, EUROPAR 96, pages 293-302, 1996. (Pubitemid 126116279)
- (1996) Proceedings of the European Conference on Parallel Processing, EUROPAR 96 , Issue.1124 , pp. 293-302
- MacKenzie, P.D.¹ Ramachandran, V.²

82
- 33646031235
- Cg: A system for programming graphics hardware in a c-like language
- July
- W. R. Mark, R. S. Glanville, K. Akeley, and M. J. Kilgard. Cg: a system for programming graphics hardware in a c-like language. ACMTrans. Graph., 22(3):896-907, July 2003.
- (2003) ACMTrans. Graph. , vol.22 , Issue.3 , pp. 896-907
- Mark, W.R.¹ Glanville, R.S.² Akeley, K.³ Kilgard, M.J.⁴

83
- 77949643305
- Introduction to GPU programming with glsl
- Washington, DC, USA, IEEE Computer Society
- R. Marroquim and A. Maximo. Introduction to GPU programming with glsl. In Proceedings of the 2009 Tutorials of the XXII Brazilian Symposium on Computer Graphics and Image Processing, SIBGRAPI-TUTORIALS '09, pages 3-16, Washington, DC, USA, 2009. IEEE Computer Society.
- (2009) Proceedings of the 2009 Tutorials of the XXII Brazilian Symposium on Computer Graphics and Image Processing, SIBGRAPI-TUTORIALS '09 , pp. 3-16
- Marroquim, R.¹ Maximo, A.²

84
- 85031896513
- On parallel hashing and integer sorting
- Springer Berlin / Heidelberg, 10.1007/BFb0032070
- Y. Matias and U. Vishkin. On parallel hashing and integer sorting. In Michael Paterson, editor, Automata, Languages and Programming, volume 443 of LectureNotes in Computer Science, pages 729-743. Springer Berlin / Heidelberg, 1990. 10.1007/BFb0032070.
- (1990) Michael Paterson, Editor, Automata, Languages and Programming, Volume 443 of LectureNotes in Computer Science , pp. 729-743
- Matias, Y.¹ Vishkin, U.²

85
- 0036954153
- Shader metaprogramming
- Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
- M. D. McCool, Z. Qin, and T. S. Popa. Shader metaprogramming. In Proceedings of the ACMSIGGRAPH/EUROGRAPHICS conference on Graphics hardware,HWWS '02, pages 57-68, Aire-la-Ville, Switzerland, Switzerland, 2002. Eurographics Association.
- (2002) Proceedings of the ACMSIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, HWWS '02 , pp. 57-68
- McCool, M.D.¹ Qin, Z.² Popa, T.S.³

86
- 5744249209
- Equation of state calculations by fast computing machines
- N. Metropolis, A. Rosenbluth, M. Rosenbluth, A. Teller, and E. Teller. Equation of state calculations by fast computing machines. J. Chem. Phys., 21:1087, 1953.
- (1953) J. Chem. Phys. , vol.21 , pp. 1087
- Metropolis, N.¹ Rosenbluth, A.² Rosenbluth, M.³ Teller, A.⁴ Teller, E.⁵

87
- 84887659408
- Tempor
- A. S. Mikhayhu. Embarrassingly Parallel. Tempor, 2012.
- (2012) Embarrassingly Parallel
- Mikhayhu, A.S.¹

88
- 0003466248
- University of Illinois Press, Champaign, IL, USA
- J. Von Neumann. Theory of Self-Reproducing Automata. University of Illinois Press, Champaign, IL, USA, 1966.
- (1966) Theory of Self-Reproducing Automata
- Von Neumann, J.¹

89
- 55649109070
- Addison-Wesley Professional, first edition
- H. Nguyen. GPU gems 3. Addison-Wesley Professional, first edition, 2007.
- (2007) GPU Gems 3
- Nguyen, H.¹

90
- 0003500941
- O'Reilly, 101Morris Street, Sebastopol, CA 95472
- B. Nichols, D. Buttlar, and J. P. Farrell. Pthreads Programming. O'Reilly, 101Morris Street, Sebastopol, CA 95472, 1998.
- (1998) Pthreads Programming
- Nichols, B.¹ Buttlar, D.² Farrell, J.P.³

91
- 0013139857
- Morgan Kaufmann, May
- R.Nikhil and Arvind. Implicit Parallel Programming in pH. Morgan Kaufmann,May 2001.
- (2001) Implicit Parallel Programming in PH
- Nikhil, R.¹ Arvind²

92
- 84876897748
- Nvidia
- Nvidia. Fermi Compute ArchitectureWhitepaper.
- Fermi Compute ArchitectureWhitepaper

93
- 79551704836
- Nvidia-Corporation
- Nvidia-Corporation. Nvidia CUDA C Programming Guide, 2012.
- (2012) Nvidia CUDA C Programming Guide

94
- 85086423044
- Hlsl shader model 4.0
- New York, NY, USA, ACM
- M. Oneppo. Hlsl shader model 4.0. In ACM SIGGRAPH 2007 courses, SIGGRAPH '07, pages 112-152,New York, NY, USA, 2007. ACM.
- (2007) ACM SIGGRAPH 2007 Courses, SIGGRAPH '07 , pp. 112-152
- Oneppo, M.¹

95
- 0141463137
- Routledge, New York, NY
- S. Openshaw and I. Turton. High Performance Computing and the Art of Parallel Programming: An Introduction for Geographers, Social Scientists, and Engineers. Routledge, New York, NY, 10001, 1999.
- (1999) High Performance Computing and the Art of Parallel Programming: An Introduction for Geographers, Social Scientists, and Engineers , pp. 10001
- Openshaw, S.¹ Turton, I.²

96
- 78650496873
- Fast and scalable CPU/GPU collision detection for rigid and deformable surfaces
- S. Pabst, A. Koch, andW. Straßer. Fast and scalable CPU/GPU collision detection for rigid and deformable surfaces. Computer Graphics Forum, 29(5):1605-1612, 2010.
- (2010) Computer Graphics Forum , vol.29 , Issue.5 , pp. 1605-1612
- Pabst, S.¹ Koch, A.² Straßer, W.³

97
- 84863667059
- Springer
- D. A. Padua, editor. Encyclopedia of Parallel Computing, volume 4. Springer, 2011.
- (2011) Encyclopedia of Parallel Computing , vol.4
- Padua, D.A.¹

98
- 72449147182
- Parallel reduction in resource lambda-calculus
- M. Pagani and P. Tranquilli. Parallel reduction in resource lambda-calculus. In APLAS, pages 226-242, 2009.
- (2009) APLAS , pp. 226-242
- Pagani, M.¹ Tranquilli, P.²

99
- 38249040569
- Parallel efficiency can be greater than unity
- D. Parkinson. Parallel efficiency can be greater than unity. Parallel Computing, 3(3):261-262, 1986.
- (1986) Parallel Computing , vol.3 , Issue.3 , pp. 261-262
- Parkinson, D.¹

100
- 84887799548
- To teach Newton's square root algorithm
- December
- H. A. Peelle. To teach Newton's square root algorithm. SIGAPL APL Quote Quad, 5(4):48-50, December 1974.
- (1974) SIGAPL APL Quote Quad , vol.5 , Issue.4 , pp. 48-50
- Peelle, H.A.¹

101
- 0035417514
- Locating and computing in parallel all the simple roots of special functions using PVM
- DOI 10.1016/S0377-0427(00)00675-0, PII S0377042700006750, Special Issue: Orthogonal Polynomials Special Functions and their Applications
- V. P. Plagianakos, N. K. Nousis, andM. N. Vrahatis. Locating and computing in parallel all the simple roots of special functions using pvm. J. Comput. Appl.Math., 133(1-2):545-554, August 2001. (Pubitemid 32826690)
- (2001) Journal of Computational and Applied Mathematics , vol.133 , Issue.1-2 , pp. 545-554
- Plagianakos, V.P.¹ Nousis, N.K.² Vrahatis, M.N.³

102
- 67349267818
- GPU accelerated monte carlo simulation of the 2d and 3d ising model
- July
- T. Preis, P. Virnau, W. Paul, and J. J. Schneider. GPU accelerated monte carlo simulation of the 2d and 3d ising model. J. Comput. Phys., 228(12):4468-4477, July 2009.
- (2009) J. Comput. Phys. , vol.228 , Issue.12 , pp. 4468-4477
- Preis, T.¹ Virnau, P.² Paul, W.³ Schneider, J.J.⁴

103
- 85023166542
- A work-efficient GPU algorithm for level set segmentation
- Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
- M. Roberts, J. Packer, M. C. Sousa, and J. R. Mitchell. A work-efficient GPU algorithm for level set segmentation. In Proceedings of the Conference on High Performance Graphics, HPG '10, pages 123-132, Aire-la-Ville, Switzerland, Switzerland, 2010. Eurographics Association.
- (2010) Proceedings of the Conference on High Performance Graphics, HPG '10 , pp. 123-132
- Roberts, M.¹ Packer, J.² Sousa, M.C.³ Mitchell, J.R.⁴

104
- 85008065154
- Why cpu frequency stalled
- April
- P. E. Ross. Why cpu frequency stalled. IEEE Spectr., 45(4):72-72, April 2008.
- (2008) IEEE Spectr. , vol.45 , Issue.4 , pp. 72-72
- Ross, P.E.¹

105
- 18844440719
- Automatic parallelization of divide and conquer algorithms
- R. Rugina and M. Rinard. Automatic parallelization of divide and conquer algorithms. In In Proceedings of the 7th ACMSIGPLANSymposiumon Principles and Practice of Parallel Programming, pages 72-83, 1999. (Pubitemid 129694052)
- (1999) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.34 , Issue.8 , pp. 72-83
- Rugina, R.¹ Rinard, M.²

106
- 74349129727
- Experiments with single core, multicore, and GPU based computation of cellular automata
- Washington, DC, USA
- S. Rybacki, J. Himmelspach, and A. M. Uhrmacher. Experiments with single core, multicore, and GPU based computation of cellular automata. In Proceedings of the 2009 First International Conference on Advances in System Simulation, SIMUL '09, pages 62-67,Washington, DC, USA, 2009. IEEE Computer Society.
- (2009) Proceedings of the 2009 First International Conference on Advances in System Simulation, SIMUL '09 , pp. 62-67
- Rybacki, S.¹ Himmelspach, J.² Uhrmacher, A.M.³

107
- 85122636849
- Progressive buffers: View-dependent geometry and texture lod rendering
- Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
- P. V. Sander and J. L. Mitchell. Progressive buffers: view-dependent geometry and texture lod rendering. In Proceedings of the third Eurographics symposium on Geometry processing, SGP '05, Aire-la-Ville, Switzerland, Switzerland, 2005. Eurographics Association.
- (2005) Proceedings of the Third Eurographics Symposium on Geometry Processing, SGP '05
- Sander, P.V.¹ Mitchell, J.L.²

108
- 84870946930
- Evaluation of a nearest-neighbor load balancing strategy for parallel molecular simulations in mpi environment
- A. Di Serio and M. B. Ibáñez. Evaluation of a nearest-neighbor load balancing strategy for parallel molecular simulations in mpi environment. In PVM/MPI, pages 226-233, 2002.
- (2002) PVM/MPI , pp. 226-233
- Di Serio, A.¹ Ibáñez, M.B.²

109
- 49049132956
- An o(log n) parallel connectivity algorithm
- Y. Shiloach and U. Vishkin. An o(log n) parallel connectivity algorithm. J. Algorithms, 3(1):57-67, 1982.
- (1982) J. Algorithms , vol.3 , Issue.1 , pp. 57-67
- Shiloach, Y.¹ Vishkin, U.²

110
- 0003914107
- Oxford University Press, Inc., New York, NY, USA
- J. R. Smith. The design and analysis of parallel algorithms. Oxford University Press, Inc., New York, NY, USA, 1993.
- (1993) The Design and Analysis of Parallel Algorithms
- Smith, J.R.¹

111
- 84887775975
- Technical Report UCB/CSD-92-673, EECS Department, University of California, Berkeley, Mar
- R. Subramonian. An o(log n) time common CRCW PRAM algorithm for minimum spanning tree. Technical Report UCB/CSD-92-673, EECS Department, University of California, Berkeley, Mar 1992.
- (1992) An O(log N) Time Common CRCW PRAM Algorithm for Minimum Spanning Tree
- Subramonian, R.¹

112
- 60349097423
- Gramps: A programming model for graphics pipelines
- February
- J. Sugerman, K. Fatahalian, S. Boulos, K. Akeley, and P. Hanrahan. Gramps: A programming model for graphics pipelines. ACM Trans. Graph., 28(1):4:1-4:11, February 2009.
- (2009) ACM Trans. Graph. , vol.28 , Issue.1 , pp. 41-411
- Sugerman, J.¹ Fatahalian, K.² Boulos, S.³ Akeley, K.⁴ Hanrahan, P.⁵

113
- 33747349191
- Nonuniversal, critical dynamics in Monte Carlo simulations
- R. H. Swendsen and J. S. Wang. Nonuniversal, critical dynamics in Monte Carlo simulations. Phys. Rev. Lett., 58:86, 1987.
- (1987) Phys. Rev. Lett. , vol.58 , pp. 86
- Swendsen, R.H.¹ Wang, J.S.²

114
- 84887807755
- Preliminary evaluations for hybrid memory cube with gather functions using FPGA
- 2012-03-19
- N. Tanabe, N. Hori, B. Nuttapon, and H. Nakajo. Preliminary evaluations for hybrid memory cube with gather functions using FPGA. IPSJ SIG Notes, 2012(6):1-10, 2012-03-19.
- (2012) IPSJ SIG Notes , Issue.6 , pp. 1-10
- Tanabe, N.¹ Hori, N.² Nuttapon, B.³ Nakajo, H.⁴

115
- 70349468317
- Wiley Series on Parallel and Distributed Computing
- D. Taniar, C. H. C. Leung, W. Rahayu, and S. Goel. High-Performance Parallel Database Processing and Grid Databases. Wiley Series on Parallel and Distributed Computing, 2008.
- (2008) High-Performance Parallel Database Processing and Grid Databases
- Taniar, D.¹ Leung, C.H.C.² Rahayu, W.³ Goel, S.⁴

116
- 74849085797
- Data-parallel algorithms for large-scale real-time simulation of the cellular Potts model on graphics processing units
- J. J. Tapia and R. D'Souza. Data-parallel algorithms for large-scale real-time simulation of the cellular Potts model on graphics processing units. 2009 IEEE International Conference on Systems Man and Cybernetics, (10):1411-1418, 2009.
- (2009) 2009 IEEE International Conference on Systems Man and Cybernetics , Issue.10 , pp. 1411-1418
- Tapia, J.J.¹ D'Souza, R.²

117
- 79251598410
- Parallelizing the cellular potts model on graphics processing units
- J. J. Tapia and R. D'Souza. Parallelizing the cellular potts model on graphics processing units. Computer Physics Communications, 182(4):857-865, 2011.
- (2011) Computer Physics Communications , vol.182 , Issue.4 , pp. 857-865
- Tapia, J.J.¹ D'Souza, R.²

118
- 84865201971
- GpGPU implementation of cellular automata model of water flow
- Berlin, Heidelberg, Springer-Verlag
- P. Topa and P. Mlocek. GpGPU implementation of cellular automata model of water flow. In Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, PPAM'11, pages 630-639, Berlin, Heidelberg, 2012. Springer- Verlag.
- (2012) Proceedings of the 9th International Conference on Parallel Processing and Applied Mathematics - Volume Part I, PPAM'11 , pp. 630-639
- Topa, P.¹ Mlocek, P.²

119
- 0025467711
- A bridging model for parallel computation
- August
- L. G. Valiant. A bridging model for parallel computation. Commun. ACM, 33(8):103-111, August 1990.
- (1990) Commun. ACM , vol.33 , Issue.8 , pp. 103-111
- Valiant, L.G.¹

120
- 57349106747
- A pram-on-chip vision (invited abstract)
- U. Vishkin. A pram-on-chip vision (invited abstract). In SPIRE, page 260, 2000.
- (2000) SPIRE , pp. 260
- Vishkin, U.¹

121
- 0031629796
- Explicit multi-threading (XMT) bridging models for instruction parallelism (extended abstract)
- U. Vishkin, S. Dascal, E. Berkovich, and J. Nuzman. Explicit multi-threading (XMT) bridging models for instruction parallelism (extended abstract). In SPAA, pages 140-151, 1998.
- (1998) SPAA , pp. 140-151
- Vishkin, U.¹ Dascal, S.² Berkovich, E.³ Nuzman, J.⁴

122
- 0013059825
- The general and logical theory of automata
- Wiley
- J. von Neumann. The general and logical theory of automata. In Cerebral Mechanisms in Behaviour.Wiley, 1951.
- (1951) Cerebral Mechanisms in Behaviour
- Von Neumann, J.¹

123
- 35248898344
- Springer-Verlag New York, Inc., New York, NY, USA
- G. J. Woeginger. Combinatorial optimization - eureka, you shrink! chapter Exact algorithms for NP-hard problems: a survey, pages 185-207. Springer-Verlag New York, Inc., New York, NY, USA, 2003.
- (2003) Combinatorial Optimization - Eureka, You Shrink! Chapter Exact Algorithms for NP-hard Problems: A Survey , pp. 185-207
- Woeginger, G.J.¹

124
- 5244336186
- Collective Monte Carlo updating for spin systems
- U. Wolff. Collective Monte Carlo updating for spin systems. Physical Review Letters, 62:361-364, 1989.
- (1989) Physical Review Letters , vol.62 , pp. 361-364
- Wolff, U.¹

125
- 0042492825
- The Potts model
- January
- F. Y. Wu. The Potts model. Reviews of Modern Physics, 54(1):235-268, January 1982.
- (1982) Reviews of Modern Physics , vol.54 , Issue.1 , pp. 235-268
- Wu, F.Y.¹

126
- 85093009412
- Scaling fast multipole methods up to 4000 GPUs
- Singapore, Singapore, A*STAR Computational Resource Centre
- R. Yokota, L. Barba, T. Narumi, and K. Yasuoka. Scaling fast multipole methods up to 4000 GPUs. In Proceedings of the ATIP/A CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?, ATIP '12, pages 9:1-9:6, Singapore, Singapore, 2012. A*STAR Computational Resource Centre.
- (2012) Proceedings of the ATIP/A CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?, ATIP '12 , pp. 91-96
- Yokota, R.¹ Barba, L.² Narumi, T.³ Yasuoka, K.⁴

127
- 84884893486
- CoRR, abs/1108.5815
- R. Yokota and L. A. Barba. Fast n-body simulations on GPUs. CoRR, abs/1108.5815, 2011.
- (2011) Fast N-body Simulations on GPUs
- Yokota, R.¹ Barba, L.A.²

128
- 84877061416
- CoRR, abs/1106.2176
- R. Yokota and L. A. Barba. A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems. CoRR, abs/1106.2176, 2011.
- (2011) A Tuned and Scalable Fast Multipole Method As A Preeminent Algorithm for Exascale Systems
- Yokota, R.¹ Barba, L.A.²

129
- 84860491027
- Hierarchical n-body simulations with autotuning for heterogeneous systems
- R. Yokota and L. A. Barba. Hierarchical n-body simulations with autotuning for heterogeneous systems. Computing in Science and Engineering, 14(3):30-39, 2012.
- (2012) Computing in Science and Engineering , vol.14 , Issue.3 , pp. 30-39
- Yokota, R.¹ Barba, L.A.²

130
- 29844456426
- Cellular automata in non-euclidean spaces
- Stevens Point, Wisconsin, USA, World Scientific and Engineering Academy and Society (WSEAS)
- S. Yukita. Cellular automata in non-euclidean spaces. In Proceedings of the 7thWSEAS International Conference on Mathematical Methods and Computational Techniques In Electrical Engineering, MMACTE'05, pages 200-207, Stevens Point, Wisconsin, USA, 2005. World Scientific and Engineering Academy and Society (WSEAS).
- (2005) Proceedings of the 7thWSEAS International Conference on Mathematical Methods and Computational Techniques in Electrical Engineering, MMACTE'05 , pp. 200-207
- Yukita, S.¹

131
- 57749174539
- Real-time kd-tree construction on graphics hardware
- December
- K. Zhou, Q. Hou, R. Wang, and B. Guo. Real-time kd-tree construction on graphics hardware. ACM Trans. Graph., 27(5):126:1-126:11,December 2008.
- (2008) ACM Trans. Graph. , vol.27 , Issue.5 , pp. 1261-12611
- Zhou, K.¹ Hou, Q.² Wang, R.³ Guo, B.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.