SCOPUS 정보 검색 플랫폼

SOSP 2013 - Proceedings of the 24th ACM Symposium on Operating Systems Principles

Volumn , Issue , 2013, Pages 49-68

Dandelion: A compiler and runtime for heterogeneous systems

(5) Rossbach, Christopher J a Yu, Yuan a Currey, Jon a Martin, Jean Philippe a Fetterly, Dennis a

a MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTING RESOURCE; DATA-PARALLEL APPLICATIONS; DESIGN AND IMPLEMENTATIONS; GENERAL-PURPOSE PROGRAMMING LANGUAGE; HETEROGENEOUS SYSTEMS; PARALLEL EXECUTIONS; PROGRAMMING ABSTRACTIONS; USER DEFINED FUNCTIONS;

COMPUTER PROGRAMMING LANGUAGES; ENERGY EFFICIENCY; ENERGY MANAGEMENT; PROGRAM PROCESSORS;

DISTRIBUTED COMPUTER SYSTEMS;

EID: 84889679621 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2517349.2522715 Document Type: Conference Paper

Times cited : (121)

References (104)

1
- 84889653662
- Apache YARN. http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop- yarn-site/YARN.html.

2
- 84889686365
- The CCI project
- The CCI project. http://cciast.codeplex.com/.

3
- 84889651481
- The LINQ project
- The LINQ project. http://msdn.microsoft.com/en-us/library/vstudio/ bb397926.aspx.

4
- 84889673546
- The PLINQ project.
- The PLINQ project. http://msdn.microsoft.com/en-us/library/dd460688.aspx.

5
- 84889683042
- home page
- Sort benchmark home page. http://sortbenchmark.org/.

6
- 82655168123
- I.B.M., White Plains, NY
- IBM 709 electronic data-processing system: advance description. I.B.M., White Plains, NY, 1957.
- (1957) IBM 709 Electronic Data-processing System: Advance Description

7
- 84889633391
- Matlab plug-in for CUDA. https://developer.nvidia.com/matlab- cuda, 2007.
- (2007) Matlab Plug-in for CUDA

8
- 84889670735
- JCuda: Java bindings for CUDA. http://www.jcuda.org/jcuda/JCuda.html, 2012.
- (2012) JCuda: Java Bindings for CUDA

9
- 78650145768
- Lime: A java-compatible and synthesizable language for heterogeneous architectures
- J. S. Auerbach, D. F. Bacon, P. Cheng, and R. M. Rabbah. Lime: a java-compatible and synthesizable language for heterogeneous architectures. In OOPSLA, 2010.
- (2010) OOPSLA
- Auerbach, J.S.¹ Bacon, D.F.² Cheng, P.³ Rabbah, R.M.⁴

10
- 79951765394
- Data-Aware Task Scheduling on Multi-Accelerator based Platforms
- C. Augonnet, J. Clet-Ortega, S. Thibault, and R. Namyst. Data-Aware Task Scheduling on Multi-Accelerator based Platforms. In 16th International Conference on Parallel and Distributed Systems, Shangai, Chine, Dec. 2010.
- 16th International Conference on Parallel and Distributed Systems, Shangai, Chine, Dec. 2010
- Augonnet, C.¹ Clet-Ortega, J.² Thibault, S.³ Namyst, R.⁴

11
- 82655178687
- C. Augonnet and R. Namyst. StarPU: A Unified Runtime System for Heterogeneous Multicore Architectures.
- StarPU: A Unified Runtime System for Heterogeneous Multicore Architectures
- Augonnet, C.¹ Namyst, R.²

12
- 70350416667
- Exploiting the Cell/BE Architecture with the StarPU Unified Runtime System
- C. Augonnet, S. Thibault, R. Namyst, and M. Nijhuis. Exploiting the Cell/BE Architecture with the StarPU Unified Runtime System. In SAMOS '09, pages 329-339, 2009.
- (2009) SAMOS '09 , pp. 329-339
- Augonnet, C.¹ Thibault, S.² Namyst, R.³ Nijhuis, M.⁴

13
- 70350635626
- An extension of the starss programming model for platforms with multiple gpus
- Berlin, Heidelberg, Springer-Verlag
- E. Ayguadé, R. M. Badia, F. D. Igual, J. Labarta, R. Mayo, and E. S. Quintana-Orté. An extension of the starss programming model for platforms with multiple gpus. In Proceedings of the 15th International Euro-Par Conference on Parallel Processing, Euro-Par '09, pages 851-862, Berlin, Heidelberg, 2009. Springer-Verlag.
- (2009) Proceedings of the 15th International Euro-Par Conference on Parallel Processing, Euro-Par '09 , pp. 851-862
- Ayguadé, E.¹ Badia, R.M.² Igual, F.D.³ Labarta, J.⁴ Mayo, R.⁵ Quintana-Orté, E.S.⁶

14
- 33646425180
- Programming Grid Applications with GRID Superscalar
- R. M. Badia, J. Labarta, R. Sirvent, J. M. Prez, J. M. Cela, and R. Grima. Programming Grid Applications with GRID Superscalar. Journal of Grid Computing, 1:2003, 2003.
- (2003) Journal of Grid Computing , vol.1 , pp. 2003
- Badia, R.M.¹ Labarta, J.² Sirvent, R.³ Prez, J.M.⁴ Cela, J.M.⁵ Grima, R.⁶

15
- 82655178682
- C. Banino, O. Beaumont, L. Carter, J. Ferrante, A. Legrand, and Y. Robert. Scheduling strategies for master-slave tasking on heterogeneous processor platforms. 2004.
- (2004) Scheduling Strategies for Master-slave Tasking on Heterogeneous Processor Platforms
- Banino, C.¹ Beaumont, O.² Carter, L.³ Ferrante, J.⁴ Legrand, A.⁵ Robert, Y.⁶

16
- 84877710346
- Legion: Expressing locality and independence with logical regions
- Los Alamitos, CA, USA, IEEE Computer Society Press
- M. Bauer, S. Treichler, E. Slaughter, and A. Aiken. Legion: expressing locality and independence with logical regions. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12, pages 66:1-66:11, Los Alamitos, CA, USA, 2012. IEEE Computer Society Press.
- (2012) Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12
- Bauer, M.¹ Treichler, S.² Slaughter, E.³ Aiken, A.⁴

17
- 70450284396
- Scientific and Engineering Computing Using ATI Stream Technology
- A. Bayoumi, M. Chu, Y. Hanafy, P. Harrell, and G. Refai-Ahmed. Scientific and Engineering Computing Using ATI Stream Technology. Computing in Science and Engineering, 11(6):92-97, 2009.
- (2009) Computing in Science and Engineering , vol.11 , Issue.6 , pp. 92-97
- Bayoumi, A.¹ Chu, M.² Hanafy, Y.³ Harrell, P.⁴ Refai-Ahmed, G.⁵

18
- 34548265764
- CellSs: A programming model for the cell BE architecture
- P. Bellens, J. M. Perez, R. M. Badia, and J. Labarta. CellSs: a programming model for the cell BE architecture. In SC 2006.
- SC 2006
- Bellens, P.¹ Perez, J.M.² Badia, R.M.³ Labarta, J.⁴

19
- 84874228458
- Microsoft Research at TREC 2011 Web Track
- B. Billerbeck, N. Craswell, D. Fetterly, and M. Najork. Microsoft Research at TREC 2011 Web Track. In Proc. of the 20th Text Retrieval Conference, 2011.
- Proc. Of the 20th Text Retrieval Conference, 2011
- Billerbeck, B.¹ Craswell, N.² Fetterly, D.³ Najork, M.⁴

20
- 79961204066
- Ffpf: Fairly fast packet filters
- H. Bos, W. de Bruijn, M. Cristea, T. Nguyen, and G. Portokalidis. Ffpf: Fairly fast packet filters. In Proceedings of OSDI'04, 2004.
- Proceedings of OSDI'04, 2004
- Bos, H.¹ De Bruijn, W.² Cristea, M.³ Nguyen, T.⁴ Portokalidis, G.⁵

21
- 79956351190
- Haloop: Efficient iterative data processing on large clusters
- Sept.
- Y. Bu, B. Howe, M. Balazinska, and M. D. Ernst. Haloop: efficient iterative data processing on large clusters. Proc. VLDB Endow., 3(1-2):285-296, Sept. 2010.
- (2010) Proc. VLDB Endow. , vol.3 , Issue.1-2 , pp. 285-296
- Bu, Y.¹ Howe, B.² Balazinska, M.³ Ernst, M.D.⁴

22
- 10644248153
- Brook for GPUs: Stream Computing on Graphics Hardware
- I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: Stream Computing on Graphics Hardware. ACM TRANSACTIONS ON GRAPHICS, 2004.
- (2004) ACM Transactions on Graphics
- Buck, I.¹ Foley, T.² Horn, D.³ Sugerman, J.⁴ Fatahalian, K.⁵ Houston, M.⁶ Hanrahan, P.⁷

23
- 80052384784
- Productive cluster programming with ompss
- Berlin, Heidelberg, Springer-Verlag
- J. Bueno, L. Martinell, A. Duran, M. Farreras, X. Martorell, R. M. Badia, E. Ayguade, and J. Labarta. Productive cluster programming with ompss. In Proceedings of the 17th international conference on Parallel processing - Volume Part I, Euro-Par'11, pages 555-566, Berlin, Heidelberg, 2011. Springer-Verlag.
- (2011) Proceedings of the 17th International Conference on Parallel Processing - Volume Part I, Euro-Par'11 , pp. 555-566
- Bueno, J.¹ Martinell, L.² Duran, A.³ Farreras, M.⁴ Martorell, X.⁵ Badia, R.M.⁶ Ayguade, E.⁷ Labarta, J.⁸

24
- 84889645238
- university of cambridge, June
- P. Calvert. Part II dissertation, computer science tripos, university of cambridge, June 2010.
- (2010) Part II Dissertation, Computer Science Tripos
- Calvert, P.¹

25
- 79952784184
- Copperhead: Compiling an embedded data parallel language
- B. Catanzaro, M. Garland, and K. Keutzer. Copperhead: compiling an embedded data parallel language. In Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, PPoPP '11, pages 47-56, 2011.
- (2011) Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, PPoPP '11 , pp. 47-56
- Catanzaro, B.¹ Garland, M.² Keutzer, K.³

26
- 63549103331
- A map reduce framework for programming graphics processors
- B. Catanzaro, N. Sundaram, and K. Keutzer. A map reduce framework for programming graphics processors. In In Workshop on Software Tools for MultiCore Systems, 2008.
- In Workshop on Software Tools for MultiCore Systems, 2008
- Catanzaro, B.¹ Sundaram, N.² Keutzer, K.³

27
- 77954727236
- FlumeJava: Easy, efficient data-parallel pipelines
- C. Chambers, A. Raniwala, F. Perry, S. Adams, R. Henry, R. Bradshaw, and N. Weizenbaum. FlumeJava: easy, efficient data-parallel pipelines. In PLDI'10.
- PLDI'10
- Chambers, C.¹ Raniwala, A.² Perry, F.³ Adams, S.⁴ Henry, R.⁵ Bradshaw, R.⁶ Weizenbaum, N.⁷

28
- 14944352193
- Processor-embedded distributed smart disks for I/O-intensive workloads: Architectures, performance models and evaluation
- S. C. Chiu, W.-k. Liao, A. N. Choudhary, and M. T. Kandemir. Processor-embedded distributed smart disks for I/O-intensive workloads: architectures, performance models and evaluation. J. Parallel Distrib. Comput., 65(4):532-551, 2005.
- (2005) J. Parallel Distrib. Comput. , vol.65 , Issue.4 , pp. 532-551
- Chiu, S.C.¹ Liao, W.-K.² Choudhary, A.N.³ Kandemir, M.T.⁴

29
- 84881142714
- Linqits: Big data on little clients
- E. Chung, J. Davis, and J. Lee. Linqits: Big data on little clients. In Proceedings of the 40th International Symposium on Computer Architecture (ISCA), 2013.
- Proceedings of the 40th International Symposium on Computer Architecture (ISCA), 2013
- Chung, E.¹ Davis, J.² Lee, J.³

30
- 56749165622
- Accelerating computing with the cell broadband engine processor
- C. H. Crawford, P. Henning, M. Kistler, and C. Wright. Accelerating computing with the cell broadband engine processor. In CF 2008, 2008.
- (2008) CF 2008
- Crawford, C.H.¹ Henning, P.² Kistler, M.³ Wright, C.⁴

31
- 26444562005
- Fpl-3: Towards language support for distributed packet processing
- M.-L. Cristea, W. de Bruijn, and H. Bos. Fpl-3: towards language support for distributed packet processing. In Proceedings of IFIP Networking 2005, 2005.
- (2005) Proceedings of IFIP Networking 2005
- Cristea, M.-L.¹ De Bruijn, W.² Bos, H.³

32
- 26444512024
- Fpl-3e: Towards language support for reconfigurable packet processing
- Berlin, Heidelberg, Springer-Verlag
- M. L. Cristea, C. Zissulescu, E. Deprettere, and H. Bos. Fpl-3e: towards language support for reconfigurable packet processing. In Proceedings of the 5th international conference on Embedded Computer Systems: architectures, Modeling, and Simulation, SAMOS'05, pages 82-92, Berlin, Heidelberg, 2005. Springer-Verlag.
- (2005) Proceedings of the 5th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, SAMOS'05 , pp. 82-92
- Cristea, M.L.¹ Zissulescu, C.² Deprettere, E.³ Bos, H.⁴

33
- 84889687312
- Supporting iteration in a heterogeneous dataflow engine
- J. Currey, S. Baker, and C. J. Rossbach. Supporting iteration in a heterogeneous dataflow engine. In SFMA, 2013.
- (2013) SFMA
- Currey, J.¹ Baker, S.² Rossbach, C.J.³

34
- 33746173613
- TCP offload to the rescue
- A. Currid. TCP offload to the rescue. Queue, 2(3):58-65, 2004.
- (2004) Queue , vol.2 , Issue.3 , pp. 58-65
- Currid, A.¹

35
- 0020087077
- Data flow program graphs
- A. L. Davis and R. M. Keller. Data flow program graphs. IEEE Computer, 15(2):26-41, 1982.
- (1982) IEEE Computer , vol.15 , Issue.2 , pp. 26-41
- Davis, A.L.¹ Keller, R.M.²

36
- 77952251155
- Pipesfs: Fast linux i/o in the unix tradition
- July Special Issue on R&D in the Linux Kernel
- W. de Bruijn and H. Bos. Pipesfs: Fast linux i/o in the unix tradition. ACM SigOps Operating Systems Review, 42(5), July 2008. Special Issue on R&D in the Linux Kernel.
- (2008) ACM SigOps Operating Systems Review , vol.42 , Issue.5
- De Bruijn, W.¹ Bos, H.²

37
- 79956128397
- Application-tailored i/o with streamline
- May
- W. de Bruijn, H. Bos, and H. Bal. Application-tailored i/o with streamline. ACM Trans. Comput. Syst., 29:6:1-6:33, May 2011.
- (2011) ACM Trans. Comput. Syst. , vol.29
- De Bruijn, W.¹ Bos, H.² Bal, H.³

38
- 83155190194
- Liszt: A domain specific language for building portable mesh-based pde solvers
- New York, NY, USA, ACM
- Z. DeVito, N. Joubert, F. Palacios, S. Oakley, M. Medina, M. Barrientos, E. Elsen, F. Ham, A. Aiken, K. Duraisamy, E. Darve, J. Alonso, and P. Hanrahan. Liszt: a domain specific language for building portable mesh-based pde solvers. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC '11, pages 9:1-9:12, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC '11
- DeVito, Z.¹ Joubert, N.² Palacios, F.³ Oakley, S.⁴ Medina, M.⁵ Barrientos, M.⁶ Elsen, E.⁷ Ham, F.⁸ Aiken, A.⁹ Duraisamy, K.¹⁰ Darve, E.¹¹ Alonso, J.¹² Hanrahan, P.¹³

39
- 84866597602
- Parallel pagerank computation using gpus
- N. T. Duong, Q. A. P. Nguyen, A. T. Nguyen, and H.-D. Nguyen. Parallel pagerank computation using gpus. In Proceedings of the Third Symposium on Information and Communication Technology, SoICT '12, pages 223-230, 2012.
- (2012) Proceedings of the Third Symposium on Information and Communication Technology, SoICT '12 , pp. 223-230
- Duong, N.T.¹ Nguyen, Q.A.P.² Nguyen, A.T.³ Nguyen, H.-D.⁴

40
- 78650003594
- Twister: A runtime for iterative mapreduce
- ACM
- J. Ekanayake, H. Li, B. Zhang, T. Gunarathne, S.- H. Bae, J. Qiu, and G. Fox. Twister: a runtime for iterative mapreduce. In HPDC '10. ACM, 2010.
- (2010) HPDC '10.
- Ekanayake, J.¹ Li, H.² Zhang, B.³ Gunarathne, T.⁴ Bae, S.-.H.⁵ Qiu, J.⁶ Fox, G.⁷

41
- 84870501280
- Spinning fast iterative data flows
- S. Ewen, K. Tzoumas, M. Kaufmann, and V. Markl. Spinning fast iterative data flows. VLDB, 2012.
- (2012) VLDB
- Ewen, S.¹ Tzoumas, K.² Kaufmann, M.³ Markl, V.⁴

42
- 70449574756
- Self/star: A data-flow oriented component framework for pervasive dependability
- IEEE Computer Society
- C. Fetzer and K. Hgstedt. Self/star: A data-flow oriented component framework for pervasive dependability. In 8th IEEE International Workshop on Object-Oriented Real-Time Dependable Systems (WORDS 2003), 15-17 January 2003, Guadalajara, Mexico, pages 66-73. IEEE Computer Society, 2003.
- (2003) 8th IEEE International Workshop on Object-Oriented Real-Time Dependable Systems (WORDS 2003), 15-17 January 2003, Guadalajara, Mexico , pp. 66-73
- Fetzer, C.¹ Hgstedt, K.²

43
- 84895890460
- Fast computation of database operations using graphics processors
- N. K. Govindaraju, B. Lloyd, W. Wang, M. Lin, and D. Manocha. Fast computation of database operations using graphics processors. In ACM SIGGRAPH 2005 Courses, SIGGRAPH '05, 2005.
- ACM SIGGRAPH 2005 Courses, SIGGRAPH '05, 2005
- Govindaraju, N.K.¹ Lloyd, B.² Wang, W.³ Lin, M.⁴ Manocha, D.⁵

44
- 0042897759
- Data mining the SDSS SkyServer database
- Paris, France, March Carleton Scientific. also as MSR-TR-2002-01
- J. Gray, A. Szalay, A. Thakar, P. Kunszt, C. Stoughton, D. Slutz, and J. Vandenberg. Data mining the SDSS SkyServer database. In Distributed Data and Structures 4: Records of the 4th International Meeting, pages 189-210, Paris, France, March 2002. Carleton Scientific. also as MSR-TR-2002-01.
- (2002) Distributed Data and Structures 4: Records of the 4th International Meeting , pp. 189-210
- Gray, J.¹ Szalay, A.² Thakar, A.³ Kunszt, P.⁴ Stoughton, C.⁵ Slutz, D.⁶ Vandenberg, J.⁷

45
- 84874042639
- Microsoft Press Series. Microsoft GmbH
- K. Gregory and A. Miller. C++ Amp: Accelerated Massive Parallelism With Microsoft Visual C++. Microsoft Press Series. Microsoft GmbH, 2012.
- (2012) C++ Amp: Accelerated Massive Parallelism with Microsoft Visual C++
- Gregory, K.¹ Miller, A.²

46
- 79953286075
- A static task partitioning approach for heterogeneous systems using opencl
- D. Grewe and M. OBoyle. A static task partitioning approach for heterogeneous systems using opencl. Compiler Construction, 6601:286-305, 2011.
- (2011) Compiler Construction , vol.6601 , pp. 286-305
- Grewe, D.¹ OBoyle, M.²

47
- 84863043723
- Pegasus: Coordinated scheduling for virtualized accelerator-based systems
- Berkeley, CA, USA, USENIX Association
- V. Gupta, K. Schwan, N. Tolia, V. Talwar, and P. Ranganathan. Pegasus: coordinated scheduling for virtualized accelerator-based systems. In Proceedings of the 2011 USENIX conference on USENIX annual technical conference, USENIX-ATC' 11, pages 3-3, Berkeley, CA, USA, 2011. USENIX Association.
- (2011) Proceedings of the 2011 USENIX Conference on USENIX Annual Technical Conference, USENIX-ATC' 11 , pp. 3-3
- Gupta, V.¹ Schwan, K.² Tolia, N.³ Talwar, V.⁴ Ranganathan, P.⁵

48
- 67650673468
- hiCUDA: A high-level directive-based language for GPU programming
- T. D. Han and T. S. Abdelrahman. hiCUDA: a high-level directive-based language for GPU programming. In GPGPU 2009.
- GPGPU 2009
- Han, T.D.¹ Abdelrahman, T.S.²

49
- 63549097654
- Mars: A mapreduce framework on graphics processors
- B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang. Mars: a mapreduce framework on graphics processors. In Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08, pages 260-269, 2008.
- (2008) Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, PACT '08 , pp. 260-269
- He, B.¹ Fang, W.² Luo, Q.³ Govindaraju, N.K.⁴ Wang, T.⁵

50
- 76149104641
- Relational query coprocessing on graphics processors
- Dec.
- B. He, M. Lu, K. Yang, R. Fang, N. K. Govindaraju, Q. Luo, and P. V. Sander. Relational query coprocessing on graphics processors. ACM Trans. Database Syst., 34(4):21:1-21:39, Dec. 2009.
- (2009) ACM Trans. Database Syst. , vol.34 , Issue.4
- He, B.¹ Lu, M.² Yang, K.³ Fang, R.⁴ Govindaraju, N.K.⁵ Luo, Q.⁶ Sander, P.V.⁷

51
- 54749089017
- Relational joins on graphics processors
- B. He, K. Yang, R. Fang, M. Lu, N. Govindaraju, Q. Luo, and P. Sander. Relational joins on graphics processors. SIGMOD '08, 2008.
- SIGMOD '08, 2008
- He, B.¹ Yang, K.² Fang, R.³ Lu, M.⁴ Govindaraju, N.⁵ Luo, Q.⁶ Sander, P.⁷

52
- 84889688020
- The HIVE project
- The HIVE project. http://hadoop.apache.org/hive/.

53
- 70449669477
- Flextream: Adaptive compilation of streaming applications for heterogeneous architectures
- A. Hormati, Y. Choi, M. Kudlur, R. M. Rabbah, T. Mudge, and S. A. Mahlke. Flextream: Adaptive compilation of streaming applications for heterogeneous architectures. In PACT, pages 214-223, 2009.
- (2009) PACT , pp. 214-223
- Hormati, A.¹ Choi, Y.² Kudlur, M.³ Rabbah, R.M.⁴ Mudge, T.⁵ Mahlke, S.A.⁶

54
- 77952283784
- Ruler: High-speed packet matching and rewriting on npus
- New York, NY, USA, ACM
- T. Hruby, K. van Reeuwijk, and H. Bos. Ruler: high-speed packet matching and rewriting on npus. In ANCS '07: Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems, pages 1-10, New York, NY, USA, 2007. ACM.
- (2007) ANCS '07: Proceedings of the 3rd ACM/IEEE Symposium on Architecture for Networking and Communications Systems , pp. 1-10
- Hruby, T.¹ Van Reeuwijk, K.² Bos, H.³

55
- 49049098857
- Liquid metal: Object-oriented programming across the hardware/software boundary
- S. S. Huang, A. Hormati, D. F. Bacon, and R. M. Rabbah. Liquid metal: Object-oriented programming across the hardware/software boundary. In ECOOP, pages 76-103, 2008.
- (2008) ECOOP , pp. 76-103
- Huang, S.S.¹ Hormati, A.² Bacon, D.F.³ Rabbah, R.M.⁴

56
- 84872201157
- Intel. Math kernel library. http://developer.intel.com/software/products/ mkl/.
- Math Kernel Library

57
- 35448961922
- Dryad: Distributed data-parallel programs from sequential building blocks
- M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: distributed data-parallel programs from sequential building blocks. In EuroSys 2007.
- EuroSys 2007
- Isard, M.¹ Budiu, M.² Yu, Y.³ Birrell, A.⁴ Fetterly, D.⁵

58
- 84866869010
- Mate-cg: A map reduce-like framework for accelerating data-intensive computations on heterogeneous clusters
- 0
- W. Jiang and G. Agrawal. Mate-cg: A map reduce-like framework for accelerating data-intensive computations on heterogeneous clusters. Parallel and Distributed Processing Symposium, International, 0:644-655, 2012.
- (2012) Parallel and Distributed Processing Symposium, International , pp. 644-655
- Jiang, W.¹ Agrawal, G.²

59
- 76749088304
- Predictive runtime code scheduling for heterogeneous architectures
- V. J. Jiménez, L. Vilanova, I. Gelado, M. Gil, G. Fursin, and N. Navarro. Predictive runtime code scheduling for heterogeneous architectures. In HiPEAC 2009.
- HiPEAC 2009
- Jiménez, V.J.¹ Vilanova, L.² Gelado, I.³ Gil, M.⁴ Fursin, G.⁵ Navarro, N.⁶

60
- 84889674251
- P. K., V. K. K., A. S. H. B., S. Balasubramanian, and P. Baruah. Cost efficient pagerank computation using gpu. 2011.
- (2011) Cost Efficient Pagerank Computation Using Gpu
- K, P.¹ K, V.K.² B, A.S.H.³ Balasubramanian, S.⁴ Baruah, P.⁵

61
- 85077032008
- Timegraph: GPU scheduling for real-time multi-tasking environments
- S. Kato, K. Lakshmanan, R. Rajkumar, and Y. Ishikawa. Timegraph: GPU scheduling for real-time multi-tasking environments. In Proceedings of the 2011 USENIX conference on USENIX annual technical conference, 2011.
- Proceedings of the 2011 USENIX Conference on USENIX Annual Technical Conference, 2011
- Kato, S.¹ Lakshmanan, K.² Rajkumar, R.³ Ishikawa, Y.⁴

62
- 0001939015
- A case for intelligent disks (IDISKs)
- K. Keeton, D. A. Patterson, and J. M. Hellerstein. A case for intelligent disks (IDISKs). SIGMOD Rec., 27(3):42-52, 1998.
- (1998) SIGMOD Rec. , vol.27 , Issue.3 , pp. 42-52
- Keeton, K.¹ Patterson, D.A.² Hellerstein, J.M.³

63
- 70349100958
- Khronos Group. Version 1.2
- Khronos Group. The OpenCL Specification, Version 1.2, 2012.
- (2012) The OpenCL Specification

64
- 84889671157
- A. Kloeckner. pycuda. https://pypi.python.org/pypi/pycuda, 2012.
- (2012) Pycuda
- Kloeckner, A.¹

65
- 0040291388
- The click modular router
- 18, August
- E. Kohler, R. Morris, B. Chen, J. Jannotti, and M. F. Kaashoek. The click modular router. ACM Trans. Comput. Syst., 18, August 2000.
- (2000) ACM Trans. Comput. Syst.
- Kohler, E.¹ Morris, R.² Chen, B.³ Jannotti, J.⁴ Kaashoek, M.F.⁵

66
- 67650046428
- Merge: A programming model for heterogeneous multi-core systems
- Mar.
- M. D. Linderman, J. D. Collins, H. Wang, and T. H. Meng. Merge: a programming model for heterogeneous multi-core systems. SIGPLAN Not., 43(3):287-296, Mar. 2008.
- (2008) SIGPLAN Not. , vol.43 , Issue.3 , pp. 287-296
- Linderman, M.D.¹ Collins, J.D.² Wang, H.³ Meng, T.H.⁴

67
- 1442310361
- Wordware Publishing Inc., Plano, TX, USA
- M. Linetsky. Programming Microsoft Directshow. Wordware Publishing Inc., Plano, TX, USA, 2001.
- (2001) Programming Microsoft Directshow
- Linetsky, M.¹

68
- 0031678357
- P-rio: A modular parallel-programming environment
- January
- O. Loques, J. Leite, and E. V. Carrera E. P-rio: A modular parallel-programming environment. IEEE Concurrency, 6:47-57, January 1998.
- (1998) IEEE Concurrency , vol.6 , pp. 47-57
- Loques, O.¹ Leite, J.² Carrera E, E.V.³

69
- 76749140917
- Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
- C.-K. Luk, S. Hong, and H. Kim. Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping. In Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 42, pages 45-55, 2009.
- (2009) Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 42 , pp. 45-55
- Luk, C.-K.¹ Hong, S.² Kim, H.³

70
- 77954723629
- Pregel: A system for large-scale graph processing
- ACM
- G. Malewicz, M. H. Austern, A. J. Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD. ACM, 2010.
- (2010) SIGMOD
- Malewicz, G.¹ Austern, M.H.² Bik, A.J.³ Dehnert, J.C.⁴ Horn, I.⁵ Leiser, N.⁶ Czajkowski, G.⁷

71
- 34548275522
- Programming using RapidMind on the Cell BE
- M. D. McCool and B. D'Amora. Programming using RapidMind on the Cell BE. In SC '06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, page 222, 2006.
- (2006) SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing , pp. 222
- McCool, M.D.¹ D'Amora, B.²

72
- 85084014248
- Differential dataflow
- F. McSherry, D. G. Murray, R. Isaacs, and M. Isard. Differential dataflow. In CIDR, 2013.
- (2013) CIDR
- McSherry, F.¹ Murray, D.G.² Isaacs, R.³ Isard, M.⁴

73
- 84873171138
- Rex: Recursive, delta-based data-centric computation
- July
- S. R. Mihaylov, Z. G. Ives, and S. Guha. Rex: recursive, delta-based data-centric computation. Proc. VLDB Endow., 5(11):1280-1291, July 2012.
- (2012) Proc. VLDB Endow. , vol.5 , Issue.11 , pp. 1280-1291
- Mihaylov, S.R.¹ Ives, Z.G.² Guha, S.³

74
- 84889658377
- Naiad: A timely dataflow system
- D. G. Murray, F. McSherry, R. Isaacs, M. Isard, P. Barham, and M. Abadi. Naiad: a timely dataflow system. SOSP, 2013.
- (2013) SOSP
- Murray, D.G.¹ McSherry, F.² Isaacs, R.³ Isard, M.⁴ Barham, P.⁵ Abadi, M.⁶

75
- 85049119901
- Ciel: A universal execution engine for distributed dataflow computing
- D. G. Murray, M. Schwarzkopf, C. Smowton, S. Smith, A. Madhavapeddy, and S. Hand. Ciel: a universal execution engine for distributed dataflow computing. In NSDI, 2011.
- (2011) NSDI
- Murray, D.G.¹ Schwarzkopf, M.² Smowton, C.³ Smith, S.⁴ Madhavapeddy, A.⁵ Hand, S.⁶

76
- 84987200491
- The code 2.0 graphical parallel programming language
- P. Newton and J. C. Browne. The code 2.0 graphical parallel programming language. In Proceedings of the 6th international conference on Super-computing, ICS '92, pages 167-177, 1992.
- (1992) Proceedings of the 6th International Conference on Super-computing, ICS '92 , pp. 167-177
- Newton, P.¹ Browne, J.C.²

77
- 84889659877
- NVIDIA. The thrust library. https://developer.nvidia.com/thrust/.
- The Thrust Library

78
- 80054979751
- NVIDIA
- NVIDIA. CUDA Toolkit 4.0 CUBLAS Library, 2011.
- (2011) CUDA Toolkit 4.0 CUBLAS Library

79
- 84889654526
- NVIDIA
- NVIDIA. NVIDIA CUDA 5.0 Programming Guide, 2013.
- (2013) NVIDIA CUDA 5.0 Programming Guide

80
- 79959876216
- Automatic compilation of matlab programs for synergistic execution on heterogeneous processors
- A. Prasad, J. Anantpur, and R. Govindarajan. Automatic compilation of matlab programs for synergistic execution on heterogeneous processors. In Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI '11, pages 152-163, 2011.
- (2011) Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '11 , pp. 152-163
- Prasad, A.¹ Anantpur, J.² Govindarajan, R.³

81
- 84870411879
- Rootbeer: Seamlessly using gpus from java
- P. C. Pratt-Szeliga, J. W. Fawcett, and R. D. Welch. Rootbeer: Seamlessly using gpus from java. In HPCC-ICESS, pages 375-380, 2012.
- (2012) HPCC-ICESS , pp. 375-380
- Pratt-Szeliga, P.C.¹ Fawcett, J.W.² Welch, R.D.³

82
- 84883116448
- Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines
- New York, NY, USA, ACM
- J. Ragan-Kelley, C. Barnes, A. Adams, S. Paris, F. Durand, and S. Amarasinghe. Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines. In Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation, PLDI '13, pages 519-530, New York, NY, USA, 2013. ACM.
- (2013) Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '13 , pp. 519-530
- Ragan-Kelley, J.¹ Barnes, C.² Adams, A.³ Paris, S.⁴ Durand, F.⁵ Amarasinghe, S.⁶

83
- 84863676008
- Scheduling concurrent applications on a cluster of cpu-gpu nodes
- V. T. Ravi, M. Becchi, W. Jiang, G. Agrawal, and S. Chakradhar. Scheduling concurrent applications on a cluster of cpu-gpu nodes. In Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), CCGRID '12, pages 140-147, 2012.
- (2012) Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (Ccgrid 2012), CCGRID '12 , pp. 140-147
- Ravi, V.T.¹ Becchi, M.² Jiang, W.³ Agrawal, G.⁴ Chakradhar, S.⁵

84
- 0035371016
- Active disks for large-scale data processing
- E. Riedel, C. Faloutsos, G. A. Gibson, and D. Nagle. Active disks for large-scale data processing. Computer, 34(6):68-74, 2001.
- (2001) Computer , vol.34 , Issue.6 , pp. 68-74
- Riedel, E.¹ Faloutsos, C.² Gibson, G.A.³ Nagle, D.⁴

85
- 82655162782
- Ptask: Operating system abstractions to manage gpus as compute devices
- C. Rossbach, J. Currey, M. Silberstein, B. Ray, and E. Witchel. Ptask: Operating system abstractions to manage gpus as compute devices. In SOSP, 2011.
- (2011) SOSP
- Rossbach, C.¹ Currey, J.² Silberstein, M.³ Ray, B.⁴ Witchel, E.⁵

86
- 84862122847
- Fast pagerank computation on a gpu cluster
- A. Rungsawang and B. Manaskasemsak. Fast pagerank computation on a gpu cluster. In Proceedings of the 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing, PDP '12, pages 450-456, 2012.
- (2012) Proceedings of the 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing, PDP '12 , pp. 450-456
- Rungsawang, A.¹ Manaskasemsak, B.²

87
- 79959466764
- Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
- S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W.-m. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In PPoPP 2008.
- PPoPP 2008
- Ryoo, S.¹ Rodrigues, C.I.² Baghsorkhi, S.S.³ Stone, S.S.⁴ Kirk, D.B.⁵ Hwu, W.-M.⁶

88
- 0003622823
- version 4.3. Technical report, OpenGL.org
- M. Segal and K. Akeley. The opengl graphics system: A specification version 4.3. Technical report, OpenGL.org, 2012.
- (2012) The Opengl Graphics System: A Specification
- Segal, M.¹ Akeley, K.²

89
- 84875669260
- Gpufs: Integrating file systems with gpus
- ACM
- M. Silberstein, B. Ford, I. Keidar, and E.Witchel. Gpufs: integrating file systems with gpus. In Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '13. ACM, 2013.
- (2013) Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '13
- Silberstein, M.¹ Ford, B.² Keidar, I.³ Witchel, E.⁴

90
- 78249234704
- Maestro: Data orchestration and tuning for opencl devices
- P. D'Ambra, M. R. Guarracino, and D. Talia, editors, Euro-Par (2), Springer
- K. Spafford, J. S. Meredith, and J. S. Vetter. Maestro: Data orchestration and tuning for opencl devices. In P. D'Ambra, M. R. Guarracino, and D. Talia, editors, Euro-Par (2), volume 6272 of Lecture Notes in Computer Science, pages 275-286. Springer, 2010.
- (2010) Lecture Notes in Computer Science , vol.6272 , pp. 275-286
- Spafford, K.¹ Meredith, J.S.² Vetter, J.S.³

91
- 84858773648
- Enabling task-level scheduling on heterogeneous platforms
- E. Sun, D. Schaa, R. Bagley, N. Rubin, and D. Kaeli. Enabling task-level scheduling on heterogeneous platforms. In Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, GPGPU-5, pages 84-93, 2012.
- (2012) Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, GPGPU-5 , pp. 84-93
- Sun, E.¹ Schaa, D.² Bagley, R.³ Rubin, N.⁴ Kaeli, D.⁵

92
- 84889632679
- G. Teodoro, T. Pan, T. Kurc, J. Kong, L. Cooper, N. Podhorszki, S. Klasky, and J. Saltz. High-throughput analysis of large microscopy image datasets on cpu-gpu cluster platforms. 2013.
- (2013) High-throughput Analysis of Large Microscopy Image Datasets on Cpu-gpu Cluster Platforms
- Teodoro, G.¹ Pan, T.² Kurc, T.³ Kong, J.⁴ Cooper, L.⁵ Podhorszki, N.⁶ Klasky, S.⁷ Saltz, J.⁸

93
- 0037521913
- StreamIt: A Language for Streaming Applications
- W. Thies, M. Karczmarek, and S. P. Amarasinghe. StreamIt: A Language for Streaming Applications. In CC 2002.
- CC 2002
- Thies, W.¹ Karczmarek, M.² Amarasinghe, S.P.³

94
- 77952597755
- CUDA-Lite: Reducing GPU Programming Complexity
- S.-Z. Ueng, M. Lathara, S. S. Baghsorkhi, and W.-M.W. Hwu. CUDA-Lite: Reducing GPU Programming Complexity. In LCPC 2008.
- LCPC 2008
- Ueng, S.-Z.¹ Lathara, M.² Baghsorkhi, S.S.³ Hwu, W.-M.W.⁴

95
- 79959597180
- Processing data streams with hard real-time constraints on heterogeneous systems
- New York, NY, USA, ACM
- U. Verner, A. Schuster, and M. Silberstein. Processing data streams with hard real-time constraints on heterogeneous systems. In Proceedings of the international conference on Supercomputing, ICS '11, pages 120-129, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the International Conference on Supercomputing, ICS '11 , pp. 120-129
- Verner, U.¹ Schuster, A.² Silberstein, M.³

96
- 82655166298
- Tapping into the fountain of CPUs: On operating system support for programmable devices
- Y. Weinsberg, D. Dolev, T. Anker, M. Ben- Yehuda, and P. Wyckoff. Tapping into the fountain of CPUs: on operating system support for programmable devices. In ASPLOS 2008.
- ASPLOS 2008
- Weinsberg, Y.¹ Dolev, D.² Anker, T.³ Ben-Yehuda, M.⁴ Wyckoff, P.⁵

97
- 84870898987
- Accelerating text mining workloads in a mapreduce-based distributed gpu environment
- Feb.
- P. Wittek and S. DaráNyi. Accelerating text mining workloads in a mapreduce-based distributed gpu environment. J. Parallel Distrib. Comput., 73(2):198-206, Feb. 2013.
- (2013) J. Parallel Distrib. Comput. , vol.73 , Issue.2 , pp. 198-206
- Wittek, P.¹ DaráNyi, S.²

98
- 84875184822
- Kernel weaver: Automatically fusing database primitives for efficient gpu computation
- H. Wu, G. Diamos, S. Cadambi, and S. Yalamanchili. Kernel weaver: Automatically fusing database primitives for efficient gpu computation. In Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-45 '12, 2012.
- Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-45 '12, 2012
- Wu, H.¹ Diamos, G.² Cadambi, S.³ Yalamanchili, S.⁴

99
- 70350678845
- JCUDA: A programmer-friendly interface for accelerating java programs with CUDA
- Y. Yan, M. Grossman, and V. Sarkar. JCUDA: A programmer-friendly interface for accelerating java programs with CUDA. In Euro-Par, pages 887-899, 2009.
- (2009) Euro-Par , pp. 887-899
- Yan, Y.¹ Grossman, M.² Sarkar, V.³

100
- 72249089011
- Distributed aggregation for data-parallel computing: Interfaces and implementations
- Y. Yu, P. K. Gunda, and M. Isard. Distributed aggregation for data-parallel computing: interfaces and implementations. In SOSP, pages 247-260, 2009.
- (2009) SOSP , pp. 247-260
- Yu, Y.¹ Gunda, P.K.² Isard, M.³

101
- 85076882757
- DryadLINQ: A system for general-purpose distributed dataparallel computing using a high-level language
- Y. Yu,M. Isard, D. Fetterly,M. Budiu, Ú . Erlingsson, P. K. Gunda, and J. Currey. DryadLINQ: A system for general-purpose distributed dataparallel computing using a high-level language. In Proceedings of the 8th Symposium on Operating Systems Design and Implementation (OSDI), pages 1-14, 2008.
- (2008) Proceedings of the 8th Symposium on Operating Systems Design and Implementation (OSDI) , pp. 1-14
- Yu, Y.¹ Isard, M.² Fetterly, D.³ Budiu, M.⁴ Erlingsson, Ú.⁵ Gunda, P.K.⁶ Currey, J.⁷

102
- 70849123432
- Technical Report MSR-TR-2008-74, Microsoft Research, May
- Y. Yu, M. Isard, D. Fetterly, M. Budiu, U. Erlingsson, P. K. Gunda, J. Currey, F. McSherry, and K. Achan. Some sample programs written in DryadLINQ. Technical Report MSR-TR-2008-74, Microsoft Research, May 2008.
- (2008) Some Sample Programs Written in DryadLINQ
- Yu, Y.¹ Isard, M.² Fetterly, D.³ Budiu, M.⁴ Erlingsson, U.⁵ Gunda, P.K.⁶ Currey, J.⁷ McSherry, F.⁸ Achan, K.⁹

103
- 85040175609
- Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing
- M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker, and I. Stoica. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In NSDI, 2012.
- (2012) NSDI
- Zaharia, M.¹ Chowdhury, M.² Das, T.³ Dave, A.⁴ Ma, J.⁵ McCauley, M.⁶ Franklin, M.J.⁷ Shenker, S.⁸ Stoica, I.⁹

104
- 84889647470
- Microsoft Cambridge at TREC-13: Web and HARD tracks
- H. Zaragoza, N. Craswell, M. Taylor, S. Saria, and S. Robertson. Microsoft Cambridge at TREC-13: Web and HARD tracks. In Proc. of the 13th Text Retrieval Conference, 2004.
- Proc. Of the 13th Text Retrieval Conference, 2004
- Zaragoza, H.¹ Craswell, N.² Taylor, M.³ Saria, S.⁴ Robertson, S.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.