SCOPUS 정보 검색 플랫폼

ACM SIGPLAN Notices

Volumn 43, Issue 3, 2008, Pages 297-307

Streamware: Programming general-purpose multicore processors using streams

(4) Gummaraju, Jayanth a Coburn, Joel a Turner, Yoshio b Rosenblum, Mendel a

a STANFORD UNIVERSITY (United States)

b HEWLETT PACKARD LABORATORIES (United States)

Author keywords

General Purpose Multicore Processors; Programming; Runtime System; Streams

Indexed keywords

AUTOMATIC PARALLELIZATION; DATA-INTENSIVE APPLICATION; GENERAL PURPOSE PROCESSORS; GENERAL-PURPOSE MULTICORE PROCESSORS; MULTI CORE; MULTI-CORE PROCESSOR; PROCESSOR CORES; PROGRAMMING; PROGRAMMING MODELS; RUNTIME ENVIRONMENTS; RUNTIME SYSTEM; SCIENTIFIC APPLICATIONS; SOFTWARE SYSTEMS; STREAM COMPILERS; STREAM LANGUAGES; STREAM PROCESSOR; STREAM PROGRAMMING; STREAMS; VIRTUAL-MACHINE CODE; WORKLOAD VARIATION;

GENERAL PURPOSE COMPUTERS; MACHINE DESIGN; QUERY LANGUAGES;

PROGRAM COMPILERS;

EID: 67650035153 PISSN: 15232867 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (2)

References (40)

1
- 67650065863
- Intel Thread Building Blocks. osstbb.intel.com
- Intel Thread Building Blocks. osstbb.intel.com.

2
- 84869355228
- MPI. www.open-mpi.org.

3
- 84869341273
- NVidia G80. www.nvidia.com.
- NVidia G80

4
- 84869355463
- OpenMP. www.openmp.org.

5
- 84869355461
- RStream Compiler
- RStream Compiler. www.reservoir.com.

6
- 33645956449
- Simplified discontinuous Galerkin methods for systems of conservation laws with convex extension
- Discontinuous Galerkin Methods, of, Springer-Verlag, Heidelberg
- T. Barth. Simplified discontinuous Galerkin methods for systems of conservation laws with convex extension. In Discontinuous Galerkin Methods, volume 11 of Lecture Notes in Computational Science and Engineering. Springer-Verlag, Heidelberg, 1999.
- (1999) Lecture Notes in Computational Science and Engineering , vol.11
- Barth, T.¹

7
- 0032689024
- Constitutive model and finite element formulation for large strain elasto-plastic analysis of shells
- Jun
- Y. Basar and M. Itskov. Constitutive model and finite element formulation for large strain elasto-plastic analysis of shells. In Journal of Computational Mechanics, Jun 1999.
- (1999) Journal of Computational Mechanics
- Basar, Y.¹ Itskov, M.²

8
- 2642548834
- Network-oriented full system simulation using M5
- N. Binkert, E. Hallnor, and S. Reinhardt. Network-oriented full system simulation using M5. In CAECW, 2003.
- (2003) CAECW
- Binkert, N.¹ Hallnor, E.² Reinhardt, S.³

9
- 33751032129
- McRT-STM: A high performance software transactional memory system for a multi-core runtime
- Bratin Saha et al. McRT-STM: a high performance software transactional memory system for a multi-core runtime. In PPoPP, 2006.
- (2006) PPoPP
- Saha, B.¹

10
- 51049084341
- Enabling scalability and performance in a large scale CMP environment
- Bratin Saha et al. Enabling scalability and performance in a large scale CMP environment. In Eurosys, 2007.
- (2007) Eurosys
- Saha, B.¹

11
- 84877609547
- Brook for GPUs: Stream computing on graphics hardware
- I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: Stream computing on graphics hardware. In SIGGRAPH, 2004.
- (2004) SIGGRAPH
- Buck, I.¹ Foley, T.² Horn, D.³ Sugerman, J.⁴ Fatahalian, K.⁵ Houston, M.⁶ Hanrahan, P.⁷

12
- 34547679939
- Evaluating MapReduce for Multicore and Multiprocessor Systems
- C. Ranger et al. Evaluating MapReduce for Multicore and Multiprocessor Systems. In HPCA, 2007.
- (2007) HPCA
- Ranger, C.¹

13
- 2942753446
- SC, Nov
- W. Dally, P. Hanrahan, M. Erez, T. J. Knight, F. Labonte, J.-H. Ahn, N. Jayasena, U. J. Kapasi, A. Das, J. Gummaraju, and I. Buck. Merrimac: Supercomputing with streams. In SC, Nov 2003.
- (2003) Merrimac: Supercomputing with streams
- Dally, W.¹ Hanrahan, P.² Erez, M.³ Knight, T.J.⁴ Labonte, F.⁵ Ahn, J.-H.⁶ Jayasena, N.⁷ Kapasi, U.J.⁸ Das, A.⁹ Gummaraju, J.¹⁰ Buck, I.¹¹

14
- 34247114371
- Compiling for Stream Processing
- A. Das, W. Dally, and P. Mattson. Compiling for Stream Processing. In PACT, 2006.
- (2006) PACT
- Das, A.¹ Dally, W.² Mattson, P.³

15
- 0031622953
- The implementation of the Cilk-5 multithreaded language
- M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. In PLDI, 1998.
- (1998) PLDI
- Frigo, M.¹ Leiserson, C.E.² Randall, K.H.³

16
- 34547423880
- Exploiting coarse-grained task, data, and pipeline parallelism in stream programs
- M. Gordon, W. Thies, and S. Amarasinghe. Exploiting coarse-grained task, data, and pipeline parallelism in stream programs. In ASPLOS, 2006.
- (2006) ASPLOS
- Gordon, M.¹ Thies, W.² Amarasinghe, S.³

17
- 47849087164
- Architectural Support for the Stream Execution Model on General-Purpose Processors
- J. Gummaraju, M. Erez, J. Coburn, M. Rosenblum, and W. Dally. Architectural Support for the Stream Execution Model on General-Purpose Processors. In PACT, 2007.
- (2007) PACT
- Gummaraju, J.¹ Erez, M.² Coburn, J.³ Rosenblum, M.⁴ Dally, W.⁵

18
- 33749377408
- Stream Programming on General-Purpose Processors
- J. Gummaraju and M. Rosenblum. Stream Programming on General-Purpose Processors. In International Symposium on Microarchitecture, 2005.
- (2005) International Symposium on Microarchitecture
- Gummaraju, J.¹ Rosenblum, M.²

19
- 0027262011
- Transactional memory: Architectural support for lock-free data structures
- M. Herlihy and J. E. B. Moss. Transactional memory: Architectural support for lock-free data structures. In ISCA, 1993.
- (1993) ISCA
- Herlihy, M.¹ Moss, J.E.B.²

20
- 27644567646
- Power efficient processor architecture and the Cell processor
- Feb
- H. P. Hofstee. Power efficient processor architecture and the Cell processor. In HPCA, Feb 2005.
- (2005) HPCA
- Hofstee, H.P.¹

21
- 35348861326
- Comparing Memory Systems for Chip Multiprocessors
- J. Leverich et al. Comparing Memory Systems for Chip Multiprocessors. In ISCA, 2007.
- (2007) ISCA
- Leverich, J.¹

22
- 34548207355
- K. Fatahalian et al. Sequoia: Programming the Memory Hierarchy. In SC, Nov 2006.
- K. Fatahalian et al. Sequoia: Programming the Memory Hierarchy. In SC, Nov 2006.

23
- 33745017747
- Large eddy simulation of reacting turbulent flows in complex geometries
- May
- K. Mahesh et al. Large eddy simulation of reacting turbulent flows in complex geometries. ASME J. of Applied Mechanics, May 2006.
- (2006) ASME J. of Applied Mechanics
- Mahesh, K.¹

24
- 0001310691
- Titanium: A high-performance Java dialect
- Feb
- K. Yelick et al. Titanium: A high-performance Java dialect. In ACM Workshop on Java for High-Performance Network Computing, Feb 1998.
- (1998) ACM Workshop on Java for High-Performance Network Computing
- Yelick, K.¹

25
- 0036396915
- The Imagine stream processor
- Sep
- U. Kapasi, W. Dally, S. Rixner, J. Owens, and B. Khailany. The Imagine stream processor. In ICCD, Sep 2002.
- (2002) ICCD
- Kapasi, U.¹ Dally, W.² Rixner, S.³ Owens, J.⁴ Khailany, B.⁵

26
- 10444269287
- The Stream Virtual Machine
- F. Labonte, P. Mattson, I. Buck, C. Kozyrakis, and M. Horowitz. The Stream Virtual Machine. In PACT, 2004.
- (2004) PACT
- Labonte, F.¹ Mattson, P.² Buck, I.³ Kozyrakis, C.⁴ Horowitz, M.⁵

27
- 0036505033
- The Raw microprocessor: A computational fabric for software circuits and general-purpose programs
- March
- M. B. Taylor et al. The Raw microprocessor: a computational fabric for software circuits and general-purpose programs. IEEE Micro, 22:25-35, March 2002.
- (2002) IEEE Micro , vol.22 , pp. 25-35
- Taylor, M.B.¹

28
- 34548052234
- M. Erez and J. Ahn and J. Gummaraju and M. Rosenblum and W. Dally. Executing Irregular Scientific Applications on Stream Architectures. In ICS, 2007.
- M. Erez and J. Ahn and J. Gummaraju and M. Rosenblum and W. Dally. Executing Irregular Scientific Applications on Stream Architectures. In ICS, 2007.

29
- 0036959649
- A Stream Compiler for Communication-Exposed Architectures
- M. Gordon et al. A Stream Compiler for Communication-Exposed Architectures. In ASPLOS, 2002.
- (2002) ASPLOS
- Gordon, M.¹

30
- 56849108794
- A Portable Run-time Interface for Multi-level Memory Hierarchies
- M. Houston et al. A Portable Run-time Interface for Multi-level Memory Hierarchies. In PPoPP, 2008.
- (2008) PPoPP
- Houston, M.¹

31
- 35448961922
- Dryad: Distributed Data Parallel Programs from Sequential Building Blocks
- M. Isard et al. Dryad: Distributed Data Parallel Programs from Sequential Building Blocks. In Eurosys, 2007.
- (2007) Eurosys
- Isard, M.¹

32
- 42549135730
- Data-parallel programming on Cell BE and the GPU using the Rapidmind development platform
- M. D. McCool. Data-parallel programming on Cell BE and the GPU using the Rapidmind development platform. In GSPx Multicore Applications Conference, 2006.
- (2006) GSPx Multicore Applications Conference
- McCool, M.D.¹

33
- 31744441529
- X10: An object-oriented approach to non-uniform cluster computing
- P. Charles et al. X10: An object-oriented approach to non-uniform cluster computing. In OOPSLA, 2005.
- (2005) OOPSLA
- Charles, P.¹

34
- 42549110926
- Sequoia: Programming the Memory Hierarchy
- T. Knight et al. Sequoia: Programming the Memory Hierarchy. In PPoPP, 2007.
- (2007) PPoPP
- Knight, T.¹

35
- 47249165359
- Thread Clustering: A Share-aware Scheduling on SMP-CMP-SMT Multiprocessors
- D. Tam, R. Azimi, and M. Stumm. Thread Clustering: A Share-aware Scheduling on SMP-CMP-SMT Multiprocessors. In EuroSys, 2007.
- (2007) EuroSys
- Tam, D.¹ Azimi, R.² Stumm, M.³

36
- 33947595619
- ACCELERATOR: Using data-parallelism to program GPUs for general-purpose uses
- D. Tarditi, S. Puri, and J. Oglesby. ACCELERATOR: Using data-parallelism to program GPUs for general-purpose uses. In ASPLOS, 2006.
- (2006) ASPLOS
- Tarditi, D.¹ Puri, S.² Oglesby, J.³

37
- 0037521913
- StreamIt: A language for streaming applications
- W. Thies, M. Karczmarek, and S. Amarasinghe. StreamIt: A language for streaming applications. In ICCC, 2002.
- (2002) ICCC
- Thies, W.¹ Karczmarek, M.² Amarasinghe, S.³

38
- 21644438927
- SC
- R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrixvector multiply. SC, 2002.
- (2002) Performance optimizations and bounds for sparse matrixvector multiply
- Vuduc, R.¹ Demmel, J.W.² Yelick, K.A.³ Kamil, S.⁴ Nishtala, R.⁵ Lee, B.⁶

39
- 35348861182
- DRAMsim: A memory system simulator
- September
- D. Wang, B. Ganesh, N. T. K. B. A. Jaleel, and B. Jacob. DRAMsim: A memory system simulator. In SIGARCH Computer Architecture News, September 2005.
- (2005) SIGARCH Computer Architecture News
- Wang, D.¹ Ganesh, B.² Jaleel, N.T.K.B.A.³ Jacob, B.⁴

40
- 57649169968
- A Lightweight Streaming Layer for Multicore Execution
- Dec
- D. Zhang, Q. Li, R. Rabbah, and S. Amarasinghe. A Lightweight Streaming Layer for Multicore Execution. In Workshop on Design, Architecture, and Simulation of Chip Multiprocessors, Dec 2007.
- (2007) Workshop on Design, Architecture, and Simulation of Chip Multiprocessors
- Zhang, D.¹ Li, Q.² Rabbah, R.³ Amarasinghe, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.