SCOPUS 정보 검색 플랫폼

Computer Journal

Volumn 48, Issue 5, 2005, Pages 588-601

Practical compiler techniques on efficient multithreaded code generation for OpenMP programs

(4) Tian, Xinmin a Girkar, Milind a Bik, Aart a Saito, Hideki a

a INTEL CORPORATION (United States)

Author keywords

[No Author keywords available]

Indexed keywords

C (PROGRAMMING LANGUAGE); FORTRAN (PROGRAMMING LANGUAGE); MULTIPROCESSING PROGRAMS; MULTIPROCESSING SYSTEMS; OPEN SYSTEMS; OPTIMIZATION; PARALLEL PROCESSING SYSTEMS;

COMPILER OPTIMIZATION; MULTITHREADED CODE GENERATION; MUTITHREADING PROCESSORS; PARALLELIZING COMPILERS;

PROGRAM COMPILERS;

EID: 24144495509 PISSN: 00104620 EISSN: None Source Type: Journal
DOI: 10.1093/comjnl/bxh109 Document Type: Article

Times cited : (14)

References (26)

1
- 8344233355
- The energy efficiency of CMP vs. SMT for multimedia workloads
- Saint-Malo, France, June 26-July 1, ACM Press, New York, NY
- Sasanka, R., Adve, S. V., Chen, Y.-K. and Debes, E. (2004) The energy efficiency of CMP vs. SMT for multimedia workloads. In Proc. 18th Annual ACM Int. Conf. on Supercomputing (ICS '04). Saint-Malo, France, June 26-July 1, pp. 196-206. ACM Press, New York, NY.
- (2004) Proc. 18th Annual ACM Int. Conf. on Supercomputing (ICS '04) , pp. 196-206
- Sasanka, R.¹ Adve, S.V.² Chen, Y.-K.³ Debes, E.⁴

2
- 0031235242
- A single-chip multi-processor
- Hammand, L., Nayfeh, B. A. and Olukotun, K. (1997) A single-chip multi-processor. IEEE Computer, 30(9), 79-85.
- (1997) IEEE Computer , vol.30 , Issue.9 , pp. 79-85
- Hammand, L.¹ Nayfeh, B.A.² Olukotun, K.³

3
- 4644226743
- Simultaneous multi-threading implementation in POWER5 - IBM's next generation POWER microprocessor
- Stanford University, Palo, Alto, CA, August 17-19, IEEE Computer Society. Available at
- Kalla, R., Sinharoy, B. and Tendler, J. (2003) Simultaneous multi-threading implementation in POWER5 - IBM's next generation POWER microprocessor. In Proc. Hot Chips Conf 15, Stanford University, Palo, Alto, CA, August 17-19, IEEE Computer Society. Available at http://www.hotchips.org/archive/hc15/pdf/11.ibm.pdf.
- (2003) Proc. Hot Chips Conf. 15
- Kalla, R.¹ Sinharoy, B.² Tendler, J.³

4
- 0001087280
- Hyper-threading technology microarchitecture and architecture
- Available at
- Marr, D., Binns, F., Hill, D. L., Hinton, G., Koufaty, D., Miller, J. and Upton, M. (2002) Hyper-threading technology microarchitecture and architecture. Intel Technol. J., 6(Q1). Available at http://www.intel.com/technology/itj.
- (2002) Intel Technol. J. , vol.6 , Issue.Q1
- Marr, D.¹ Binns, F.² Hill, D.L.³ Hinton, G.⁴ Koufaty, D.⁵ Miller, J.⁶ Upton, M.⁷

5
- 0037660155
- OpenMP Architecture Review Board Available at
- OpenMP Architecture Review Board (2002) OpenMP C and C++ Application Program Interface, Version 2.0. Available at http://www.openmp.org.
- (2002) OpenMP C and C++ Application Program Interface, Version 2.0

6
- 0003554133
- OpenMP Architecture Review Board Available at
- OpenMP Architecture Review Board (2000) OpenMP Fortran Application Program Interface, Version 2.0. Available at hftp://www.openmp.org.
- (2000) OpenMP Fortran Application Program Interface, Version 2.0

7
- 84947257473
- Exploring the use of hyper-threading technology for multimedia applications with Intel OpenMP compiler
- Nice, France, April 22-26, electronic edition. IEEE Computer Society
- Tian, X., Chen, Y-K., Girkar, M., Ge, S., Lienhart, R. and Shah, S. (2003) Exploring the use of hyper-threading technology for multimedia applications with Intel OpenMP compiler. In Proc. IEEE 17th Int. Parallel and Distributed Processing Symp., Nice, France, April 22-26, p. 36, electronic edition. IEEE Computer Society.
- (2003) Proc. IEEE 17th Int. Parallel and Distributed Processing Symp. , pp. 36
- Tian, X.¹ Chen, Y.-K.² Girkar, M.³ Ge, S.⁴ Lienhart, R.⁵ Shah, S.⁶

8
- 1942448564
- Intel® OpenMP* C++/Fortran compiler for hyper-threading technology: Implementation and performance
- Available at
- Tian, X., Bik, A., Girkar, M., Grey, P., Saito, H. and Su, E. (2002) Intel® OpenMP* C++/Fortran compiler for hyper-threading technology: implementation and performance. Intel Technol. J., 6(Q1). Available at http://www.intel.com/technology/itj.
- (2002) Intel Technol. J. , vol.6 , Issue.Q1
- Tian, X.¹ Bik, A.² Girkar, M.³ Grey, P.⁴ Saito, H.⁵ Su, E.⁶

9
- 24144496596
- Compiler support and performance tuning of OpenMP programs on Sun Fire Servers
- Aachen, Germany, September 22-23. Available at
- Lee, M., Meadows, L., Gove, D., Paulraj, D., Goil, S., Whitney, B., Copty, N. and Songl, Y. (2003) Compiler support and performance tuning of OpenMP programs on Sun Fire Servers. In Proc. Fifth European Workshop on OpenMP, Aachen, Germany, September 22-23. Available at http://wwwrz.rwth-aachen.de/ewomp03/omptalks/Tuesday/Session6/t14p.pdf.
- (2003) Proc. Fifth European Workshop on OpenMP
- Lee, M.¹ Meadows, L.² Gove, D.³ Paulraj, D.⁴ Goil, S.⁵ Whitney, B.⁶ Copty, N.⁷ Songl, Y.⁸

10
- 35248821174
- A practical OpenMP compiler for system on chips
- Toronto, Canada, June 26-27, LNCS Springer-Verlag, Berlin
- Liu, F. and Chaudhary, V. (2003) A practical OpenMP compiler for system on chips. In Proc. Int. Workshop on OpenUP Applications and Tools (WOMPAT'03), Toronto, Canada, June 26-27, LNCS 2716, 54-68, Springer-Verlag, Berlin.
- (2003) Proc. Int. Workshop on OpenUP Applications and Tools (WOMPAT'03) , vol.2716 , pp. 54-68
- Liu, F.¹ Chaudhary, V.²

11
- 35248836538
- A C++ infrastructure for automatic introduction and translation of OpenMP directives
- (WOMPAT'03), Toronto, Canada, June 26-27, LNCS Springer-Verlag, Berlin
- Quinlan, D., Schordan, M., Yi, Q. and de Supinski, B. R. (2003) A C++ infrastructure for automatic introduction and translation of OpenMP directives. In Proc. Int Workshop on OpenMP Applications and Tools, (WOMPAT'03), Toronto, Canada, June 26-27, LNCS 2716, 13-25. Springer-Verlag, Berlin.
- (2003) Proc. Int Workshop on OpenMP Applications and Tools , vol.2716 , pp. 13-25
- Quinlan, D.¹ Schordan, M.² Yi, Q.³ de Supinski, B.R.⁴

12
- 0037870924
- OdinMP/CCp - A portable implementation of OpenMP for C
- Lund University, Lund, Sweden, September 30-October 1. Available at
- Brunschen, C. and Brorsson, M. (1999) OdinMP/CCp - a portable implementation of OpenMP for C. In Proc. First European Workshop on GpenMP, Lund University, Lund, Sweden, September 30-October 1. Available at http://www.it.lth.se/ewomp99/papers/brunschen.pdf
- (1999) Proc. First European Workshop on GpenMP
- Brunschen, C.¹ Brorsson, M.²

13
- 12444316748
- Automatic parallelization for symmetric shared-memory multi-processors
- Toronto, ON, November 12-14, IBM. Available at
- Chow, J.-H., Lyon, L. and Sarkar, V. (1996) Automatic parallelization for symmetric shared-memory multi-processors. In Proc. CASCON'96, Toronto, ON, November 12-14, pp. 76-89. IBM. Available at http://www.cs. ubc.ca/local/reading/proceedings/cascon96/htm/english/frm/intro.htm.
- (1996) Proc. CASCON'96 , pp. 76-89
- Chow, J.-H.¹ Lyon, L.² Sarkar, V.³

14
- 84900342836
- SPEComp: A new benchmark suite for measuring parallel computer performance
- West Lafayette, IN, July 30-31, LNCS Springer-Verlag
- Aslot, V, Domeika, M., Eigenmann, R., Gaertner, G., Jones, W. B. and Parady, B. (2001) SPEComp: a new benchmark suite for measuring parallel computer performance. In Proc. Int. Workshop on OpenMP Applications and Tools (WOMPAT'01), West Lafayette, IN, July 30-31, LNCS 2104, 1-10. Springer-Verlag.
- (2001) Proc. Int. Workshop on OpenMP Applications and Tools (WOMPAT'01) , vol.2104 , pp. 1-10
- Aslot, V.¹ Domeika, M.² Eigenmann, R.³ Gaertner, G.⁴ Jones, W.B.⁵ Parady, B.⁶

15
- 0348126386
- SPEC OMP2001 benchmark on the Fujitsu PRIMEPOWER system
- Barcelona, Spain, September 8-9
- Iwashita, H., Yamanaka, E., Sueyasu, N., Waveren, M. and Miura, K, (2001) SPEC OMP2001 benchmark on the Fujitsu PRIMEPOWER system. In Proc. Third European Workshop on OpenMP (EWOMP'01). Barcelona, Spain, September 8-9.
- (2001) Proc. Third European Workshop on OpenMP (EWOMP'01)
- Iwashita, H.¹ Yamanaka, E.² Sueyasu, N.³ Waveren, M.⁴ Miura, K.⁵

16
- 24144474794
- Intel Press Hillsboro, OR. Available at
- Bik, A. J. C. (2004) The Software Vectorization Handbook. Intel Press Hillsboro, OR. Available at http://www.intel.com/intelpress.
- (2004) The Software Vectorization Handbook
- Bik, A.J.C.¹

17
- 0344908850
- Automatic intra-register vectorization for the Intel® architecture
- Bik, A., Girkar, M., Grey, P. and Tian, X. (2002) Automatic intra-register vectorization for the Intel® architecture. Int. J. Parallel Prog., 30(2), 65-98.
- (2002) Int. J. Parallel Prog. , vol.30 , Issue.2 , pp. 65-98
- Bik, A.¹ Girkar, M.² Grey, P.³ Tian, X.⁴

18
- 18844390479
- On the importance of points-to analysis and other memory disambiguation methods for C programs
- Snowbird, UT, June 20-22, ACM SIGPLAN Notices 47-58
- Ghiya, R., Lavery, D. and Sehr, D. (2001) On the importance of points-to analysis and other memory disambiguation methods for C programs. In Proc. 2001 ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI), Snowbird, UT, June 20-22, pp. 47-58. ACM SIGPLAN Notices, 36, 47-58.
- (2001) Proc. 2001 ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI) , vol.36 , pp. 47-58
- Ghiya, R.¹ Lavery, D.² Sehr, D.³

19
- 0003927035
- Addison-Wesley Publishing Company, Redwood City, CA
- Wolfe, M. (1996) High Performance Compilers for Parallel Computers. Addison-Wesley Publishing Company, Redwood City, CA.
- (1996) High Performance Compilers for Parallel Computers
- Wolfe, M.¹

20
- 0023535689
- Guided self-scheduling: A practical scheduling scheme for parallel supercomputers
- Polychronopoulos, C. D. and Kuck, D. J. (1987) Guided self-scheduling: a practical scheduling scheme for parallel supercomputers. IEEE Trans. Comput., 36(12), 1425-1439.
- (1987) IEEE Trans. Comput. , vol.36 , Issue.12 , pp. 1425-1439
- Polychronopoulos, C.D.¹ Kuck, D.J.²

21
- 12444339820
- Towards efficient multi-level threading of H.264 encoder on Intel hyper-threading architectures
- Santa Fe, NM, April 26-30, electronic edition. IEEE Computer Society
- Chen, Y.-K., Tian, X., Ge, S. and Girkar, M. (2004) Towards efficient multi-level threading of H.264 encoder on Intel hyper-threading architectures. In Proc. 18th Int. Parallel and Distributed Processing Symp. (IPDPS'04), Santa Fe, NM, April 26-30, pp. 63b, electronic edition. IEEE Computer Society.
- (2004) Proc. 18th Int. Parallel and Distributed Processing Symp. (IPDPS'04)
- Chen, Y.-K.¹ Tian, X.² Ge, S.³ Girkar, M.⁴

22
- 24144449684
- A portable and efficient thread library for OpenMP
- KTH Royal Institute of Technology, Stockholm, Sweden, October 18-22, John Wiley Available at
- Karlsson, S. (2004) A portable and efficient thread library for OpenMP. In Proc. 6th European Workshop on OpenMP, KTH Royal Institute of Technology, Stockholm, Sweden, October 18-22, pp. 43-47. John Wiley Available at http://www.imit.kth.se/ewomp2004/proceedings.pdf
- (2004) Proc. 6th European Workshop on OpenMP , pp. 43-47
- Karlsson, S.¹

23
- 0002663333
- Measuring synchronization and scheduling overheads in OpenMP
- Lund University, Lund, Sweden, September 30-October 1. Available at
- Bull, J. M. (1999) Measuring synchronization and scheduling overheads in OpenMP. In Proc. first European Workshop on OpenMP, Lund University, Lund, Sweden, September 30-October 1. Available at http://www.it.lth.se/ewomp99/papers/bull.pdf.
- (1999) Proc. First European Workshop on OpenMP
- Bull, J.M.¹

24
- 0003989360
- Morgan Kaufmann Publisher, Inc., San Francisco, CA
- Pacheco, S. (1997) Parallel Programming with MPI. Morgan Kaufmann Publisher, Inc., San Francisco, CA.
- (1997) Parallel Programming With MPI
- Pacheco, S.¹

25
- 0038379316
- Performance comparison of MPI and three OpenMP programming styles on shared memory multiprocessors
- San Diego, CA, June 7-9, ACM Press, New York, NY
- Cappello, F. and Etiemble, D. (2003) Performance comparison of MPI and three OpenMP programming styles on shared memory multiprocessors. In Proc. l5th Annual ACM Symp on Parallel Algorithms and Architectures, San Diego, CA, June 7-9, pp. 118-127. ACM Press, New York, NY
- (2003) Proc. L5th Annual ACM Symp. on Parallel Algorithms and Architectures , pp. 118-127
- Cappello, F.¹ Etiemble, D.²

26
- 84974695575
- OmniRPC: A grid RPC facility for cluster and global computing in OpenMP
- LNCS Springer-Verlag, Berlin
- Sato, M., Hirano, M., Tanaka, Y. and Sekiguchi, S. (2001) OmniRPC: a grid RPC facility for cluster and global computing in OpenMP. In Proc. Int. Workshop on OpenMP Applications and Tools (WOMPAT'01), LNCS 2104, 130-136. Springer-Verlag, Berlin.
- (2001) Proc. Int. Workshop on OpenMP Applications and Tools (WOMPAT'01) , vol.2104 , pp. 130-136
- Sato, M.¹ Hirano, M.² Tanaka, Y.³ Sekiguchi, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.