메뉴 건너뛰기




Volumn 48, Issue 5, 2005, Pages 588-601

Practical compiler techniques on efficient multithreaded code generation for OpenMP programs

Author keywords

[No Author keywords available]

Indexed keywords

C (PROGRAMMING LANGUAGE); FORTRAN (PROGRAMMING LANGUAGE); MULTIPROCESSING PROGRAMS; MULTIPROCESSING SYSTEMS; OPEN SYSTEMS; OPTIMIZATION; PARALLEL PROCESSING SYSTEMS;

EID: 24144495509     PISSN: 00104620     EISSN: None     Source Type: Journal    
DOI: 10.1093/comjnl/bxh109     Document Type: Article
Times cited : (14)

References (26)
  • 1
    • 8344233355 scopus 로고    scopus 로고
    • The energy efficiency of CMP vs. SMT for multimedia workloads
    • Saint-Malo, France, June 26-July 1, ACM Press, New York, NY
    • Sasanka, R., Adve, S. V., Chen, Y.-K. and Debes, E. (2004) The energy efficiency of CMP vs. SMT for multimedia workloads. In Proc. 18th Annual ACM Int. Conf. on Supercomputing (ICS '04). Saint-Malo, France, June 26-July 1, pp. 196-206. ACM Press, New York, NY.
    • (2004) Proc. 18th Annual ACM Int. Conf. on Supercomputing (ICS '04) , pp. 196-206
    • Sasanka, R.1    Adve, S.V.2    Chen, Y.-K.3    Debes, E.4
  • 3
    • 4644226743 scopus 로고    scopus 로고
    • Simultaneous multi-threading implementation in POWER5 - IBM's next generation POWER microprocessor
    • Stanford University, Palo, Alto, CA, August 17-19, IEEE Computer Society. Available at
    • Kalla, R., Sinharoy, B. and Tendler, J. (2003) Simultaneous multi-threading implementation in POWER5 - IBM's next generation POWER microprocessor. In Proc. Hot Chips Conf 15, Stanford University, Palo, Alto, CA, August 17-19, IEEE Computer Society. Available at http://www.hotchips.org/archive/hc15/pdf/11.ibm.pdf.
    • (2003) Proc. Hot Chips Conf. 15
    • Kalla, R.1    Sinharoy, B.2    Tendler, J.3
  • 4
    • 0001087280 scopus 로고    scopus 로고
    • Hyper-threading technology microarchitecture and architecture
    • Available at
    • Marr, D., Binns, F., Hill, D. L., Hinton, G., Koufaty, D., Miller, J. and Upton, M. (2002) Hyper-threading technology microarchitecture and architecture. Intel Technol. J., 6(Q1). Available at http://www.intel.com/technology/itj.
    • (2002) Intel Technol. J. , vol.6 , Issue.Q1
    • Marr, D.1    Binns, F.2    Hill, D.L.3    Hinton, G.4    Koufaty, D.5    Miller, J.6    Upton, M.7
  • 7
    • 84947257473 scopus 로고    scopus 로고
    • Exploring the use of hyper-threading technology for multimedia applications with Intel OpenMP compiler
    • Nice, France, April 22-26, electronic edition. IEEE Computer Society
    • Tian, X., Chen, Y-K., Girkar, M., Ge, S., Lienhart, R. and Shah, S. (2003) Exploring the use of hyper-threading technology for multimedia applications with Intel OpenMP compiler. In Proc. IEEE 17th Int. Parallel and Distributed Processing Symp., Nice, France, April 22-26, p. 36, electronic edition. IEEE Computer Society.
    • (2003) Proc. IEEE 17th Int. Parallel and Distributed Processing Symp. , pp. 36
    • Tian, X.1    Chen, Y.-K.2    Girkar, M.3    Ge, S.4    Lienhart, R.5    Shah, S.6
  • 8
    • 1942448564 scopus 로고    scopus 로고
    • Intel® OpenMP* C++/Fortran compiler for hyper-threading technology: Implementation and performance
    • Available at
    • Tian, X., Bik, A., Girkar, M., Grey, P., Saito, H. and Su, E. (2002) Intel® OpenMP* C++/Fortran compiler for hyper-threading technology: implementation and performance. Intel Technol. J., 6(Q1). Available at http://www.intel.com/technology/itj.
    • (2002) Intel Technol. J. , vol.6 , Issue.Q1
    • Tian, X.1    Bik, A.2    Girkar, M.3    Grey, P.4    Saito, H.5    Su, E.6
  • 9
    • 24144496596 scopus 로고    scopus 로고
    • Compiler support and performance tuning of OpenMP programs on Sun Fire Servers
    • Aachen, Germany, September 22-23. Available at
    • Lee, M., Meadows, L., Gove, D., Paulraj, D., Goil, S., Whitney, B., Copty, N. and Songl, Y. (2003) Compiler support and performance tuning of OpenMP programs on Sun Fire Servers. In Proc. Fifth European Workshop on OpenMP, Aachen, Germany, September 22-23. Available at http://wwwrz.rwth-aachen.de/ewomp03/omptalks/Tuesday/Session6/t14p.pdf.
    • (2003) Proc. Fifth European Workshop on OpenMP
    • Lee, M.1    Meadows, L.2    Gove, D.3    Paulraj, D.4    Goil, S.5    Whitney, B.6    Copty, N.7    Songl, Y.8
  • 10
    • 35248821174 scopus 로고    scopus 로고
    • A practical OpenMP compiler for system on chips
    • Toronto, Canada, June 26-27, LNCS Springer-Verlag, Berlin
    • Liu, F. and Chaudhary, V. (2003) A practical OpenMP compiler for system on chips. In Proc. Int. Workshop on OpenUP Applications and Tools (WOMPAT'03), Toronto, Canada, June 26-27, LNCS 2716, 54-68, Springer-Verlag, Berlin.
    • (2003) Proc. Int. Workshop on OpenUP Applications and Tools (WOMPAT'03) , vol.2716 , pp. 54-68
    • Liu, F.1    Chaudhary, V.2
  • 11
    • 35248836538 scopus 로고    scopus 로고
    • A C++ infrastructure for automatic introduction and translation of OpenMP directives
    • (WOMPAT'03), Toronto, Canada, June 26-27, LNCS Springer-Verlag, Berlin
    • Quinlan, D., Schordan, M., Yi, Q. and de Supinski, B. R. (2003) A C++ infrastructure for automatic introduction and translation of OpenMP directives. In Proc. Int Workshop on OpenMP Applications and Tools, (WOMPAT'03), Toronto, Canada, June 26-27, LNCS 2716, 13-25. Springer-Verlag, Berlin.
    • (2003) Proc. Int Workshop on OpenMP Applications and Tools , vol.2716 , pp. 13-25
    • Quinlan, D.1    Schordan, M.2    Yi, Q.3    de Supinski, B.R.4
  • 12
    • 0037870924 scopus 로고    scopus 로고
    • OdinMP/CCp - A portable implementation of OpenMP for C
    • Lund University, Lund, Sweden, September 30-October 1. Available at
    • Brunschen, C. and Brorsson, M. (1999) OdinMP/CCp - a portable implementation of OpenMP for C. In Proc. First European Workshop on GpenMP, Lund University, Lund, Sweden, September 30-October 1. Available at http://www.it.lth.se/ewomp99/papers/brunschen.pdf
    • (1999) Proc. First European Workshop on GpenMP
    • Brunschen, C.1    Brorsson, M.2
  • 13
    • 12444316748 scopus 로고    scopus 로고
    • Automatic parallelization for symmetric shared-memory multi-processors
    • Toronto, ON, November 12-14, IBM. Available at
    • Chow, J.-H., Lyon, L. and Sarkar, V. (1996) Automatic parallelization for symmetric shared-memory multi-processors. In Proc. CASCON'96, Toronto, ON, November 12-14, pp. 76-89. IBM. Available at http://www.cs. ubc.ca/local/reading/proceedings/cascon96/htm/english/frm/intro.htm.
    • (1996) Proc. CASCON'96 , pp. 76-89
    • Chow, J.-H.1    Lyon, L.2    Sarkar, V.3
  • 16
    • 24144474794 scopus 로고    scopus 로고
    • Intel Press Hillsboro, OR. Available at
    • Bik, A. J. C. (2004) The Software Vectorization Handbook. Intel Press Hillsboro, OR. Available at http://www.intel.com/intelpress.
    • (2004) The Software Vectorization Handbook
    • Bik, A.J.C.1
  • 17
    • 0344908850 scopus 로고    scopus 로고
    • Automatic intra-register vectorization for the Intel® architecture
    • Bik, A., Girkar, M., Grey, P. and Tian, X. (2002) Automatic intra-register vectorization for the Intel® architecture. Int. J. Parallel Prog., 30(2), 65-98.
    • (2002) Int. J. Parallel Prog. , vol.30 , Issue.2 , pp. 65-98
    • Bik, A.1    Girkar, M.2    Grey, P.3    Tian, X.4
  • 18
    • 18844390479 scopus 로고    scopus 로고
    • On the importance of points-to analysis and other memory disambiguation methods for C programs
    • Snowbird, UT, June 20-22, ACM SIGPLAN Notices 47-58
    • Ghiya, R., Lavery, D. and Sehr, D. (2001) On the importance of points-to analysis and other memory disambiguation methods for C programs. In Proc. 2001 ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI), Snowbird, UT, June 20-22, pp. 47-58. ACM SIGPLAN Notices, 36, 47-58.
    • (2001) Proc. 2001 ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI) , vol.36 , pp. 47-58
    • Ghiya, R.1    Lavery, D.2    Sehr, D.3
  • 20
    • 0023535689 scopus 로고
    • Guided self-scheduling: A practical scheduling scheme for parallel supercomputers
    • Polychronopoulos, C. D. and Kuck, D. J. (1987) Guided self-scheduling: a practical scheduling scheme for parallel supercomputers. IEEE Trans. Comput., 36(12), 1425-1439.
    • (1987) IEEE Trans. Comput. , vol.36 , Issue.12 , pp. 1425-1439
    • Polychronopoulos, C.D.1    Kuck, D.J.2
  • 21
    • 12444339820 scopus 로고    scopus 로고
    • Towards efficient multi-level threading of H.264 encoder on Intel hyper-threading architectures
    • Santa Fe, NM, April 26-30, electronic edition. IEEE Computer Society
    • Chen, Y.-K., Tian, X., Ge, S. and Girkar, M. (2004) Towards efficient multi-level threading of H.264 encoder on Intel hyper-threading architectures. In Proc. 18th Int. Parallel and Distributed Processing Symp. (IPDPS'04), Santa Fe, NM, April 26-30, pp. 63b, electronic edition. IEEE Computer Society.
    • (2004) Proc. 18th Int. Parallel and Distributed Processing Symp. (IPDPS'04)
    • Chen, Y.-K.1    Tian, X.2    Ge, S.3    Girkar, M.4
  • 22
    • 24144449684 scopus 로고    scopus 로고
    • A portable and efficient thread library for OpenMP
    • KTH Royal Institute of Technology, Stockholm, Sweden, October 18-22, John Wiley Available at
    • Karlsson, S. (2004) A portable and efficient thread library for OpenMP. In Proc. 6th European Workshop on OpenMP, KTH Royal Institute of Technology, Stockholm, Sweden, October 18-22, pp. 43-47. John Wiley Available at http://www.imit.kth.se/ewomp2004/proceedings.pdf
    • (2004) Proc. 6th European Workshop on OpenMP , pp. 43-47
    • Karlsson, S.1
  • 23
    • 0002663333 scopus 로고    scopus 로고
    • Measuring synchronization and scheduling overheads in OpenMP
    • Lund University, Lund, Sweden, September 30-October 1. Available at
    • Bull, J. M. (1999) Measuring synchronization and scheduling overheads in OpenMP. In Proc. first European Workshop on OpenMP, Lund University, Lund, Sweden, September 30-October 1. Available at http://www.it.lth.se/ewomp99/papers/bull.pdf.
    • (1999) Proc. First European Workshop on OpenMP
    • Bull, J.M.1
  • 24
    • 0003989360 scopus 로고    scopus 로고
    • Morgan Kaufmann Publisher, Inc., San Francisco, CA
    • Pacheco, S. (1997) Parallel Programming with MPI. Morgan Kaufmann Publisher, Inc., San Francisco, CA.
    • (1997) Parallel Programming With MPI
    • Pacheco, S.1
  • 25
    • 0038379316 scopus 로고    scopus 로고
    • Performance comparison of MPI and three OpenMP programming styles on shared memory multiprocessors
    • San Diego, CA, June 7-9, ACM Press, New York, NY
    • Cappello, F. and Etiemble, D. (2003) Performance comparison of MPI and three OpenMP programming styles on shared memory multiprocessors. In Proc. l5th Annual ACM Symp on Parallel Algorithms and Architectures, San Diego, CA, June 7-9, pp. 118-127. ACM Press, New York, NY
    • (2003) Proc. L5th Annual ACM Symp. on Parallel Algorithms and Architectures , pp. 118-127
    • Cappello, F.1    Etiemble, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.