메뉴 건너뛰기




Volumn 50, Issue 12, 2004, Pages 729-742

Exploitation of parallelism to nested loops with dependence cycles

Author keywords

Data dependence analysis; Dependence cycle; Loop optimization; Parallelism exploitation; Parallelizing compilers; Vectorizing compilers

Indexed keywords

ALGORITHMS; EMBEDDED SYSTEMS; OPTIMIZATION; PARALLEL PROCESSING SYSTEMS; PROGRAM COMPILERS; VECTORS;

EID: 9544241321     PISSN: 13837621     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.sysarc.2004.06.001     Document Type: Article
Times cited : (5)

References (30)
  • 2
    • 0004230378 scopus 로고    scopus 로고
    • Kluwer Academic Publishers, Norwell, MA
    • U. Banerjee, Dependence, Analysis, Kluwer Academic Publishers, Norwell, MA, 1997.
    • (1997) Dependence, Analysis
    • Banerjee, U.1
  • 6
    • 0031118682 scopus 로고    scopus 로고
    • Plugging anti and output dependence removal techniques into loop parallelization algorithm
    • P.-Y. Calland, A. Darte, Y. Robert, Plugging anti and output dependence removal techniques into loop parallelization algorithm, Parallel Computing 23 (1997) 251-266.
    • (1997) Parallel Computing , vol.23 , pp. 251-266
    • Calland, P.-Y.1    Darte, A.2    Robert, Y.3
  • 7
    • 0032293898 scopus 로고    scopus 로고
    • The extension of the interval test
    • W.-L. Chang, C.-P. Chu, The extension of the interval test, Parallel Computing 24 (14) (1998) 2101-2127.
    • (1998) Parallel Computing , vol.24 , Issue.14 , pp. 2101-2127
    • Chang, W.-L.1    Chu, C.-P.2
  • 8
    • 0035400667 scopus 로고    scopus 로고
    • The generalized direction vector I test
    • W.-L. Chang, C.-P. Chu, The generalized direction vector I test, Parallel Computing 27 (8) (2001) 1117-1144.
    • (2001) Parallel Computing , vol.27 , Issue.8 , pp. 1117-1144
    • Chang, W.-L.1    Chu, C.-P.2
  • 10
    • 0343825184 scopus 로고    scopus 로고
    • The infinity lambda test: A multidimensional version of Banerjee infinity test
    • W.-L. Chang, C.-P. Chu, The infinity lambda test: a multidimensional version of Banerjee infinity test, Parallel Computing 26 (10) (2000) 1275-1295.
    • (2000) Parallel Computing , vol.26 , Issue.10 , pp. 1275-1295
    • Chang, W.-L.1    Chu, C.-P.2
  • 11
    • 0010605401 scopus 로고
    • An analysis of recurrence relation in Fortran do-loops for vector processing
    • IEEE CS Press, Los Alamities, CA
    • C.-P. Chu, D.L. Carver, An analysis of recurrence relation in Fortran do-loops for vector processing, in: Proc. Fifth Parallel Processing Symp, IEEE CS Press, Los Alamities, CA, 1991, pp. 619-625.
    • (1991) Proc. Fifth Parallel Processing Symp , pp. 619-625
    • Chu, C.-P.1    Carver, D.L.2
  • 12
    • 9544247587 scopus 로고
    • Exploitation of parallelism in Fortran do-loops for vectoring processing
    • Department of Computer Science, LSU
    • C.-P. Chu, D.L. Carver, Exploitation of parallelism in Fortran do-loops for vectoring processing, Technical Report 91-004, Department of Computer Science, LSU, 1991.
    • (1991) Technical Report , vol.91 , Issue.4
    • Chu, C.-P.1    Carver, D.L.2
  • 13
    • 9544250236 scopus 로고
    • Digital Equipment Corporation, Part Number: AA-PV4QA-TE
    • MasPar Group, Parallel, Programming, language, Digital Equipment Corporation, Part number: AA-PV4QA-TE, 1993.
    • (1993) Parallel, Programming, Language
  • 15
    • 0026407190 scopus 로고
    • A comparative study of automatic vectorizing compilers
    • D. Levine, D. Callahan, J. Dongarra, A comparative study of automatic vectorizing compilers, Parallel Computing 17 (1991) 1223-1244.
    • (1991) Parallel Computing , vol.17 , pp. 1223-1244
    • Levine, D.1    Callahan, D.2    Dongarra, J.3
  • 17
    • 84904061321 scopus 로고
    • C.D. Polychronopoulos, Advanced loop optimizations for parallel computers
    • C.D. Polychronopoulos, Advanced loop optimizations for parallel computers, in: Proceedings of the 1987 International Conference on Supercomputing, 1987, pp. 255-277.
    • (1987) Proceedings of the 1987 International Conference on Supercomputing , pp. 255-277
  • 19
    • 0001790593 scopus 로고
    • Depth first search and linear graph algorithms
    • R. Tarjan, Depth first search and linear graph algorithms, SIAM Journal on Computing 1 (2) (1972) 146-160.
    • (1972) SIAM Journal on Computing , vol.1 , Issue.2 , pp. 146-160
    • Tarjan, R.1
  • 22
    • 0037104175 scopus 로고    scopus 로고
    • A precise dependence analysis for multi-dimensional arrays under specific dependence direction
    • W.-L. Chang, C.-P. Chu, J.-H. Wu, A precise dependence analysis for multi-dimensional arrays under specific dependence direction, The Journal of Systems and Software 63 (2) (2002) 99-107.
    • (2002) The Journal of Systems and Software , vol.63 , Issue.2 , pp. 99-107
    • Chang, W.-L.1    Chu, C.-P.2    Wu, J.-H.3
  • 23
    • 0035546689 scopus 로고    scopus 로고
    • A multi-dimensional version of the I test
    • W.-L. Chang, C.-P. Chu, J. Wu, A multi-dimensional version of the I test, Parallel Computing 27 (13) (2001) 1783-1799.
    • (2001) Parallel Computing , vol.27 , Issue.13 , pp. 1783-1799
    • Chang, W.-L.1    Chu, C.-P.2    Wu, J.3
  • 24
    • 14844343678 scopus 로고    scopus 로고
    • A polynomial-time dependence test for determininginteger-valued solutions in multi-dimensional arrays under variable bounds
    • in press
    • W.-L. Chang, C.-P. Chu, J.-H. Wu, A polynomial-time dependence test for determininginteger-valued solutions in multi-dimensional arrays under variable bounds, Journal of Supercomputing, in press.
    • Journal of Supercomputing
    • Chang, W.-L.1    Chu, C.-P.2    Wu, J.-H.3
  • 27
    • 9544220519 scopus 로고    scopus 로고
    • A proof method for the correctness of the interval test to be applied for determining whether there are integer-valued solutions for one-dimensional arrays with subscripts formed by induction variable
    • Kanazawa Bunka Hall, Kanazawa, Japan, 4-6 September
    • W.-L. Chang, B.-H. Chen, A proof method for the correctness of the interval test to be applied for determining whether there are integer-valued solutions for one-dimensional arrays with subscripts formed by induction variable, in: The Third International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'02), Kanazawa Bunka Hall, Kanazawa, Japan, 4-6 September 2002, pp. 52-57.
    • (2002) The Third International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'02) , pp. 52-57
    • Chang, W.-L.1    Chen, B.-H.2
  • 28
    • 0002921197 scopus 로고    scopus 로고
    • Efficient exploitation of parallelism on Pentium III and Pentium 4 processor-based systems
    • A. Bik, M. Girkar, P. Grey, X. Tian, Efficient exploitation of parallelism on Pentium III and Pentium 4 processor-based systems, Intel Technology Journal Q1 (March) (2001) 1-9.
    • (2001) Intel Technology Journal , vol.Q1 , Issue.MARCH , pp. 1-9
    • Bik, A.1    Girkar, M.2    Grey, P.3    Tian, X.4
  • 30
    • 0037104261 scopus 로고    scopus 로고
    • A simple and general approach to parallelize loops with arbitrary control flow and uniform data dependence distance
    • W.-L. Chang, C.-P. Chu, J.-H. Wu, A simple and general approach to parallelize loops with arbitrary control flow and uniform data dependence distance, The Journal of Systems and Software 63 (2) (2002) 91-98.
    • (2002) The Journal of Systems and Software , vol.63 , Issue.2 , pp. 91-98
    • Chang, W.-L.1    Chu, C.-P.2    Wu, J.-H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.