메뉴 건너뛰기




Volumn 19, Issue 6, 1997, Pages 853-898

Parallelizing Nonnumerical Code with Selective Scheduling and Software Pipelining

Author keywords

Algorithms; Experimentation; Global instruction scheduling; Instruction level parallelism; Languages; Software pipelining; Speculative code motion; Superscalar; VLIW

Indexed keywords

ALGORITHMS; CODES (SYMBOLS); COMPUTER ARCHITECTURE; COMPUTER OPERATING PROCEDURES; COMPUTER PROGRAMMING LANGUAGES; PIPELINE PROCESSING SYSTEMS; PROGRAM COMPILERS; PROGRAM PROCESSORS;

EID: 0031274169     PISSN: 01640925     EISSN: None     Source Type: Journal    
DOI: 10.1145/267959.269966     Document Type: Article
Times cited : (42)

References (43)
  • 2
    • 0007941219 scopus 로고
    • A development environment for horizontal microcode
    • AIKEN, A. AND NICOLAU, A. 1988. A development environment for horizontal microcode. IEEE Trans. Softw. Eng. 14, 5 (May), 584-594.
    • (1988) IEEE Trans. Softw. Eng. , vol.14 , Issue.5 MAY , pp. 584-594
    • Aiken, A.1    Nicolau, A.2
  • 3
    • 0026242244 scopus 로고
    • Intel i860 processor
    • ATKINS, M. 1991. Intel i860 processor. IEEE Micro 11, 24-28.
    • (1991) IEEE Micro , vol.11 , pp. 24-28
    • Atkins, M.1
  • 6
    • 0026243790 scopus 로고
    • Efficiently computing static single assignment form and the control dependence graph
    • CYTRON, R., FERRANTE, J., ROSEN, B., WEGMAN, M., AND ZADECK, F. 1991. Efficiently computing static single assignment form and the control dependence graph. ACM Trans. Program. Lang. Syst. 13, 4 (Jan.), 451-490.
    • (1991) ACM Trans. Program. Lang. Syst. , vol.13 , Issue.4 JAN , pp. 451-490
    • Cytron, R.1    Ferrante, J.2    Rosen, B.3    Wegman, M.4    Zadeck, F.5
  • 7
    • 0027590187 scopus 로고
    • Compiling for cydra 5
    • DEHNERT, J. AND TOWLE, R. 1993. Compiling for cydra 5. J. Supercomput. 7, 1/2, 181-228.
    • (1993) J. Supercomput. , vol.7 , Issue.1-2 , pp. 181-228
    • Dehnert, J.1    Towle, R.2
  • 11
    • 0002106131 scopus 로고
    • A new compilation technique for parallelizing loops with unpredictable branches on a VLIW architecture
    • MIT Press, Cambridge, Mass.
    • EBCIOǦLU, K. AND NAKATANI, T. 1989. A new compilation technique for parallelizing loops with unpredictable branches on a VLIW architecture. In Languages and Compilers for Parallel Computing. MIT Press, Cambridge, Mass., 213-229.
    • (1989) Languages and Compilers for Parallel Computing , pp. 213-229
    • Ebcioǧlu, K.1    Nakatani, T.2
  • 16
    • 0028461905 scopus 로고
    • Avoidance and suppression of compension code in a trace scheduling compiler
    • FREUDENBERGER, S. M., GROSS, T. R., AND LOWNEY, P. 1994. Avoidance and suppression of compension code in a trace scheduling compiler. ACM Trans. Program. Lang. Syst. 16, 4, 1156-1214.
    • (1994) ACM Trans. Program. Lang. Syst. , vol.16 , Issue.4 , pp. 1156-1214
    • Freudenberger, S.M.1    Gross, T.R.2    Lowney, P.3
  • 18
    • 0025413768 scopus 로고
    • Region scheduling: An approach for detecting and redistributing parallelism
    • GUPTA, R. AND SOFFA, M. 1990. Region scheduling: An approach for detecting and redistributing parallelism. IEEE Trans. Softw. Eng. 16, 4 (Apr.), 421-431.
    • (1990) IEEE Trans. Softw. Eng. , vol.16 , Issue.4 APR , pp. 421-431
    • Gupta, R.1    Soffa, M.2
  • 20
    • 0347095853 scopus 로고
    • A special issue on IBM RISC System/600
    • IBM. 1990. A special issue on IBM RISC System/600. IBM J. Res. Devel. 34, 1 (Jan.).
    • (1990) IBM J. Res. Devel. , vol.34 , Issue.1 JAN
  • 25
    • 0031237555 scopus 로고    scopus 로고
    • Increasing cache bandwidth using multiport caches for exploiting ILP in non-numerical codes
    • MOON, S.-M. 1997. Increasing cache bandwidth using multiport caches for exploiting ILP in non-numerical codes. IEEE Proceedings - Computers and Digital Techniques 144, 5 (Sept.), 295-303.
    • (1997) IEEE Proceedings - Computers and Digital Techniques , vol.144 , Issue.5 SEPT , pp. 295-303
    • Moon, S.-M.1
  • 26
    • 0029352611 scopus 로고
    • Generalized multiway branch unit for VLIW microprocessors
    • MOON, S.-M. AND CARSON, S. 1995. Generalized multiway branch unit for VLIW microprocessors. IEEE Trans. Parall. Distrib. Syst. 6, 8 (Aug.), 850-862.
    • (1995) IEEE Trans. Parall. Distrib. Syst. , vol.6 , Issue.8 AUG , pp. 850-862
    • Moon, S.-M.1    Carson, S.2
  • 28
    • 0030703876 scopus 로고    scopus 로고
    • Performance analysis of tree VLIW architecture for exploiting branch ILP in non-numerical code
    • ACM, New York
    • MOON, S.-M. AND EBCIOǦLU, K. 1997. Performance analysis of tree VLIW architecture for exploiting branch ILP in non-numerical code. In Proceedings of the 1997 International Conference on Supercomputing. ACM, New York, 301-308.
    • (1997) Proceedings of the 1997 International Conference on Supercomputing , pp. 301-308
    • Moon, S.-M.1    Ebcioǧlu, K.2
  • 31
    • 0027659775 scopus 로고
    • Making compaction based parallelization affordable
    • NAKATANI, T. AND AND EBCIOǦLU, K. 1993. Making compaction based parallelization affordable. IEEE Trans. Parall. Distrib. Syst. 4, 9 (Sept.), 1014-1529.
    • (1993) IEEE Trans. Parall. Distrib. Syst. , vol.4 , Issue.9 SEPT , pp. 1014-1529
    • Nakatani, T.1    Ebcioǧlu, K.2
  • 32
    • 0022874874 scopus 로고
    • Advanced compiler optimizations for supercomputers
    • PADUA, D. AND WOLFE, M. 1986. Advanced compiler optimizations for supercomputers. Commun. ACM 29, 12 (Dec.), 1184-1201.
    • (1986) Commun. ACM , vol.29 , Issue.12 DEC , pp. 1184-1201
    • Padua, D.1    Wolfe, M.2
  • 34
    • 0021817378 scopus 로고
    • Reduced instruction set computers
    • PATTERSON, D. 1985. Reduced instruction set computers. Commun. ACM 28, 1 (Jan.), 8-21.
    • (1985) Commun. ACM , vol.28 , Issue.1 JAN , pp. 8-21
    • Patterson, D.1
  • 35
    • 0024480706 scopus 로고
    • The Cydra 5 departmental supercomputer: Design philosophies, decisions, and trade-offs
    • RAU, B. 1989. The Cydra 5 departmental supercomputer: Design philosophies, decisions, and trade-offs. IEEE Comput. 22, 1 (Jan.), 12-34.
    • (1989) IEEE Comput. , vol.22 , Issue.1 JAN , pp. 12-34
    • Rau, B.1
  • 36
    • 0002017307 scopus 로고
    • Instruction-level parallel processing: History, overview, and perspective
    • RAU, B. AND FISHER, J. 1993. Instruction-level parallel processing: History, overview, and perspective. J. Supercomput. 7, 1/2, 9-50.
    • (1993) J. Supercomput. , vol.7 , Issue.1-2 , pp. 9-50
    • Rau, B.1    Fisher, J.2
  • 37
    • 0003015894 scopus 로고
    • Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing
    • IEEE, New York
    • RAU, B. AND GLAESER, C. 1981. Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing. In Proceedings of the 14th Annual Workshop on Microprogramming. IEEE, New York, 183-198.
    • (1981) Proceedings of the 14th Annual Workshop on Microprogramming , pp. 183-198
    • Rau, B.1    Glaeser, C.2
  • 39
    • 0002228438 scopus 로고
    • An architectural framework for supporting heterogeneous instruction set architectures
    • SILBERMAN, G. AND EBCIOǦLU, K. 1993. An architectural framework for supporting heterogeneous instruction set architectures. IEEE Comput. 26, 6 (June), 39-56.
    • (1993) IEEE Comput. , vol.26 , Issue.6 JUNE , pp. 39-56
    • Silberman, G.1    Ebcioǧlu, K.2
  • 40
    • 0002790769 scopus 로고
    • Alpha AXP architecture
    • SITES, R. 1993. Alpha AXP architecture. Commun. ACM 36, 2 (Feb.), 33-44.
    • (1993) Commun. ACM , vol.36 , Issue.2 FEB , pp. 33-44
    • Sites, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.