메뉴 건너뛰기




Volumn 16, Issue 1, 1998, Pages 55-92

Tolerating Latency in Multiprocessors through Compiler-Inserted Prefetching

Author keywords

B.3.2 Memory Structures : Design Styles cache memories; Compiler optimization; D.3.4 Programming Languages : Processors compilers; Design; Experimentation; Optimization; Performance; Prefetching

Indexed keywords

ALGORITHMS; BUFFER STORAGE; COMPUTER ARCHITECTURE; COMPUTER SOFTWARE; FAULT TOLERANT COMPUTER SYSTEMS; MULTIPROCESSING SYSTEMS; OPTIMIZATION; PROGRAM PROCESSORS;

EID: 0031988272     PISSN: 07342071     EISSN: None     Source Type: Journal    
DOI: 10.1145/273011.273021     Document Type: Article
Times cited : (52)

References (43)
  • 5
    • 0026267802 scopus 로고
    • An effective on-chip preloading scheme to reduce data access penalty
    • BAER, J.-L. AND CHEN, T.-F. 1991. An effective on-chip preloading scheme to reduce data access penalty. In Proceedings of Supercomputing '91.
    • (1991) Proceedings of Supercomputing '91
    • Baer, J.-L.1    Chen, T.-F.2
  • 9
    • 0004269807 scopus 로고
    • CONVEX COMPUTER Convex Computer Corp.
    • CONVEX COMPUTER. 1994. Convex Exemplar Architecture. Convex Computer Corp.
    • (1994) Convex Exemplar Architecture
  • 10
    • 0027574855 scopus 로고
    • A methodology for procedure cloning
    • April
    • COOPER, K., HALL, M., AND KENNEDY, K. 1993. A methodology for procedure cloning. Comput. Lang. 19, 2 (April).
    • (1993) Comput. Lang. , vol.19 , Issue.2
    • Cooper, K.1    Hall, M.2    Kennedy, K.3
  • 12
    • 0023963509 scopus 로고
    • Synchronization, coherence, and event ordering in multiprocessors
    • Feb.
    • DUBOIS, M., SCHEURICH, C., AND BRIGGS, F. A. 1988. Synchronization, coherence, and event ordering in multiprocessors. Computer 21, 2 (Feb.), 9-21.
    • (1988) Computer , vol.21 , Issue.2 , pp. 9-21
    • Dubois, M.1    Scheurich, C.2    Briggs, F.A.3
  • 14
    • 0347662803 scopus 로고
    • The impact of hierarchical memory systems on linear algebra algorithm design
    • Univ. of Illinois, Urbana-Champaign, Ill.
    • GALLIVAN, K., JALBY, W., MEIER, U., AND SAMEH, A. 1987. The impact of hierarchical memory systems on linear algebra algorithm design. Tech. Rep. UIUCSRD 625, Univ. of Illinois, Urbana-Champaign, Ill.
    • (1987) Tech. Rep. UIUCSRD 625
    • Gallivan, K.1    Jalby, W.2    Meier, U.3    Sameh, A.4
  • 15
    • 0347662804 scopus 로고
    • The influence of memory hierarchy on algorithm organization: Programming FFTs on a vector multiprocessor
    • MIT Press, Cambridge, Mass.
    • GANNON, D. AND JALBY, W. 1987. The influence of memory hierarchy on algorithm organization: Programming FFTs on a vector multiprocessor. In The Characteristics of Parallel Algorithms. MIT Press, Cambridge, Mass.
    • (1987) The Characteristics of Parallel Algorithms
    • Gannon, D.1    Jalby, W.2
  • 18
    • 0011603313 scopus 로고
    • Tango introduction and tutorial
    • Stanford Univ., Palo Alto, Calif.
    • GOLDSCHMIDT, S. R. AND DAVIS, H. 1990. Tango introduction and tutorial. Tech. Rep. CSL-TR-90-410, Stanford Univ., Palo Alto, Calif.
    • (1990) Tech. Rep. CSL-TR-90-410
    • Goldschmidt, S.R.1    Davis, H.2
  • 19
  • 26
    • 0018518477 scopus 로고
    • How to make a multiprocessor computer that correctly executes multiprocess programs
    • Sept.
    • LAMPORT, L. 1979. How to make a multiprocessor computer that correctly executes multiprocess programs. IEEE Trans. Comput. C-28, 9 (Sept.), 241-248.
    • (1979) IEEE Trans. Comput. , vol.C-28 , Issue.9 , pp. 241-248
    • Lamport, L.1
  • 32
    • 84945709131 scopus 로고
    • The organization of matrices and matrix operations in a paged multiprogramming environment
    • MCKELLER, A. C. AND COFFMAN, E. G. 1969. The organization of matrices and matrix operations in a paged multiprogramming environment. Commun. ACM 12, 3, 153-165.
    • (1969) Commun. ACM , vol.12 , Issue.3 , pp. 153-165
    • Mckeller, A.C.1    Coffman, E.G.2
  • 33
    • 0002031606 scopus 로고
    • Tolerating latency through software-controlled prefetching in shared-memory multiprocessors
    • MOWRY, T. AND GUPTA, A. 1991. Tolerating latency through software-controlled prefetching in shared-memory multiprocessors. J. Parallel Distrib. Comput. 12, 2, 87-106.
    • (1991) J. Parallel Distrib. Comput. , vol.12 , Issue.2 , pp. 87-106
    • Mowry, T.1    Gupta, A.2
  • 37
    • 0003897840 scopus 로고
    • Splash: Stanford parallel applications for shared memory
    • Stanford Univ., Palo Alto, Calif.
    • SINGH, J. P., WEBER, W.-D., AND GUPTA, A. 1991. Splash: Stanford parallel applications for shared memory. Tech. Rep. CSL-TR-91-469, Stanford Univ., Palo Alto, Calif.
    • (1991) Tech. Rep. CSL-TR-91-469
    • Singh, J.P.1    Weber, W.-D.2    Gupta, A.3
  • 38
    • 0025440459 scopus 로고
    • A survey of cache coherence schemes for multiprocessors
    • June
    • STENSTROM, P. 1990. A survey of cache coherence schemes for multiprocessors. IEEE Comput. 23, 6 (June), 12-24.
    • (1990) IEEE Comput. , vol.23 , Issue.6 , pp. 12-24
    • Stenstrom, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.