SCOPUS 정보 검색 플랫폼

Proceedings of the Annual International Symposium on Microarchitecture, MICRO

Volumn Part F129425, Issue , 1994, Pages 118-127

Data relocation and prefetching for programs with large data sets

(4) Yamada, Yoji a Gyllenhall, John a Haab, Grant a Hwu, Wen Mei a

a UNIVERSITY OF ILLINOIS AT URBANA CHAMPAIGN (United States)

Author keywords

Cache conflicts; Data copying; Data relocation; Program optimization; Software prefetching

Indexed keywords

CACHE MEMORY; COMPUTER ARCHITECTURE; HARDWARE;

CACHE CONFLICTS; CACHE PERFORMANCE; DATA COPYING; HARDWARE AND SOFTWARE; NESTED LOOP STRUCTURE; NUMERICAL APPLICATIONS; PROGRAM OPTIMIZATION; SOFTWARE PREFETCHING;

COST REDUCTION;

EID: 0006424869 PISSN: 10724451 EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/192724.192740 Document Type: Conference Paper

Times cited : (2)

References (17)

1
- 84976859541
- The cache performance and optimizations of blocked algorithms
- Apr
- M. S. Lam, E. E. Rothberg, and M. E. Wolf, "The cache performance and optimizations of blocked algorithms", in Proc. Fourth Int'l Conf. on Architectural Support for Prog. Lang, and Operating Systems, pp. 63-74, Apr. 1991.
- (1991) Proc. Fourth Int'l Conf. on Architectural Support for Prog. Lang, and Operating Systems , pp. 63-74
- Lam, M.S.¹ Rothberg, E.E.² Wolf, M.E.³

2
- 0027764718
- To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts
- Los Alamitos, California, IEEE Computer Society Press, Nov
- O. Temam, E. D. Granston, and W. Jalby, "To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts", in Proceedings of Supercomputing '93, (Los Alamitos, California), pp. 410-419, IEEE Computer Society Press, Nov. 1993.
- (1993) Proceedings of Supercomputing '93 , pp. 410-419
- Temam, O.¹ Granston, E.D.² Jalby, W.³

3
- 1542423315
- ch. The influence of memory hierarchy on algorithm organization: Programming FFTs on a vector multiprocessor. MIT press
- D. Gannon and W. Jalby, The characteristics of parallel programs, ch. The influence of memory hierarchy on algorithm organization: Programming FFTs on a vector multiprocessor. MIT press, 1987.
- (1987) The Characteristics of Parallel Programs
- Gannon, D.¹ Jalby, W.²

4
- 0347662803
- Tech. Rep, Center for Supercomputing Research and Development, University of Illinois, Urbana, IL
- K. Gallivan, W. Jalby, U. Meier, and A. Sameh, "The impact of hierarchical memory systems on linear algebra design", Tech. Rep. CSRD-625, Center for Supercomputing Research and Development, University of Illinois, Urbana, IL, 1987.
- (1987) The Impact of Hierarchical Memory Systems on Linear Algebra Design
- Gallivan, K.¹ Jalby, W.² Meier, U.³ Sameh, A.⁴

5
- 0026157234
- Data prefetching in multiprocessor vector cache memories
- Toronto, Canada, June
- J. W. C. Fu and J. H. Patel, "Data prefetching in multiprocessor vector cache memories", in Proc. 18th Ann. Int'l Symp. Computer Architecture, (Toronto, Canada), pp. 54-63, June 1991.
- (1991) Proc. 18th Ann. Int'l Symp. Computer Architecture , pp. 54-63
- Fu, J.W.C.¹ Patel, J.H.²

6
- 0026186269
- Compile-time partitioning of iterative parallel loops to reduce cache coherence traffic
- S. G. Abraham and D. E. Hudak, "Compile-time partitioning of iterative parallel loops to reduce cache coherence traffic", J. Parallel and Distributed Computing, vol. 2, pp. 318-328, 1991.
- (1991) J. Parallel and Distributed Computing , vol.2 , pp. 318-328
- Abraham, S.G.¹ Hudak, D.E.²

7
- 0026267802
- An effective on-chip preloading scheme to reduce data access penalty
- Nov
- J.-L. Baer and T.-F. Chen, "An effective on-chip preloading scheme to reduce data access penalty", in Proceeding of Supercomputing '91, pp. 176-186, Nov. 1991.
- (1991) Proceeding of Supercomputing '91 , pp. 176-186
- Baer, J.-L.¹ Chen, T.-F.²

8
- 84976833735
- Design and evaluation of a compiler algorithm for prefetching
- Oct
- T. C. Mowry, M. S. Lam, and A. Gupta, "Design and evaluation of a compiler algorithm for prefetching", in Proc. Fifth Int'l Conf. on Architectural Support for Prog. Lang, and Operating Systems, pp. 62-73, Oct. 1992.
- (1992) Proc. Fifth Int'l Conf. on Architectural Support for Prog. Lang, and Operating Systems , pp. 62-73
- Mowry, T.C.¹ Lam, M.S.² Gupta, A.³

9
- 84944799568
- Data access microarchitectures for superscalar processors with compiler-assisted data prefetching
- Albuquerque, NM., Nov
- W. Y. Chen, S. A. Mahlke, P. P. Chang, and W. W. Hwu, "Data access microarchitectures for superscalar processors with compiler-assisted data prefetching", in Proc. 24th Ann. Workshop on Microprogramming and Microarchitectures, (Albuquerque, NM.), Nov. 1991.
- (1991) Proc. 24th Ann. Workshop on Microprogramming and Microarchitectures
- Chen, W.Y.¹ Mahlke, S.A.² Chang, P.P.³ Hwu, W.W.⁴

10
- 33646901785
- Tolerating data access latency with register preloading
- July
- W. Y. Chen, S. A. Mahlke, W. W. Hwu, T. Kiyohara, and P. P. Chang, "Tolerating data access latency with register preloading", in Proceedings of the 6th International Conference on Supercomputing, July 1992.
- (1992) Proceedings of the 6th International Conference on Supercomputing
- Chen, W.Y.¹ Mahlke, S.A.² Hwu, W.W.³ Kiyohara, T.⁴ Chang, P.P.⁵

11
- 0026157612
- IMPACT: An architectural framework for multiple-instruction-issue processors
- Toronto, Canada, June
- P. P. Chang, S. A. Mahlke, W. Y. Chen, N. J. Wärter, and W. W. Hwu, "IMPACT: An architectural framework for multiple-instruction-issue processors", in Proc. 18th Ann. Int'l Symp. Computer Architecture, (Toronto, Canada), pp. 266-275, June 1991.
- (1991) Proc. 18th Ann. Int'l Symp. Computer Architecture , pp. 266-275
- Chang, P.P.¹ Mahlke, S.A.² Chen, W.Y.³ Wärter, N.J.⁴ Hwu, W.W.⁵

12
- 0027595384
- The superblock: An effective technique for VLIW and superscalar compilation
- Jan
- W. W. Hwu, S. A. Mahlke, W. Y. Chen, P. P. Chang, N. J. Warter, R. A. Bringmann, R. G. Ouellette, R. E. Hank, T. Kiyohara, G. E. Haab, J. G. Holm, and D. M. Lavery, "The superblock: An effective technique for VLIW and superscalar compilation", Journal of Supercomputing, vol. 7, pp. 229-248, Jan. 1992.
- (1992) Journal of Supercomputing , vol.7 , pp. 229-248
- Hwu, W.W.¹ Mahlke, S.A.² Chen, W.Y.³ Chang, P.P.⁴ Warter, N.J.⁵ Bringmann, R.A.⁶ Ouellette, R.G.⁷ Hank, R.E.⁸ Kiyohara, T.⁹ Haab, G.E.¹⁰ Holm, J.G.¹¹ Lavery, D.M.¹²

13
- 84976676720
- A practical algorithm for exact array dependence analysis
- Aug
- W. Pugh, "A practical algorithm for exact array dependence analysis", Communications of the ACM, vol. 35, pp. 102-114, Aug. 1992.
- (1992) Communications of the ACM , vol.35 , pp. 102-114
- Pugh, W.¹

14
- 0026974538
- Eliminating false data dependences using the omega test
- June
- W. Pugh and D. Wonnacott, "Eliminating false data dependences using the omega test", in Proceedings of the SIGPLAN '92 Conference on Programming Language Design and Implementation, pp. 140-151, June 1992.
- (1992) Proceedings of the SIGPLAN '92 Conference on Programming Language Design and Implementation , pp. 140-151
- Pugh, W.¹ Wonnacott, D.²

15
- 0003477925
- Tech. Rep, Center for Supercomputing Research and Development, University of Illinois, Urbana, IL, May
- M. Berry, D. Chen, P. Koss, D. Kuck, S. Lo, Y. Pang, R. Roloff, A. Sameh, E. Clementi, S. Chin, D. Schneider, G. Fox, P. Messina, D. Walker, C. Hsiung, J. Schwarzmeier, K. Lue, S. Orzag, F. Seidl, O. Johnson, G. Swanson, R. Goodrum, and J. Martin, "The PERFECT club benchmarks: Effective performance evaluation of supercomputers", Tech. Rep. CSRD-827, Center for Supercomputing Research and Development, University of Illinois, Urbana, IL, May 1989.
- (1989) The PERFECT Club Benchmarks: Effective Performance Evaluation of Supercomputers
- Berry, M.¹ Chen, D.² Koss, P.³ Kuck, D.⁴ Lo, S.⁵ Pang, Y.⁶ Roloff, R.⁷ Sameh, A.⁸ Clementi, E.⁹ Chin, S.¹⁰ Schneider, D.¹¹ Fox, G.¹² Messina, P.¹³ Walker, D.¹⁴ Hsiung, C.¹⁵ Schwarzmeier, J.¹⁶ Lue, K.¹⁷ Orzag, S.¹⁸ Seidl, F.¹⁹ Johnson, O.²⁰ more..

16
- 6144224602
- Tech. Rep, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL
- J. W. C. Fu and J. H. Patel, "How to simulate 100 billion references cheaply", Tech. Rep. CRHC-91-30, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL, 1991.
- (1991) How to Simulate 100 Billion References Cheaply
- Fu, J.W.C.¹ Patel, J.H.²

17
- 0017922490
- The Cray-1 computer system
- Jan
- R. M. Russell, "The Cray-1 computer system", Communications of the ACM, vol. 21, pp. 63-72, Jan. 1978.
- (1978) Communications of the ACM , vol.21 , pp. 63-72
- Russell, R.M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.