SCOPUS 정보 검색 플랫폼

Proceedings - 15th Annual IEEE Symposium on High-Performance Interconnects, HOT Interconnects

Volumn , Issue , 2007, Pages 143-149

Reducing the impact of the memory wall for I/O using cache injection

(3) León, Edgar A a Ferreira, Kurt B a Maccabe, Arthur B a

a MSC01 1070 (United States)

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION PERFORMANCE; DATA PREFETCHING; DATA USAGE; MEMORY BOUND APPLICATIONS; MEMORY CONTROLLERS; MEMORY LATENCIES; MEMORY PRESSURE; MEMORY SPEEDS; MEMORY WALL; SCIENTIFIC COMPUTATIONS;

CACHE MEMORY; COMPUTER GRAPHICS; CRYPTOGRAPHY; DATA STORAGE EQUIPMENT; GRAPHIC METHODS; IMAGE PROCESSING; IMAGING SYSTEMS; OPTICAL DATA PROCESSING; PRESSURE; PROGRAM COMPILERS;

DATA REDUCTION;

EID: 46449101559 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/HOTI.2007.8 Document Type: Conference Paper

Times cited : (16)

References (21)

1
- 0030783438
- An evaluation of fine-grain producer-initiated communication in cache-coherent multiprocessors
- San Antonio, TX
- H. Abdel-Shafi, J. Hall, S. V. Adve, and V. S. Adve. An evaluation of fine-grain producer-initiated communication in cache-coherent multiprocessors. In 3rd IEEE Symposium on High-Performance Computer Architecture (HPCA '97), pages 204-215, San Antonio, TX, 1997.
- (1997) 3rd IEEE Symposium on High-Performance Computer Architecture (HPCA '97) , pp. 204-215
- Abdel-Shafi, H.¹ Hall, J.² Adve, S.V.³ Adve, V.S.⁴

2
- 27544478808
- Mambo - a full system simulator for the PowerPC architecture
- Mar
- P. Bohrer, M. Elnozahy, A. Gheith, C. Lefurgy, T. Nakra, J. Peterson, R. Rajamony, R. Rockhold, H. Shafi, R. Simpson, E. Speight, K. Sudeep, E. V. Hensbergen, and L. Zhang. Mambo - a full system simulator for the PowerPC architecture. ACM SIGMETRICS Performance Evaluation Review, 31(4):8-12, Mar. 2004.
- (2004) ACM SIGMETRICS Performance Evaluation Review , vol.31 , Issue.4 , pp. 8-12
- Bohrer, P.¹ Elnozahy, M.² Gheith, A.³ Lefurgy, C.⁴ Nakra, T.⁵ Peterson, J.⁶ Rajamony, R.⁷ Rockhold, R.⁸ Shafi, H.⁹ Simpson, R.¹⁰ Speight, E.¹¹ Sudeep, K.¹² Hensbergen, E.V.¹³ Zhang, L.¹⁴

3
- 46349112166
- Method and apparatus for accelerating Input/Output processing using cache injections,
- Mar, US PatentNo. US 6,711,650 B1
- P. Bohrer, R. Rajamony, and H. Shafi. Method and apparatus for accelerating Input/Output processing using cache injections, Mar. 2004. US PatentNo. US 6,711,650 B1.
- (2004)
- Bohrer, P.¹ Rajamony, R.² Shafi, H.³

4
- 0033708935
- Semicoarsening multigrid on distributed memory machines
- P. N. Brown, R. D. Falgout, and J. E. Jones. Semicoarsening multigrid on distributed memory machines. SIAM Journal on Scientific Computing, 21(5):1823-1834, 2000.
- (2000) SIAM Journal on Scientific Computing , vol.21 , Issue.5 , pp. 1823-1834
- Brown, P.N.¹ Falgout, R.D.² Jones, J.E.³

5
- 0033097556
- Producer-consumer communication in distributed shared memory multiprocessors
- G. T. Byrd and M. J. Flynn. Producer-consumer communication in distributed shared memory multiprocessors. Proceedings of the IEEE, 87(3):456-466, 1999.
- (1999) Proceedings of the IEEE , vol.87 , Issue.3 , pp. 456-466
- Byrd, G.T.¹ Flynn, M.J.²

6
- 46449091801
- M. P. I. Forum. MPI: A message-passing interface standard. Technical Report UT-CS-94-230, Knoxville, TN, 1994.
- M. P. I. Forum. MPI: A message-passing interface standard. Technical Report UT-CS-94-230, Knoxville, TN, 1994.

7
- 0004116219
- AMS Chelsea, Translated from Russian
- F. R. Gantmacher. The Theory of Matrices, volume I. AMS Chelsea, 1959. Translated from Russian.
- (1959) The Theory of Matrices, volume I
- Gantmacher, F.R.¹

8
- 46449121306
- Advanced POWER Virtualization on IBM eServer p5 Servers: Architecture and Performance Considerations
- second edition
- B. Gibbs, B. Atyam, F. Berres, B. Blanchard, L. Castillo, P. Coelho, N. Guerin, L. Liu, C. D. Maciel, C. Sosa, and R. Thirumalai. Advanced POWER Virtualization on IBM eServer p5 Servers: Architecture and Performance Considerations. IBM Redbooks, second edition, 2005.
- (2005) IBM Redbooks
- Gibbs, B.¹ Atyam, B.² Berres, F.³ Blanchard, B.⁴ Castillo, L.⁵ Coelho, P.⁶ Guerin, N.⁷ Liu, L.⁸ Maciel, C.D.⁹ Sosa, C.¹⁰ Thirumalai, R.¹¹

9
- 27544482360
- Direct cache access for high bandwidth network I/O
- Madison, WI, June
- R. Huggahalli, R. Iyer, and S. Tetrick. Direct cache access for high bandwidth network I/O. In 32nd Annual International Symposium on Computer Architecture (ISCA'05), pages 50-59, Madison, WI, June 2005.
- (2005) 32nd Annual International Symposium on Computer Architecture (ISCA'05) , pp. 50-59
- Huggahalli, R.¹ Iyer, R.² Tetrick, S.³

10
- 0036949388
- An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches
- San Jose, CA, Oct
- C. Kim, D. Burger, and S. W. Keckler. An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches. In 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS X), pages 211-222, San Jose, CA, Oct. 2002.
- (2002) 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS X) , pp. 211-222
- Kim, C.¹ Burger, D.² Keckler, S.W.³

11
- 0026839484
- The Stanford Dash multiprocessor
- D. Lenoski, J. Laudon, K. Gharachorloo, W.-D. Weber, A. Gupta, J. Hennessy, M. Horowitz, and M. S. Lam. The Stanford Dash multiprocessor. Computer, 25(3):63-79, 1992.
- (1992) Computer , vol.25 , Issue.3 , pp. 63-79
- Lenoski, D.¹ Laudon, J.² Gharachorloo, K.³ Weber, W.-D.⁴ Gupta, A.⁵ Hennessy, J.⁶ Horowitz, M.⁷ Lam, M.S.⁸

12
- 46449134294
- Reducing memory bandwidth for chip-multiprocessors using cache injection
- Seattle, WA, Nov
- E. A. León and A. B. Maccabe. Reducing memory bandwidth for chip-multiprocessors using cache injection. In 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI'06). Poster Session, Seattle, WA, Nov. 2006.
- (2006) 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI'06). Poster Session
- León, E.A.¹ Maccabe, A.B.²

13
- 84885597319
- An infrastructure for the development of kernel network services
- Brighton, United Kingdom, Oct, ACM SIGOPS
- E. A. León and M. Ostrowski. An infrastructure for the development of kernel network services. In 20th ACM Symposium on Operating Systems Principles (SOSP'05). Poster Session, Brighton, United Kingdom, Oct. 2005. ACM SIGOPS.
- (2005) 20th ACM Symposium on Operating Systems Principles (SOSP'05). Poster Session
- León, E.A.¹ Ostrowski, M.²

14
- 4143107088
- Increasing memory bandwidth for vector computations
- Zurich, Switzerland, Mar
- S. A. McKee, S. A. Moyer, and W. A. Wulf. Increasing memory bandwidth for vector computations. In International Conference on Programming Languages and System Architectures, pages 87-104, Zurich, Switzerland, Mar. 1994.
- (1994) International Conference on Programming Languages and System Architectures , pp. 87-104
- McKee, S.A.¹ Moyer, S.A.² Wulf, W.A.³

15
- 0032317370
- Cache injection on bus based multiprocessors
- West Lafayette, IN
- A. Milenkovic and V. Milutinovic. Cache injection on bus based multiprocessors. In 17th Symposium on Reliable Distributed Systems (SRDS'98), pages 341-346, West Lafayette, IN, 1998.
- (1998) 17th Symposium on Reliable Distributed Systems (SRDS'98) , pp. 341-346
- Milenkovic, A.¹ Milutinovic, V.²

16
- 0002031606
- Tolerating latency through software-controlled prefetching in shared-memory multiprocessors
- T. Mowry and A. Gupta. Tolerating latency through software-controlled prefetching in shared-memory multiprocessors. Journal of Parallel and Distributed Computing, 12(2):87-106, 1991.
- (1991) Journal of Parallel and Distributed Computing , vol.12 , Issue.2 , pp. 87-106
- Mowry, T.¹ Gupta, A.²

17
- 0008602220
- PhD thesis, Department of Computer Science, University of Virginia, Apr
- S. A. Moyer. Access Ordering and Effective Memory Bandwidth. PhD thesis, Department of Computer Science, University of Virginia, Apr. 1993.
- (1993) Access Ordering and Effective Memory Bandwidth
- Moyer, S.A.¹

18
- 77954460854
- Data prefetching and data forwarding in shared memory multiprocessors
- North Carolina State University, NC
- D. K. Poulsen and P.-C. Yew. Data prefetching and data forwarding in shared memory multiprocessors. In International Conference on Parallel Processing (ICPP'94), pages 276-280, North Carolina State University, NC, 1994.
- (1994) International Conference on Parallel Processing (ICPP'94) , pp. 276-280
- Poulsen, D.K.¹ Yew, P.-C.²

19
- 25844437046
- POWER5 system microarchitecture
- B. Sinharoy, R. N. Kalla, J. M. Tendler, R. J. Eickemeyer, and J. B. Joyner. POWER5 system microarchitecture. IBM Journal of Research and Development, 49(4/5), 2005.
- (2005) IBM Journal of Research and Development , vol.49 , Issue.4-5
- Sinharoy, B.¹ Kalla, R.N.² Tendler, J.M.³ Eickemeyer, R.J.⁴ Joyner, J.B.⁵

20
- 85117198273
- An empirical performance evaluation of scalable scientific applications
- Baltimore, Maryland
- J. S. Vetter and A. Yoo. An empirical performance evaluation of scalable scientific applications. In 2002 ACM/IEEE Conference on Supercomputing (SC'02), pages 1-18, Baltimore, Maryland, 2002.
- (2002) 2002 ACM/IEEE Conference on Supercomputing (SC'02) , pp. 1-18
- Vetter, J.S.¹ Yoo, A.²

21
- 0003158656
- Hitting the memory wall: Implications of the obvious
- Mar
- W. A. Wulf and S. A. McKee. Hitting the memory wall: Implications of the obvious. ACM SIGARCH Computer Architecture News, 3(1):20-24, Mar. 1995.
- (1995) ACM SIGARCH Computer Architecture News , vol.3 , Issue.1 , pp. 20-24
- Wulf, W.A.¹ McKee, S.A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.