-
1
-
-
33646429702
-
Multi-level memory prefetching for media and stream processors
-
J. Fritts, "Multi-level memory prefetching for media and stream processors," in Proc. Int. Conf. Multimedia Expo (ICME), 2002, pp. 101-104.
-
(2002)
Proc. Int. Conf. Multimedia Expo (ICME)
, pp. 101-104
-
-
Fritts, J.1
-
2
-
-
33646414407
-
Paged control store prefech mechanism
-
Dec.
-
T. A. Enger, "Paged control store prefech mechanism," IBM Tech. Disci. Bull., vol. 7, no. 16, pp. 2140-2141, Dec. 1973.
-
(1973)
IBM Tech. Disci. Bull.
, vol.7
, Issue.16
, pp. 2140-2141
-
-
Enger, T.A.1
-
3
-
-
33646400221
-
Cache memory with prefetching of data by priority
-
May
-
B. T. Bennet and P. A. Franaczek, "Cache memory with prefetching of data by priority," IBM Technical Disclosure Bulleting, vol. 18, no. 12, pp. 4231-4232, May 1976.
-
(1976)
IBM Technical Disclosure Bulleting
, vol.18
, Issue.12
, pp. 4231-4232
-
-
Bennet, B.T.1
Franaczek, P.A.2
-
5
-
-
0018106484
-
Sequential program prefetching in memory hierarchies
-
Dec.
-
A. J. Smith, "Sequential program prefetching in memory hierarchies," IEEE Computer, vol. 11, no. 12, pp. 7-21, Dec. 1978.
-
(1978)
IEEE Computer
, vol.11
, Issue.12
, pp. 7-21
-
-
Smith, A.J.1
-
6
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
N. P. Jouppi, "Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers," in Proc. Int. Symp. Comput. Arch., 1990, pp. 363-373.
-
(1990)
Proc. Int. Symp. Comput. Arch.
, pp. 363-373
-
-
Jouppi, N.P.1
-
7
-
-
0026267802
-
An effective on-chip preloading scheme to reduce data access penalty
-
J.-L. Baer and T.-F. Chen, "An effective on-chip preloading scheme to reduce data access penalty." in Proc. Supercomputing, 1991, pp. 176-186.
-
(1991)
Proc. Supercomputing
, pp. 176-186
-
-
Baer, J.-L.1
Chen, T.-F.2
-
8
-
-
33646429702
-
Multi-level memory prefetching for media and stream processors
-
J. Fritts, "Multi-level memory prefetching for media and stream processors," in Int. Conf. Multimedia Expo (ICME), 2002, pp. 101-104.
-
(2002)
Int. Conf. Multimedia Expo (ICME)
, pp. 101-104
-
-
Fritts, J.1
-
9
-
-
0036005098
-
Prefetching for improved bus wrapper performance in cores
-
Jan.
-
R. Lysecky and F. Vahid, "Prefetching for improved bus wrapper performance in cores," ACM Trans. Des. Automat. Electron. Syst., vol. 7, no. 1, pp. 58-90, Jan. 2002.
-
(2002)
ACM Trans. Des. Automat. Electron. Syst.
, vol.7
, Issue.1
, pp. 58-90
-
-
Lysecky, R.1
Vahid, F.2
-
10
-
-
0038344707
-
Improving data prefetching efficacy in multimedia applications
-
Jun.
-
R. Cucchiara, A. Prati, and M. Piccardi, "Improving data prefetching efficacy in multimedia applications." Multimedia Tools Appl., vol. 20, no. 2, pp. 159-178, Jun. 2003.
-
(2003)
Multimedia Tools Appl.
, vol.20
, Issue.2
, pp. 159-178
-
-
Cucchiara, R.1
Prati, A.2
Piccardi, M.3
-
11
-
-
84944748972
-
A hardware-based cache pollution filtering mechanism for aggressive prefetches
-
X. Zhuang and H.-H. S. Lee, "A hardware-based cache pollution filtering mechanism for aggressive prefetches," in Proc. IEEE Int. Conf. Parallel Process. (ICPP), 2003, pp. 286-293.
-
(2003)
Proc. IEEE Int. Conf. Parallel Process. (ICPP)
, pp. 286-293
-
-
Zhuang, X.1
Lee, H.-H.S.2
-
12
-
-
0003690936
-
Software methods for improvement of cache performance on supercomputer applications
-
Ph.D. dissertation, Rice University, Houston, TX
-
A. K. Porterfield, "Software methods for improvement of cache performance on supercomputer applications," Ph.D. dissertation, Rice University, Houston, TX, 1989, Tech. Rep. CRPC-TR89009.
-
(1989)
Tech. Rep.
, vol.CRPC-TR89009
-
-
Porterfield, A.K.1
-
13
-
-
0026138044
-
Software prefetching
-
D. Callahan, K. Kennedy, and A. Porterfield, "Software prefetching," in Proc. 4th Int. Conf. Arch. Support Prog. Lang. Oper. Syst. (ASPLOS), 1991, pp. 40-52.
-
(1991)
Proc. 4th Int. Conf. Arch. Support Prog. Lang. Oper. Syst. (ASPLOS)
, pp. 40-52
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
14
-
-
0003278283
-
The microarchitecture of the Pentium 4 processor
-
H. Glenn, D. Sager, M. Upton, D. Boggs, D. Carmean, A. Kyker, and P. Roussel, "The microarchitecture of the Pentium 4 processor," Intel Technol. J., vol. Ql, pp. 1-10, 2001.
-
(2001)
Intel Technol. J.
, vol.QL
, pp. 1-10
-
-
Glenn, H.1
Sager, D.2
Upton, M.3
Boggs, D.4
Carmean, D.5
Kyker, A.6
Roussel, P.7
-
15
-
-
0030129806
-
The MIPS R10000 superscalar microprocessor
-
Apr.
-
K. Yeager, "The MIPS R10000 superscalar microprocessor," IEEE Micro, vol. 16, no. 2, pp. 28-40, Apr. 1996.
-
(1996)
IEEE Micro
, vol.16
, Issue.2
, pp. 28-40
-
-
Yeager, K.1
-
16
-
-
0030676682
-
Data prefetching on the HP PA-8000
-
V. Santhanam, E. H. Gornish, and W. C. Hsu, "Data prefetching on the HP PA-8000," in Proc. 24th Int. Symp. Comput. Arch. (ISCA), 1997, pp. 264-273.
-
(1997)
Proc. 24th Int. Symp. Comput. Arch. (ISCA)
, pp. 264-273
-
-
Santhanam, V.1
Gornish, E.H.2
Hsu, W.C.3
-
17
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
T. Mowry, M. Lam, and A. Gupta, "Design and evaluation of a compiler algorithm for prefetching," in Proc. ACM 5th Int. Conf. Arch. Support Program, Lang. Oper. Syst., 1992, pp. 62-73.
-
(1992)
Proc. ACM 5th Int. Conf. Arch. Support Program, Lang. Oper. Syst.
, pp. 62-73
-
-
Mowry, T.1
Lam, M.2
Gupta, A.3
-
18
-
-
0002031606
-
Tolerating latency through software-controlled data prefetching
-
Jun.
-
T. Mowry and A. Gupta, "Tolerating latency through software-controlled data prefetching," J. Parallel Distrib. Comput., vol. 12, no. 2, pp. 87-106, Jun. 1991.
-
(1991)
J. Parallel Distrib. Comput.
, vol.12
, Issue.2
, pp. 87-106
-
-
Mowry, T.1
Gupta, A.2
-
19
-
-
0034839064
-
Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors
-
C.-K. Luk, "Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors," in Proc. 28th Int. Conf. Comput. Arch., 2001, pp. 40-51.
-
(2001)
Proc. 28th Int. Conf. Comput. Arch.
, pp. 40-51
-
-
Luk, C.-K.1
-
21
-
-
84858881855
-
-
[Online]
-
Intel, Intel Corp. [Online]. Available: http://www.intel.com, 2005
-
(2005)
-
-
-
22
-
-
0004864204
-
An integrated hardware/software scheme for shared-memory multiprocessors
-
E. H. Gornish and A. V. Veidenbaum, "An integrated hardware/software scheme for shared-memory multiprocessors," in Proc. Int. Conf. Parallel Process., 1994, pp. 281-284.
-
(1994)
Proc. Int. Conf. Parallel Process.
, pp. 281-284
-
-
Gornish, E.H.1
Veidenbaum, A.V.2
-
23
-
-
0029511258
-
An effective programmable prefetch engine for on-chip caches
-
T. Chen, "An effective programmable prefetch engine for on-chip caches," in Proc. 28th Int. Symp. Microarch., 1995, pp. 237-242.
-
(1995)
Proc. 28th Int. Symp. Microarch.
, pp. 237-242
-
-
Chen, T.1
-
25
-
-
53549132930
-
Scheduler-based prefetching for multilevel memories
-
MIT, Boston, MA, Group Memo 444
-
D. Chiou, S. Devadas, J. Jacos, P. Jain, V. Lee, E. Peserico, P. Portante, L. Rudolph, G. E. Suh, and D. Willenson, "Scheduler-Based Prefetching for Multilevel Memories," Lab. Comput. Sci., MIT, Boston, MA, Group Memo 444, 2001.
-
(2001)
Lab. Comput. Sci.
-
-
Chiou, D.1
Devadas, S.2
Jacos, J.3
Jain, P.4
Lee, V.5
Peserico, E.6
Portante, P.7
Rudolph, L.8
Suh, G.E.9
Willenson, D.10
-
26
-
-
0038345683
-
Guided region prefetching: A cooperative hardware/software approach
-
Z. Wang, D. Burger, K. S. McKinley, S. K. Reinhardt, and C. C. Weems, "Guided region prefetching: A cooperative hardware/software approach." in Proc. 30th Ann. Int. Symp. Comput. Arch., 2003, pp. 388-400.
-
(2003)
Proc. 30th Ann. Int. Symp. Comput. Arch.
, pp. 388-400
-
-
Wang, Z.1
Burger, D.2
McKinley, K.S.3
Reinhardt, S.K.4
Weems, C.C.5
-
27
-
-
33646389168
-
Compiler-directed contentaware prefetching for dynamic data structures
-
H. Al-Sukhni, I. Bratt, and D. A. Connors, "Compiler-directed contentaware prefetching for dynamic data structures," in Proc. 12th Int. Conf. Parallel Arch. Compilation Tech. (PACT), 2003, pp. 91-102.
-
(2003)
Proc. 12th Int. Conf. Parallel Arch. Compilation Tech. (PACT)
, pp. 91-102
-
-
Al-Sukhni, H.1
Bratt, I.2
Connors, D.A.3
-
28
-
-
84858877281
-
-
[Online]
-
IMEC [Online]. Available: http://www.imec.be/design/atomium/, 2005
-
(2005)
-
-
-
29
-
-
0003913538
-
-
Exploration of Memory Organization for Embedded Multimedia System Design. Boston, MA: Kluwer
-
F. Catthoor, S. Wuytack, E. D. Greef, F. Balasa, L. Nachtergaele, and A. Vandecappelle, Custom Memory Management Methodology, Exploration of Memory Organization for Embedded Multimedia System Design. Boston, MA: Kluwer, 1998.
-
(1998)
Custom Memory Management Methodology
-
-
Catthoor, F.1
Wuytack, S.2
Greef, E.D.3
Balasa, F.4
Nachtergaele, L.5
Vandecappelle, A.6
-
31
-
-
33947534419
-
-
Texas Instruments, Dallas, TX
-
"TMS320C620x/C670x DSP Program and Date Memory Controller, Direct Memory Access (DMA) Controller," ver. A, Texas Instruments, Dallas, TX, 2004.
-
(2004)
"TMS320C620x/C670x DSP Program and Date Memory Controller, Direct Memory Access (DMA) Controller," Ver. A
-
-
-
32
-
-
84893726637
-
Laver assignment techniques for low energy in multi-layered memory organizations
-
E. Brockmeyer, M. Miranda, H. Corporaal, and F. Catthoor, "Laver assignment techniques for low energy in multi-layered memory organizations," Proc. Des. Automat. Test Eur., pp. 1070-1075, 2003.
-
(2003)
Proc. Des. Automat. Test Eur.
, pp. 1070-1075
-
-
Brockmeyer, E.1
Miranda, M.2
Corporaal, H.3
Catthoor, F.4
-
33
-
-
3042615493
-
Data reuse analysis technique for software-controlled memory hierarchies
-
I. Issenin, E. Brockmeyer, M. Miranda, and N. Dutt, "Data reuse analysis technique for software-controlled memory hierarchies." in Proc. Des. Automat. Test Eur., 2004, pp. 202-207.
-
(2004)
Proc. Des. Automat. Test Eur.
, pp. 202-207
-
-
Issenin, I.1
Brockmeyer, E.2
Miranda, M.3
Dutt, N.4
-
34
-
-
0033279857
-
Minimizing the required memory bandwidth in VLSI system realizations
-
Dec.
-
S. Wuytack, F. Catthoor, G. D. Jong, and H. J. D. Man, "Minimizing the required memory bandwidth in VLSI system realizations," IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 7, no. 4, pp. 433-441. Dec. 1999.
-
(1999)
IEEE Trans. Very Large Scale Integr. (VLSI) Syst.
, vol.7
, Issue.4
, pp. 433-441
-
-
Wuytack, S.1
Catthoor, F.2
Jong, G.D.3
Man, H.J.D.4
-
35
-
-
2342635671
-
CACTI 3.0: An integrated cache timing, power and area model
-
COMPAQ, Palo Alto, CA
-
P. Shivakumar and N. Jouppi, "CACTI 3.0: An Integrated Cache Timing, Power and Area Model," COMPAQ, Palo Alto, CA, WRL Res. Rep. 2001/2, 2001.
-
(2001)
WRL Res. Rep.
, vol.2001
, Issue.2
-
-
Shivakumar, P.1
Jouppi, N.2
-
37
-
-
0029356792
-
A fast hierarchical motion vector estimation algorithm using mean pyramid
-
Aug.
-
M. Nam, J.-S. Kim, R.-H. Park, and Y. S. Shim, "A fast hierarchical motion vector estimation algorithm using mean pyramid," IEEE Trans. Circuits Syst. Video Technol., vol. 5, no. 4, pp. 344-351, Aug. 1995.
-
(1995)
IEEE Trans. Circuits Syst. Video Technol.
, vol.5
, Issue.4
, pp. 344-351
-
-
Nam, M.1
Kim, J.-S.2
Park, R.-H.3
Shim, Y.S.4
-
40
-
-
33646424580
-
Platform independent data transfer and storage exploration illustrated on a parallel cavity detection algorithm
-
K. Danckaert, F. Catthoor, and H. D. Man, "Platform independent data transfer and storage exploration illustrated on a parallel cavity detection algorithm," in Proc. ACM Conf. Parrallel Distrib. Process. Tech. Appl., 1999, pp. 1669-1675.
-
(1999)
Proc. ACM Conf. Parrallel Distrib. Process. Tech. Appl.
, pp. 1669-1675
-
-
Danckaert, K.1
Catthoor, F.2
Man, H.D.3
-
41
-
-
0033875764
-
The local wavelet transform: A memory-efficient, high-speed architecture optimized to a region-oriented zero-tree coder
-
G. Lafruit, L. Nachtergaele, B. Vahnhoof, and F. Catthoor, "The local wavelet transform: A memory-efficient, high-speed architecture optimized to a region-oriented zero-tree coder," Integr. Comput.-Aided Eng., vol. 7, no. 2, pp. 89-103, 2000.
-
(2000)
Integr. Comput.-aided Eng.
, vol.7
, Issue.2
, pp. 89-103
-
-
Lafruit, G.1
Nachtergaele, L.2
Vahnhoof, B.3
Catthoor, F.4
-
42
-
-
84898793811
-
One shot active 3D shape reconstruction
-
M. Proesmans, L. V. Gool, and A. Ossterlinkck, "One shot active 3D shape reconstruction," in Proc. 13th Int. Conf. Pattern Recognit.: Appl. Robot. Syst. (ICPR), 1996, pp. 336-340.
-
(1996)
Proc. 13th Int. Conf. Pattern Recognit.: Appl. Robot. Syst. (ICPR)
, pp. 336-340
-
-
Proesmans, M.1
Gool, L.V.2
Ossterlinkck, A.3
-
43
-
-
33646413590
-
-
Texas Instruments, Dallas, TX, SPRA486C
-
"Power Consumption Summary," Texas Instruments, Dallas, TX, SPRA486C, 2002.
-
(2002)
Power Consumption Summary
-
-
|