메뉴 건너뛰기




Volumn , Issue , 2010, Pages 191-202

Cashing in on hints for better prefetching and caching in PVFS and MPI-IO

Author keywords

Caching; Hints; I O; MPI IO; Performance; Prefetching; PVFS

Indexed keywords

CACHING; HINTS; I/O; MPI-IO; PERFORMANCE; PREFETCHING; PVFS;

EID: 78650035887     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1851476.1851499     Document Type: Conference Paper
Times cited : (13)

References (51)
  • 3
    • 0029181135 scopus 로고
    • A model and compilation strategy for out-of-core data parallel programs
    • R. Bordawekar, A. Choudhary, K. Kennedy, C. Koelbel, and M. Paleczny. A model and compilation strategy for out-of-core data parallel programs. In PPOPP, pages 1-10, 1995.
    • (1995) PPOPP , pp. 1-10
    • Bordawekar, R.1    Choudhary, A.2    Kennedy, K.3    Koelbel, C.4    Paleczny, M.5
  • 4
    • 33244494252 scopus 로고    scopus 로고
    • The performance impact of kernel prefetching on buffer cache replacement algorithms
    • A. R. Butt, C. Gniady, and Y. C. Hu. The performance impact of kernel prefetching on buffer cache replacement algorithms. SIGMETRICS Perf. Eval. Rev., 33(1):157-168, 2005.
    • (2005) SIGMETRICS Perf. Eval. Rev. , vol.33 , Issue.1 , pp. 157-168
    • Butt, A.R.1    Gniady, C.2    Hu, Y.C.3
  • 5
    • 70350771132 scopus 로고    scopus 로고
    • Parallel I/O prefetching using MPI file caching and I/O signatures
    • S. Byna, Y. Chen, X.-H. Sun, R. Thakur, and W. Gropp. Parallel I/O prefetching using MPI file caching and I/O signatures. In SC, pages 1-12, 2008.
    • (2008) SC , pp. 1-12
    • Byna, S.1    Chen, Y.2    Sun, X.-H.3    Thakur, R.4    Gropp, W.5
  • 6
    • 79955680825 scopus 로고    scopus 로고
    • PVFS: A parallel file system for linux clusters
    • P. H. Carns, W. B. Ligon III, R. B. Ross, and R. Thakur. PVFS: A parallel file system for linux clusters. In ALS, pages 317-327, 2000.
    • (2000) ALS , pp. 317-327
    • Carns, P.H.1    Ligon Iii, W.B.2    Ross, R.B.3    Thakur, R.4
  • 7
    • 0001945085 scopus 로고    scopus 로고
    • Automatic I/O hint generation through speculative execution
    • F. Chang and G. A. Gibson. Automatic I/O hint generation through speculative execution. In OSDI, pages 1-14, 1999.
    • (1999) OSDI , pp. 1-14
    • Chang, F.1    Gibson, G.A.2
  • 8
    • 70350757788 scopus 로고    scopus 로고
    • Hiding I/O latency with pre-execution prefetching for parallel applications
    • Y. Chen, S. Byna, X.-H. Sun, R. Thakur, and W. Gropp. Hiding I/O latency with pre-execution prefetching for parallel applications. In SC, pages 1-10, 2008.
    • (2008) SC , pp. 1-10
    • Chen, Y.1    Byna, S.2    Sun, X.-H.3    Thakur, R.4    Gropp, W.5
  • 9
    • 10044225941 scopus 로고    scopus 로고
    • Co-array Fortran performance and potential: An NPB experimental study
    • C. Coarfa, Y. Dotsenko, J. Eckhardt, and J. Mellor-Crummey. Co-array Fortran performance and potential: An NPB experimental study. In LCPC, pages 2-4, 2003.
    • (2003) LCPC , pp. 2-4
    • Coarfa, C.1    Dotsenko, Y.2    Eckhardt, J.3    Mellor-Crummey, J.4
  • 10
    • 78650025890 scopus 로고    scopus 로고
    • U. Consortium, v1.2. Technical Report LBNL-59208, Lawrence Berkeley National Lab Tech Report
    • U. Consortium. UPC specifications, v1.2. Technical Report LBNL-59208, Lawrence Berkeley National Lab Tech Report, 2005.
    • (2005) UPC Specifications
  • 11
    • 70350214299 scopus 로고    scopus 로고
    • Diskseen: Exploiting disk layout and access history to enhance I/O prefetch
    • X. Ding, S. Jiang, F. Chen, K. Davis, and X. Zhang. Diskseen: exploiting disk layout and access history to enhance I/O prefetch. In USENIX, pages 1-14, 2007.
    • (2007) USENIX , pp. 1-14
    • Ding, X.1    Jiang, S.2    Chen, F.3    Davis, K.4    Zhang, X.5
  • 13
    • 33845462269 scopus 로고    scopus 로고
    • Operating system I/O speculation: How two invocations are faster than one
    • K. Fraser and F. Chang. Operating system I/O speculation: How two invocations are faster than one. In USENIX, pages 325-338, 2003.
    • (2003) USENIX , pp. 325-338
    • Fraser, K.1    Chang, F.2
  • 14
    • 85060311108 scopus 로고    scopus 로고
    • AMP: Adaptive multi-stream prefetching in a shared cache
    • B. Gill and L. A. Bathen. AMP: adaptive multi-stream prefetching in a shared cache. In FAST, pages 185-198, 2007.
    • (2007) FAST , pp. 185-198
    • Gill, B.1    Bathen, L.A.2
  • 15
    • 84991634667 scopus 로고    scopus 로고
    • SARC: Sequential prefetching in adaptive replacement cache
    • B. Gill and D. Modha. SARC: sequential prefetching in adaptive replacement cache. In USENIX, pages 293-308, 2005.
    • (2005) USENIX , pp. 293-308
    • Gill, B.1    Modha, D.2
  • 16
    • 85084161825 scopus 로고
    • Reducing file system latency using a predictive approach
    • J. Griffioen and R. Appleton. Reducing file system latency using a predictive approach. In USENIX, pages 13-13, 1994.
    • (1994) USENIX , pp. 13-13
    • Griffioen, J.1    Appleton, R.2
  • 17
    • 78650029832 scopus 로고    scopus 로고
    • Report on the experimental language X10
    • IBM. Report on the experimental language X10. http://x10.codehaus.org/, 2006.
    • (2006) IBM
  • 18
    • 0030710004 scopus 로고    scopus 로고
    • Data access reorganizations in compiling out-of-core data parallel programs on distributed memory machines
    • M. Kandemir, R. Bordawekar, and A. Choudhary. Data access reorganizations in compiling out-of-core data parallel programs on distributed memory machines. In IPPS, pages 559-564, 1997.
    • (1997) IPPS , pp. 559-564
    • Kandemir, M.1    Bordawekar, R.2    Choudhary, A.3
  • 20
    • 4444263176 scopus 로고    scopus 로고
    • A study of source-level compiler algorithms for automatic construction of pre-execution code
    • D. Kim and D. Yeung. A study of source-level compiler algorithms for automatic construction of pre-execution code. ACM TOCS, 22(3):326-379, 2004.
    • (2004) ACM TOCS , vol.22 , Issue.3 , pp. 326-379
    • Kim, D.1    Yeung, D.2
  • 21
    • 0028734944 scopus 로고
    • Dynamic file-access characteristics of a production parallel scientific workload
    • D. Kotz and N. Nieuwejaar. Dynamic file-access characteristics of a production parallel scientific workload. In SC, pages 640-649, 1994.
    • (1994) SC , pp. 640-649
    • Kotz, D.1    Nieuwejaar, N.2
  • 22
    • 0025413929 scopus 로고
    • Prefetching in file systems for MIMD multiprocessors
    • D. F. Kotz and C. S. Ellis. Prefetching in file systems for MIMD multiprocessors. IEEE TPDS, 1(2):218-230, 1990.
    • (1990) IEEE TPDS , vol.1 , Issue.2 , pp. 218-230
    • Kotz, D.F.1    Ellis, C.S.2
  • 23
    • 85084160069 scopus 로고    scopus 로고
    • Design and implementation of a predictive file prefetching algorithm
    • T. M. Kroeger and D. D. E. Long. Design and implementation of a predictive file prefetching algorithm. In USENIX, pages 105-118, 2001.
    • (2001) USENIX , pp. 105-118
    • Kroeger, T.M.1    Long, D.D.E.2
  • 24
    • 34548020997 scopus 로고    scopus 로고
    • Competitive prefetching for concurrent sequential I/O
    • C. Li, K. Shen, and A. Papathanasiou. Competitive prefetching for concurrent sequential I/O. In EuroSys, pages 189-202, 2007.
    • (2007) EuroSys , pp. 189-202
    • Li, C.1    Shen, K.2    Papathanasiou, A.3
  • 25
    • 84890088494 scopus 로고    scopus 로고
    • Second-tier cache management using write hints
    • X. Li, A. Aboulnaga, K. Salem, A. Sachedina, and S. Gao. Second-tier cache management using write hints. In FAST, pages 115-128, 2005.
    • (2005) FAST , pp. 115-128
    • Li, X.1    Aboulnaga, A.2    Salem, K.3    Sachedina, A.4    Gao, S.5
  • 26
    • 34548726061 scopus 로고    scopus 로고
    • Implementation and evaluation of client-side file caching for MPI-IO
    • W.-K. Liao, A. Ching, K. Coloma, A. Choudhary, and L. Ward. Implementation and evaluation of client-side file caching for MPI-IO. In IPDPS, pages 1-10, 2007.
    • (2007) IPDPS , pp. 1-10
    • Liao, W.-K.1    Ching, A.2    Coloma, K.3    Choudhary, A.4    Ward, L.5
  • 27
    • 0036669170 scopus 로고    scopus 로고
    • Learning to classify parallel input/output access patterns
    • T. Madhyastha and D. Reed. Learning to classify parallel input/output access patterns. IEEE TPDS, 13(8):802-813, 2002.
    • (2002) IEEE TPDS , vol.13 , Issue.8 , pp. 802-813
    • Madhyastha, T.1    Reed, D.2
  • 28
    • 78650033446 scopus 로고    scopus 로고
    • The Fortress language specification version 1.0
    • S. Microsystems. The Fortress language specification version 1.0. http://research.sun.com/projects/plrg/fortress.pdf, 2008.
    • (2008) S. Microsystems
  • 29
    • 0026287511 scopus 로고
    • Input/output behavior of supercomputing applications
    • E. Miller and R. Katz. Input/output behavior of supercomputing applications. In SC, pages 567-576, 1991.
    • (1991) SC , pp. 567-576
    • Miller, E.1    Katz, R.2
  • 30
    • 85088074507 scopus 로고    scopus 로고
    • Automatic compiler-inserted I/O prefetching for out-of-core applications
    • T. Mowry, A. Demke, and O. Krieger. Automatic compiler-inserted I/O prefetching for out-of-core applications. In OSDI, pages 3-17, 1996.
    • (1996) OSDI , pp. 3-17
    • Mowry, T.1    Demke, A.2    Krieger, O.3
  • 31
    • 0030259496 scopus 로고    scopus 로고
    • File-access characteristics of parallel scientific workloads
    • N. Nieuwejaar, D. Kotz, A. Purakayastha, C. S. Ellis, and M. Best. File-access characteristics of parallel scientific workloads. IEEE TPDS, 7(10):1075-1089, 1996.
    • (1996) IEEE TPDS , vol.7 , Issue.10 , pp. 1075-1089
    • Nieuwejaar, N.1    Kotz, D.2    Purakayastha, A.3    Ellis, C.S.4    Best, M.5
  • 32
    • 84893311843 scopus 로고    scopus 로고
    • Energy efficient prefetching and caching
    • A. Papathanasiou and M. Scott. Energy efficient prefetching and caching. In USENIX, pages 255-268, 2004.
    • (2004) USENIX , pp. 255-268
    • Papathanasiou, A.1    Scott, M.2
  • 35
    • 33751105800 scopus 로고    scopus 로고
    • MPI-IO/GPFS, an optimized implementation of MPI-IO on top of GPFS
    • J.-P. Prost, R. Treumann, R. Hedges, B. Jia, and A. Koniges. MPI-IO/GPFS, an optimized implementation of MPI-IO on top of GPFS. SC, pages 58-58, 2001.
    • (2001) SC , pp. 58-58
    • Prost, J.-P.1    Treumann, R.2    Hedges, R.3    Jia, B.4    Koniges, A.5
  • 36
    • 57349148140 scopus 로고    scopus 로고
    • Enabling cross-layer optimizations in storage systems with custom metadata
    • E. Santos-Neto, S. Al-Kiswany, N. Andrade, S. Gopalakrishnan, and M. Ripeanu. Enabling cross-layer optimizations in storage systems with custom metadata. In HPDC, pages 213-216, 2008.
    • (2008) HPDC , pp. 213-216
    • Santos-Neto, E.1    Al-Kiswany, S.2    Andrade, N.3    Gopalakrishnan, S.4    Ripeanu, M.5
  • 37
    • 85086687574 scopus 로고    scopus 로고
    • Efficient cooperative caching using hints
    • P. Sarkar and J. Hartman. Efficient cooperative caching using hints. In OSDI, pages 35-46, 1996.
    • (1996) OSDI , pp. 35-46
    • Sarkar, P.1    Hartman, J.2
  • 38
    • 0038023772 scopus 로고    scopus 로고
    • Hint-based cooperative caching
    • P. Sarkar and J. H. Hartman. Hint-based cooperative caching. ACM TOCS, 18(4):387-419, 2000.
    • (2000) ACM TOCS , vol.18 , Issue.4 , pp. 387-419
    • Sarkar, P.1    Hartman, J.H.2
  • 40
    • 63549141550 scopus 로고    scopus 로고
    • Profiler and compiler assisted adaptive I/O prefetching for shared storage caches
    • S. W. Son, S. P. Muralidhara, O. Ozturk, M. Kandemir, I. Kolcu, and M. Karakoy. Profiler and compiler assisted adaptive I/O prefetching for shared storage caches. In PACT, pages 112-121, 2008.
    • (2008) PACT , pp. 112-121
    • Son, S.W.1    Muralidhara, S.P.2    Ozturk, O.3    Kandemir, M.4    Kolcu, I.5    Karakoy, M.6
  • 42
    • 85029696725 scopus 로고    scopus 로고
    • Data sieving and collective I/O in ROMIO
    • R. Thakur, W. Gropp, and E. Lusk. Data sieving and collective I/O in ROMIO. In FRONTIERS, page 182, 1999.
    • (1999) FRONTIERS , pp. 182
    • Thakur, R.1    Gropp, W.2    Lusk, E.3
  • 43
    • 0026881152 scopus 로고
    • Improving disk cache hit-ratios through cache partitioning
    • D. Thiébaut, H. S. Stone, and J. L. Wolf. Improving disk cache hit-ratios through cache partitioning. IEEE TOC, 41(6):665-676, 1992.
    • (1992) IEEE TOC , vol.41 , Issue.6 , pp. 665-676
    • Thiébaut, D.1    Stone, H.S.2    Wolf, J.L.3
  • 44
    • 0031164230 scopus 로고    scopus 로고
    • Informed multi-process prefetching and caching
    • A. Tomkins, H. Patterson, and G. Gibson. Informed multi-process prefetching and caching. In SIGMETRICS, pages 100-114, 1997.
    • (1997) SIGMETRICS , pp. 100-114
    • Tomkins, A.1    Patterson, H.2    Gibson, G.3
  • 45
    • 0001589803 scopus 로고    scopus 로고
    • Data prefetch mechanisms
    • S. Vanderwiel and D. Lilja. Data prefetch mechanisms. ACM CSUR, 32(2):174-199, 2000.
    • (2000) ACM CSUR , vol.32 , Issue.2 , pp. 174-199
    • Vanderwiel, S.1    Lilja, D.2
  • 46
    • 78650001539 scopus 로고    scopus 로고
    • Kernel-level caching for optimizing I/O by exploiting inter-application data sharing
    • M. Vilayannur, M. Kandemir, and A. Sivasubramaniam. Kernel-level caching for optimizing I/O by exploiting inter-application data sharing. Cluster Computing, 0:425, 2002.
    • (2002) Cluster Computing , pp. 425
    • Vilayannur, M.1    Kandemir, M.2    Sivasubramaniam, A.3
  • 47
    • 84889647199 scopus 로고    scopus 로고
    • Argon: Performance insulation for shared storage servers
    • M. Wachs, M. Abd-El-Malek, E. Thereska, and G. R. Ganger. Argon: performance insulation for shared storage servers. In FAST, pages 61-76, 2007.
    • (2007) FAST , pp. 61-76
    • Wachs, M.1    Abd-El-Malek, M.2    Thereska, E.3    Ganger, G.R.4
  • 49
    • 51849164886 scopus 로고    scopus 로고
    • MC2: Multiple clients on a multilevel cache
    • G. Yadgar, M. Factor, K. Li, and A. Schuster. MC2: Multiple clients on a multilevel cache. In ICDCS, pages 722-730, 2008.
    • (2008) ICDCS , pp. 722-730
    • Yadgar, G.1    Factor, M.2    Li, K.3    Schuster, A.4
  • 50
    • 82655174090 scopus 로고    scopus 로고
    • Karma: Know-it-all replacement for a multilevel cache
    • G. Yadgar, M. Factor, and A. Schuster. Karma: Know-it-all replacement for a multilevel cache. In FAST, pages 169-184, 2006.
    • (2006) FAST , pp. 169-184
    • Yadgar, G.1    Factor, M.2    Schuster, A.3
  • 51
    • 85084162062 scopus 로고    scopus 로고
    • A decoupled architecture for application-specific file prefetching
    • C.-K. Yang, T. Mitra, and T.-C. Chiueh. A decoupled architecture for application-specific file prefetching. In USENIX, pages 157-170, 2002.
    • (2002) USENIX , pp. 157-170
    • Yang, C.-K.1    Mitra, T.2    Chiueh, T.-C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.