-
1
-
-
77950611743
-
Hpctoolkit: Tools for performance analysis of optimized parallel programs
-
April
-
L. Adhianto, S. Banerjee, M. Fagan, M. Krentel, G. Marin, J. Mellor-Crummey, and N. R. Tallent. Hpctoolkit: tools for performance analysis of optimized parallel programs http://hpctoolkit.org. Concurr. Comput. : Pract. Exper., 22:685-701, April 2010.
-
(2010)
Concurr. Comput.: Pract. Exper.
, vol.22
, pp. 685-701
-
-
Adhianto, L.1
Banerjee, S.2
Fagan, M.3
Krentel, M.4
Marin, G.5
Mellor-Crummey, J.6
Tallent, N.R.7
-
2
-
-
32844456410
-
Online performance analysis by statistical sampling of microprocessor performance counters
-
ACM
-
R. Azimi, M. Stumm, and R. Wisniewski. Online performance analysis by statistical sampling of microprocessor performance counters. In Proc. of International Conference on Supercomputing, pages 101-110. ACM, 2005.
-
(2005)
Proc. of International Conference on Supercomputing
, pp. 101-110
-
-
Azimi, R.1
Stumm, M.2
Wisniewski, R.3
-
3
-
-
0032186636
-
Parsec: A parallel simulation environment for complex systems
-
R. Bagrodia, R. Meyer, M. Takai, Y. Chen, X. Zeng, J. Martin, and H. Song. Parsec: A parallel simulation environment for complex systems. Computer, 31(10):77-85, 1998.
-
(1998)
Computer
, vol.31
, Issue.10
, pp. 77-85
-
-
Bagrodia, R.1
Meyer, R.2
Takai, M.3
Chen, Y.4
Zeng, X.5
Martin, J.6
Song, H.7
-
4
-
-
66749161432
-
Coordinated management of multiple interacting resources in chip multiprocessors: A machine learning approach
-
IEEE Computer Society
-
R. Bitirgen, E. Ipek, and J. Martinez. Coordinated management of multiple interacting resources in chip multiprocessors: A machine learning approach. In Proc. of IEEE/ACM International Symposium on Microarchitecture, pages 318-329. IEEE Computer Society, 2008.
-
(2008)
Proc. of IEEE/ACM International Symposium on Microarchitecture
, pp. 318-329
-
-
Bitirgen, R.1
Ipek, E.2
Martinez, J.3
-
6
-
-
0035478854
-
Random forests
-
L. Breiman. Random forests. Machine learning, 45(1):5-32, 2001.
-
(2001)
Machine Learning
, vol.45
, Issue.1
, pp. 5-32
-
-
Breiman, L.1
-
7
-
-
0034268943
-
A portable programming interface for performance evaluation on modern processors
-
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. A portable programming interface for performance evaluation on modern processors. International Journal of High Performance Computing Applications, 14(3), 2000.
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, Issue.3
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
-
8
-
-
84877042382
-
A scalable cross-platform infrastructure for application performance tuning using hardware counters
-
IEEE Computer Society
-
S. Browne, J. Dongarra, N. Garner, K. London, and P. Mucci. A scalable cross-platform infrastructure for application performance tuning using hardware counters. In Proc. of ACM/IEEE Conference on Supercomputing. IEEE Computer Society, 2000.
-
(2000)
Proc. of ACM/IEEE Conference on Supercomputing
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
London, K.4
Mucci, P.5
-
9
-
-
78650804470
-
Perfexpert: An easy-to-use performance diagnosis tool for hpc applications
-
IEEE Computer Society
-
M. Burtscher, B.-D. Kim, J. Diamond, J. McCalpin, L. Koesterke, and J. Browne. Perfexpert: An easy-to-use performance diagnosis tool for hpc applications. In Proc. of International Conference for High Performance Computing, Networking, Storage and Analysis, SC '10, pages 1-11. IEEE Computer Society, 2010.
-
(2010)
Proc. of International Conference for High Performance Computing, Networking, Storage and Analysis, SC '10
, pp. 1-11
-
-
Burtscher, M.1
Kim, B.-D.2
Diamond, J.3
McCalpin, J.4
Koesterke, L.5
Browne, J.6
-
10
-
-
77955299196
-
Automatic phase detection and structure extraction of mpi applications
-
August
-
M. Casas, R. M. Badia, and J. Labarta. Automatic phase detection and structure extraction of mpi applications. Int. J. High Perform. Comput. Appl., 24:335-360, August 2010.
-
(2010)
Int. J. High Perform. Comput. Appl.
, vol.24
, pp. 335-360
-
-
Casas, M.1
Badia, R.M.2
Labarta, J.3
-
11
-
-
0002607026
-
Bayesian classification (AutoClass): Theory and results
-
American Association for Artificial Intelligence
-
P. Cheeseman and J. Stutz. Bayesian classification (AutoClass): Theory and results. In Advances in knowledge discovery and data mining. American Association for Artificial Intelligence, 1996.
-
(1996)
Advances in Knowledge Discovery and Data Mining
-
-
Cheeseman, P.1
Stutz, J.2
-
12
-
-
51649112844
-
Prediction-based power-performance adaptation of multithreaded scientific codes
-
M. Curtis-Maury, F. Blagojevic, C. Antonopoulos, and D. Nikolopoulos. Prediction-based power-performance adaptation of multithreaded scientific codes. IEEE Transactions on Parallel and Distributed Systems, pages 1396-1410, 2008.
-
(2008)
IEEE Transactions on Parallel and Distributed Systems
, pp. 1396-1410
-
-
Curtis-Maury, M.1
Blagojevic, F.2
Antonopoulos, C.3
Nikolopoulos, D.4
-
15
-
-
77950606394
-
Automatic performance analysis with periscope
-
M. Gerndt and M. Ott. Automatic performance analysis with periscope. Concurr. Comput. : Pract. Exper., 22:736-748, 2010.
-
(2010)
Concurr. Comput.: Pract. Exper.
, vol.22
, pp. 736-748
-
-
Gerndt, M.1
Ott, M.2
-
17
-
-
85065703189
-
Correlation-based feature selection for discrete and numeric class machine learning
-
M. A. Hall. Correlation-based feature selection for discrete and numeric class machine learning. In Proc. of International Conference on Machine Learning, pages 359-366, 2000.
-
(2000)
Proc. of International Conference on Machine Learning
, pp. 359-366
-
-
Hall, M.A.1
-
18
-
-
34547466206
-
Mercury and freon: Temperature emulation and management for server systems
-
ACM
-
T. Heath, A. Centeno, P. George, L. Ramos, Y. Jaluria, and R. Bianchini. Mercury and freon: temperature emulation and management for server systems. In Proc. of International Conference on Architectural Support for Programming Languages and Operating Systems, pages 106-116. ACM, 2006.
-
(2006)
Proc. of International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 106-116
-
-
Heath, T.1
Centeno, A.2
George, P.3
Ramos, L.4
Jaluria, Y.5
Bianchini, R.6
-
20
-
-
0001815269
-
Constructing optimal binary decision trees is NP-complete
-
L. Hyafil and R. Rivest. Constructing optimal binary decision trees is NP-complete. Information Processing Letters, 5(1):15-17, 1976.
-
(1976)
Information Processing Letters
, vol.5
, Issue.1
, pp. 15-17
-
-
Hyafil, L.1
Rivest, R.2
-
21
-
-
79959840009
-
-
Web site
-
Intel. Vtune Amplifier XE. Web site: www.intel.com/software/products/ vtune, 2011.
-
(2011)
Vtune Amplifier XE
-
-
-
22
-
-
34548776288
-
Perfsuite: An accessible, open source performance analysis environment for linux
-
R. Kufrin. Perfsuite: An accessible, open source performance analysis environment for linux. In The International Conference on Linux Clusters, volume 151, 2005.
-
(2005)
The International Conference on Linux Clusters
, vol.151
-
-
Kufrin, R.1
-
24
-
-
33745304805
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
C. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. Reddi, and K. Hazelwood. Pin: building customized program analysis tools with dynamic instrumentation. ACM SIGPLAN Notices, 40(6):190-200, 2005.
-
(2005)
ACM SIGPLAN Notices
, vol.40
, Issue.6
, pp. 190-200
-
-
Luk, C.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.8
Hazelwood, K.9
-
25
-
-
0001087280
-
Hyper-threading technology architecture and microarchitecture
-
D. T. Marr, F. Binns, D. L. Hill, G. Hinton, D. A. Koufaty, J. A. Miller, and M. Upton. Hyper-threading technology architecture and microarchitecture. Intel Technology Journal, 6(1):1-12, 2002.
-
(2002)
Intel Technology Journal
, vol.6
, Issue.1
, pp. 1-12
-
-
Marr, D.T.1
Binns, F.2
Hill, D.L.3
Hinton, G.4
Koufaty, D.A.5
Miller, J.A.6
Upton, M.7
-
26
-
-
77954607799
-
Resource-conscious scheduling for energy efficiency on multicore processors
-
ACM
-
A. Merkel, J. Stoess, and F. Bellosa. Resource-conscious scheduling for energy efficiency on multicore processors. In Proc. of European conference on Computer systems, EuroSys, pages 153-166. ACM, 2010.
-
(2010)
Proc. of European Conference on Computer Systems, EuroSys
, pp. 153-166
-
-
Merkel, A.1
Stoess, J.2
Bellosa, F.3
-
27
-
-
0002438680
-
Vampir: Visualization and analysis of mpi resources
-
W. E. Nagel, A. Arnold, M. Weber, H.-C. Hoppe, and K. Solchenbach. Vampir: Visualization and analysis of mpi resources. SUPERCOMPUTER, 12:69-80, 1996.
-
(1996)
Supercomputer
, vol.12
, pp. 69-80
-
-
Nagel, W.E.1
Arnold, A.2
Weber, M.3
Hoppe, H.-C.4
Solchenbach, K.5
-
28
-
-
36949001762
-
Using model trees for computer architecture performance analysis of software applications
-
0
-
E. Ould-Ahmed-Vall, J. Woodlee, C. Yount, K. Doshi, and S. Abraham. Using model trees for computer architecture performance analysis of software applications. IEEE International Symmposium on Performance Analysis of Systems and Software, 0:116-125, 2007.
-
(2007)
IEEE International Symmposium on Performance Analysis of Systems and Software
, pp. 116-125
-
-
Ould-Ahmed-Vall, E.1
Woodlee, J.2
Yount, C.3
Doshi, K.4
Abraham, S.5
-
29
-
-
33751095034
-
Paraver: A tool to visualize and analyze parallel code
-
V. Pillet, J. Labarta, T. Cortes, and S. Girona. Paraver: A tool to visualize and analyze parallel code. IN WOTUG-18, pages 17-31, 1995.
-
(1995)
IN WOTUG-18
, pp. 17-31
-
-
Pillet, V.1
Labarta, J.2
Cortes, T.3
Girona, S.4
-
30
-
-
33744584654
-
Induction of decision trees
-
J. Quinlan. Induction of decision trees. Machine learning, 1(1):81-106, 1986.
-
(1986)
Machine Learning
, vol.1
, Issue.1
, pp. 81-106
-
-
Quinlan, J.1
-
32
-
-
0024627518
-
Inferring decision trees using the minimum description length principle
-
J. R. Quinlan and R. L. Rivest. Inferring decision trees using the minimum description length principle. Inf. Comput., 80:227-248, 1989.
-
(1989)
Inf. Comput.
, vol.80
, pp. 227-248
-
-
Quinlan, J.R.1
Rivest, R.L.2
-
36
-
-
77957769762
-
Hardware counter driven on-the-fly request signatures
-
K. Shen, M. Zhong, S. Dwarkadas, C. Li, C. Stewart, and X. Zhang. Hardware counter driven on-the-fly request signatures. ACM SIGOPS Operating Systems Review, 42(2), 2008.
-
(2008)
ACM SIGOPS Operating Systems Review
, vol.42
, Issue.2
-
-
Shen, K.1
Zhong, M.2
Dwarkadas, S.3
Li, C.4
Stewart, C.5
Zhang, X.6
-
39
-
-
0029200683
-
Simultaneous multithreading: Maximizing on-chip parallelism
-
ACM
-
D. M. Tullsen, S. J. Eggers, and H. M. Levy. Simultaneous multithreading: maximizing on-chip parallelism. In Proc. of International Symposium on Computer Architecture, ISCA '95, pages 392-403. ACM, 1995.
-
(1995)
Proc. of International Symposium on Computer Architecture, ISCA '95
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.J.2
Levy, H.M.3
-
40
-
-
0032594959
-
An overview of statistical learning theory
-
V. Vapnik. An overview of statistical learning theory. Neural Networks, IEEE Transactions on, 10(5):988-999, 1999.
-
(1999)
Neural Networks, IEEE Transactions on
, vol.10
, Issue.5
, pp. 988-999
-
-
Vapnik, V.1
-
41
-
-
0033691589
-
Performance analysis of distributed applications using automatic classification of communication inefficiencies
-
ACM
-
J. Vetter. Performance analysis of distributed applications using automatic classification of communication inefficiencies. In Proc. of International Conference on Supercomputing, pages 245-254. ACM, 2000.
-
(2000)
Proc. of International Conference on Supercomputing
, pp. 245-254
-
-
Vetter, J.1
-
43
-
-
72249121870
-
Detecting large-scale system problems by mining console logs
-
ACM
-
W. Xu, L. Huang, A. Fox, D. Patterson, and M. Jordan. Detecting large-scale system problems by mining console logs. In Proc. of ACM Symposium on Operating Systems Principles, pages 117-132. ACM, 2009.
-
(2009)
Proc. of ACM Symposium on Operating Systems Principles
, pp. 117-132
-
-
Xu, W.1
Huang, L.2
Fox, A.3
Patterson, D.4
Jordan, M.5
-
44
-
-
85017232404
-
Automated Fingerprinting of Performance Pathologies Using Performance Monitoring Units (PMUs)
-
USENIX Association
-
W. Yoo, K. Larson, S. Kim, W. Ahn, R. Campbell, and L. Baugh. Automated Fingerprinting of Performance Pathologies Using Performance Monitoring Units (PMUs). In Proc. of USENIX Workshop on Hot topics in parallelism. USENIX Association, 2011.
-
(2011)
Proc. of USENIX Workshop on Hot Topics in Parallelism
-
-
Yoo, W.1
Larson, K.2
Kim, S.3
Ahn, W.4
Campbell, R.5
Baugh, L.6
-
45
-
-
0032593334
-
Toward scalable performance visualization with jumpshot
-
O. Zaki, E. Lusk, W. Gropp, and D. Swider. Toward scalable performance visualization with jumpshot. International Journal of High Performance Computing Applications, 13(3), 1999.
-
(1999)
International Journal of High Performance Computing Applications
, vol.13
, Issue.3
-
-
Zaki, O.1
Lusk, E.2
Gropp, W.3
Swider, D.4
|