Download - perc.nersc

Transcript
Page 1: perc.nersc

http://perc.nersc.gov

Performance Science and Engineering

Measuring memoryhierarchy performance

Analytic Performance Bounds for a PETSc Kernel

Mfl

op/s

0

100

200

300

400

500

600

700

800

900

SP Origin T3E Pentium Ultra II

Theoretical Peak Mem BW PeakOper. Issue Peak Observed

Bounding performancebased on fundamental

application characteristics

Block#

ProcedureName

Memory

Ref. Mem. Ref. %

L1 hit

Rate

L2 hit

RateRatio

Random

Memory Bandwidth

WeightedBandwidth

180155 dgemv_n 4.82E+09 0.9198 93.47 93.48 0.07 4166.0 3831.7

180159 dgemv_n 1.42E+08 0.0271 90.33 90.39 0.00 1809.2 49.1

180160 dgemv_n 1.22E+08 0.0232 94.81 99.89 0.00 5561.3 129.3

5885 MatSetValues 6.56E+07 0.0125 77.32 90.00 0.20 1522.6 19.0

MAPS for TCSini for random and non-random loads

Block #

Bandwidth180155180159180160

5885

2831.749.1

129.319.0

Convolvingapplication &

machine topredict

performance

Compilerframeworkto optimizehigh-level

abstractions

Infrastructurefor accessing

hardwareperformance

monitors

Infrastructurefor dynamic

instrumentation

Tools formeasuring &understanding

applicationperformance

ENABLING TECHNOLOGIESC O N V O L U T I O

N S

Enhanced Simulations

& Experiments

ApplicationSignatures

MachineSignatures

BoundModels

PhaseModels

PAPI Sigma++ DynInst

Scientific Simulations

& ExperimentsROSE SvPablo Tau

CacheSimulator

PredictionTool

MemoryRef Tool

dumpMap .addr

sourcefiles

.lstfiles

tracefiles

ProgramExecution

InstrumentedbinarySigma

Compile/Link

Infrastructurefor capturing& analyzing

memory accesses

Lawrence LivermoreNational LaboratoryDan QuinlanBronis de Supinski

Universityof Maryland

Jeff Hollingsworth

Oak RidgeNational LaboratoryPatrick Worley Jeffrey Vetter

San DiegoSupercomputing Center

Allan Snavely

University ofNorth Carolina

Dan Reed

ArgonneNational LaboratoryPaul HovlandBoyana Norris

Universityof Tennessee

Jack Dongarra

Lawrence BerkeleyNational LaboratoryDavid BaileyErich Strohmaier

Primary participants:

Supplementaryparticipants:

Technical Universityof Catalonia

Jesús Labarta

Los AlamosNational LaboratoryAdolfy HoisieHarvey Wasserman

Portland StateUniversity

Karen Karavanic

Universityof Oregon

Allen Malony

RiceUniversity

J. Mellor-Crummey

Universityof Wisconsin

Barton P. Miller

Thomas J. Watson Research CenterPO Box 218Yorktown Heights, NY 10598

IBMResearch

Luiz DeRose