mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2024-11-14 22:17:26 +01:00
51 lines
2.1 KiB
Plaintext
51 lines
2.1 KiB
Plaintext
|
SHORT Overview of arithmetic and memory performance
|
||
|
|
||
|
EVENTSET
|
||
|
FIXC0 INSTR_RETIRED_ANY
|
||
|
FIXC1 CPU_CLK_UNHALTED_CORE
|
||
|
FIXC2 CPU_CLK_UNHALTED_REF
|
||
|
PMC0 FP_COMP_OPS_EXE_SSE_FP_PACKED
|
||
|
PMC1 FP_COMP_OPS_EXE_SSE_FP_SCALAR
|
||
|
PMC2 FP_COMP_OPS_EXE_SSE_SINGLE_PRECISION
|
||
|
PMC3 FP_COMP_OPS_EXE_SSE_DOUBLE_PRECISION
|
||
|
UPMC0 UNC_QMC_NORMAL_READS_ANY
|
||
|
UPMC1 UNC_QMC_WRITES_FULL_ANY
|
||
|
UPMC2 UNC_QHL_REQUESTS_REMOTE_READS
|
||
|
UPMC3 UNC_QHL_REQUESTS_LOCAL_READS
|
||
|
UPMC4 UNC_QHL_REQUESTS_REMOTE_WRITES
|
||
|
|
||
|
METRICS
|
||
|
Runtime (RDTSC) [s] time
|
||
|
Runtime unhalted [s] FIXC1*inverseClock
|
||
|
Clock [MHz] 1.E-06*(FIXC1/FIXC2)/inverseClock
|
||
|
CPI FIXC1/FIXC0
|
||
|
DP [MFLOP/s] (DP assumed) 1.0E-06*(PMC0*2.0+PMC1)/time
|
||
|
SP [MFLOP/s] (SP assumed) 1.0E-06*(PMC0*4.0+PMC1)/time
|
||
|
Packed [MUOPS/s] 1.0E-06*PMC0/time
|
||
|
Scalar [MUOPS/s] 1.0E-06*PMC1/time
|
||
|
SP [MUOPS/s] 1.0E-06*PMC2/time
|
||
|
DP [MUOPS/s] 1.0E-06*PMC3/time
|
||
|
Memory bandwidth [MBytes/s] 1.0E-06*(UPMC0+UPMC1)*64/time
|
||
|
Memory data volume [GBytes] 1.0E-09*(UPMC0+UPMC1)*64
|
||
|
Remote Read BW [MBytes/s] 1.0E-06*(UPMC2)*64/time
|
||
|
Remote Write BW [MBytes/s] 1.0E-06*(UPMC4)*64/time
|
||
|
Remote BW [MBytes/s] 1.0E-06*(UPMC2+UPMC4)*64/time
|
||
|
|
||
|
LONG
|
||
|
Formulas:
|
||
|
DP [MFLOP/s] = (FP_COMP_OPS_EXE_SSE_FP_PACKED*2 + FP_COMP_OPS_EXE_SSE_FP_SCALAR)/ runtime
|
||
|
SP [MFLOP/s] = (FP_COMP_OPS_EXE_SSE_FP_PACKED*4 + FP_COMP_OPS_EXE_SSE_FP_SCALAR)/ runtime
|
||
|
Packed [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_FP_PACKED/time
|
||
|
Scalar [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_FP_SCALAR/time
|
||
|
SP [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_SINGLE_PRECISION/time
|
||
|
DP [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_DOUBLE_PRECISION/time
|
||
|
Memory bandwidth [MBytes/s] = 1.0E-06*(UNC_QMC_NORMAL_READS_ANY+UNC_QMC_WRITES_FULL_ANY)*64/time
|
||
|
Memory data volume [GBytes] = 1.0E-09*(UNC_QMC_NORMAL_READS_ANY+UNC_QMC_WRITES_FULL_ANY)*64
|
||
|
Remote Read BW [MBytes/s] = 1.0E-06*(UNC_QHL_REQUESTS_REMOTE_READS)*64/time
|
||
|
Remote Write BW [MBytes/s] = 1.0E-06*(UNC_QHL_REQUESTS_REMOTE_WRITES)*64/time
|
||
|
Remote BW [MBytes/s] = 1.0E-06*(UNC_QHL_REQUESTS_REMOTE_READS+UNC_QHL_REQUESTS_REMOTE_WRITES)*64/time
|
||
|
-
|
||
|
This is a overview group using the capabilities of Westmere to measure multiple events at
|
||
|
the same time.
|
||
|
|