mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2025-08-03 01:50:34 +02:00
Add likwid collector
This commit is contained in:
35
collectors/likwid/groups/westmereEX/FLOPS_SP.txt
Normal file
35
collectors/likwid/groups/westmereEX/FLOPS_SP.txt
Normal file
@@ -0,0 +1,35 @@
|
||||
SHORT Single Precision MFLOP/s
|
||||
|
||||
EVENTSET
|
||||
FIXC0 INSTR_RETIRED_ANY
|
||||
FIXC1 CPU_CLK_UNHALTED_CORE
|
||||
FIXC2 CPU_CLK_UNHALTED_REF
|
||||
PMC0 FP_COMP_OPS_EXE_SSE_FP_PACKED
|
||||
PMC1 FP_COMP_OPS_EXE_SSE_FP_SCALAR
|
||||
PMC2 FP_COMP_OPS_EXE_SSE_SINGLE_PRECISION
|
||||
PMC3 FP_COMP_OPS_EXE_SSE_DOUBLE_PRECISION
|
||||
|
||||
METRICS
|
||||
Runtime (RDTSC) [s] time
|
||||
Runtime unhalted [s] FIXC1*inverseClock
|
||||
Clock [MHz] 1.E-06*(FIXC1/FIXC2)/inverseClock
|
||||
CPI FIXC1/FIXC0
|
||||
SP [MFLOP/s] 1.0E-06*(PMC0*4.0+PMC1)/time
|
||||
Packed [MUOPS/s] 1.0E-06*PMC0/time
|
||||
Scalar [MUOPS/s] 1.0E-06*PMC1/time
|
||||
SP [MUOPS/s] 1.0E-06*PMC2/time
|
||||
DP [MUOPS/s] 1.0E-06*PMC3/time
|
||||
|
||||
LONG
|
||||
Formulas:
|
||||
SP [MFLOP/s] = 1.0E-06*(FP_COMP_OPS_EXE_SSE_FP_PACKED*4+FP_COMP_OPS_EXE_SSE_FP_SCALAR)/runtime
|
||||
Packed [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_FP_PACKED/runtime
|
||||
Scalar [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_FP_SCALAR/runtime
|
||||
SP [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_SINGLE_PRECISION/runtime
|
||||
DP [MUOPS/s] = 1.0E-06*FP_COMP_OPS_EXE_SSE_DOUBLE_PRECISION/runtime
|
||||
-
|
||||
The Westmere EX has no possibility to measure MFLOPs if mixed precision calculations are done.
|
||||
Therefore both single as well as double precision are measured to ensure the correctness
|
||||
of the measurements. You can check if your code was vectorized on the number of
|
||||
FP_COMP_OPS_EXE_SSE_FP_PACKED versus the FP_COMP_OPS_EXE_SSE_FP_SCALAR.
|
||||
|
Reference in New Issue
Block a user