mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2025-01-15 00:29:09 +01:00
37 lines
1.4 KiB
Plaintext
37 lines
1.4 KiB
Plaintext
SHORT L3 cache bandwidth in MBytes/s
|
|
|
|
EVENTSET
|
|
FIXC0 INSTR_RETIRED_ANY
|
|
FIXC1 CPU_CLK_UNHALTED_CORE
|
|
FIXC2 CPU_CLK_UNHALTED_REF
|
|
MBOX0C1 DRAM_READS
|
|
MBOX0C2 DRAM_WRITES
|
|
|
|
METRICS
|
|
Runtime (RDTSC) [s] time
|
|
Runtime unhalted [s] FIXC1*inverseClock
|
|
Clock [MHz] 1.E-06*(FIXC1/FIXC2)/inverseClock
|
|
CPI FIXC1/FIXC0
|
|
Memory load bandwidth [MBytes/s] 1.0E-06*MBOX0C1*64.0/time
|
|
Memory load data volume [GBytes] 1.0E-09*MBOX0C1*64.0
|
|
Memory evict bandwidth [MBytes/s] 1.0E-06*MBOX0C2*64.0/time
|
|
Memory evict data volume [GBytes] 1.0E-09*MBOX0C2*64.0
|
|
Memory bandwidth [MBytes/s] 1.0E-06*(MBOX0C1+MBOX0C2)*64.0/time
|
|
Memory data volume [GBytes] 1.0E-09*(MBOX0C1+MBOX0C2)*64.0
|
|
|
|
LONG
|
|
Formulas:
|
|
L3 load bandwidth [MBytes/s] = 1.0E-06*L2_LINES_IN_ALL*64.0/time
|
|
L3 load data volume [GBytes] = 1.0E-09*L2_LINES_IN_ALL*64.0
|
|
L3 evict bandwidth [MBytes/s] = 1.0E-06*L2_TRANS_L2_WB*64.0/time
|
|
L3 evict data volume [GBytes] = 1.0E-09*L2_TRANS_L2_WB*64.0
|
|
L3 bandwidth [MBytes/s] = 1.0E-06*(L2_LINES_IN_ALL+L2_TRANS_L2_WB)*64/time
|
|
L3 data volume [GBytes] = 1.0E-09*(L2_LINES_IN_ALL+L2_TRANS_L2_WB)*64
|
|
-
|
|
Profiling group to measure L3 cache bandwidth. The bandwidth is computed by the
|
|
number of cache line allocated in the L2 and the number of modified cache lines
|
|
evicted from the L2. This group also output data volume transferred between the
|
|
L3 and measured cores L2 caches. Note that this bandwidth also includes data
|
|
transfers due to a write allocate load on a store miss in L2.
|
|
|