mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2024-12-28 16:19:05 +01:00
36 lines
1.8 KiB
Plaintext
36 lines
1.8 KiB
Plaintext
SHORT L2 cache bandwidth in MBytes/s (experimental)
|
|
|
|
EVENTSET
|
|
FIXC1 ACTUAL_CPU_CLOCK
|
|
FIXC2 MAX_CPU_CLOCK
|
|
PMC0 DATA_CACHE_REFILLS_LOCAL_ALL
|
|
PMC1 DATA_CACHE_REFILLS_REMOTE_ALL
|
|
PMC2 HWPREF_DATA_CACHE_FILLS_LOCAL_ALL
|
|
PMC3 HWPREF_DATA_CACHE_FILLS_REMOTE_ALL
|
|
|
|
METRICS
|
|
Runtime (RDTSC) [s] time
|
|
Runtime unhalted [s] FIXC1*inverseClock
|
|
Clock [MHz] 1.E-06*(FIXC1/FIXC2)/inverseClock
|
|
CPI PMC1/PMC0
|
|
Local bandwidth [MBytes/s] 1.0E-06*(PMC0+PMC2)*64.0/time
|
|
Local data volume [GBytes] 1.0E-09*(PMC0+PMC2)*64.0
|
|
Remote bandwidth [MBytes/s] 1.0E-06*(PMC1+PMC3)*64.0/time
|
|
Remote data volume [GBytes] 1.0E-09*(PMC1+PMC3)*64.0
|
|
Total bandwidth [MBytes/s] 1.0E-06*(PMC0+PMC2+PMC1+PMC3)*64.0/time
|
|
Total data volume [GBytes] 1.0E-09*(PMC0+PMC2+PMC1+PMC3)*64.0
|
|
|
|
LONG
|
|
Formulas:
|
|
Local bandwidth [MBytes/s] = 1.0E-06*(DATA_CACHE_REFILLS_LOCAL_ALL+HWPREF_DATA_CACHE_FILLS_LOCAL_ALL)*64.0/time
|
|
Local data volume [GBytes] = 1.0E-09*(DATA_CACHE_REFILLS_LOCAL_ALL+HWPREF_DATA_CACHE_FILLS_LOCAL_ALL)*64.0
|
|
Remote bandwidth [MBytes/s] = 1.0E-06*(DATA_CACHE_REFILLS_REMOTE_ALL+HWPREF_DATA_CACHE_FILLS_REMOTE_ALL)*64.0/time
|
|
Remote data volume [GBytes] = 1.0E-09*(DATA_CACHE_REFILLS_REMOTE_ALL+HWPREF_DATA_CACHE_FILLS_REMOTE_ALL)*64.0
|
|
Total bandwidth [MBytes/s] = 1.0E-06*(DATA_CACHE_REFILLS_LOCAL_ALL+HWPREF_DATA_CACHE_FILLS_LOCAL_ALL+DATA_CACHE_REFILLS_REMOTE_ALL+HWPREF_DATA_CACHE_FILLS_REMOTE_ALL)*64.0/time
|
|
Total data volume [GBytes] = 1.0E-09*(DATA_CACHE_REFILLS_LOCAL_ALL+HWPREF_DATA_CACHE_FILLS_LOCAL_ALL+DATA_CACHE_REFILLS_REMOTE_ALL+HWPREF_DATA_CACHE_FILLS_REMOTE_ALL)*64.0
|
|
-
|
|
Profiling group to measure NUMA traffic. The data sources range from
|
|
local L2, CCX and memory for the local metrics and remote CCX and memory
|
|
for the remote metrics. There are also events that measure the software
|
|
prefetches from local and remote domain but AMD Zen provides only 4 counters.
|