mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2025-01-15 00:29:09 +01:00
24 lines
719 B
Plaintext
24 lines
719 B
Plaintext
|
SHORT TLB Misses
|
||
|
|
||
|
EVENTSET
|
||
|
PMC0 LONG_DATA_PAGE_WALK
|
||
|
PMC1 DATA_PAGE_WALK
|
||
|
|
||
|
METRICS
|
||
|
Runtime (RDTSC) [s] time
|
||
|
L1 TLB misses [misses/s] PMC1/time
|
||
|
L2 TLB misses [misses/s] PMC0/time
|
||
|
L1 TLB misses per L2 TLB miss PMC1/PMC0
|
||
|
|
||
|
LONG
|
||
|
Formulas:
|
||
|
L1 TLB misses [misses/s] = DATA_PAGE_WALK/time
|
||
|
L2 TLB misses [misses/s] = LONG_DATA_PAGE_WALK/time
|
||
|
L1 TLB misses per L2 TLB miss = DATA_PAGE_WALK/LONG_DATA_PAGE_WALK
|
||
|
-
|
||
|
Analysis of the layered TLB of the Intel Xeon Phi. According to the book
|
||
|
'Intel Xeon Phi Coprocessor High-Performance Programming' by James Jeffers and
|
||
|
James Reinders, a high L1 TLB misses per L2 TLB miss ratio suggests that your
|
||
|
working set fits into the L2 TLB but not in L1 TLB. Using large pages may be
|
||
|
beneficial.
|