mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2024-11-13 05:27:31 +01:00
23 lines
727 B
Plaintext
23 lines
727 B
Plaintext
|
SHORT L1 compute to data access ratio
|
||
|
|
||
|
EVENTSET
|
||
|
PMC0 VPU_ELEMENTS_ACTIVE
|
||
|
PMC1 DATA_READ_OR_WRITE
|
||
|
|
||
|
METRICS
|
||
|
Runtime (RDTSC) [s] time
|
||
|
L1 compute intensity PMC0/PMC1
|
||
|
|
||
|
LONG
|
||
|
Formulas:
|
||
|
L1 compute intensity = VPU_ELEMENTS_ACTIVE/DATA_READ_OR_WRITE
|
||
|
-
|
||
|
These metric is a way to measure the computational density of an
|
||
|
application, or how many computations it is performing on average for each
|
||
|
piece of data loaded. L1 compute to data access ratio should be
|
||
|
used to judge suitability of an application for running on the Intel MIC
|
||
|
architecture. Applications that will perform well on the Intel MIC
|
||
|
architecture should be vectorized, and ideally be able to perform multiple
|
||
|
operations on the same pieces of data (or same cache lines).
|
||
|
|