89 lines
4.2 KiB
Plaintext
89 lines
4.2 KiB
Plaintext
--------------------------------------------------------------------------------
|
|
CPU name: Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz
|
|
CPU type: Intel Cascadelake SP processor
|
|
CPU clock: 2.49 GHz
|
|
--------------------------------------------------------------------------------
|
|
Initializing parameters...
|
|
Initializing atoms...
|
|
Creating atoms...
|
|
Pattern: seq
|
|
Number of timesteps: 200
|
|
Number of atoms: 256
|
|
Number of neighbors per atom: 1024
|
|
Number of times to replicate neighbor lists: 1
|
|
Estimated total data volume (kB): 1056.7680
|
|
Estimated atom data volume (kB): 3.0720
|
|
Estimated neighborlist data volume (kB): 1050.6240
|
|
Initializing neighbor lists...
|
|
Creating neighbor lists...
|
|
Computing forces...
|
|
Total time: 0.2466, Mega atom updates/s: 0.2076
|
|
Cycles per atom: 9631.9934, Cycles per neighbor: 9.4062
|
|
Statistics:
|
|
Vector width: 16, Processor frequency: 2.0000 GHz
|
|
Average neighbors per atom: 1018.9055
|
|
Average SIMD iterations per atom: 63.6816
|
|
Total number of computed pair interactions: 52428800
|
|
Total number of SIMD iterations: 3276800
|
|
Useful read data volume for force computation: 0.84GB
|
|
Cycles/SIMD iteration: 150.4999
|
|
--------------------------------------------------------------------------------
|
|
Region force, Group 1: MEM_SP
|
|
+-------------------+------------+
|
|
| Region Info | HWThread 0 |
|
|
+-------------------+------------+
|
|
| RDTSC Runtime [s] | 0.085843 |
|
|
| call count | 200 |
|
|
+-------------------+------------+
|
|
|
|
+------------------------------------------+---------+------------+
|
|
| Event | Counter | HWThread 0 |
|
|
+------------------------------------------+---------+------------+
|
|
| INSTR_RETIRED_ANY | FIXC0 | 129769100 |
|
|
| CPU_CLK_UNHALTED_CORE | FIXC1 | 172300100 |
|
|
| CPU_CLK_UNHALTED_REF | FIXC2 | 215371300 |
|
|
| PWR_PKG_ENERGY | PWR0 | 9.2849 |
|
|
| PWR_DRAM_ENERGY | PWR3 | 0 |
|
|
| FP_ARITH_INST_RETIRED_128B_PACKED_SINGLE | PMC0 | 0 |
|
|
| FP_ARITH_INST_RETIRED_SCALAR_SINGLE | PMC1 | 154000 |
|
|
| FP_ARITH_INST_RETIRED_256B_PACKED_SINGLE | PMC2 | 0 |
|
|
| FP_ARITH_INST_RETIRED_512B_PACKED_SINGLE | PMC3 | 89088000 |
|
|
| CAS_COUNT_RD | MBOX0C0 | 8354 |
|
|
| CAS_COUNT_WR | MBOX0C1 | 1126 |
|
|
| CAS_COUNT_RD | MBOX1C0 | 7863 |
|
|
| CAS_COUNT_WR | MBOX1C1 | 1105 |
|
|
| CAS_COUNT_RD | MBOX2C0 | 7990 |
|
|
| CAS_COUNT_WR | MBOX2C1 | 1113 |
|
|
| CAS_COUNT_RD | MBOX3C0 | 4775 |
|
|
| CAS_COUNT_WR | MBOX3C1 | 1112 |
|
|
| CAS_COUNT_RD | MBOX4C0 | 4201 |
|
|
| CAS_COUNT_WR | MBOX4C1 | 1127 |
|
|
| CAS_COUNT_RD | MBOX5C0 | 4035 |
|
|
| CAS_COUNT_WR | MBOX5C1 | 1120 |
|
|
+------------------------------------------+---------+------------+
|
|
|
|
+-----------------------------------+------------+
|
|
| Metric | HWThread 0 |
|
|
+-----------------------------------+------------+
|
|
| Runtime (RDTSC) [s] | 0.0858 |
|
|
| Runtime unhalted [s] | 0.0691 |
|
|
| Clock [MHz] | 1995.2787 |
|
|
| CPI | 1.3277 |
|
|
| Energy [J] | 9.2849 |
|
|
| Power [W] | 108.1610 |
|
|
| Energy DRAM [J] | 0 |
|
|
| Power DRAM [W] | 0 |
|
|
| SP [MFLOP/s] | 16606.5397 |
|
|
| AVX SP [MFLOP/s] | 16604.7458 |
|
|
| Packed [MUOPS/s] | 1037.7966 |
|
|
| Scalar [MUOPS/s] | 1.7940 |
|
|
| Memory read bandwidth [MBytes/s] | 27.7476 |
|
|
| Memory read data volume [GBytes] | 0.0024 |
|
|
| Memory write bandwidth [MBytes/s] | 4.9974 |
|
|
| Memory write data volume [GBytes] | 0.0004 |
|
|
| Memory bandwidth [MBytes/s] | 32.7450 |
|
|
| Memory data volume [GBytes] | 0.0028 |
|
|
| Operational intensity | 507.1471 |
|
|
+-----------------------------------+------------+
|
|
|