Rafael Ravedutti
|
5814a86125
|
Small fixes
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2023-02-12 01:44:48 +01:00 |
|
Rafael Ravedutti
|
90f30d26a3
|
Add lammps-avx512 kernels without correction instructions
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2023-01-23 22:13:33 +01:00 |
|
Rafael Ravedutti
|
df09c2861e
|
Add first version with more than one optimization scheme
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-01-17 14:15:02 +01:00 |
|
Rafael Ravedutti
|
f7010113bf
|
Include commented timestamping on asm
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-11-10 14:39:44 +01:00 |
|
Rafael Ravedutti
|
f8b9a095cf
|
Add working version of force.s
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-11-09 01:23:15 +01:00 |
|
Rafael Ravedutti
|
b2dada6179
|
Fix param references and assembler errors for force.s
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-11-09 00:05:42 +01:00 |
|
Rafael Ravedutti
|
d8c8733cb2
|
Update asm force again
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-11-08 23:32:24 +01:00 |
|
Rafael Ravedutti
|
cf7ea1460c
|
Update asm version to be integrated
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-11-08 22:31:58 +01:00 |
|
Rafael Ravedutti
|
9b615cf0b3
|
Update force kernel to be integrated
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-11-08 14:05:29 +01:00 |
|
Jan Eitzinger
|
3c3d27b48a
|
Introduce separate version for traced force routine.
|
2021-10-26 09:11:17 +02:00 |
|
Rafael Ravedutti
|
4e0390404a
|
Add correct version (without traces) of CACHES_PMC results with and without force calculation on casclakesp2
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-10-06 16:08:46 +02:00 |
|
Rafael Ravedutti
|
b1f8e91f0f
|
Add cache measurements for versions with and without force computation
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-10-04 18:13:17 +02:00 |
|
Rafael Ravedutti
|
6b75c158d6
|
Include atom reference on index tracer and add only_mem ASM version
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-08-10 01:19:26 +02:00 |
|
Jan Eitzinger
|
b6d4753c2a
|
Add LIKWID Option. Allow to overwrite with asm variant.
|
2021-06-11 09:38:34 +02:00 |
|
Rafael Ravedutti
|
b8b364d265
|
Move analysis and result files to log directory
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-06-11 02:16:10 +02:00 |
|
Rafael Ravedutti
|
6c4168fdb5
|
Add OSACA analysis and ASM code for AVX2 with AoS, lt600 variant
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-08 01:03:44 +02:00 |
|
Rafael Ravedutti
|
9c28ff1e9e
|
Add arch_analysis directory and first AVX2 results
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-06 23:18:28 +02:00 |
|
Rafael Ravedutti
|
e53d9961ef
|
Fix compilation when INTERNAL_LOOP_NTIMES is not set and create avx512 directory
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-06 13:59:02 +02:00 |
|
Rafael Ravedutti
|
15de65303e
|
Add version iterating most internal loop multiple times
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-05-05 03:04:41 +02:00 |
|
Rafael Ravedutti
|
faf1e2ae85
|
Update results and assembly for SoA
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-30 15:58:54 +02:00 |
|
Rafael Ravedutti
|
02ff7de18f
|
Add markers for all kernel variants
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-28 23:19:06 +02:00 |
|
Rafael Ravedutti
|
10dacc5a4e
|
Add force for AoS data layout with markers
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-28 17:48:40 +02:00 |
|
Rafael Ravedutti
|
5131b7bcaa
|
Add comments for second kernel variant on Intel AOS assembly
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-16 16:31:27 +02:00 |
|
Rafael Ravedutti
|
e656490a38
|
Add annotated assembly files
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2021-04-16 12:44:18 +02:00 |
|