Rafael Ravedutti
|
ab2eb1ff50
|
Write LAMMPS kernel with SIMD intrinsics and implement AVX512 with double-precision functions
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-04-05 02:57:23 +02:00 |
|
Rafael Ravedutti
|
fdbeed4368
|
Fix AVX2 versions with half neighbor lists
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-27 16:39:39 +02:00 |
|
Rafael Ravedutti
|
0e742766b7
|
Add working version of Simd4xn kernel with half neighbor lists
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-23 15:54:18 +01:00 |
|
Rafael Ravedutti
|
e72323ab6a
|
Fix Simd2xnn Kernel with half neighbor lists and add AVX512 intrinsics with double
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-23 15:21:07 +01:00 |
|
Rafael Ravedutti
|
94521f03b3
|
Fix reference version with half neighbor lists
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-23 14:31:47 +01:00 |
|
Rafael Ravedutti
|
8709bc2a06
|
Add first version for half neighbor lists in GROMACS variant
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-22 23:47:05 +01:00 |
|
Rafael Ravedutti
|
887f41871c
|
Add parameter reading for LAMMPS variant
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-17 02:44:34 +01:00 |
|
Rafael Ravedutti
|
d4b34e1fa4
|
Fix intrinsics for AVX2
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-17 00:35:21 +01:00 |
|
Rafael Ravedutti
|
4090f43095
|
Optimize partial forces reduction for compute_4xn kernel
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-16 17:54:52 +01:00 |
|
Rafael Ravedutti
|
f3263a2d48
|
Separate simd file into multiple files
Signed-off-by: Rafael Ravedutti <rafaelravedutti@gmail.com>
|
2022-03-16 14:52:55 +01:00 |
|