cdb1d5b9f1Add version with AVX2 intrinsics for gromacs scheme
Rafael Ravedutti
2022-02-04 17:52:48 +0100
34ce407f18Update stats for gromacs scheme
Rafael Ravedutti
2022-02-04 14:47:37 +0100
6e6a3f6502Use aligned loads when gathering j atoms
Rafael Ravedutti
2022-02-04 14:29:32 +0100
7b90800a2bSetting forces to zero before calculation is not required
Rafael Ravedutti
2022-02-04 14:05:04 +0100
9daf9e5f4dFix exclusion masks and add SIMD debug tools
Rafael Ravedutti
2022-02-02 21:54:18 +0100
4c5f013bf4Assign masked adds results to forces
Rafael Ravedutti
2022-02-02 18:07:56 +0100
6ad1e58a3eAdd first kernel using SIMD instrinsics for 4xn cases
Rafael Ravedutti
2022-02-02 18:00:44 +0100
5fd2d422eeAdjust kernels to work with MxN loops
Rafael Ravedutti
2022-02-02 00:49:55 +0100
85e7954932Check all clusters in cell when building neighbor lists because ghost clusters may not be sorted
Rafael Ravedutti
2022-02-01 20:16:04 +0100
4a5216a177Remove bb z-check on while loop when building neighbor lists
Rafael Ravedutti
2022-02-01 00:46:12 +0100
e64c3345bcFix a few more bugs on gromacs variant
Rafael Ravedutti
2022-01-31 23:46:20 +0100
696e6da01dImplement Neighbour list AoS memory layout + performance measurement
Maximilian Gaul
2022-01-31 20:27:59 +0100
e0e6b6a68cPerform a few fixes for gromacs variant
Rafael Ravedutti
2022-01-31 17:49:22 +0100
6691803910Add first version of force calculation with cluster scheme
Rafael Ravedutti
2022-01-28 18:07:41 +0100
eedcc97e4aRemove segfaults
Rafael Ravedutti
2022-01-28 15:18:54 +0100
a119fcdfddFix some segfaults and add function to update single atoms
Rafael Ravedutti
2022-01-27 03:07:31 +0100
aa0f4048d0Rename default directory to lammps and reorganize gromacs variant steps
Rafael Ravedutti
2022-01-25 21:00:11 +0100
cbe42b8149Fix errors to make gromacs approach compilable so far
Rafael Ravedutti
2022-01-25 12:19:28 +0100
6291709ae7Add first draft code with GROMACS approach
Rafael Ravedutti
2022-01-25 00:43:10 +0100
b2a6574426Remove unnecessary atom force backcopy in computeForce
Maximilian Gaul
2022-01-24 18:09:27 +0100
c4080e866eMake integrate kernels aware of neighbour list update
Maximilian Gaul
2022-01-24 18:04:50 +0100
72730bc27bUpdate Makefile and config.mk
Rafael Ravedutti
2022-01-17 14:16:39 +0100
df09c2861eAdd first version with more than one optimization scheme
Rafael Ravedutti
2022-01-17 14:15:02 +0100
489e7ee9d3Update .gitignore
Rafael Ravedutti
2022-01-17 11:46:57 +0100
165335cea0Update compilation flags for all available compilers
Rafael Ravedutti
2022-01-17 11:40:44 +0100
7b592b5fc7Moved presentation resources to second presentation
Maximilian Gaul
2022-01-05 12:48:37 +0100
4690542db5Added CPU metrics {Cache, FLOPS, L2, L3}, restructured resource folders
Maximilian Gaul
2022-01-05 12:31:47 +0100
8c131a7699Reminder for likwid perf measurements
Maximilian Gaul
2022-01-04 13:51:53 +0100
dc4d5f1a9cPorting atom velocity memory layout to AoS, porting velocity integration to CUDA, adding measurements + logbook update
Maximilian Gaul
2022-01-01 18:18:12 +0100
50007216edImplemented atom force AoS memory layout, added performance measurements + logbook Update
Maximilian Gaul
2022-01-01 16:09:21 +0100
72e4599accCopy neighbour lists only when reneighbouring happens, added measurements + logbook update
Maximilian Gaul
2022-01-01 12:56:42 +0100
8fa03733e9Copy parameters & cutforces threshold only once at the start + measurements
Maximilian Gaul
2021-12-28 16:48:26 +0100
bf1ae3d013Removed debug prints, only zero atom forces and not copy them, added measurements
Maximilian Gaul
2021-12-28 16:32:54 +0100
8009b54113Trying to debug segfault if cudaMemcpy is limited to neighbour list update
Maximilian Gaul
2021-12-25 15:36:08 +0100
0ea0587442Only malloc once at the beginning plus measurement csv
Maximilian Gaul
2021-12-25 13:52:33 +0100
134e3f4b78Also pinnend neighbor-struct memory, added additional performance measurements, added nvprof result to logbook
Maximilian Gaul
2021-12-18 15:58:56 +0100
c2bfa3ca3fAdd scripts for perf measurement, made atom-memory allocation pinnend using 'cudaMallocHost', added measurements for atom pinnend memory
Maximilian Gaul
2021-12-18 13:02:04 +0100
2a099da5b7Started cuda profiling, added first result to logbook
Maximilian Gaul
2021-12-03 08:13:43 +0100
7691b23d67Measure memory transfer of CPU to GPU, add explanation how to distribute calculation among multiple GPUs
Maximilian Gaul
2021-12-01 17:16:32 +0100
35c110155eSeparate tracing from force computation and fix stubbed version
Rafael Ravedutti
2021-12-01 00:07:45 +0100
bb21a885a1Add new setups for Copper melting with LJ and EAM
Rafael Ravedutti
2021-11-30 01:33:55 +0100
da90466f98Added first performance measurements with threads per block from 1 to 32
Maximilian Gaul
2021-11-25 08:09:20 +0100
8f723c1299Added command line description of MD-Bench, added memory transfer rate from CPU to GPU to force.cu
Maximilian Gaul
2021-11-23 15:55:23 +0100
0586ef150aFix num of threads instead of num of blocks, add logbook template
Maximilian Gaul
2021-11-15 19:39:09 +0100
2e5d973f7dRough rewrite to execute outer loop of force calculation in parallel, not inner loop
Maximilian Gaul
2021-11-14 10:02:23 +0100
e2fd1a0476Fixed bug, results are now equal to master branch (but still slow)
Maximilian Gaul
2021-11-11 21:00:30 +0100
4105c844c6Runs fine (but slow), results seem to be slightly off from original
Maximilian Gaul
2021-11-11 20:47:06 +0100
1f5c9c4b23Fixed segfault error, added more cudaErrorChecks, added cudaFree to avoid memory leak
Maximilian Gaul
2021-11-11 20:29:14 +0100
29e115464bFixed cudaMemcpy for AOS data layout, added debug outputs, added cudaErrorChecks
Maximilian Gaul
2021-11-11 20:14:30 +0100
1a54314c8bFirst run but segfault at the moment after a few seconds
Maximilian Gaul
2021-11-11 15:23:46 +0100
280f595b7fFixed linker error by putting includes and cuda function in extern 'C'
Maximilian Gaul
2021-11-11 14:49:29 +0100
3428974730getTimeStamp() couldn't get linked
Maximilian Gaul
2021-11-11 08:03:56 +0100
b54842f764Added Makefile instructions for .cu files
Maximilian Gaul
2021-11-11 07:27:12 +0100
9730164e6fRename force.c to force.cu because of cuda build errors
Maximilian Gaul
2021-11-10 16:20:04 +0100
0f5fdd3708Sum results after cuda function executed
Maximilian Gaul
2021-11-10 16:02:05 +0100
f7010113bfInclude commented timestamping on asm
Rafael Ravedutti
2021-11-10 14:39:44 +0100
841dfb9490Fix data types for rdr and rdrho
Rafael Ravedutti
2021-11-09 20:36:23 +0100
3f7fb7f22acudaMemcpy of Atom and other properties, first draft implementation of CUDA kernel
Maximilian Gaul
2021-11-09 16:40:25 +0100