deepmd-kit icon indicating copy to clipboard operation
deepmd-kit copied to clipboard

LAMMPS fix dplr issue

Open cesaremalosso opened this issue 1 year ago • 6 comments

Summary

ERROR: find a bonded pair that is not on the same processor, something should not happen (../fix_dplr.cpp:345) when restart from a previous simulation run.

DeePMD-kit Version

2.2.3

Backend and its version

2.13.0

Python Version, CUDA Version, GCC Version, LAMMPS Version, etc

No response

Details

Hi,

I'm encountering the following error: ERROR: find a bonded pair that is not on the same processor, something should not happen (../fix_dplr.cpp:345)

when I try to restart a LAMMPS run from the previous ended one. I'm using a DPLR model and I'm running it just with 1 MPI process (mpirun -np 1) for the LAMMPS lmp_mpi. I attach the input file for LAMMPS as well as the two restart file from the previous run

files.zip

cesaremalosso avatar Apr 17 '24 11:04 cesaremalosso

Hi, has this issue been fixed?

cesaremalosso avatar Aug 17 '24 16:08 cesaremalosso

Hi, has this issue been fixed?

Hi @cesaremalosso, I am sorry that I was so overwhelmed in the past few weeks. In the recent days I have more time. I am working on this issue. Thanks for your patience.

Yi-FanLi avatar Aug 24 '24 06:08 Yi-FanLi

Thank you very much @Yi-FanLi ! I have another related question. It seems that the kspace_style pppm/dplr, which is used to account for the long-range interactions, is quite slow in LAMMPS, significantly slowing down the MD simulation. Do you think it would be beneficial to implement OpenMP thread parallelization to speed this up? Perhaps using GPUs for both the short-range NNP and the Wannier NN, while using CPU processes for the particle-particle particle-mesh solver?

cesaremalosso avatar Aug 24 '24 16:08 cesaremalosso

Perhaps using GPUs for both the short-range NNP and the Wannier NN, while using CPU processes for the particle-particle particle-mesh solver

This is the current implementation. pppm/dplr is based on pppm and the pppm solver is implemented in LAMMPS.

I know pppm has some variants, such as pppm/gpu, pppm/omp, pppm/intel, but they need to be tested.

Could you raise a separate issue?

njzjz avatar Aug 26 '24 02:08 njzjz

Hi @njzjz , I raised a separate issue here: #4078

cesaremalosso avatar Aug 26 '24 09:08 cesaremalosso

Thank you very much @Yi-FanLi ! I have another related question. It seems that the kspace_style pppm/dplr, which is used to account for the long-range interactions, is quite slow in LAMMPS, significantly slowing down the MD simulation. Do you think it would be beneficial to implement OpenMP thread parallelization to speed this up? Perhaps using GPUs for both the short-range NNP and the Wannier NN, while using CPU processes for the particle-particle particle-mesh solver?

Hi @cesaremalosso, I am replying to you in #4078 : )

Yi-FanLi avatar Aug 28 '24 06:08 Yi-FanLi