amirhajibabaei / autoforce Goto Github PK

Sparse Gaussian Process Potentials

License: MIT License

Python 100.00%

physics chemistry machinelearning gaussian-processes sparse-gaussian-processes density-functional-theory molecular-dynamics-simulation ab-initio-simulations metadynamics metadynamics-simulations

autoforce's People

Contributors

Stargazers

Watchers

Forkers

changwmyung sohaibumr tomuhama 17donggeonkim myung-group standardgalactic hitergelei jkha-unist mumerchem niveditasingh2022 swillow swmoon-unist xinjianouyang

autoforce's Issues

pckl=None doesn't work

It looks like active calculator does not accept None type.

ActiveCalculator
In on-the-fly MD, if the initial atomic forces are zero and the model is empty/immature, model may remain blind in the following steps. The solution is to set covdiff to a finite value (~0.1).

north pole singularity in spherical harmonics

In calculation of the gradients of spherical harmonics, there is a division by sin(theta) in lines:
....
Y_theta = cos_theta * self.l_float * Y / sin_theta
Y_theta[1:, 1:] -= _r * Y[:-1, :-1] * self.coef / sin_theta
....
When sin_theta is 0, Y_theta becomes nan.
At the moment nan_to_num is applied when the gradient is returned which simply replaces nan with zeros.
But in calculations of the gradients of SOAP inconsistencies exist with autograd when xyz=[0,0,1].
The correct workaround may be to add a small positive number where sin_theta is zero.
But a beter solution maybe possible.

Addition kernel

Define a addition kernel class, where two (or more) kernels are given at init.
func, leftgrad, rightgrad, gradgrad, etc for the product kernel can be deduced from argument kernels.

Efficient calculation of the diagonal elements of the Gram matrix (in forces block).

Using sparse methods, calculating the full Gram matrix is bypassed.
Similarity calculations is only calculated between potentially many large (data) systems
and a few small inducing systems which reduces the computational complexity considerably.
Only the diagonal elements of the full gram matrix are needed;
either for calculation of the variance or the "trace" term in variational ELBO.
At the moment, covariance of all forces with each other (in one system) is calculated and
then the diagonal elements is extracted.
Instead the diagonal elements should be calculated directly, inside the similarity kernel.

distributed variance calculation

parallel variance calculation in atoms-distributed regime is missing.

Sign issue in stress calculations

Theoretically
Stress=stress1 + stress2,
where
stress1 = (sum over all atoms F.T@r) / volume
where r is coordinates and F forces on atom.
stress2 is related to derivatives wrt cell.
This works fine in ParametricCalculator (test using calc.calculate_numerical_stress(atoms)).
But in AutoForceCalculator (machine learned) we have to multiply stress1 by -1 in order to get the right numerical stresses.
Everything seemingly works just fine but I still can't explain the -1 multiplyer.

Saving and reloading Local causes inconsistensies

At the moment, when Local objects are sampled from data as inducing geometries, they include i, j indices which belong to the structure that they are embedded in.
Converting these to atoms, saving them in a traj, and reloading them causes these indices to be changed.
i, j indices are relevant via the "bothways" keyword in Local.select: if bothways=False, generally neighbors with j>i are returned.
Therefore in kernels such as PairKernel where bothways=False is frequently used, it is possible that an empty array is returned even if there are relevant atoms in the local environment.
Plus the behavior might change simply by saving and reloading locs.
This issue needs to be fixed.

too small noise causes a large shift in predicted energy

When I chose noise=1e-6 (too small) in kernel, the predicted energies by the posterior potential contained a large shift from the actual energies.
I used shift because the predictions were perfectly correlated with the data (R2~1).
This could be related to the jitters added to the diagonal of the Gram matrix.
But other possibilities also exist.
First, we need to find out why this happens.
Second, the program should issue an error or a warning when this happens.

data first or reference first?

In conditional adding of data or references to a model, sometimes the order at which data and references are added becomes important.
For instance if an atoms object is added first, addition of its locals to references becomes less likely.
Thereof, in training by MD, situations raise when multiple data are added consecutively, but no references are added.
Usually this is accompanied by sharp discontinuities in the energy time series, every time a data is inserted.
One might refer to this as an stressed model.
This stress usually is eventually released when a few references are successfully added to the model.
What is the best order of adding data and references to a model, to avoid this stressed phases?

Product kernels class

Define a product kernel class, where two (or more) kernels are given at init.
func, leftgrad, rightgrad, gradgrad, etc for the product kernel can be deduced from argument kernels.

amirhajibabaei / autoforce Goto Github PK

autoforce's People

Contributors

Stargazers

Watchers

Forkers

autoforce's Issues

pckl=None doesn't work

failures, solutions, tips

north pole singularity in spherical harmonics

Addition kernel

Efficient calculation of the diagonal elements of the Gram matrix (in forces block).

distributed variance calculation

Sign issue in stress calculations

Saving and reloading Local causes inconsistensies

too small noise causes a large shift in predicted energy

data first or reference first?

Product kernels class

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent