sum_hills truncates gaussians #420

mnmelo · 2018-12-09T22:18:38Z

Dear devs,

I ran into this issue, which can be exemplified by a simple MWE:

I create a HILLS file of a single CV, with a single deposed gaussian at 0.0. (SIGMA=0.05 and HEIGHT=0.25):

#! FIELDS time cv.val sigma_cv.val height biasf
#! SET multivariate false
#! SET kerneltype gaussian
      0.0    0.0                   0.05                   0.25                     -1

I then ask sum_hills to give me the FES from 0 to 1, over 1000 bins:

plumed sum_hills --min 0 --max 1 --bin 1000 --hills HILLS

which yields a fes.dat with the following values (abridged above and below):

    ...
    0.175000000   -0.000546873    0.038281095
    0.176000000   -0.000509799    0.035889838
    0.177000000   -0.000475048    0.033633410
    0.178000000   -0.000000000   -0.000000000
    0.179000000   -0.000000000   -0.000000000
    0.180000000   -0.000000000   -0.000000000
    ...

As can be seen, the single gaussian does not extend all the way till the end of the [0,1] range, and is instead truncated around 3.5*σ. Granted, the truncation error is small, but still well above the machine precision limit. I couldn't find any documentation for sum_hills describing it.

Tied to this is the summation handling during METAD: are gaussians similarly truncated when added to a GRID?

Thanks!
(using v2.5, at commit b1c3c74, but had the issue at least since 2c38c46)

The text was updated successfully, but these errors were encountered:

carlocamilloni · 2018-12-10T07:24:55Z

Gaussians have always been truncated in plumed since plumed1

…

Sent from my iPhone

Il giorno 9 dic 2018, alle ore 23:18, mnmelo ***@***.***> ha scritto: Dear devs, I ran into this issue, which can be exemplified by a simple MWE: I create a HILLS file of a single CV, with a single deposed gaussian at 0.0. (SIGMA=0.05 and HEIGHT=0.25): #! FIELDS time cv.val sigma_cv.val height biasf #! SET multivariate false #! SET kerneltype gaussian 0.0 0.0 0.05 0.25 -1 I then ask sum_hills to give me the FES from 0 to 1, over 1000 bins: plumed sum_hills --min 0 --max 1 --bin 1000 --hills HILLS which yields a fes.dat with the following values (abridged above and below): ... 0.175000000 -0.000546873 0.038281095 0.176000000 -0.000509799 0.035889838 0.177000000 -0.000475048 0.033633410 0.178000000 -0.000000000 -0.000000000 0.179000000 -0.000000000 -0.000000000 0.180000000 -0.000000000 -0.000000000 ... As can be seen, the single gaussian does not extend all the way till the end of the [0,1] range, and is instead truncated around 3.5*σ. Granted, the truncation error is small, but still well above the machine precision limit. I couldn't find any documentation for sum_hills describing it. Tied to this is the summation handling during METAD: are gaussians similarly truncated when added to a GRID? Thanks! (using v2.5, at commit b1c3c74, but had the issue at least since 2c38c46) — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

maxbonomi · 2018-12-10T07:33:59Z

I think even before that! I remember this truncation was present in one of the first metadynamics implementation in an in-house version of ORAC from the Parrinello lab. Max

…

On Dec 10, 2018, at 08:24, Carlo Camilloni ***@***.***> wrote: Gaussians have always been truncated in plumed since plumed1 Sent from my iPhone > Il giorno 9 dic 2018, alle ore 23:18, mnmelo ***@***.***> ha scritto: > > Dear devs, > > I ran into this issue, which can be exemplified by a simple MWE: > > I create a HILLS file of a single CV, with a single deposed gaussian at 0.0. (SIGMA=0.05 and HEIGHT=0.25): > > #! FIELDS time cv.val sigma_cv.val height biasf > #! SET multivariate false > #! SET kerneltype gaussian > 0.0 0.0 0.05 0.25 -1 > I then ask sum_hills to give me the FES from 0 to 1, over 1000 bins: > > plumed sum_hills --min 0 --max 1 --bin 1000 --hills HILLS > which yields a fes.dat with the following values (abridged above and below): > > ... > 0.175000000 -0.000546873 0.038281095 > 0.176000000 -0.000509799 0.035889838 > 0.177000000 -0.000475048 0.033633410 > 0.178000000 -0.000000000 -0.000000000 > 0.179000000 -0.000000000 -0.000000000 > 0.180000000 -0.000000000 -0.000000000 > ... > As can be seen, the single gaussian does not extend all the way till the end of the [0,1] range, and is instead truncated around 3.5*σ. Granted, the truncation error is small, but still well above the machine precision limit. I couldn't find any documentation for sum_hills describing it. > > Tied to this is the summation handling during METAD: are gaussians similarly truncated when added to a GRID? > > Thanks! > (using v2.5, at commit b1c3c74, but had the issue at least since 2c38c46) > > — > You are receiving this because you are subscribed to this thread. > Reply to this email directly, view it on GitHub, or mute the thread. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

GiovanniBussi · 2018-12-10T10:49:28Z

Truncation is required in order to make the addition of a Gaussian to a grid computationally efficient. Notice that Gaussians are consistently truncated in sum_hills and when applied by the METAD keyword.

We can expect a small error coming from this, in particular due to the fact that there is a discontinuity in the potential. I suspect that the problem will be very small and likely smaller than the typical statistical error.

One way to fix it would be to "stretch" the potential as we do for switching functions since PLUMED 2.2. The main problem is that this breaks backward compatibility. My guess is that the discontinuity on switching functions is most critical (especially because people can play with D_MAX). Still I see that it would be nicer to have a continuous energy function..

Implementing the stretching of Gaussians would be very easy (actually, it would affect all Gaussian kernels that we use). We could make it optional to allow people to keep results consistent. Instead of using a NOSTRETCH flag (as in switching functions) I would use an environment variable, basically because we use Gaussian kernels in many places and it would be difficult to add a NOSTRETCH flag to all of them. Something like export PLUMED_GAUSSIAN_STRETCH=no.

An additional issue would be that if we stretch the Gaussians used in histograms we should also
scale them to be correctly normalized.

What do you think?

Giovanni

GiovanniBussi · 2018-12-10T10:56:00Z

Actually the last problem is not a problem. For histograms we already use truncated Gaussians with correct normalization:

plumed2/src/tools/KernelFunctions.cpp

Line 177 in b1c3c74

} else if( ktype==truncatedgaussian ) {

So it would be easy to add a "stretched Gaussian" to the list of kernels and use that in METAD and SumHills by default, perhaps with a env var to select the new/old behavior.

carlocamilloni · 2018-12-10T10:57:49Z

I agree with this solution, I think it is rather inelegant that we still have the discontinuity in the energy

…

On 10 Dec 2018, at 11:56, Giovanni ***@***.***> wrote: Actually the last problem is not a problem. For histograms we already use truncated Gaussians with correct normalization: https://github.com/plumed/plumed2/blob/b1c3c7406918e99f4300af7912dd6445549332cd/src/tools/KernelFunctions.cpp#L177 <https://github.com/plumed/plumed2/blob/b1c3c7406918e99f4300af7912dd6445549332cd/src/tools/KernelFunctions.cpp#L177> So it would be easy to add a "stretched Gaussian" to the list of kernels and use that in METAD and SumHills by default, perhaps with a env var to select the new/old behavior. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#420 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEXl9rBU0x1Cb8FZF0t-TDl0v-n-yAenks5u3j3AgaJpZM4ZKWns>.

GiovanniBussi · 2018-12-10T11:10:42Z

Actually the code that computes Gaussians is repeated in multiple places. I think we can find all of them using git grep DP2CUTOFF

spiwokv · 2018-12-10T14:09:18Z

You can use moving pre-calculated Gaussians as we did in metadynview and metadynminer.
Vojtech

spiwokv · 2018-12-10T14:10:44Z

Unless you have variable hill widths.

GiovanniBussi · 2018-12-10T14:15:40Z

@spiwokv I see this is making the calculation of the exponential functions faster. However, it will mean that the Gaussian's centers are discretized on the grid, right?

In case we have some evidence that this can make the calculation faster, I think we could add it as an optional feature. I would be careful however in using this as the only choice.

spiwokv · 2018-12-10T15:18:54Z

@GiovanniBussi yes, it is discretized on a grid (i don't see any simple interpolation). I did mean it for sum_hills, not for metadynamics. If you use for example periodic cv from -pi to pi, you can make a hill centered at 0,0 and calculate potential for all points. Next you can role it to center it to CV values. This does all periodicity. For a non-periodic cv you have to calculate it for -2pi to +2pi.

spiwokv · 2018-12-11T10:59:18Z

In metadynminer by fast algorithm at 256x256 grid 2.304877 secs

by slow algorithm 54.28688 secs

difference

histogram of differences

GiovanniBussi mentioned this issue Nov 17, 2020

WIP: MetaD NEIGHBOR list option to sum gaussians #639

Merged

This was referenced Nov 17, 2021

Stretched Gaussians #755

Closed

Stretched Gaussians (2nd version) #757

Merged

GiovanniBussi closed this as completed in #757 Nov 19, 2021

This was referenced Mar 31, 2022

Plumed 2.8.0 fails metadynamics unit test PHOTOX/ABIN#83

Closed

GHA: support Plumed 2.8 PHOTOX/ABIN#102

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sum_hills truncates gaussians #420

sum_hills truncates gaussians #420

mnmelo commented Dec 9, 2018

carlocamilloni commented Dec 10, 2018 via email

maxbonomi commented Dec 10, 2018 via email

GiovanniBussi commented Dec 10, 2018

GiovanniBussi commented Dec 10, 2018

carlocamilloni commented Dec 10, 2018 via email

GiovanniBussi commented Dec 10, 2018

spiwokv commented Dec 10, 2018

spiwokv commented Dec 10, 2018

GiovanniBussi commented Dec 10, 2018

spiwokv commented Dec 10, 2018

spiwokv commented Dec 11, 2018

sum_hills truncates gaussians #420

sum_hills truncates gaussians #420

Comments

mnmelo commented Dec 9, 2018

carlocamilloni commented Dec 10, 2018 via email

maxbonomi commented Dec 10, 2018 via email

GiovanniBussi commented Dec 10, 2018

GiovanniBussi commented Dec 10, 2018

carlocamilloni commented Dec 10, 2018 via email

GiovanniBussi commented Dec 10, 2018

spiwokv commented Dec 10, 2018

spiwokv commented Dec 10, 2018

GiovanniBussi commented Dec 10, 2018

spiwokv commented Dec 10, 2018

spiwokv commented Dec 11, 2018