Remove empty unused `x1` and `x2` grid values in subgrids #151

cschwan · 2022-07-02T10:27:56Z

We already remove unused grid points in the muf2/mur2 dimension, and we should also be able to do that with x1 and x2. This might require some modifications here and there, but should improve the size of each subgrid as they usually have a small range in x in which the subgrid is non-zero.

This will require changing Grid::optimize to also modify ImportOnlySubgridV1 subgrids.

The text was updated successfully, but these errors were encountered:

cschwan · 2022-09-09T14:33:59Z

@alecandido @felixhekhorn I am working on this change; it will not decrease the file sizes, but I hope it will improve the performance of the evolution a bit. The price that we have to pay for this is that instead of having always the same x-grid everywhere - the usual 50 points - the grids return a subset of these 50 points. So while the x-grid points are still the same each process uses a different range of them. This might interfere with the caching of the EKOs, what do you think?

felixhekhorn · 2022-09-09T15:09:38Z

caching? the best we do at the moment is just the slice here (and that is on pid so unaffected): https://github.com/N3PDF/pineappl/blob/d924e9e41798c1f755849fd054938c05bbdb99c2/pineappl/src/grid.rs#L1740
afterwards we have the index remap, which would need to be shifted down to subgrid level as far as I understand
https://github.com/N3PDF/pineappl/blob/d924e9e41798c1f755849fd054938c05bbdb99c2/pineappl/src/grid.rs#L1754

cschwan · 2022-09-09T15:22:07Z

Sorry, I meant caching of the EKOs in pineko or probably somewhere higher. It might be a problem that every grid we'd like to evolve then has a different x-grid (well, technically a subset as I said).

felixhekhorn · 2022-09-09T15:25:21Z

ok, that is no problem - with the output rework my idea was to compute the EKOs just by Q2 and then do the interpolation on x at fitting or process scale just on demand

alecandido · 2022-09-09T18:36:11Z

Of course we can do, if needed (that's why we implemented in the first place). But I'd try to avoid re-interpolating as much as possible...

I.e.: everything fine, we'll do as @felixhekhorn said, if we have to. But since we agreed to have a single x-grid for the theory, it would be ideal to have all the PineAPPLgrid with the same one, if possible. This will automatically skip re-interpolation when not needed

In any case, a subset is better than other options, there will be less interpolation error.

cschwan · 2022-09-10T06:15:57Z

I.e.: everything fine, we'll do as @felixhekhorn said, if we have to. But since we agreed to have a single x-grid for the theory,

This won't change.

it would be ideal to have all the PineAPPLgrid with the same one, if possible. This will automatically skip re-interpolation when not needed

That's the case, you should be able to feed the same set of EKOs to Grid::convolute_eko, and it'll select the necessary subset. But Grid::eko_info will return a smaller set of x-grid points, and I wonder whether higher up the toolchain this will lead to regeneration of many subsets of EKOs of the original 50x50 again and again for the newly implemented datasets with the implementation of pineko (?) right now. But it's certainly possible to avoid that.

alecandido · 2022-09-10T07:05:13Z

That's the case, you should be able to feed the same set of EKOs to Grid::convolute_eko, and it'll select the necessary subset.

Point is that if you are going to use a smaller set of points, the interpolation basis is different, and thus also the interpolation coefficients (you have to account also for the missing part). So re-interpolation is needed (though for a subset is for sure advantageous).

But Grid::eko_info will return a smaller set of x-grid points, and I wonder whether higher up the toolchain this will lead to regeneration of many subsets of EKOs of the original 50x50 again and again for the newly implemented datasets with the implementation of pineko (?) right now. But it's certainly possible to avoid that.

As @felixhekhorn said: if it's not already the case, we'll make sure that a new EKO won't be recomputed (unless strictly needed, e.g. because of a new Q2 request), just re-interpolating the available one.

However, what is already happening is that one EKO is generated for each dataset, since in general they have different Q2 value, even when the xgrid is actually the same. The "unique" EKO for the theory is only the one needed to evolve the fit result.
Our goal is to transition to a two-EKOs-per-theory scheme, in which we'll have:

a small EKO, the one already present to evolve the fit
a big EKO, that will be the accumulator for all the pieces needed to evolve all the datasets (only possible if accumulation is on disk, unfeasible for full in-memory approach)

cschwan added the enhancement New feature or request label Jul 2, 2022

cschwan self-assigned this Jul 2, 2022

cschwan mentioned this issue Jul 2, 2022

Add offline file-size optimisation #45

Open

14 tasks

cschwan mentioned this issue Oct 3, 2022

Fix bug in Grid::convolute_eko #182

Merged

cschwan mentioned this issue Dec 7, 2022

Optimize x1- and x2-grids #193

Merged

cschwan linked a pull request Dec 7, 2022 that will close this issue

Optimize x1- and x2-grids #193

Merged

cschwan closed this as completed in #193 Jan 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove empty unused `x1` and `x2` grid values in subgrids #151

Remove empty unused `x1` and `x2` grid values in subgrids #151

cschwan commented Jul 2, 2022

cschwan commented Sep 9, 2022

felixhekhorn commented Sep 9, 2022

cschwan commented Sep 9, 2022 •

edited

Loading

felixhekhorn commented Sep 9, 2022

alecandido commented Sep 9, 2022 •

edited

Loading

cschwan commented Sep 10, 2022

alecandido commented Sep 10, 2022

Remove empty unused x1 and x2 grid values in subgrids #151

Remove empty unused x1 and x2 grid values in subgrids #151

Comments

cschwan commented Jul 2, 2022

cschwan commented Sep 9, 2022

felixhekhorn commented Sep 9, 2022

cschwan commented Sep 9, 2022 • edited Loading

felixhekhorn commented Sep 9, 2022

alecandido commented Sep 9, 2022 • edited Loading

cschwan commented Sep 10, 2022

alecandido commented Sep 10, 2022

Remove empty unused `x1` and `x2` grid values in subgrids #151

Remove empty unused `x1` and `x2` grid values in subgrids #151

cschwan commented Sep 9, 2022 •

edited

Loading

alecandido commented Sep 9, 2022 •

edited

Loading