vegasflow+pineappl example #56

scarrazza · 2020-09-11T20:14:26Z

Here a first prototype. However there 3 points which require investigation:

apply multiple custom cuts
make the fill function async (i.e. fight with pickle)
replace lhapdf with pdfflow (i.e. make sure pdfflow works in eager mode)

scarlehoff · 2020-09-11T20:21:31Z

Pdfflow should work eager 100%

What problem do you have with pickle? In #55 I found out that tensorflow 2.2 have some unpickable internal state that tf 2.3 doesn't have.

scarrazza · 2020-09-11T20:31:26Z

Concerning pdfflow (master, tf2.3), here a short example which fails in eager mode:

import tensorflow as tf
tf.config.run_functions_eagerly(True)
from pdfflow.pflow import mkPDF
pdf = mkPDF('NNPDF31_nlo_as_0118_luxqed/0')
pdf.alphasQ2([10.5])

this fails with:

InvalidArgumentError: cannot compute GreaterEqual as input #1(zero-based) 
was expected to be a float tensor but is a double tensor [Op:GreaterEqual]

Concerning pickle, I have tried the multiprocessing apply_async function:

pool.apply_async(fill, [x1, x2, q2, yll, weight])

however this fails with:

AttributeError: Can't get attribute 'fill' 
on <module '__main__' from 'example_pineappl.py'>

scarlehoff · 2020-09-11T21:24:02Z

Concerning pdfflow (master, tf2.3), here a short example which fails in eager mode:

import tensorflow as tf
tf.config.run_functions_eagerly(True)
from pdfflow.pflow import mkPDF
pdf = mkPDF('NNPDF31_nlo_as_0118_luxqed/0')
pdf.alphasQ2([10.5])

this fails with:

InvalidArgumentError: cannot compute GreaterEqual as input #1(zero-based) 
was expected to be a float tensor but is a double tensor [Op:GreaterEqual]

You are calling the tf interface with a python object. You have to either use py_alpha or use float_me

This is a very tricky point, in one of the PR of pdfflow the error is caught and you get told (at least for xfq... Maybe I forgot alphas!)

Concerning pickle, I have tried the multiprocessing apply_async function:
pool.apply_async(fill, [x1, x2, q2, yll, weight])
however this fails with:
AttributeError: Can't get attribute 'fill' 
on <module '__main__' from 'example_pineappl.py'>

But how do you know this is a pickle error?

scarrazza · 2020-09-12T08:25:25Z

Great, thanks for the pdfflow, now it works.

scarlehoff · 2020-09-12T15:30:11Z

Many thanks for this, this is a fairly complicated and real-world usage case.

So, seeing the problem you were having dovevo averlo capito subito but I had completely overlooked the imports. Sorry. For many of these things you cannot pickle stuff to send to different processes, you are allowed to open threads but they must be all childs of the same process.
(btw, probably also dask would fail for pineappl, pickles are bad)

Btw, this example highlights the need for a feature to pass arguments around. Neither the usage of partial neither the importance of avoiding tf.function before the partial is obvious. So compile needs to check whether the input function is a tensorflow function and if it is it should failing telling the user "look, you can't do this in this order"

scarrazza · 2020-09-12T16:32:26Z

Right, thanks for spotting this. The current implementation with async is 10x faster (CPU).
At this point the only minor requirement is how to apply cuts in a "elegant" way, in particular the conditions in lines 78-80.

Add a conditions generator

scarrazza · 2020-09-17T12:10:36Z

@cschwan, in principle this PR is ready. After applying cuts the numbers I get are similar to the python example in pineappl.
Could you please have a look and cross check?

scarrazza · 2020-09-17T12:36:50Z

This default configuration takes ~2s on CPU and ~1s on GPU, so close to 50% improvement.
If I remove pineappl the GPU drops to 0.5s, so in overall the performance is good,

cschwan · 2020-09-18T09:04:42Z

I need the patch NNPDF/pineappl@74ae0d0 to make the PineAPPL Python API work. The bad news is that the numbers from the grid seem random. When I increase the statistics successively by factors of ten, the numbers change wildly. ~~Since the integrated results from vegasflow are stable,~~ Something must be wrong with filling the grid. The integrated result seems to be missing the multiplication with the number of calls.

cschwan · 2020-09-18T09:50:07Z

Are you filling all iterations into a single grid? If so, that will surely give wrong numbers. In that case let's try to use only the last iteration.

cschwan · 2020-09-18T09:56:02Z

Another concern is that the example doesn't convolve the matrix elements with PDFs. With a PLAIN MC that's fine, but I reckon that the adaption can go wrong if you use VEGAS without PDFs, because the PDFs change the importance of the integrand.

cschwan · 2020-09-18T15:10:03Z

The last commit breaks the correctness. You can't fill the grids asynchronously, at least not that easily.

scarlehoff · 2020-09-18T15:30:36Z

Ah, I was indeed wondering whether the result seemed correct by chance
In any case, passing the .numpy() seems to make it go much faster (I think due to the loop). This should make it work..

cschwan · 2020-09-18T15:43:03Z

~~Strange, the results are still wrong and I can't tell you why. The fact that it's faster suggests it does less work, but that's only a guess, of course.~~ I didn't pull the latest commit...

scarlehoff · 2020-09-18T15:54:39Z

No, I just checked and it is slower because iterating a tensorflow tensor is slower than a numpy tensor.
I compared the values in the numpy() and they are numerically the same as in the tensorflow tensor.

~~What do you mean by wrong results? (what are the right ones)~~

I see :P so they are right, so the problem is simply the grid needs to be filled synchronously which is fair I'd say

cschwan · 2020-09-18T16:01:31Z

The 'correct' results are the results that are calculated by the PineAPPL example program, which I check against mg5_aMC@NLO.

scarlehoff · 2020-09-18T16:10:21Z

Oh, I know now why my results were correct in the async case. Of course they were, I was running the whole batch in one go.

cschwan · 2020-09-18T16:19:19Z

When I compare the numbers from commits 1dcf172 and a9f626a I don't get the exact same numbers, but they seem to differ only within the MC uncertainty.

scarrazza · 2020-09-19T19:27:07Z

The async issue should be fixed in my last commit. We forgot to ask explicitly the pool to "wait" for pineappl fill. @cschwan could you please double check?

The apply_async seems to queue the fill evaluations. The .numpy() call is pretty useful, it accelerates a lot the integration procedure, otherwise eager applies the convert to tensor element by element, adding a huge overhead.

Anyway, the bottleneck of the implementation seems to be pineappl fill, for 10M events and events_limit=1e6 it takes on my laptop 1 minute to complete when we run sequentially or asynchronously, obviously the vegasflow time drops from 50s to 23s but the pool dominates the calculation.

So, following our discussion, I believe we should try the approach suggested by Christopher. Create n_events/events_limit pineappl grids, and fill then in separate process. @scarlehoff is there some way to extract the thread id (batch id) inside the integrand function? If have that we can pass an array of pineappl grids and try to fill with apply_async using more than 1 process.

cschwan · 2020-09-21T05:51:17Z

The numbers from last commit look fine, but they seem to have changed again. Is there a way to ensure that, given N evaluation, the random numbers are always the same? In that case you could unit-test it.

scarlehoff · 2020-09-21T07:59:35Z

Why would gou need the batch id? Every new call should be a different process regardless of the id right?

Wrt reproducibility, this has been an outstanding issue with tf. One would need to seed numpy, tensorflow and then hope that the multithreading work similarly between the two runs. Let me have a go at it, maybe with the GPU works better (for n3fit I can get reproducible results only running on 1 thread)

scarrazza · 2020-09-21T08:05:02Z

Why would gou need the batch id? Every new call should be a different process regardless of the id right?

In principle we need to now the thread number only for the last iteration (the one where fill is called). I just tried with a global variable and it works, however the performance is pretty bad because we are forced to allocate tons of threads, withdrawing computing power from vegasflow. So I think we should keep the single thread fill implementation, and try to get rid of the python for loop.

scarlehoff · 2020-09-21T08:08:03Z

But all batches must fill the grid.

The python loop is surely hurting, can't pineappl take arrays? Numpy arrays can be passed to C

cschwan · 2020-09-21T08:21:39Z

Anyway, the bottleneck of the implementation seems to be pineappl fill, for 10M events and events_limit=1e6 it takes on my laptop 1 minute to complete when we run sequentially or asynchronously, obviously the vegasflow time drops from 50s to 23s but the pool dominates the calculation.

This is expected, since the matrix element and the phase space generation in this example is extremely cheap. For more realistic scenarios the timings will be more favourable.

scarlehoff · 2020-09-21T08:32:12Z

This seems to work. I'm not sure it would work as expected if not running eagerly though... I need to look into this and I'll add a set_seed method, it will be useful.

cschwan · 2020-09-21T09:33:58Z

The python loop is surely hurting, can't pineappl take arrays? Numpy arrays can be passed to C

I've implemented a function in NNPDF/pineappl@bfb0d40. Is this what you need? If yes, we also need the corresponding function in the Python interface.

scarrazza · 2020-09-21T09:50:32Z

Yes, that's good, thanks!

scarrazza · 2020-09-21T10:29:56Z

Could you please port this function to the capi?

cschwan · 2020-09-21T10:40:32Z

@scarrazza That is the C API.

scarrazza · 2020-09-21T10:44:17Z

Strange, I have recompiled and the header does not contain this function, and I get ab undefined symbol: pineappl_grid_fill_array..

scarrazza · 2020-09-21T18:57:38Z

Ok, here some results with GPU for 10M events.

i9-9980xe:
- vegas time without pineappl: 7.6s
- vegas time with pineappl: 7.8s
- vegas+pool time: 9.4s
rtx2080ti:
- vegas time without pineappl: 2.8s
- vegas time with pineappl: 2.8s
- vegas+pool time: 5.1s

So, pineappl is really well integrated and adds a minimal overhead.

scarlehoff · 2020-09-21T19:05:56Z

I guess point two shows one interesting advantage, if your integrator is running in the GPU, the CPU is free to do its thing without having a effect in the integrator. Although those 0,2s difference might be a fluctuation :P

adding pineappl example

76f1ccb

scarrazza requested review from cschwan and scarlehoff September 11, 2020 20:14

replacing lhapdf with pdfflow

42b61bb

updating example

1e91bd8

scarlehoff and others added 2 commits September 12, 2020 17:30

pool to threads

6b5247d

minor adjustments

ea7ffc4

scarlehoff mentioned this pull request Sep 14, 2020

Add Arch User Repository badge NNPDF/pineappl#41

Closed

scarlehoff added this to the v1.2 milestone Sep 14, 2020

scarlehoff mentioned this pull request Sep 14, 2020

Release of v 1.2 #59

Closed

6 tasks

add a condition generator

f0fa81f

scarlehoff mentioned this pull request Sep 16, 2020

Add a conditions generator #60

Merged

scarlehoff and others added 3 commits September 16, 2020 13:10

return with the size that scatter_nd expects

93f3c28

applying cuts

3ebac34

Merge pull request #60 from N3PDF/add_cuts

20e6c7c

Add a conditions generator

scarrazza marked this pull request as ready for review September 17, 2020 12:09

scarlehoff mentioned this pull request Sep 18, 2020

Integrating pineappl N3PDF/pdfflow#31

Closed

fixing ylp cut

7637c56

fix

1dcf172

add .numpy()

4226b82

un-async

a9f626a

fixing async

c0e6fff

seed enabled

26e1c4d

final implementation

0490722

adding someprints

7413f1b

adding argparser

c417d02

scarlehoff merged commit 1bf91f2 into master Sep 24, 2020

scarlehoff deleted the pineappl branch September 24, 2020 18:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vegasflow+pineappl example #56

vegasflow+pineappl example #56

scarrazza commented Sep 11, 2020 •

edited

Loading

scarlehoff commented Sep 11, 2020

scarrazza commented Sep 11, 2020

scarlehoff commented Sep 11, 2020

scarrazza commented Sep 12, 2020

scarlehoff commented Sep 12, 2020

scarrazza commented Sep 12, 2020

scarrazza commented Sep 17, 2020

scarrazza commented Sep 17, 2020

cschwan commented Sep 18, 2020 •

edited

Loading

cschwan commented Sep 18, 2020

cschwan commented Sep 18, 2020

cschwan commented Sep 18, 2020

scarlehoff commented Sep 18, 2020

cschwan commented Sep 18, 2020 •

edited

Loading

scarlehoff commented Sep 18, 2020 •

edited

Loading

cschwan commented Sep 18, 2020 •

edited

Loading

scarlehoff commented Sep 18, 2020

cschwan commented Sep 18, 2020

scarrazza commented Sep 19, 2020

cschwan commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

cschwan commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

cschwan commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarrazza commented Sep 21, 2020

cschwan commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

vegasflow+pineappl example #56

vegasflow+pineappl example #56

Conversation

scarrazza commented Sep 11, 2020 • edited Loading

scarlehoff commented Sep 11, 2020

scarrazza commented Sep 11, 2020

scarlehoff commented Sep 11, 2020

scarrazza commented Sep 12, 2020

scarlehoff commented Sep 12, 2020

scarrazza commented Sep 12, 2020

scarrazza commented Sep 17, 2020

scarrazza commented Sep 17, 2020

cschwan commented Sep 18, 2020 • edited Loading

cschwan commented Sep 18, 2020

cschwan commented Sep 18, 2020

cschwan commented Sep 18, 2020

scarlehoff commented Sep 18, 2020

cschwan commented Sep 18, 2020 • edited Loading

scarlehoff commented Sep 18, 2020 • edited Loading

cschwan commented Sep 18, 2020 • edited Loading

scarlehoff commented Sep 18, 2020

cschwan commented Sep 18, 2020

scarrazza commented Sep 19, 2020

cschwan commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

cschwan commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

cschwan commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarrazza commented Sep 21, 2020

cschwan commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarrazza commented Sep 21, 2020

scarlehoff commented Sep 21, 2020

scarrazza commented Sep 11, 2020 •

edited

Loading

cschwan commented Sep 18, 2020 •

edited

Loading

cschwan commented Sep 18, 2020 •

edited

Loading

scarlehoff commented Sep 18, 2020 •

edited

Loading

cschwan commented Sep 18, 2020 •

edited

Loading