Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

in bbWW we use only one part of HH sample for training #15

Closed
acarvalh opened this issue Jul 20, 2020 · 3 comments
Closed

in bbWW we use only one part of HH sample for training #15

acarvalh opened this issue Jul 20, 2020 · 3 comments

Comments

@acarvalh
Copy link

acarvalh commented Jul 20, 2020

We do not need two trainings.

By now, the half of the HH non-res LO events (== events that need reweighing) not used in application is the odd one.
In the BDT mode it loads all.

That is followed in this commit and this commit

One needs to pay attention

  1. add resonant to the logic in default mode
  2. In training code explicitly throw away the part to be used on application

ps.: as no training is done in HH non-res NLO that does not need to be added to this logic

@veelken
Copy link

veelken commented Jul 21, 2020

Hi Xandra,

sorry, what is the motivation for introducing this special case ?
Why should we not use the standard odd-even method for all BDTs and all LBNs regardless of if we are processing signal MC, background MC, or data ?

Cheers,

Christian

@acarvalh
Copy link
Author

Hi

There is nothing new there.
That is what we do for tHq sample in ttH analysis, for instance (we used 1/3 to application and 2/3 for training). Also there I wrote first really like this there, then Karl encapsulated in a fancy way.. I just did not make his fancy way.

The motivation of doing differently from HH multilepton is that in bbWW we do have sample statistics to make only one MVA file. That is what I argued in this talk slide 5 and there was no opposition to it, so I went ahead.

However, as in bbWW there are other groups doing MVA's other than in TLL fw I will do later today a survey in Skype on what the other groups are doing since at least you did not get that comment.
The idea is we all be uniform and avoid confusion of each BDT case doing it differently (in case they intend to do differently).

cheers

@acarvalh
Copy link
Author

By not I will convert best to the 2-fold method for the sake of baseline.

We keep in mind thought that for DNNs the plan seem to be using cross validation == in poor wording, having one MVA file to all events and this file will do the magic

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants