The scripts in this repo are designed to calibrate the tau energy scale (pt spectrum) at the ATLAS detector using Mixture Density Networks (MDN).
This code runs out of the box on a lxplus
node with a cern computing account. If the user does not have access to these, the dataset will have to be downloaded.
If you are on a computing node with access to CVMFS (for e.g. CERN's lxplus
, or BNL, Berkley, UChicago, etc.), then run the following before proceeding
setupATLAS
lsetup "python 3.9.14-x86_64-centos7"
First clone the repository, then create a python environment with the provided requirements.txt
file as follows:
python3 -m venv tesenv
source tesenv/bin/activate
pip install -r taunet/requirements.txt
If you are not on an lxplus
node or would like to use a different dataset, specify the path to this data by running
export TAUNET_PATH="<path_to_ntuples>"
Once you have the environment set up you can run the file fit.py
to learn a good network. Running plot.py
will then create plots from this network. One good network is already in the folder final_MDN
. To get plots for this using data from part of the dataset run
python plot.py --path final_MDN --debug --nfiles 1
There is still some validation to be performed in order to merge with ATLAS software. Some ideas are listed below:
- Make some plots of the standard deviation given by the network and compare this to the resolution. Notice that this is not trivial to do based on the target we use to train out network.
- Perhaps train the network with target of
ptTruthVisDressed
. This makes interpretation of the network standard deviation more straightforward. - Remove dependance on combined variables and attempt to gain same performance as in the above work.
- Try networks with more gaussian components until performance begins to fall.
- Perform a final hyperparameter scan to try and squeeze as much out of the network as possible.
To learn a bit more about the current state of this work, see the following presentations of the project:
- An overview of the important results was presented for the tau working group at ATLAS.
- Specifics of the learning used were presented for the EPE-ML group at the University of Washington.
- A general overview of the project was presented to the University of Washington REU program.
Finally, a paper written for the UW REU program giving a full description of the project can be found on the UW REU 2022 page.