Experiments with end-to-end training #12

jpata · 2020-01-18T04:39:06Z

merge GNN code from @jmduarte Benchmarking GNN #10
retrain all baseline models with Run3 TTbar dataset
Improved graph_data.py to prepare a more coherent set of input elements x ordered by block id, with the output candidates y_candidates padded within each block to have the same length.
add additional tunable elements to distance matrix based on locality to improve graph connectivity for message passing approaches
implemented an efficient block-by-block loss function inspired by awkward-array, such that we can compare the average pt,eta,phi of the true and predicted candidates within each true block
tested various end-to-end training approaches to regress PFCandidates directly from the elements via an intermediate clustering step
- Message Passing GNN with EdgePooling clustering+declustering (cannot overtrain, max pt correlation ~0.6....0.7)
- GCN (as above)
- GraphUNet (as above)
- all to all dense baseline (can overtrain by memorizing dataset)

…into endtoend_gnn

jpata · 2020-01-27T18:56:08Z

This is now ready to be merged, we have a baseline end-to-end training for elements to PFCandidates that seems to work reasonably well.

The following plots are done on 1k testing events not seen during training.

Confusion matrix of true vs predicted candidate pdgid (0 - no candidate):
confusion.pdf

Number of true vs predicted candidates per event:
num_corr.pdf

True vs predicted pt of 1000 candidates in one testing event:
pt_corr_0.pdf

* start adding gnn * add gnn to benchmarks * Update run_training.sh * update graph_data and EdgeNet to include edge_attr and benchmarking * add notebook for plotting * added first end-to-end training example * up * up * up * added end2end training examples * up * up * up * up * up * up * cmdline args * added sequential conv * added cls accuracy monitoring * up * remove additional edges * add act * dataset location * elem id encoding, fix norm * fix nans * add ntest * added num pred and true plotting: * added npy file saving * switch to relu * pfnet7 same setup as others * up * loss coefs configurable * added model to predict only id * fix bugs with relabeling * fix plot title * cosmetic * up * up * added sinkhorn loss * fixes * added reordering code * fix printout, reweighting * added class weighting * update readme * dropout configurable, simplify cross-check model * fix weight application * update weights Co-authored-by: Javier Duarte <jduarte@ucsd.edu> Former-commit-id: 779431e

Dev feb24 flatiron

jmduarte and others added 30 commits November 27, 2019 16:01

start adding gnn

97dc87d

Merge branch 'master' of github.com:jpata/particleflow

747947b

add gnn to benchmarks

fead6b5

Update run_training.sh

9e7bf53

update graph_data and EdgeNet to include edge_attr and benchmarking

4247d5c

add notebook for plotting

8a7c654

added first end-to-end training example

bbca252

up

380f3d0

Merge branch 'gnn_jmd_v2' into endtoend_gnn

dd5df68

up

0267ba6

Merge branch 'endtoend_gnn' of https://github.com/jpata/particleflow …

3f6bd14

…into endtoend_gnn

up

af1046d

added end2end training examples

3621e15

up

d30b858

up

6cc0a74

up

ef87251

up

37e1cfd

up

d9b0299

up

cdb38b6

cmdline args

020ab79

added sequential conv

b2cddd4

added cls accuracy monitoring

092504a

up

21abfe1

remove additional edges

68a57d7

add act

628e5c0

dataset location

6e19786

elem id encoding, fix norm

3bba476

fix nans

184b093

add ntest

9fa2ab0

added num pred and true plotting:

784ed1e

jpata and others added 16 commits January 23, 2020 09:23

Merge branch 'endtoend_gnn' of https://github.com/jpata/particleflow …

0b787c2

…into endtoend_gnn

added npy file saving

6d36494

switch to relu

4a1df05

pfnet7 same setup as others

2fd750e

up

c09a839

loss coefs configurable

ba999c9

added model to predict only id

9b6339b

fix bugs with relabeling

a9ba8ec

fix plot title

a706599

cosmetic

23f708f

Merge branch 'endtoend_gnn' of https://github.com/jpata/particleflow …

b410b39

…into endtoend_gnn

up

81d5d52

up

fac6586

added sinkhorn loss

fa53fcb

fixes

e969a67

added reordering code

53d7ff4

jpata and others added 2 commits January 28, 2020 14:12

fix printout, reweighting

926eef3

added class weighting

283c31e

jpata mentioned this pull request Jan 29, 2020

end to end PF regression #14

Closed

4 tasks

jpata and others added 4 commits January 29, 2020 11:32

update readme

5a9e791

dropout configurable, simplify cross-check model

cd62a86

fix weight application

49d7589

update weights

31e8d07

jpata merged commit 779431e into master Jan 30, 2020

jpata mentioned this pull request Jan 30, 2020

Benchmarking GNN #10

Closed

jpata deleted the endtoend_gnn branch February 19, 2020 23:08

erwulff added a commit that referenced this pull request Feb 9, 2024

Merge pull request #12 from erwulff/dev_feb24_flatiron

08f0572

Dev feb24 flatiron

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiments with end-to-end training #12

Experiments with end-to-end training #12

jpata commented Jan 18, 2020 •

edited

Loading

jpata commented Jan 27, 2020

Experiments with end-to-end training #12

Experiments with end-to-end training #12

Conversation

jpata commented Jan 18, 2020 • edited Loading

jpata commented Jan 27, 2020

jpata commented Jan 18, 2020 •

edited

Loading