[WIP] MET finetuning #320

farakiko · 2024-05-15T08:21:20Z

Implements a training script that performs MET finetuning using a pre-trained backbone MLPF.

* Update README.md * Update README.md * Update README.md * document running script * md5sum * up * clean

* May 2024 training

* fix validation scripts * dqm script * fccee_cld postprocessing * update submission script * update validation script * finalize dqm part * add data link

* enable onnx export via dynamo with dynamic shapes * added standalone export script * fp16 quantization sort of works also * use sdpa * MultiheadAttention op runs * update timing study * cleanup * model closes * update timing study * onnx is factorized * update onnx script * revert main model code * move to notebook

* update validation for cmssw 14 * it's running * update dqm for cmssw 14 * update with link * update recipe * added runtime plot notebook

…nd vbf to training (jpata#330) * update cmssw plots, add ttbar sample to valid * update validation notebook * disable ray for now * update README [skip ci] * remove DQM part [skip ci]

… add per-particle ispu flag (jpata#332) * generate ttbar nopu events * up * update postprocessing * small sample generation * v3_1 run * updates for CMSSE 14 generation * [skip ci] cleanup postprocessing * [skip ci] update pu gen * update postprocessing with new truth definition based only on caloparticles * remove pdb, switch genjet to energy * [skip ci] prepare for v3_3 * [skip ci] fix flag * added time and mem limits * pu files from scratch * 20240702_cptruthdef submission * ttbar nopu v2 * up * added genjet, genmet to clic postprocessing * remove delphes * update tests * add postprocessing jobs * update torch * update dataset version * propagate genjets, genmet * shared memory error * training on v2.0.0 for cms * fix occasional root file load bug * add jmenano * fix qq * clic training * up

* CMS training instructions

* remove correct dir * update samples * add samples * fix supervised key * resubmit training * add missing * add mpgun * plt target and gen separately * separate submission scripts * add stats * fix softmax bug * add CLD sample * add CLD sample * add finetuning script * add genjob script

* chore: update raytune search space, utils and startscript * fix: raytune deprecated env var for storage_path Also add num samples to draw in HPO as cmd line arg * chore: update clic config file for jureap57 * feat: script to build python env from scratch * chore: update startscripts for raytrain and raytune * fix CMS model path for ACAT2022 * MLPF datasets v2.0.0: track pythia-level genjets, genmet in datasets; add per-particle ispu flag (jpata#332) * generate ttbar nopu events * up * update postprocessing * small sample generation * v3_1 run * updates for CMSSE 14 generation * [skip ci] cleanup postprocessing * [skip ci] update pu gen * update postprocessing with new truth definition based only on caloparticles * remove pdb, switch genjet to energy * [skip ci] prepare for v3_3 * [skip ci] fix flag * added time and mem limits * pu files from scratch * 20240702_cptruthdef submission * ttbar nopu v2 * up * added genjet, genmet to clic postprocessing * remove delphes * update tests * add postprocessing jobs * update torch * update dataset version * propagate genjets, genmet * shared memory error * training on v2.0.0 for cms * fix occasional root file load bug * add jmenano * fix qq * clic training * up * CMS training instructions (jpata#336) * CMS training instructions * Update pyg-clic.yaml * Update pyg-clic.yaml * fix: black formatting * Enable CI/CD test of HPO workflow * fix: typo in test script --------- Co-authored-by: Joosep Pata <[email protected]>

… classifier (jpata#340) * dataset relabeling

farakiko added 18 commits April 29, 2024 12:01

update clic backbone config

48cbbef

up MET training v1

57aeaa5

add ttbar yaml script

1814d7c

update args of loader

aa0fb6c

ue only the last latentX

2884c4a

must retrieve "ycand"

2f5633f

add --freeze-backbone arg

5a5ebc5

remove charge

b397188

update input_dim

9f28112

fix --freeze-backbone

1442cbb

up

98bf94a

up

e0fce79

up

5359da9

up

05811e4

up

3f4dd74

up

b2d2fdd

up fix float

5a1175f

up freezing backbone and opt

447465c

jpata mentioned this pull request May 15, 2024

implement a per-particle weight to predict a MET correction #299

Open

farakiko added 11 commits May 15, 2024 10:49

fix mlpf.train

7caacab

from bfloat16 to float16

a7500b6

up bfloat16

41f3d4f

up

484e9e3

float64

0685aa2

up

f860bc5

up

2e9c8e8

up 20

b1292fa

up 10

36046eb

up 5

d23ea61

revert

3dd5157

farakiko and others added 30 commits July 1, 2024 12:40

up

88a198a

up ReLU and MSE

7996dce

up

651cd39

CMSSW documentation (jpata#319)

4f2cf7a

* Update README.md * Update README.md * Update README.md * document running script * md5sum * up * clean

Full CMS pytorch training in May 2024 (jpata#316)

4c213a0

* May 2024 training

update CMSSW validation scripts and documentation (jpata#322)

6f25fa3

* fix validation scripts * dqm script * fccee_cld postprocessing * update submission script * update validation script * finalize dqm part * add data link

switch onnx model to full float for cmssw compat (jpata#325)

eafc016

Update validation scripts to CMSSW_14_1_0 (jpata#323)

a01e74d

* update validation for cmssw 14 * it's running * update dqm for cmssw 14 * update with link * update recipe * added runtime plot notebook

update cmssw plots, add ttbar sample to valid, add multiparticlegun a…

fe81bd4

…nd vbf to training (jpata#330) * update cmssw plots, add ttbar sample to valid * update validation notebook * disable ray for now * update README [skip ci] * remove DQM part [skip ci]

Update README.md

7658452

fix CMS instructions (jpata#334)

5f4384f

fix CMS model path for ACAT2022

08dcbe3

CMS training instructions (jpata#336)

1499838

* CMS training instructions

CMS dataset relabel, generate v2.1.0 with more stats, separate binary…

4e7236e

… classifier (jpata#340) * dataset relabeling

back to where we were

ddfb1a4

push cld config

344259d

allow cld as a choice for --dataset

901632e

add cld options

a873209

oops

347b565

fix load

dc5b926

up

eddcbdb

up

14da090

up

e1bb077

fix stale patience

ce40222

fix

f3533b1

fix start_epoch

f540d1b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] MET finetuning #320

[WIP] MET finetuning #320

farakiko commented May 15, 2024

[WIP] MET finetuning #320

Are you sure you want to change the base?

[WIP] MET finetuning #320

Conversation

farakiko commented May 15, 2024