Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] MET finetuning #320

Draft
wants to merge 97 commits into
base: main
Choose a base branch
from
Draft

[WIP] MET finetuning #320

wants to merge 97 commits into from

Conversation

farakiko
Copy link
Collaborator

Implements a training script that performs MET finetuning using a pre-trained backbone MLPF.

farakiko and others added 30 commits July 1, 2024 12:40
* Update README.md

* Update README.md

* Update README.md

* document running script

* md5sum

* up

* clean
* fix validation scripts

* dqm script

* fccee_cld postprocessing

* update submission script

* update validation script

* finalize dqm part

* add data link
* enable onnx export via dynamo with dynamic shapes

* added standalone export script

* fp16 quantization sort of works also

* use sdpa

* MultiheadAttention op runs

* update timing study

* cleanup

* model closes

* update timing study

* onnx is factorized

* update onnx script

* revert main model code

* move to notebook
* update validation for cmssw 14

* it's running

* update dqm for cmssw 14

* update with link

* update recipe

* added runtime plot notebook
…nd vbf to training (jpata#330)

* update cmssw plots, add ttbar sample to valid

* update validation notebook

* disable ray for now

* update README [skip ci]

* remove DQM part [skip ci]
… add per-particle ispu flag (jpata#332)

* generate ttbar nopu events

* up

* update postprocessing

* small sample generation

* v3_1 run

* updates for CMSSE 14 generation

* [skip ci] cleanup postprocessing

* [skip ci] update pu gen

* update postprocessing with new truth definition based only on caloparticles

* remove pdb, switch genjet to energy

* [skip ci] prepare for v3_3

* [skip ci] fix flag

* added time and mem limits

* pu files from scratch

* 20240702_cptruthdef submission

* ttbar nopu v2

* up

* added genjet, genmet to clic postprocessing

* remove delphes

* update tests

* add postprocessing jobs

* update torch

* update dataset version

* propagate genjets, genmet

* shared memory error

* training on v2.0.0 for cms

* fix occasional root file load bug

* add jmenano

* fix qq

* clic training

* up
* CMS training instructions
* remove correct dir

* update samples

* add samples

* fix supervised key

* resubmit training

* add missing

* add mpgun

* plt target and gen separately

* separate submission scripts

* add stats

* fix softmax bug

* add CLD sample

* add CLD sample

* add finetuning script

* add genjob script
* chore: update raytune search space, utils and startscript

* fix: raytune deprecated env var for storage_path

Also add num samples to draw in HPO as cmd line arg

* chore: update clic config file for jureap57

* feat: script to build python env from scratch

* chore: update startscripts for raytrain and raytune

* fix CMS model path for ACAT2022

* MLPF datasets v2.0.0: track pythia-level genjets, genmet in datasets; add per-particle ispu flag (jpata#332)

* generate ttbar nopu events

* up

* update postprocessing

* small sample generation

* v3_1 run

* updates for CMSSE 14 generation

* [skip ci] cleanup postprocessing

* [skip ci] update pu gen

* update postprocessing with new truth definition based only on caloparticles

* remove pdb, switch genjet to energy

* [skip ci] prepare for v3_3

* [skip ci] fix flag

* added time and mem limits

* pu files from scratch

* 20240702_cptruthdef submission

* ttbar nopu v2

* up

* added genjet, genmet to clic postprocessing

* remove delphes

* update tests

* add postprocessing jobs

* update torch

* update dataset version

* propagate genjets, genmet

* shared memory error

* training on v2.0.0 for cms

* fix occasional root file load bug

* add jmenano

* fix qq

* clic training

* up

* CMS training instructions (jpata#336)

* CMS training instructions

* Update pyg-clic.yaml

* Update pyg-clic.yaml

* fix: black formatting

* Enable CI/CD test of HPO workflow

* fix: typo in test script

---------

Co-authored-by: Joosep Pata <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants