ID-Trainer

A simple config-based tool for high-energy-physics machine learning tasks.

Currently supports
Binary-classification (currently using XGBoost and DNN)	Examples: DY vs ttbar, DY prompt vs DY fake, good electrons vs bad electrons
Multi-sample classification (currently using XGBoost and DNN)	Examples: DY vs (ttbar and QCD)
Multi-class classification (currently using XGBoost and DNN)	Examples: DY vs ttbar vs QCD, , good photons vs bad photons

Salient features:
Parallel reading of root files (using DASK)
Runs on flat ntuples (even NanoAODs) out of the box
Adding multiple MVAs is very trivial (Subject to available computing power)
Cross-section and pt-eta reweighting can be handled together
Multi-Sample training possible
Multi-Class training possible
Ability to customize thresholds

What will be the output of the trainer:
Feature distributions
Statistics in training and testing
ROCs, loss plots, MVA scores
Confusion Matrices
Correlation plots
Trained models (h5 for DNN / pkl for XGBoost)

Primary intended use: For ID Training

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
Configs		Configs
Tools		Tools
archive		archive
condor_example		condor_example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Trainer.ipynb		Trainer.ipynb
Trainer.py		Trainer.py
WhatisID.png		WhatisID.png