Uncertainty in Gradient Boosting via Ensembles

This is a supplementary code for our paper: "Uncertainty in Gradient Boosting via Ensembles" by Andrey Malinin, Liudmila Prokhorenkova, Aleksei Ustimenko (ICLR 2021)

See also our tutorials on uncertainty estimation with CatBoost: blog post with synthetic regression example, blog post with practical classification example.

Datasets can be found here.

Training models

python train_models.py regression 1

First argument options: regression, classification, regression_rf, classification_rf
Second argument (for CatBoost only): 0 or 1 indicates whether to tune hyperparameters (or use already obtained ones)

Aggregating results and getting tables in latex format

Regression:

python aggregate_results_regression.py prr_auc

Options: std, nll_rmse, prr_auc, rf_rmse, rf_prr_auc

Classification:

python aggregate_results_classification.py prr_auc

Options: nll_error, prr_auc, rf_nll_error, rf_prr_auc

Synthetic experiments

synthetic_regression.ipynb

synthetic_classification.ipynb

Additional experiment on KDD-99 Intrusion Detection dataset

(not included in the paper)

gbdt_uncertainty/kdd/kdd.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Uncertainty in Gradient Boosting via Ensembles

Training models

Aggregating results and getting tables in latex format

Synthetic experiments

Additional experiment on KDD-99 Intrusion Detection dataset

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
datasets		datasets
gbdt_uncertainty		gbdt_uncertainty
LICENSE		LICENSE
README.md		README.md
aggregate_results_classification.py		aggregate_results_classification.py
aggregate_results_regression.py		aggregate_results_regression.py
synthetic_classification.ipynb		synthetic_classification.ipynb
synthetic_regression.ipynb		synthetic_regression.ipynb
train_models.py		train_models.py

License

yandex-research/GBDT-uncertainty

Folders and files

Latest commit

History

Repository files navigation

Uncertainty in Gradient Boosting via Ensembles

Training models

Aggregating results and getting tables in latex format

Synthetic experiments

Additional experiment on KDD-99 Intrusion Detection dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages