Name		Name	Last commit message	Last commit date
parent directory ..
data_0419_0		data_0419_0
demo_data/data_0419_0		demo_data/data_0419_0
models		models
01_data_prep.ipynb		01_data_prep.ipynb
02_training_automl.ipynb		02_training_automl.ipynb
03_training.ipynb		03_training.ipynb
04_demo_data_selection.ipynb		04_demo_data_selection.ipynb
05_data_4_cache_db.ipynb		05_data_4_cache_db.ipynb
Dockerfile		Dockerfile
README.md		README.md
data_diagram.png		data_diagram.png
delete_me.ipynb		delete_me.ipynb
requirements.txt		requirements.txt

README.md

Data preparation and training - artificial data set

This directory contains jupyter notebooks for preparing data and training a model for the coupon recommendation service.

In order to run the Jupyter notebooks, the original dataset needs to be present. Path to the directory needs to be specified at the top of each script.

01_data_prep.ipynb - data preparation. This notebook contains data cleaning, merging, feature engineering, and encoding. It results in an input dataset for training.
02_training_automl.ipynb - Training using H2O AutoML.
03_training.ipynb - Training model using scikit-learn. Algorithm (GBM) and parameters are selected based on AutoML result. The notebook compares training on an unbalanced and balanced dataset.
04_demo_data_selection.ipynb - Using the model trained in the previous notebook, select 'good' customer-coupon pairs, i.e. customers for whom there are many 'hit' coupons predicted, along with the 'hit' coupons. Data are saved in a demo_data directory. Details can be found in the notebook.

In order to run the jupyter notebooks, use (specify ip and port according to your needs):

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

jupyter notebook --ip 0.0.0.0 --port 8000 --no-browser

The notebooks should be run in the order they are numbered.

Docker images

This repository contains a Dockerfile for building a docker image. To build it, use:

docker build -t coupon-rec:0.0.1 . --build-arg DATA_DIR=<path to data dir>

To run it, use:

docker run -it -p 0.0.0.0:8002:8000 coupon-rec:0.0.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training-with-artificial-data

training-with-artificial-data

README.md

Data preparation and training - artificial data set

Docker images

Files

training-with-artificial-data

Directory actions

More options

Directory actions

More options

Latest commit

History

training-with-artificial-data

Folders and files

parent directory

README.md

Data preparation and training - artificial data set

Docker images