Probabilistic prediction of travel times

We will use a neural network to predict the travel time distribution between two locations. To predict a distribution instead of a single value we modify the neural network by:

feeding its outputs to the parameters of a probability distribution function (LogNormal oder Normal). Optimizing the model against the observed data is equivalent to minimizing the corresponding negative log-likelihood of the joint-pdf predicted by the model.
adding an extra layer with monotonically increasing outputs that represent a fixed set of the distribution-quantiles. Optimizing the model against the observed data is equivalent to minimizing the average of the pinball losses for every quantile.

We apply both models to the NYC taxi trip data that can be found in the nyc.gov website.

Docker image and environment variables

Build the image:

docker build -t travel_time -f Dockerfile .

The environment variables that will be used in the model are in src/settings.py. Change them:

export DATA_DIR="$(pwd)"/data
export TFBOARD_DIR="$(pwd)"/tfboard
export ARTIFACTS_DIR="$(pwd)"/artifacts

# maximum memory in GB that can be allocated by tensorflow
export GPU_MEMORY_LIMIT=16

Collect & preprocess data

We can collect the NYC taxi trip data (drives and taxi zones) for the entire 2016 from nyc.gov website, and store it in DATA_DIR:

docker run -it --rm --runtime=nvidia --gpus all --name=experiment \
  -v $DATA_DIR:/tf/data \
  -v $TFBOARD_DIR:/tf/tfboard \
  -v $ARTIFACTS_DIR:/tf/artifacts \
  --env GPU_MEMORY_LIMIT=$GPU_MEMORY_LIMIT \
  travel_time:latest python src/main.py collect-data --year=2016

To generate features from the data and split it into a training, validation and test datasets execute:

docker run -it --rm --runtime=nvidia --gpus all --name=experiment \
  -v $DATA_DIR:/tf/data \
  -v $TFBOARD_DIR:/tf/tfboard \
  -v $ARTIFACTS_DIR:/tf/artifacts \
  --env GPU_MEMORY_LIMIT=$GPU_MEMORY_LIMIT \
  travel_time:latest python src/main.py preprocess-data \
  --tr=0.8 --va=0.1 --te=0.1

It splits the data from every month into a training, validation and test dataset and stores it in a separate folder, as shown below:

$DATA_DIR
├── raw/             # obtained in the data collection step
└── preprocessed/    # obtained in the data preprocessing step
    ├── train/
    ├── validation/
    └── test/

If you are worried about data drift over time you might need a different type of data splitting.

Train model

The json strings that you can provide overwrite the default arguments used by the model:

docker run -it --rm --runtime=nvidia --gpus all --name=experiment \
  -v $DATA_DIR:/tf/data \
  -v $TFBOARD_DIR:/tf/tfboard \
  -v $ARTIFACTS_DIR:/tf/artifacts \
  --env GPU_MEMORY_LIMIT=$GPU_MEMORY_LIMIT \
  travel_time:latest python src/main.py train \
  --model_wrapper=ModelPDF \
  --model_args='{"l2": 0.0001, "batch_normalization": false, "layer_sizes": [64, [64, 64], [64, 64], 32, 8]}' \
  --ds_args='{"max_files": 2}' \
  --callbacks_args='{"period": 10, "profile_batch": 0}' \
  --training_args='{"epochs": 100}'

After training the model the following directory structure will be generated (the experiment id is generated automatically):

$ARTIFACTS_DIR
└── ex_<id>
    ├── checkpoints/
    ├── model/
    └── model_attributes.json

$TFBOARD_DIR
└── ex_<id>
    ├── train/
    └── validation/

Check the training logs with tensorboard:

docker run --rm --name=tfboard \
  -p 6006:6006 \
  -v $TFBOARD_DIR:/tf/tfboard \
  travel_time:latest tensorboard --logdir /tf/tfboard --host 0.0.0.0

# visit http://0.0.0.0:6006/

Evaluate model

We can evaluate accuracy and the uncertainty estimation provided by the predicted probability distribution functions or quantiles by using the following metrics/plots:
- Fraction of cases (y-axis) where the observed percentile is below a given value (x-axis). Under observed percentile we understand the percentile of the predicted distribution to which the observation belongs to. For example, if we have predicted a normally distributed pdf with zero mean and unit standard deviation, and the observed value is 0, then the observed percentile will be 50. In the ideal case the plot should follow a straight line from (x,y) = (0,0) to (100,1).
- Histograms of the ratios between the mean and the standard deviation of the predicted distribution for every datapoint. For the model that predicts a fixed number of percentiles we replace the standard deviation with the difference between two percentiles, for example the 15-th and 85-th. This makes it harder to compare the predictions of both models.

Serve model

We serve a ModelPDF that can output the mean and standard deviation of the predicted travel time distribution.

First, we create a servable from the trained model. For example, we will use the model from experiment ex_011 that is stored in $ARTIFACTS_DIR/ex_000. The code below generates a new servable in $ARTIFACTS_DIR/ex_000/model_mean_std:
```
docker run -it --rm --runtime=nvidia --gpus all --name=experiment \
  -v $ARTIFACTS_DIR:/tf/artifacts \
  travel_time:latest python src/main.py prepare-servable \
  --load_dir=/tf/artifacts/ex_000
```

Next we spawn the tf serving container and mount to it the newly created servable:

MODEL_DIR=$ARTIFACTS_DIR/ex_000/model_mean_std

docker run -t --rm -p 8501:8501 \
    --name=serving \
    -v "$MODEL_DIR:/models/model_mean_std/1" \
    -e MODEL_NAME=model_mean_std \
    tensorflow/serving:2.13.0

Test exported model predictions of the travel-time mean:

curl -X POST http://localhost:8501/v1/models/model_mean_std/versions/1:predict \
-H 'Content-type: application/json' \
-d '{"signature_name": "mean_value", "instances": [{"time": [571.0], "trip_distance": [1.1], "pickup_lon": [-73.991791], "pickup_lat": [40.736072], "pickup_area": [1e-5], "dropoff_lon": [-73.991142], "dropoff_lat": [40.734538], "dropoff_area": [2e-5], "passenger_count": [1], "vendor_id": [1], "weekday": [1], "month": [1]}]}'

To get the predicted std change the value of the signature_name from mean_value to std:

curl -X POST http://localhost:8501/v1/models/model_mean_std/versions/1:predict \
-H 'Content-type: application/json' \
-d '{"signature_name": "std", "instances": [{"time": [571.0], "trip_distance": [1.1], "pickup_lon": [-73.991791], "pickup_lat": [40.736072], "pickup_area": [1e-5], "dropoff_lon": [-73.991142], "dropoff_lat": [40.734538], "dropoff_area": [2e-5], "passenger_count": [1], "vendor_id": [1], "weekday": [1], "month": [1]}]}'

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figs		figs
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Probabilistic prediction of travel times

Docker image and environment variables

Collect & preprocess data

Train model

Evaluate model

Serve model

About

Releases 1

Packages

Languages

ImScientist/probabilistic-forecasting-travel-time

Folders and files

Latest commit

History

Repository files navigation

Probabilistic prediction of travel times

Docker image and environment variables

Collect & preprocess data

Train model

Evaluate model

Serve model

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages