Model Training Instructions

In order to train a model, data must be downloaded and placed in this directory.

Files required are:

ascii-all.tar.gz - https://fki.tic.heia-fr.ch/DBs/iamOnDB/data/ascii-all.tar.gz
lineStrokes-all.tar.gz - https://fki.tic.heia-fr.ch/DBs/iamOnDB/data/lineStrokes-all.tar.gz
original-xml-part.tar.gz - https://fki.tic.heia-fr.ch/DBs/iamOnDB/data/original-xml-part.tar.gz

Only a subset of the downloaded data is required. Move the relevant download data so the directory structure is as folllows:

data/
├── raw/
│   ├── ascii/
│   ├── lineStrokes/
│   ├── original/
|   blacklist.npy

Once this is completed, run

from handwriting_synthesis.training.preparation import prepare
prepare()

extract the data and dump it to numpy files.

To train the model, run

from handwriting_synthesis.training import train
train()

This takes a couple of days on a single Tesla K80.

Provide feedback