bert_document_classification/examples at master · ArneDefauw/bert_document_classification

History

Name		Name	Last commit message	Last commit date
parent directory ..
ml4health_2019_replication		ml4health_2019_replication
README.md		README.md
config.ini		config.ini
prediction.py		prediction.py
training.py		training.py

README.md

Replicating paper results

Unfortunately, one cannot include the exact data utilized to the train both the clinical models due to HIPPA constraints. The data can be found here if you fill out the appropriate agreements: https://portal.dbmi.hms.harvard.edu/data-challenges/

To replicate results in paper, please see the folder: /examples/ml4health_2019_replication. This will simply requiring copying the downloaded data into the appropriate directory and running scripts.

Training on new datasets

For training, simply alter the config.ini present in /examples file for your purposes. Relevant variables are:

model_storage_directory: directory to store logging information, tensorboard checkpoints, model checkpoints
bert_model_path: the file path to a pretrained bert model. Can be the pytorch-transformers alias.
labels: an ordered list of labels you are training against. this should match the order given in a .fit() instance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

README.md

Replicating paper results

Training on new datasets

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Replicating paper results

Training on new datasets