This is a repository that provides a Docker environment that challenges Kaggle's KDD Cup 2013 - Author-Paper Identification Challenge (Track 1).
Run Jupyter notebook:
$ $ docker run -it --name kdd_2013 -p 8888:8888 kdd_2013
Go to http://localhost:8888
and you'll see the notebooks.
Clone repository:
$ git clone https://github.com/PiroHiroPiro/author_paper_identification_challenge.git
$ cd author_paper_identification_challenge
Download dataset from KDD Cup 2013:
Set dataset to source/data
$ cp -r [Download Dir]/* source/data/
Build image:
$ docker build -t kdd_2013 ./author_paper_identification_challenge
This software is released under the MIT License, see LICENSE.