Skip to content

Latest commit

 

History

History
47 lines (36 loc) · 2.09 KB

README.md

File metadata and controls

47 lines (36 loc) · 2.09 KB

Manga text segmentation

🔎 Deep learning

This project is about japanese text segmentation in manga. It contains implementation of a of Unet + Resnet34 to perform this task. It also has a semi-supervised approach to enhance state of the art results.

📁 Project layout

.
├── models                  # Contains weights of the trained neural nets
│   ├── ancillary_model.pth # The ancillary model weights
│   ├── simple_Unet34.pth   # The base unet + resnet34 auto-encorder
├── tools                   # Contains a set of helper scripts
│   ├── get_bbox.py         # Draws bounding boxes on image from xml for manga109 dataset
│   ├── img_cropper.py      # Crops and save image for dataset selection
│   ├── viz_dataset.py      # Side by side image viz to validate segmentation
├── data                    # Contains the financial data for the environment
├── notebooks               # Training and prediction notebooks
├── exemples                # Contains a manga image and the ouput
├── config.py               # Contains training specs for models
├── data.py                 # Pytorch dataset class
├── evaluation.py           # For model testing. Prints out some metrics
├── train_base.py           # A train script for Unet34 model
├── train_ancillary.py      # A train script for ancillary model
├── models.py               # Contains neural nets
├── utils.py                # Some helper functions
├── report.pdf              # Article to get details about the project
├── requirement.txt
├── .gitignore
└── README.md


Dataset

Thanks the Manga109 dataset providers, check it out here Manga 109 website. Thanks also to this annotated version for segmentation zenodo for Manga109 datset from https://github.com/juvian.

🔧 How to run the code

  1. Install requirements.txt in your Python environment
  2. Run the predict notebook after setting right paths to model etc