notes-partition-BERT

This is a rough implementation of the paper "Text Segmentation by Cross Segment Attention" by Lukasik, et. al.

The model is to be trained on the partitions made in Wikipedia articles in the Wiki-727K dataset.

This model is also going to be trained on more granuarly segmented documents in the future such as atomizing segments in Wikipedia articles into paragraphs or class Powerpoint slides.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
BERTdataset.py		BERTdataset.py
DFtoDataset.py		DFtoDataset.py
README.md		README.md
text_manipulation.py		text_manipulation.py
wiki_utils.py		wiki_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

notes-partition-BERT

About

Releases

Packages

Languages

bullybutcher/notes-partition-BERT

Folders and files

Latest commit

History

Repository files navigation

notes-partition-BERT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages