Document similarity and topic clues A historiographical study case

DH2022 files

data_structured

`encpos_structured_sents.json`

dump JSON du corpus structuré (dictionnaire par position puis chapitre > liste des phrases)

{
  "ENCPOS_ID": {
    "metadata": [
    ],
    "chapter_title": [
      "first sentence",
      "second sentence",
      "…"
    ],
    "chapter_title": [
    ]
   },
  "ENCPOS_ID": {
    "metadata": [
    ],
    "chapter_title": [
      "first sentence",
      "second sentence",
      "…"
    ],
    "chapter_title": [
    ]
   }

Tutorial (Google Colab/Drive)

A Jupyter Notebook is available to demo run, check out the tutorial on Google Colab :

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Document similarity and topic clues A historiographical study case

data_structured

`encpos_structured_sents.json`

Tutorial (Google Colab/Drive)

Files

README.md

Latest commit

History

README.md

File metadata and controls

Document similarity and topic clues A historiographical study case

data_structured

encpos_structured_sents.json

Tutorial (Google Colab/Drive)

`encpos_structured_sents.json`