notoglot-mini

Simple Python package and dataset that provides information about the world’s scripts (writing systems).

Currently, the package is primarily tailored for the Noto fonts project, but is already useful beyond it.

This package is currently at “alpha stage”, though we believe that the quality of the data is good.
Currently, this package is not published to PyPI.
In future, this package will be replaced by a more comprehensive notoglot package, so the current API may change.
Additional data points about each script will be published in future.

Dataset

The dataset is a JSON file that contains information about the world’s scripts, keyed by the script’s ISO 15924 code, and structured like so:

{
    "Armn": {
        "id": "Armn",
        "name": "Armenian",
        "status": "",
        "family": "European",
        "type": "alphabet",
        "summary": "Armenian (<span class='autonym'>Հայոց գրեր</span>) is a European bicameral alphabet, written left-to-right (12 million users). Created around 405 CE by Mesrop Mashtots. Used for the Armenian language to this day. Was widespread in the 18th–19th centuries CE in the Ottoman Empire. Armenia uses a reformed spelling introduced in the Soviet Union, the Armenian diaspora mostly uses the original Mesropian orthography.",
        "urls": {
            "ScriptSource": "https://scriptsource.org/scr/Armn",
            "Unicode": "https://www.unicode.org/versions/Unicode15.0.0/ch07.pdf#G3334",
            "Wikipedia": "https://en.wikipedia.org/wiki/ISO_15924:Armn",
            "Wiktionary": "https://en.wiktionary.org/wiki/Category:Armenian_script",
            "WiktionaryLangs": "https://en.wiktionary.org/wiki/Category:Armenian_script_languages",
            "r12a": "https://r12a.github.io/scripts/links?iso=Armn"
        }
    }
}

The most unique data point in each script entry is summary, which contains a single-paragraph, hand-curated description of the script.

Python package

Installation

python3 -m pip install --upgrade git+https://github.com/notofonts/notoglot-mini/

Usage in Python

import notoglot_mini
scripts = notoglot_mini.LoadScripts()
print(scripts["Armn"]["summary"])

Other Python packages of interest

gflanguages: additional info about writing systems, languages and regions
fontTools.unicodedata: info about scripts and their code points
unicodedata2: info about the Unicode codepoints and their properties
aksharamukha: transliteration between various scripts

License

Python code

Dataset

The dataset contains some information from:

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
notoglot_mini		notoglot_mini
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE_DATA		LICENSE_DATA
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

notoglot-mini

Dataset

Python package

Installation

Usage in Python

Other Python packages of interest

License

Python code

Dataset

About

Releases

Packages

Languages

License

notofonts/notoglot-mini

Folders and files

Latest commit

History

Repository files navigation

notoglot-mini

Dataset

Python package

Installation

Usage in Python

Other Python packages of interest

License

Python code

Dataset

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages