Skip to content
This repository has been archived by the owner on Dec 15, 2023. It is now read-only.
/ notoglot-mini Public archive

Information about the world’s writing systems, tailored for the Noto fonts project but useful beyond

License

Notifications You must be signed in to change notification settings

notofonts/notoglot-mini

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

notoglot-mini

Simple Python package and dataset that provides information about the world’s scripts (writing systems).

Currently, the package is primarily tailored for the Noto fonts project, but is already useful beyond it.

  • This package is currently at “alpha stage”, though we believe that the quality of the data is good.

  • Currently, this package is not published to PyPI.

  • In future, this package will be replaced by a more comprehensive notoglot package, so the current API may change.

  • Additional data points about each script will be published in future.

Dataset

The dataset is a JSON file that contains information about the world’s scripts, keyed by the script’s ISO 15924 code, and structured like so:

{
    "Armn": {
        "id": "Armn",
        "name": "Armenian",
        "status": "",
        "family": "European",
        "type": "alphabet",
        "summary": "Armenian (<span class='autonym'>Հայոց գրեր</span>) is a European bicameral alphabet, written left-to-right (12 million users). Created around 405 CE by Mesrop Mashtots. Used for the Armenian language to this day. Was widespread in the 18th–19th centuries CE in the Ottoman Empire. Armenia uses a reformed spelling introduced in the Soviet Union, the Armenian diaspora mostly uses the original Mesropian orthography.",
        "urls": {
            "ScriptSource": "https://scriptsource.org/scr/Armn",
            "Unicode": "https://www.unicode.org/versions/Unicode15.0.0/ch07.pdf#G3334",
            "Wikipedia": "https://en.wikipedia.org/wiki/ISO_15924:Armn",
            "Wiktionary": "https://en.wiktionary.org/wiki/Category:Armenian_script",
            "WiktionaryLangs": "https://en.wiktionary.org/wiki/Category:Armenian_script_languages",
            "r12a": "https://r12a.github.io/scripts/links?iso=Armn"
        }
    }
}
  • The most unique data point in each script entry is summary, which contains a single-paragraph, hand-curated description of the script.

Python package

Installation

python3 -m pip install --upgrade git+https://github.com/notofonts/notoglot-mini/

Usage in Python

import notoglot_mini
scripts = notoglot_mini.LoadScripts()
print(scripts["Armn"]["summary"])

Other Python packages of interest

License

Python code

Copyright 2021-2022 Noto Authors. Distributed under the Apache 2 license.

Dataset

The dataset contains some information from:

About

Information about the world’s writing systems, tailored for the Noto fonts project but useful beyond

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages