Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TT-1042: Read language codes for initialized files from .env #9

Merged
merged 15 commits into from
Sep 14, 2023

Conversation

fredrikmonsen
Copy link
Collaborator

This commit makes the application read which language codes to read from in the metadata_extract.data.txt json files.

Additionally, this commit changes language code to ISO-692-2, and separates 'nob' and 'nno'. Furthermore, the Norwegian Nynorsk word lists have been expanded.

Undetermined language code 'und' is added with universally used keywords such as 'issn' and 'isbn'

@pierrebeauguitte pierrebeauguitte self-assigned this Sep 6, 2023
.env.example Outdated Show resolved Hide resolved
test/pytest.ini Outdated Show resolved Hide resolved
metadata_extract/init_files.py Outdated Show resolved Hide resolved
metadata_extract/data/txt/info_page_keywords.json Outdated Show resolved Hide resolved
metadata_extract/data/txt/info_page_keywords.json Outdated Show resolved Hide resolved
metadata_extract/data/txt/info_page_keywords.json Outdated Show resolved Hide resolved
metadata_extract/resource_loader.py Outdated Show resolved Hide resolved
src/util.py Outdated Show resolved Hide resolved
metadata_extract/text.py Outdated Show resolved Hide resolved
metadata_extract/resource_loader.py Outdated Show resolved Hide resolved
metadata_extract/text.py Outdated Show resolved Hide resolved
src/util.py Outdated Show resolved Hide resolved
@pierrebeauguitte pierrebeauguitte merged commit 84ca0ca into main Sep 14, 2023
1 check passed
@pierrebeauguitte pierrebeauguitte deleted the feature/TT-1042 branch September 14, 2023 07:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants