A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
-
Updated
Feb 17, 2023
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
A list of Indonesian NLP resources.
Data Science Learning Path - A complete guide to learn data science for beginners
data resource untuk NLP bahasa indonesia
Repositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
Pujangga - Indonesian Natural Language Processing Tool with REST API, an Interface for InaNLP and Deeplearning4j's Word2Vec
A Python module that fetches a page of a word/phrase from the Online Indonesian Dictionary (https://kbbi.kemdikbud.go.id).
Database kamus kumpulan kata dalam bahasa Indonesia sesuai KBBI (Indonesian word list database based on KBBI)
A benchmark dataset for Indonesian text summarization.
Convert numbers into words in Indonesian language
Json hari libur indonesia yang slalu update.
A curated list of natural language processing courses, video lectures, books, library and many more.
IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)
Introduction to Node.js and Example Applications (Bahasa Indonesia)
Baik Language Next Release
The first large-scale summarization corpus for the Indonesian language. AACL 2020.
Indonesian Grapheme-to-Phoneme (IPA notation)
Add a description, image, and links to the indonesian-language topic page so that developers can more easily learn about it.
To associate your repository with the indonesian-language topic, visit your repo's landing page and select "manage topics."