From c51b242dec842fb9a93722e7e68ec2e6d74b7823 Mon Sep 17 00:00:00 2001 From: "Michael A. Hedderich" Date: Mon, 9 Aug 2021 11:34:21 +0200 Subject: [PATCH] Extended Awesome List Tools Extended list of weak supervision tools and added section on data augmentation. (issue #34 ) --- awesome-lists/tools.md | 10 ++++++++++ 1 file changed, 10 insertions(+) diff --git a/awesome-lists/tools.md b/awesome-lists/tools.md index e92d1bd..45c44ec 100644 --- a/awesome-lists/tools.md +++ b/awesome-lists/tools.md @@ -1,5 +1,15 @@

Tools

## Data Programming & Weak Supervision +### General ML - [Cleanlab](https://github.com/cgnorthcutt/cleanlab): A Python library for machine learning with noisy labels. It supports finding label errors in datasets and training models on noisily labeled data. - [Knodle](http://knodle.cc/): A modular Python framework that allows combining different classifiers, weak supervision approaches and denoising methods. + +### NLP +- [skweak](https://github.com/NorskRegnesentral/skweak): A Python library to support weak supervision and labeling functions for sequence and text classification. +- [ANEA](https://github.com/uds-lsv/anea): A GUI and Python library to annotate NER tags via distant supervision for many languages and entity types. + +## Data Augmentation +- [Albumentations](https://github.com/albumentations-team/albumentations): A Python library supporting more than 70 types of image augmentation and including support for bounding boxes and keypoints. +- [nlpaug](https://github.com/makcedward/nlpaug): A Python libraries for data augmentation for text and signal data. +- [TextAttack](https://github.com/QData/TextAttack): A Python library for adversarial attacks and data augmentation for NLP tasks.