forked from HazyResearch/data-centric-ai
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Extended list of weak supervision tools and added section on data augmentation. (issue HazyResearch#34 )
- Loading branch information
1 parent
c0c33ae
commit c51b242
Showing
1 changed file
with
10 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,15 @@ | ||
<h1 id="tools">Tools</h1> | ||
|
||
## Data Programming & Weak Supervision | ||
### General ML | ||
- [Cleanlab](https://github.com/cgnorthcutt/cleanlab): A Python library for machine learning with noisy labels. It supports finding label errors in datasets and training models on noisily labeled data. | ||
- [Knodle](http://knodle.cc/): A modular Python framework that allows combining different classifiers, weak supervision approaches and denoising methods. | ||
|
||
### NLP | ||
- [skweak](https://github.com/NorskRegnesentral/skweak): A Python library to support weak supervision and labeling functions for sequence and text classification. | ||
- [ANEA](https://github.com/uds-lsv/anea): A GUI and Python library to annotate NER tags via distant supervision for many languages and entity types. | ||
|
||
## Data Augmentation | ||
- [Albumentations](https://github.com/albumentations-team/albumentations): A Python library supporting more than 70 types of image augmentation and including support for bounding boxes and keypoints. | ||
- [nlpaug](https://github.com/makcedward/nlpaug): A Python libraries for data augmentation for text and signal data. | ||
- [TextAttack](https://github.com/QData/TextAttack): A Python library for adversarial attacks and data augmentation for NLP tasks. |