Skip to content

Commit

Permalink
Extended Awesome List Tools
Browse files Browse the repository at this point in the history
Extended list of weak supervision tools and added section on data augmentation. (issue HazyResearch#34 )
  • Loading branch information
michael-aloys authored Aug 9, 2021
1 parent c0c33ae commit c51b242
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions awesome-lists/tools.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,15 @@
<h1 id="tools">Tools</h1>

## Data Programming & Weak Supervision
### General ML
- [Cleanlab](https://github.com/cgnorthcutt/cleanlab): A Python library for machine learning with noisy labels. It supports finding label errors in datasets and training models on noisily labeled data.
- [Knodle](http://knodle.cc/): A modular Python framework that allows combining different classifiers, weak supervision approaches and denoising methods.

### NLP
- [skweak](https://github.com/NorskRegnesentral/skweak): A Python library to support weak supervision and labeling functions for sequence and text classification.
- [ANEA](https://github.com/uds-lsv/anea): A GUI and Python library to annotate NER tags via distant supervision for many languages and entity types.

## Data Augmentation
- [Albumentations](https://github.com/albumentations-team/albumentations): A Python library supporting more than 70 types of image augmentation and including support for bounding boxes and keypoints.
- [nlpaug](https://github.com/makcedward/nlpaug): A Python libraries for data augmentation for text and signal data.
- [TextAttack](https://github.com/QData/TextAttack): A Python library for adversarial attacks and data augmentation for NLP tasks.

0 comments on commit c51b242

Please sign in to comment.