From b260d1172942d8b4308b47013dcb6c2a28ccdbef Mon Sep 17 00:00:00 2001 From: Lilian Date: Tue, 28 Jun 2022 18:41:25 +0200 Subject: [PATCH] Add dirty_cat to Python/General-Purpose Machine Learning --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 17d3edd..03bf528 100644 --- a/README.md +++ b/README.md @@ -1220,6 +1220,7 @@ be * [Shapash](https://github.com/MAIF/shapash) : Shapash is a Python library that provides several types of visualization that display explicit labels that everyone can understand. * [Eurybia](https://github.com/MAIF/eurybia): Eurybia monitors data and model drift over time and securizes model deployment with data validation. * [Colossal-AI](https://github.com/hpcaitech/ColossalAI): An open-source deep learning system for large-scale model training and inference with high efficiency and low cost. +* dirty_cat](https://github.com/dirty-cat/dirty_cat) - facilitates machine-learning on dirty, non-curated categories. It provides transformers and encoders robust to morphological variants, such as typos. #### Data Analysis / Data Visualization