cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Pythonpythondata-sciencedata-validationannotationsweak-supervisiondatasetdata-analysisoutlier-detectioncrowdsourcinglabelingdata-cleaninghacktoberfestactive-learningdata-qualitydata-profilingrobust-machine-learningnoisy-labelsout-of-distribution-detectiondata-labelingdata-centric-ai
This is stars and forks stats for /cleanlab/cleanlab repository. As of 26 Apr, 2024 this repository has 6744 stars and 552 forks.

cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. # cleanlab works with **any classifier**. Yup, you can use PyTorch/TensorFlow/OpenAI/XGBoost/etc. cl = cleanlab.classification.CleanLearning(sklearn.YourFavoriteClassifier()) # cleanlab finds data and label issues in **any...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
mindee/doctrPythonOther2.2k02850
kottory/NJU-health-reportPython10703680
F-Society-Freaks/TikTok-Shares-BotterPython12701600
maurosoria/dirsearchPythonHTMLDockerfile10.4k02.2k0
jwyang/faster-rcnn.pytorchPythonCCuda7.4k02.3k0
neuml/txtaiPythonOther5.2k03980
Ultimaker/CuraPythonQMLGLSL5.3k01.9k0
zendesk/zendesk_jwt_sso_examplesClassic ASPJavaRuby1400840
Xilinx/linux-xlnxCAssemblyShell1.2k01.5k0
Azure/azure-iot-sdk-cCC++CMake558+1753-1