huggingface/datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

PythonJupyter Notebooknlpmachine-learningnatural-language-processingcomputer-visiondeep-learningtensorflownumpyspeechpandaspytorchdatasetshacktoberfest
This is stars and forks stats for /huggingface/datasets repository. As of 25 Apr, 2024 this repository has 17291 stars and 2339 forks.

🤗 Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the major public datasets (image datasets, audio datasets, text datasets in 467 languages and dialects, etc.) provided on the HuggingFace Datasets Hub. With a simple command like squad_dataset = load_dataset("squad"), get any of these datasets ready to use in a dataloader for training/evaluating a ML model (Numpy/Pandas/PyTorch/TensorFlow/JAX), efficient...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
tssovi/grokking-the-object-oriented-design-interviewPython3.4k+181.9k+1
decidim/decidimRubyHTMLJavaScript1.3k03750
QuantlabFinancial/cpp_tip_of_the_weekPython1.3k0930
Nusantara-ROM/android_bionicAssemblyCC++2020
CN-annotation-team/redis7.0-chinese-annotatedCTclRuby57301500
kokke/tiny-AES-cCPythonMakefile3.8k01.3k0
steinwurf/fmtCMakePythonC++0000
HermanMartinus/bearblogCSSJavaScriptPython1.8k0590
atsign-foundation/sshnoportsDartShellPython2450130
CVMix/CVMix-srcFortranNCLPython240300