gururise/AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

PythonHTMLJavaScript
This is stars and forks stats for /gururise/AlpacaDataCleaned repository. As of 06 May, 2024 this repository has 1260 stars and 133 forks.

🦙🛁 Cleaned Alpaca Dataset Welcome to the Cleaned Alpaca Dataset repository! This repository hosts a cleaned and curated version of a dataset used to train the Alpaca LLM (Large Language Model). The original dataset had several issues that are addressed in this cleaned version. On April 8, 2023 the remaining uncurated instructions (~50,000) were replaced with data from the GPT-4-LLM dataset. Curation of the incoming GPT-4 data is ongoing. A 7b Lora model (trained on April 8, 2023) is available on...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
binary-husky/chatgpt_academicPythonCSSOther42.9k+4375.6k+41
sahil280114/codealpacaPython1.3k0960
feizc/MLE-LLaMAPython2920190
Helixform/CodeCursorTypeScriptRustCSS1.3k0520
matter-labs/zksync-eraRustTypeScriptSolidity89008440
neilmiddleton/neilmiddleton.github.ioSassCSSHTML0020
liusj5257/azurlane_anti_nameShellPython4280930
bazelbuild/bazel-central-registryStarlarkPythonShell17401360
t3dotgg/chirpTypeScriptJavaScriptCSS3400590
luoxuhai/AlockTypeScriptObjective-CSwift5510400