This is stars and forks stats for /CarperAI/trlx repository. As of 29 Apr, 2024 this repository has 3932 stars and 411 forks.
Transformer Reinforcement Learning X trlX is a distributed training framework designed from the ground up to focus on fine-tuning large language models with reinforcement learning using either a provided reward function or a reward-labeled dataset. Training support for 🤗 Hugging Face models is provided by Accelerate-backed trainers, allowing users to fine-tune causal and T5-based language models of up to 20B parameters, such as facebook/opt-6.7b, EleutherAI/gpt-neox-20b, and google/flan-t5-xxl. For...
Transformer Reinforcement Learning X trlX is a distributed training framework designed from the ground up to focus on fine-tuning large language models with reinforcement learning using either a provided reward function or a reward-labeled dataset. Training support for 🤗 Hugging Face models is provided by Accelerate-backed trainers, allowing users to fine-tune causal and T5-based language models of up to 20B parameters, such as facebook/opt-6.7b, EleutherAI/gpt-neox-20b, and google/flan-t5-xxl. For...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
IBM-EPBL/IBM-Project-35221-1660282887 | PythonCSSHTML | 57 | 0 | 63 | 0 |
octallium/modern-python-101 | Python | 289 | 0 | 168 | 0 |
duncantl/RExcelXML | RShellC | 6 | 0 | 2 | 0 |
pharo-nosql/voyage | SmalltalkShell | 34 | 0 | 21 | 0 |
ZeframLou/foundry-canary | SolidityShell | 44 | 0 | 0 | 0 |
robotichead/NearBeach | VueCSSPython | 133 | 0 | 59 | 0 |
alfg/docker-nginx-rtmp | XSLTDockerfileShell | 979 | 0 | 392 | 0 |
edk2-porting/edk2-msm | ASLCC++ | 2.1k | +3 | 404 | +2 |
Edwinliby/Hacktoberfest2022 | Jupyter NotebookCPython | 57 | 0 | 216 | 0 |
Mridul-1-Sharma/Hacktoberfest2022-DataStructuresAndAlgorithms | C++JavaC | 42 | 0 | 114 | 0 |