CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

PythonShellDockerfilemachine-learningreinforcement-learningpytorch
This is stars and forks stats for /CarperAI/trlx repository. As of 29 Apr, 2024 this repository has 3932 stars and 411 forks.

Transformer Reinforcement Learning X trlX is a distributed training framework designed from the ground up to focus on fine-tuning large language models with reinforcement learning using either a provided reward function or a reward-labeled dataset. Training support for 🤗 Hugging Face models is provided by Accelerate-backed trainers, allowing users to fine-tune causal and T5-based language models of up to 20B parameters, such as facebook/opt-6.7b, EleutherAI/gpt-neox-20b, and google/flan-t5-xxl. For...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
IBM-EPBL/IBM-Project-35221-1660282887PythonCSSHTML570630
octallium/modern-python-101Python28901680
duncantl/RExcelXMLRShellC6020
pharo-nosql/voyageSmalltalkShell340210
ZeframLou/foundry-canarySolidityShell44000
robotichead/NearBeachVueCSSPython1330590
alfg/docker-nginx-rtmpXSLTDockerfileShell97903920
edk2-porting/edk2-msmASLCC++2.1k+3404+2
Edwinliby/Hacktoberfest2022Jupyter NotebookCPython5702160
Mridul-1-Sharma/Hacktoberfest2022-DataStructuresAndAlgorithmsC++JavaC4201140