lvwerra/trl

Train transformer language models with reinforcement learning.

PythonShellMakefile
This is stars and forks stats for /lvwerra/trl repository. As of 05 May, 2024 this repository has 5887 stars and 624 forks.

TRL - Transformer Reinforcement Learning Full stack transformer language models with reinforcement learning. What is it? trl is a full stack library where we provide a set of tools to train transformer language models and stable diffusion models with Reinforcement Learning, from the Supervised Fine-tuning step (SFT), Reward Modeling step (RM) to the Proximal Policy Optimization (PPO) step. The library is built on top of the transformers library...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
jessarcher/dotfilesLuaShellJavaScript5190720
wolfSSL/documentationCMakefileCSS90170
quicklyon/zentao-dockerShellPHPMakefile16080
nf-core/hgtseqNextflowGroovyPython18030
nf-core/rnaspliceNextflowPythonR90130
nf-core/cageseqNextflowPythonPerl9090
nf-core/kmermaidNextflowPythonHTML18090
ArulselvanMadhavan/diffusers-ocamlOCamlDockerfileMakefile21010
mohammadpz/pytorch_forward_forwardPython1.4k01360
Fantasy-Studio/Paint-by-ExamplePythonShell765+6750