This is stars and forks stats for /lvwerra/trl repository. As of 05 May, 2024 this repository has 5887 stars and 624 forks.
TRL - Transformer Reinforcement Learning Full stack transformer language models with reinforcement learning. What is it? trl is a full stack library where we provide a set of tools to train transformer language models and stable diffusion models with Reinforcement Learning, from the Supervised Fine-tuning step (SFT), Reward Modeling step (RM) to the Proximal Policy Optimization (PPO) step. The library is built on top of the transformers library...
TRL - Transformer Reinforcement Learning Full stack transformer language models with reinforcement learning. What is it? trl is a full stack library where we provide a set of tools to train transformer language models and stable diffusion models with Reinforcement Learning, from the Supervised Fine-tuning step (SFT), Reward Modeling step (RM) to the Proximal Policy Optimization (PPO) step. The library is built on top of the transformers library...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
jessarcher/dotfiles | LuaShellJavaScript | 519 | 0 | 72 | 0 |
wolfSSL/documentation | CMakefileCSS | 9 | 0 | 17 | 0 |
quicklyon/zentao-docker | ShellPHPMakefile | 16 | 0 | 8 | 0 |
nf-core/hgtseq | NextflowGroovyPython | 18 | 0 | 3 | 0 |
nf-core/rnasplice | NextflowPythonR | 9 | 0 | 13 | 0 |
nf-core/cageseq | NextflowPythonPerl | 9 | 0 | 9 | 0 |
nf-core/kmermaid | NextflowPythonHTML | 18 | 0 | 9 | 0 |
ArulselvanMadhavan/diffusers-ocaml | OCamlDockerfileMakefile | 21 | 0 | 1 | 0 |
mohammadpz/pytorch_forward_forward | Python | 1.4k | 0 | 136 | 0 |
Fantasy-Studio/Paint-by-Example | PythonShell | 765 | +6 | 75 | 0 |