lvwerra/trl

Train transformer language models with reinforcement learning.

PythonShellMakefile

Stars and forks stats for /lvwerra/trl

418 forks on 2023-06-26418 forks on 2023-06-27420 forks on 2023-06-28421 forks on 2023-06-29422 forks on 2023-06-30422 forks on 2023-07-01422 forks on 2023-07-02423 forks on 2023-07-03426 forks on 2023-07-04429 forks on 2023-07-05430 forks on 2023-07-06432 forks on 2023-07-07432 forks on 2023-07-08432 forks on 2023-07-09432 forks on 2023-07-10434 forks on 2023-07-11435 forks on 2023-07-12436 forks on 2023-07-13437 forks on 2023-07-14438 forks on 2023-07-15439 forks on 2023-07-16441 forks on 2023-07-17442 forks on 2023-07-18444 forks on 2023-07-19448 forks on 2023-07-20451 forks on 2023-07-21454 forks on 2023-07-22460 forks on 2023-07-23466 forks on 2023-07-24465 forks on 2023-07-25469 forks on 2023-07-26473 forks on 2023-07-27476 forks on 2023-07-28476 forks on 2023-07-29477 forks on 2023-07-30478 forks on 2023-07-31479 forks on 2023-08-01479 forks on 2023-08-02483 forks on 2023-08-03485 forks on 2023-08-04486 forks on 2023-08-05488 forks on 2023-08-06493 forks on 2023-08-07493 forks on 2023-08-08494 forks on 2023-08-09508 forks on 2023-08-10514 forks on 2023-08-11522 forks on 2023-08-12526 forks on 2023-08-13529 forks on 2023-08-14536 forks on 2023-08-15539 forks on 2023-08-16541 forks on 2023-08-17542 forks on 2023-08-18542 forks on 2023-08-19544 forks on 2023-08-20546 forks on 2023-08-21549 forks on 2023-08-22554 forks on 2023-08-23560 forks on 2023-08-24560 forks on 2023-08-25561 forks on 2023-08-26563 forks on 2023-08-27563 forks on 2023-08-28565 forks on 2023-08-29565 forks on 2023-08-30566 forks on 2023-08-31571 forks on 2023-09-01575 forks on 2023-09-02575 forks on 2023-09-03576 forks on 2023-09-04577 forks on 2023-09-05578 forks on 2023-09-06578 forks on 2023-09-07578 forks on 2023-09-08580 forks on 2023-09-09581 forks on 2023-09-10581 forks on 2023-09-11581 forks on 2023-09-12584 forks on 2023-09-13586 forks on 2023-09-14589 forks on 2023-09-15589 forks on 2023-09-16588 forks on 2023-09-17590 forks on 2023-09-18593 forks on 2023-09-19594 forks on 2023-09-20596 forks on 2023-09-21596 forks on 2023-09-22598 forks on 2023-09-23

598forks in total +192last 90 days

3 857 stars on 2023-06-263 857 stars on 2023-06-273 870 stars on 2023-06-283 882 stars on 2023-06-293 887 stars on 2023-06-303 894 stars on 2023-07-013 898 stars on 2023-07-023 905 stars on 2023-07-033 915 stars on 2023-07-043 926 stars on 2023-07-053 928 stars on 2023-07-063 933 stars on 2023-07-073 933 stars on 2023-07-083 939 stars on 2023-07-093 939 stars on 2023-07-103 960 stars on 2023-07-113 969 stars on 2023-07-123 981 stars on 2023-07-133 990 stars on 2023-07-143 994 stars on 2023-07-154 004 stars on 2023-07-164 019 stars on 2023-07-174 028 stars on 2023-07-184 050 stars on 2023-07-194 136 stars on 2023-07-204 160 stars on 2023-07-214 199 stars on 2023-07-224 224 stars on 2023-07-234 260 stars on 2023-07-244 294 stars on 2023-07-254 325 stars on 2023-07-264 360 stars on 2023-07-274 384 stars on 2023-07-284 384 stars on 2023-07-294 403 stars on 2023-07-304 429 stars on 2023-07-314 441 stars on 2023-08-014 472 stars on 2023-08-024 492 stars on 2023-08-034 512 stars on 2023-08-044 523 stars on 2023-08-054 539 stars on 2023-08-064 557 stars on 2023-08-074 557 stars on 2023-08-084 576 stars on 2023-08-094 670 stars on 2023-08-104 786 stars on 2023-08-114 844 stars on 2023-08-124 922 stars on 2023-08-134 981 stars on 2023-08-145 022 stars on 2023-08-155 062 stars on 2023-08-165 088 stars on 2023-08-175 113 stars on 2023-08-185 113 stars on 2023-08-195 135 stars on 2023-08-205 154 stars on 2023-08-215 191 stars on 2023-08-225 223 stars on 2023-08-235 245 stars on 2023-08-245 261 stars on 2023-08-255 276 stars on 2023-08-265 289 stars on 2023-08-275 289 stars on 2023-08-285 316 stars on 2023-08-295 335 stars on 2023-08-305 368 stars on 2023-08-315 402 stars on 2023-09-015 419 stars on 2023-09-025 419 stars on 2023-09-035 441 stars on 2023-09-045 463 stars on 2023-09-055 495 stars on 2023-09-065 524 stars on 2023-09-075 543 stars on 2023-09-085 551 stars on 2023-09-095 556 stars on 2023-09-105 556 stars on 2023-09-115 570 stars on 2023-09-125 584 stars on 2023-09-135 595 stars on 2023-09-145 610 stars on 2023-09-155 616 stars on 2023-09-165 623 stars on 2023-09-175 638 stars on 2023-09-185 652 stars on 2023-09-195 668 stars on 2023-09-205 684 stars on 2023-09-215 684 stars on 2023-09-225 699 stars on 2023-09-23

5.7kstars in total +2klast 90 days

This is stars and forks stats for /lvwerra/trl repository. As of 23 Sep, 2023 this repository has 5699 stars and 598 forks.

TRL - Transformer Reinforcement Learning Full stack transformer language models with reinforcement learning. What is it? trl is a full stack library where we provide a set of tools to train transformer language models and stable diffusion models with Reinforcement Learning, from the Supervised Fine-tuning step (SFT), Reward Modeling step (RM) to the Proximal Policy Optimization (PPO) step. The library is built on top of the transformers library...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
jessarcher/dotfilesLuaShellJavaScript511+1710
wolfSSL/documentationCMakefileCSS90160
quicklyon/zentao-dockerShellPHPMakefile16080
nf-core/hgtseqNextflowGroovyPython18030
nf-core/rnaspliceNextflowPythonR90110
nf-core/cageseqNextflowPythonPerl9090
nf-core/kmermaidNextflowPythonHTML18090
ArulselvanMadhavan/diffusers-ocamlOCamlDockerfileMakefile21010
mohammadpz/pytorch_forward_forwardPython1.4k+51360
Fantasy-Studio/Paint-by-ExamplePythonShell750+1075+3