allenai/RL4LMs

A modular RL library to fine-tune language models to human preferences

PythonHTMLPerlShellDockerfilenlpnatural-language-processingreinforcement-learningmachine-translationtext-generationlanguage-modelingsummarizationdialogue-generationtable-to-text
This is stars and forks stats for /allenai/RL4LMs repository. As of 23 Apr, 2024 this repository has 1838 stars and 168 forks.

🤖 RL4LMs 🚀 A modular RL library to fine-tune language models to human preferences We provide easily customizable building blocks for training language models including implementations of on-policy algorithms, reward functions, metrics, datasets and LM based actor-critic policies Paper Link: https://arxiv.org/abs/2210.01241 Website Link: https://rl4lms.apps.allenai.org/ Thoroughly tested and benchmarked with over 2000 experiments 🔥 (GRUE benchmark 🏆) on a comprehensive set of: 7 different Natural...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
openai/glide-text2imPythonJupyter Notebook3.3k04650
De3vil/KLoggerPythonShell2810460
Methexis-Inc/terminal-copilotPython5280330
Rongjiehuang/GenerSpeechPythonShell2790430
prisma/prisma-enginesRustTypeScriptNix97301710
RedisJSON/RedisJSONRustPythonShell3.7k03130
mariussoutier/PlayBasicsScalaHTMLJava1650630
typelevel/fabricScalaShell114060
manojthemiracle/pocSmartyJavaDockerfile00140
coinspect/learn-evm-attacksSolidityShell1.3k01470