This is stars and forks stats for /allenai/RL4LMs repository. As of 23 Apr, 2024 this repository has 1838 stars and 168 forks.
🤖 RL4LMs 🚀 A modular RL library to fine-tune language models to human preferences We provide easily customizable building blocks for training language models including implementations of on-policy algorithms, reward functions, metrics, datasets and LM based actor-critic policies Paper Link: https://arxiv.org/abs/2210.01241 Website Link: https://rl4lms.apps.allenai.org/ Thoroughly tested and benchmarked with over 2000 experiments 🔥 (GRUE benchmark 🏆) on a comprehensive set of: 7 different Natural...
🤖 RL4LMs 🚀 A modular RL library to fine-tune language models to human preferences We provide easily customizable building blocks for training language models including implementations of on-policy algorithms, reward functions, metrics, datasets and LM based actor-critic policies Paper Link: https://arxiv.org/abs/2210.01241 Website Link: https://rl4lms.apps.allenai.org/ Thoroughly tested and benchmarked with over 2000 experiments 🔥 (GRUE benchmark 🏆) on a comprehensive set of: 7 different Natural...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
openai/glide-text2im | PythonJupyter Notebook | 3.3k | 0 | 465 | 0 |
De3vil/KLogger | PythonShell | 281 | 0 | 46 | 0 |
Methexis-Inc/terminal-copilot | Python | 528 | 0 | 33 | 0 |
Rongjiehuang/GenerSpeech | PythonShell | 279 | 0 | 43 | 0 |
prisma/prisma-engines | RustTypeScriptNix | 973 | 0 | 171 | 0 |
RedisJSON/RedisJSON | RustPythonShell | 3.7k | 0 | 313 | 0 |
mariussoutier/PlayBasics | ScalaHTMLJava | 165 | 0 | 63 | 0 |
typelevel/fabric | ScalaShell | 114 | 0 | 6 | 0 |
manojthemiracle/poc | SmartyJavaDockerfile | 0 | 0 | 14 | 0 |
coinspect/learn-evm-attacks | SolidityShell | 1.3k | 0 | 147 | 0 |