opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

reinforcement-learningdeep-learningdeep-reinforcement-learninglarge-language-modelshuman-feedbackrlhf
This is stars and forks stats for /opendilab/awesome-RLHF repository. As of 04 May, 2024 this repository has 1907 stars and 123 forks.

Awesome RLHF (RL with Human Feedback) This is a collection of research papers for Reinforcement Learning with Human Feedback (RLHF). And the repository will be continuously updated to track the frontier of RLHF. Welcome to follow and star! Table of Contents Awesome RLHF (RL with Human Feedback) Table of Contents Overview of RLHF Detailed Explanation Papers 2023 2022 2021 2020 and before Codebases Dataset Blogs Other Language Support Contributing License Overview of RLHF The idea of RLHF is to...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
34j/so-vits-svc-forkPythonJupyter NotebookBatchfile7.1k+511k+4
andri27-ts/Reinforcement-LearningJupyter NotebookPython3.9k05880
microsoft/LoRAPython6.7k+85396+8
enricoros/nextjs-chatgpt-appTypeScriptCSSOther1.6k+14443+8
binary-husky/chatgpt_academicPythonCSSOther42.9k+4375.6k+41
stochasticai/xturingPython2.3k01840
waldo-vision/waldoTypeScriptJavaScriptOther1990210
junshutang/Make-It-3DPythonCudaOther1.4k+19800
OpenNMT/CTranslate2C++PythonCuda2k+25181+1
ymcui/Chinese-LLaMA-AlpacaPythonShell14.9k01.5k0