rl

repotechsstarsweeklyforksweekly
hiyouga/LLaMA-Efficient-TuningPython4.8k+249946+26
ray-project/rayPythonC++Java28.1k+1134.8k+13
GaryYufei/AlignLLMHumanSurvey421+15170
Docta-ai/doctaPythonShell1.6k+12132+4
THUDM/WebGLMPythonShell1.3k+51240
udacity/deep-reinforcement-learningJupyter NotebookPythonTeX4.6k02.3k0
Upbolt/HydroxideLua32001150
q9f/eth.rbRuby1800600
thu-ml/tianshouPython6.7k01k0
google/dopamineJupyter NotebookPythonOther10.2k01.4k0
status-im/nim-ethNim740300
LAION-AI/Open-AssistantPythonTypeScriptJavaScript35.4k03.2k0
opendilab/awesome-RLHF1.9k01230
RUCAIBox/LLMSurveyPythonShellJavaScript5.9k04740
argilla-io/argillaPythonVueJavaScript2.6k02470
tatsu-lab/alpaca_evalJupyter NotebookPython6580900
hiyouga/ChatGLM-Efficient-TuningPython3.1k04550

Popular technologies

Popular topics