EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

PythonOther
This is stars and forks stats for /EleutherAI/lm-evaluation-harness repository. As of 29 Apr, 2024 this repository has 2538 stars and 675 forks.

Language Model Evaluation Harness We're Refactoring LM-Eval! (as of 6/15/23) We have a revamp of the Evaluation Harness library internals staged on the big-refactor branch! It is far along in progress, but before we start to move the master branch of the repository over to this new design with a new version release, we'd like to ensure that it's been tested by outside users and there are no glaring bugs. We’d like your help to test it out! you can help by: Trying out your current workloads on the...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
uberduck-ai/uberduck-ml-devRoffPythonOther378+1630
SJTU-ViSYS/M2DGRSCSSLiquidJavaScript67401020
Qonfused/WarpSpeedStarlarkPython1000
noise-lab/research-courseTeXHTMLRuby91090
GSI-CS-CO/bel_projectsVHDLCSystemVerilog120130
OblivionTime/chatVueJavaScriptOther2990650
wanglin2/mind-mapVueJavaScriptOther1.2k02060
buroa/k8s-gitopsYAMLHCLOther67010
adityarajpt/HelloWorldInatorBrainfuckCShell12080
snltty/p2p-tunnelC#JavaScriptVue40701010