This is stars and forks stats for /openai/evals repository. As of 27 Apr, 2024 this repository has 12088 stars and 2278 forks.
OpenAI Evals Evals is a framework for evaluating LLMs (large language models) or systems built using LLMs as components. It also includes an open-source registry of challenging evals. We now support evaluating the behavior of any system including prompt chains or tool-using agents, via the Completion Function Protocol. With Evals, we aim to make it as simple as possible to build an eval while writing as little code as possible. An "eval" is a task used to evaluate the quality of a system's behavior. To...
OpenAI Evals Evals is a framework for evaluating LLMs (large language models) or systems built using LLMs as components. It also includes an open-source registry of challenging evals. We now support evaluating the behavior of any system including prompt chains or tool-using agents, via the Completion Function Protocol. With Evals, we aim to make it as simple as possible to build an eval while writing as little code as possible. An "eval" is a task used to evaluate the quality of a system's behavior. To...
repo | techs | stars | weekly | forks | weekly |
---|---|---|---|---|---|
THUDM/GLM | PythonShellDockerfile | 2.7k | +6 | 271 | 0 |
awslabs/mountpoint-s3 | RustShellPython | 3.4k | 0 | 90 | 0 |
apache/incubator-opendal | RustJavaMDX | 1.9k | +17 | 275 | +2 |
facebook/buck2 | RustStarlarkPython | 2.9k | 0 | 151 | 0 |
creativetimofficial/corporate-ui-dashboard | SCSSCSSHTML | 19 | 0 | 68 | 0 |
giuspek/FormalMethods2023 | SMTPython | 3 | 0 | 8 | 0 |
QingyangKong/ChainlinkLearningPath | SolidityJavaScript | 40 | 0 | 38 | 0 |
facebook/buck2-prelude | StarlarkPythonErlang | 31 | 0 | 20 | 0 |
sveltia/sveltia-cms | JavaScriptSvelte | 279 | 0 | 15 | 0 |
vda-lab/datavis-technologies-handson | SvelteJavaScriptOther | 1 | 0 | 125 | 0 |