CLUEbenchmark/SuperCLUE - stats on ReviewGithub

evaluation chinese gpt-4 foundation-models chatgpt

This is stars and forks stats for /CLUEbenchmark/SuperCLUE repository. As of 08 May, 2024 this repository has 1818 stars and 61 forks.

SuperCLUE 中文通用大模型综合性基准SuperCLUE SuperCLUE最新9月榜单文章地址：www.cluebenchmarks.com/superclue.html 技术报告：SuperCLUE: A Comprehensive Chinese Large Language Model Benchmark 【2023-9-12】 SuperCLUE-Safety：中文大模型多轮对抗安全基准【9月26日】，SuperCLUE发布中文大模型9月榜单。 SuperCLUE是一个综合性大模型评测基准，本次评测主要聚焦于大模型的四个能力象限，包括语言理解与生成、专业技能与知识、Agent智能体和安全性，进而细化为12项基础能力。相比与上月，新增了AI Agent智能体 SuperCLUE能力评估结构图 SuperCLUE多维度测评方案为什么新增AI Agent智能体能力？ AI agent（智能体）是当前与大语言模型相关的前沿研究热点，拥有类似贾维斯等科幻电影中人类超级助手的能力，可以根据需求自主的完成任务。然而，面向AI agent智能体，缺乏针对中文大模型的广泛评估。为了解决这一问题，我们在SuperCLUE新的榜单中新增了AI...

Read on Github Github Stats Page

repo	techs	stars	weekly	forks	weekly
curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain	Jupyter Notebook	442	0	147	0
e-johnstonn/FableForge	Python	318	0	53	0
HqWu-HITCS/Awesome-Chinese-LLM		3.3k	+110	288	+5
refuel-ai/autolabel	PythonJupyter NotebookMakefile	1.4k	0	89	0
zjunlp/DeepKE	PythonJupyter NotebookOther	2.4k	0	565	0
chat2db/Chat2DB	JavaTypeScriptLess	7.7k	0	1k	0
ramonvc/freegpt-webui	PythonJavaScriptCSS	5.3k	0	1.6k	0
rshipp/awesome-malware-analysis		10.3k	0	2.5k	0
facexl/AI-chat	LessVueTypeScript	32	0	5	0
embedchain/embedchain	PythonTypeScriptJupyter Notebook	5.5k	0	1k	0