Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
onejune2018/Awesome-LLM-Eval

Awesome-LLM-Eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

58.5/100
615Forks: 51
View on GitHubHomepage →
Loading report...

Similar Projects

Static-to-Dynamic-LLMEval

65

The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"

547

awesome-chatgpt

47

Curated list of awesome tools, demos, docs for ChatGPT and GPT-3

8.2K

SuperCLUE

58

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3.3K

Awesome-LLM-Long-Context-Modeling

73

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1.9K
Back to List