Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonbigcode-project/bigcodebench

bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

59.2/100
505Forks: 72
View on GitHubHomepage →
Loading report...

Similar Projects

kani

85

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

Python603

promptbench

56

A unified evaluation framework for large language models

Python2.8K

vim-ai

56

AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim.

Python1.2K

langflow

95

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python149.5K
Back to List