bigcodebench
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
Maintained by bigcode-project
Project Information
- GitHub Stars
- 338
- Language
- Python
- Last Updated
- April 17, 2025 at 11:58 AM
Topics
benchmark
chatgpt
code-generation
large-language-models
program-synthesis
tool-use
function-calling
instruction-following
claude-3
gemini
gpt-4
deepseek
llm
agent
agents