bigcodebench logo

bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

Maintained by bigcode-project

Project Information

GitHub Stars
338
Language
Python
Last Updated
April 17, 2025 at 11:58 AM

Topics

benchmark
chatgpt
code-generation
large-language-models
program-synthesis
tool-use
function-calling
instruction-following
claude-3
gemini
gpt-4
deepseek
llm
agent
agents

Explore More

Discover similar projects or browse the full catalog.