← Back to Benchmarks
agenticPending curation
TheAgentCompany
175 professional workplace tasks in a simulated software company environment with 16 AI colleagues, 4 integrated platforms, and 6 job role domains. Best agent: ~43%. Published NeurIPS 2025.
Year2024
Why our crawl picked it up
Notes the discovery agent wrote when proposing this benchmark.
(no notes recorded)
This entry was added by an automated crawl and hasn't been curated yet. Once it's reviewed and promoted into the bundled set, you'll see task anatomy, examples, scores, and richer context here.