Launches
Follow our journey building SkillsBench.
Follow our journey building SkillsBench.
Announcing SkillsBench Week 1 updates: We're building the first benchmark that measures how well skills work and how well agents use skills. In the first week we are able to grow significantly in task numbers and the contributor community. Early results: Skills boost agent Show more
Introducing SkillsBench - also call for open-source contributors SkillsBench is the first benchmark that tests whether agent skills can improve performance, and how well agents use skills. If you are interested, check out the GitHub in the comment, Dm, and join our Discord