benchflow-ai / skillsbench
SkillsBench evaluates how well skills work and how effective agents are at using them
8000
See what the GitHub community is most excited about today.
SkillsBench evaluates how well skills work and how effective agents are at using them