Paper page - OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents
… It first measures how well different models and agent frameworks handle real downstream tasks — with and without skill augmentation — and then runs controlled, same-task comparisons across community-contributed skills, logging quality alongside token and time cost. …