Train AI agent skill documents like neural networks to systematically improve performance without touching model weights.
SkillOpt is a framework from Microsoft researchers for systematically improving the performance of AI agents. It addresses the challenge of unreliable, hand-crafted agent skills by treating the skill document itself as a trainable component. The framework applies a rigorous, deep-learning-style optimization process to refine these skills based on performance feedback, without altering the agent's core model weights.
SkillOpt is an open-source Python library used via the command line. Users configure a benchmark and an initial skill document. The framework runs an optimization loop that executes agent tasks, scores the outcomes, and uses an optimizer model to suggest edits to the skill document. Accepted edits are compiled into a final best_skill.md file, which can be used as instructions for the original frozen agent model.
This framework is best for AI developers and researchers who need to systematically optimize and reproduce improvements in agent capabilities for specific tasks, moving beyond manual prompt engineering.
SkillOpt is a specialized tool for developers, not a consumer application. It requires significant setup, including defining benchmark environments and evaluation metrics, to be used effectively.