hermes - 💡(How to fix) Fix Feature Request: Integrate Microsoft SkillOpt for Self-Evolving Agent Skills

hermes2026-05-27 01:33:14

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Microsoft recently released SkillOpt — a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

This issue proposes exploring SkillOpt integration with Hermes to enable automated skill optimization without touching model weights.

Root Cause

This issue proposes exploring SkillOpt integration with Hermes to enable automated skill optimization without touching model weights.

Fix Action

Fix / Workaround

Pipeline (6 stages):

Rollout — execute episodes with current skill
Reflect — analyze trajectories, generate patches
Aggregate — hierarchical merge of patches
Select — rank and select top edits
Update — apply edits to skill document
Evaluate — validate candidate skill, accept/reject

Code Example

hermes skillopt train --config configs/hermes/default.yaml --split_dir ./my_tasks
hermes skillopt import ./outputs/best_skill.md --name my-optimized-skill

RAW_BUFFERClick to expand / collapse

Feature Request: Integrate Microsoft SkillOpt for Self-Evolving Agent Skills

Summary

This issue proposes exploring SkillOpt integration with Hermes to enable automated skill optimization without touching model weights.

What is SkillOpt?

Paper: arXiv:2605.23904
Project Page: https://microsoft.github.io/SkillOpt/
License: MIT

Core concept: Train agent skills like you train neural networks — with epochs, (mini-)batchsize, learning rates, and validation gates — but without touching model weights.

Pipeline (6 stages):

Rollout — execute episodes with current skill
Reflect — analyze trajectories, generate patches
Aggregate — hierarchical merge of patches
Select — rank and select top edits
Update — apply edits to skill document
Evaluate — validate candidate skill, accept/reject

Output: best_skill.md — a deployable skill document

Why Integrate with Hermes?

Hermes Current	With SkillOpt
Skills are manually written SKILL.md files	Skills can be auto-optimized from task trajectories
Skill improvement requires human iteration	Skill improvement is data-driven with validation gates
No systematic skill evaluation framework	Built-in train/val/test split evaluation
Skills are static after creation	Skills can self-evolve over time

Proposed Integration Paths

Option A: CLI Wrapper Skill (Low Effort)

Create a Hermes skill that wraps SkillOpt CLI:

hermes skillopt train --config configs/hermes/default.yaml --split_dir ./my_tasks
hermes skillopt import ./outputs/best_skill.md --name my-optimized-skill

Option B: Native Integration (Medium Effort)

Add SkillOpt as optional dependency
Provide hermes skill optimize <skill-name> command
Auto-generate training data from Hermes session history
Integrate with existing skill registry

Option C: Deep Integration (High Effort)

Replace/adapt Hermes skill system to use SkillOpt's document-based skill representation
Real-time skill optimization during agent execution
Continuous learning from user feedback

Technical Compatibility

✅ Python 3.10+ (matches Hermes)
✅ Supports Azure OpenAI / OpenAI / Anthropic / local vLLM
✅ MIT license
⚠️ Requires JSON training data in specific format
⚠️ Currently benchmark-oriented (SearchQA, ALFWorld, etc.), needs adaptation for general tasks

Next Steps

Evaluate SkillOpt on a Hermes skill (e.g., github-pr-workflow or requesting-code-review)
Prototype Option A as a proof-of-concept
Gather feedback from maintainers and community

References

This is a research/experimental feature proposal. Happy to contribute a POC if there's interest!

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix Feature Request: Integrate Microsoft SkillOpt for Self-Evolving Agent Skills

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fix / Workaround

Code Example

Feature Request: Integrate Microsoft SkillOpt for Self-Evolving Agent Skills

Summary

What is SkillOpt?

Why Integrate with Hermes?

Proposed Integration Paths

Option A: CLI Wrapper Skill (Low Effort)

Option B: Native Integration (Medium Effort)

Option C: Deep Integration (High Effort)

Technical Compatibility

Next Steps

References

Still need to ship something?

TRENDING