hermes - 💡(How to fix) Fix Research disclosure: static review observations for agent-facing skill artifacts [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
NousResearch/hermes-agent#17497Fetched 2026-04-30 06:47:09
View on GitHub
Comments
0
Participants
1
Timeline
3
Reactions
0
Participants
Timeline (top)
labeled ×3

Root Cause

This is a static artifact review and should not be read as a claim of confirmed exploitation. We are sharing it because this repository combined moderate finding volume with several higher-signal artifact-backed structural patterns.

RAW_BUFFERClick to expand / collapse

Hello maintainers,

We are conducting a research study on repository-local agent instruction surfaces and recently analyzed this repository with agent-audit.

This is a static artifact review and should not be read as a claim of confirmed exploitation. We are sharing it because this repository combined moderate finding volume with several higher-signal artifact-backed structural patterns.

High-level scan summary:

  • 96 raw findings
  • 87 clustered issue instances
  • 9 multi-signal issue instances
  • 1 cross-tool issue instance

Review these files first:

  1. skills/red-teaming/godmode/SKILL.md

    • canonical class: identity_rewrite_with_persistent_effect
    • rule: asamm.AI-04.persistent-identity-rewrite
    • why it stood out: identity-rewrite language plus persistent config/write-path cues in the same file
  2. skills/research/research-paper-writing/SKILL.md

    • canonical class: autonomous_execution_or_looping
    • rule: asamm.AD-02.autonomous-loop-with-writes
    • why it stood out: autonomous workflow framing combined with write-action language
  3. optional-skills/email/agentmail/SKILL.md

    • canonical class: broad_external_action_without_approval
    • rule: asamm.AD-02.broad-action-without-approval
    • why it stood out: external-action capability without clear per-action approval/scoping
  4. skills/autonomous-ai-agents/claude-code/SKILL.md

    • canonical class: unsafe_command_or_execution_surface
    • rule: atr.privilege-escalation.shell-metacharacter-injection-in-tool-arguments
    • why it stood out: shell / command execution pattern
  5. skills/autonomous-ai-agents/hermes-agent/SKILL.md

    • canonical class: remote_fetch_or_install_expands_trust_boundary
    • rule: aguara.external-download.curl-or-wget-piped-to-shell
    • why it stood out: remote fetch / install pattern associated with trust-boundary expansion

Questions that would help interpret these findings:

  • Are these skills intended as operative defaults, optional power-user workflows, or red-team/reference content?
  • Is persistence/approval logic intentionally documented outside the skill files themselves?
  • Are some of the flagged artifacts intentionally adversarial or exploratory rather than intended for ordinary execution?

Method and dataset reference: https://github.com/scadastrangelove/agent-audit/tree/main/artifacts/article-support-dataset-v1

We would appreciate any maintainer context, especially for whether these files are meant to be operative defaults versus optional or demonstrative content.

extent analysis

TL;DR

Review the identified SKILL.md files to determine if the flagged artifacts are intended for ordinary execution or are part of red-team/reference content.

Guidance

  • Examine the SKILL.md files in the specified directories to understand the context and purpose of the flagged code patterns.
  • Determine if the skills are intended as operative defaults, optional power-user workflows, or red-team/reference content to assess the risk associated with the findings.
  • Check if persistence/approval logic is intentionally documented outside the skill files themselves to ensure that the necessary security controls are in place.
  • Consider providing context on whether some of the flagged artifacts are intentionally adversarial or exploratory to help interpret the findings.

Notes

The issue lacks information on the specific repository and its intended use, which may affect the interpretation of the findings. Maintainer context is necessary to determine the appropriate course of action.

Recommendation

Apply workaround: Review and refine the identified SKILL.md files to ensure they align with the intended use and security requirements of the repository, and provide context to help interpret the findings.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

hermes - 💡(How to fix) Fix Research disclosure: static review observations for agent-facing skill artifacts [1 participants]