hermes - ✅(Solved) Fix feat(agent): tool call deduplication to prevent redundant execution loops [1 pull requests, 1 participants]

andrewhosf · 2026-04-30T20:10:34Z

[hermes] PR 18075: feat agent : add tool call deduplication to prevent redundant execution loops - Repository: NousResearch/hermes-agent - Author: andrewhosf -… # PR #18075: feat(agent): add tool call deduplication to prevent redundant execution loops - Repository: NousResearch/hermes-agent - Author: andrewhosf - State: closed | merged: False - Link: https://github.com/NousResearch/hermes-agent/pull/18075 ## Description (problem / solution / changelog) # PR Proposal: Agent-Level Tool Call Deduplication / Loop Prevention ## Problem Statement The Hermes agent can enter expensive loops where it executes the same tool call (e.g., `git log`, `search_files`) multiple times in a row with identical arguments, producing identical output each time. This wastes tokens, burns context window space, and delays actual work. **Real-world example from 2026-04-30:** The agent ran `git log --all --oneline --grep="tokens_per_sec\|tok/s\|token.*sec\|speed" -- cli.py run_agent.py` 30+ times consecutively with slightly different grep patterns, all returning the same 7 unrelated commits. The agent never recognized it had already confirmed the feature didn't exist upstream. ## Why Existing PRs Don't Fully Solve This - **PR #16641** (tool-call loop guardrails): Detects repeated *failing* or *non-progressing* tool calls within a single turn and injects warnings. Our bug was repeated *successful* calls across multiple turns that returned the same information. - **PR #3006 / #8126** (tool result caching): Caches results for identical calls. This would help but doesn't address the root cause — the model shouldn't generate the redundant calls in the first place. ## Proposed Solution Add an agent-level **tool call deduplication and progress tracking layer** that sits between the model's output and actual tool execution. ### Core Mechanism 1. **Per-turn tool call registry**: Hash (tool_name + normalized_args) → output 2. **Before executing any tool**: Check if this exact call was already made this turn 3. **If duplicate**: Return cached result with a `duplicate: true` flag and append a system note: `"Note: This tool call was already executed in this turn with identical arguments. Result was: ..."` 4. **If similar but not exact** (same tool, slightly different args, same output): Flag as "no new information" after N repeats 5. **Cross-turn tracking** (optional): Maintain a short LRU cache of recent tool calls to catch loops that span multiple turns ### Implementation Sketch ```python # agent/tool_dedup.py class ToolCallRegistry: """Tracks tool calls within a turn to prevent redundant execution.""" def __init__(self, max_history=100): self._history = {} # hash -> (args, output, timestamp) self._max_history = max_history def check(self, tool_name: str, args: dict) -> tuple[bool, Any]: """Returns (is_duplicate, cached_output) if this exact call was already made.""" key = self._hash(tool_name, args) if key in self._history: return True, self._history[key][1] return False, None def record(self, tool_name: str, args: dict, output: Any): """Record a tool call result for deduplication.""" key = self._hash(tool_name, args) self._history[key] = (args, output, time.time()) def _hash(self, tool_name: str, args: dict) -> str: # Normalize args (sort dict keys, handle lists) normalized = json.dumps(args, sort_keys=True, default=str) return hashlib.sha256(f"{tool_name}:{normalized}".encode()).hexdigest()[:32] def reset(self): """Clear history at the start of each turn.""" self._history.clear() ``` ### Integration Points - **run_agent.py**: Instantiate `ToolCallRegistry` at turn start, check before each tool execution - **cli.py**: Add config option `tool_deduplication.enabled` (default: true) - **Config**: Add to `hermes_cli/config.py` and `cli-config.yaml.example` ### Config Defaults ```yaml tool_deduplication: enabled: true # How many identical calls before we force a cache hit exact_duplicate_threshold: 1 # Always dedup exact duplicates # How many similar calls (same tool, different args, same output) before warning no_progress_threshold: 3 # Whether to append a system note when returning cached results append_system_note: true ``` ### Test Plan 1. **Unit tests**: `tests/agent/test_tool_dedup.py` - Exact duplicate detection - Args normalization (dict order, list order) - Registry reset behavior - Hash collision safety 2. **Integration tests**: `tests/run_agent/test_tool_dedup_runtime.py` - Agent makes same `read_file` call twice in one turn → second returns cached - Agent runs `git log` with different grep patterns → all tracked separately - Turn reset: new turn can re-run same tool with fresh result 3. **Regression tests**: - Ensure normal multi-tool workflows still work - Ensure deliberate re-checks (e.g., polling `process(action="poll")`) aren't broken ## Scope - **Files to modify**: ~5 files - `agent/tool_dedup.py` (new) - `run_agent.py` (wire into tool execution loop) - `hermes_cli/config.py` (add defaults) - `cli-config.yaml.example` (document) -

hermes2026-04-30 20:10:34

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

NousResearch/hermes-agent#18076•Fetched 2026-05-01 05:54:03

View on GitHub

Comments

Participants

Timeline

Reactions

Author

andrewhosf

Participants

andrewhosf

Timeline (top)

labeled ×3cross-referenced ×1

Root Cause

PR #16641 (tool-call loop guardrails): Detects repeated failing or non-progressing tool calls within a single turn and injects warnings. Our bug was repeated successful calls across multiple turns that returned the same information.
PR #3006 / #8126 (tool result caching): Caches results for identical calls. This would help but doesn't address the root cause — the model shouldn't generate the redundant calls in the first place.

Fix Action

Fixed

Fixed by PR: feat(agent): add tool call deduplication to prevent redundant execution loops (https://github.com/NousResearch/hermes-agent/pull/18075)

PR fix notes

PR #18075: feat(agent): add tool call deduplication to prevent redundant execution loops

Repository: NousResearch/hermes-agent
Author: andrewhosf
State: closed | merged: False
Link: https://github.com/NousResearch/hermes-agent/pull/18075

Description (problem / solution / changelog)

PR Proposal: Agent-Level Tool Call Deduplication / Loop Prevention

Problem Statement

The Hermes agent can enter expensive loops where it executes the same tool call (e.g., git log, search_files) multiple times in a row with identical arguments, producing identical output each time. This wastes tokens, burns context window space, and delays actual work.

Real-world example from 2026-04-30: The agent ran git log --all --oneline --grep="tokens_per_sec\|tok/s\|token.*sec\|speed" -- cli.py run_agent.py 30+ times consecutively with slightly different grep patterns, all returning the same 7 unrelated commits. The agent never recognized it had already confirmed the feature didn't exist upstream.

Why Existing PRs Don't Fully Solve This

PR #16641 (tool-call loop guardrails): Detects repeated failing or non-progressing tool calls within a single turn and injects warnings. Our bug was repeated successful calls across multiple turns that returned the same information.
PR #3006 / #8126 (tool result caching): Caches results for identical calls. This would help but doesn't address the root cause — the model shouldn't generate the redundant calls in the first place.

Proposed Solution

Add an agent-level tool call deduplication and progress tracking layer that sits between the model's output and actual tool execution.

Core Mechanism

Per-turn tool call registry: Hash (tool_name + normalized_args) → output
Before executing any tool: Check if this exact call was already made this turn
If duplicate: Return cached result with a duplicate: true flag and append a system note: "Note: This tool call was already executed in this turn with identical arguments. Result was: ..."
If similar but not exact (same tool, slightly different args, same output): Flag as "no new information" after N repeats
Cross-turn tracking (optional): Maintain a short LRU cache of recent tool calls to catch loops that span multiple turns

Implementation Sketch

# agent/tool_dedup.py

class ToolCallRegistry:
    """Tracks tool calls within a turn to prevent redundant execution."""
    
    def __init__(self, max_history=100):
        self._history = {}  # hash -> (args, output, timestamp)
        self._max_history = max_history
    
    def check(self, tool_name: str, args: dict) -> tuple[bool, Any]:
        """Returns (is_duplicate, cached_output) if this exact call was already made."""
        key = self._hash(tool_name, args)
        if key in self._history:
            return True, self._history[key][1]
        return False, None
    
    def record(self, tool_name: str, args: dict, output: Any):
        """Record a tool call result for deduplication."""
        key = self._hash(tool_name, args)
        self._history[key] = (args, output, time.time())
    
    def _hash(self, tool_name: str, args: dict) -> str:
        # Normalize args (sort dict keys, handle lists)
        normalized = json.dumps(args, sort_keys=True, default=str)
        return hashlib.sha256(f"{tool_name}:{normalized}".encode()).hexdigest()[:32]
    
    def reset(self):
        """Clear history at the start of each turn."""
        self._history.clear()

Integration Points

run_agent.py: Instantiate ToolCallRegistry at turn start, check before each tool execution
cli.py: Add config option tool_deduplication.enabled (default: true)
Config: Add to hermes_cli/config.py and cli-config.yaml.example

Config Defaults

tool_deduplication:
  enabled: true
  # How many identical calls before we force a cache hit
  exact_duplicate_threshold: 1  # Always dedup exact duplicates
  # How many similar calls (same tool, different args, same output) before warning
  no_progress_threshold: 3
  # Whether to append a system note when returning cached results
  append_system_note: true

Test Plan

Unit tests: tests/agent/test_tool_dedup.py
- Exact duplicate detection
- Args normalization (dict order, list order)
- Registry reset behavior
- Hash collision safety
Integration tests: tests/run_agent/test_tool_dedup_runtime.py
- Agent makes same read_file call twice in one turn → second returns cached
- Agent runs git log with different grep patterns → all tracked separately
- Turn reset: new turn can re-run same tool with fresh result
Regression tests:
- Ensure normal multi-tool workflows still work
- Ensure deliberate re-checks (e.g., polling process(action="poll")) aren't broken

Scope

Files to modify: ~5 files
- agent/tool_dedup.py (new)
- run_agent.py (wire into tool execution loop)
- hermes_cli/config.py (add defaults)
- cli-config.yaml.example (document)
- tests/agent/test_tool_dedup.py (new)
- tests/run_agent/test_tool_dedup_runtime.py (new)
Risk: Low. This is a pure optimization — it only prevents redundant execution, never blocks new calls.

Alternative: Simpler Approach

Instead of full deduplication, we could add a "no new information" detector:

After any tool call, hash the output
If the same tool was called recently with the same output, append a note: "This produced the same result as the previous call. Consider if you already have the answer."
This is lighter weight but less aggressive about preventing the redundant execution itself.

Recommendation

Implement the full deduplication approach. The cost is low (one dict lookup per tool call), the benefit is high (prevents the exact loop we hit), and it generalizes to any idempotent tool.

Related PRs

#16641: Tool-call loop guardrails (warning-first, single-turn)
#3006: RAM-backed tool result cache
#8126: Opt-in result memoization for idempotent tools

This PR would complement those by addressing the execution layer, not just the result caching or warning layer.

Changed files

PR_TOOL_DEDUP_PROPOSAL.md (added, +126/-0)

Code Example

# agent/tool_dedup.py

class ToolCallRegistry:
    """Tracks tool calls within a turn to prevent redundant execution."""
    
    def __init__(self, max_history=100):
        self._history = {}  # hash -> (args, output, timestamp)
        self._max_history = max_history
    
    def check(self, tool_name: str, args: dict) -> tuple[bool, Any]:
        """Returns (is_duplicate, cached_output) if this exact call was already made."""
        key = self._hash(tool_name, args)
        if key in self._history:
            return True, self._history[key][1]
        return False, None
    
    def record(self, tool_name: str, args: dict, output: Any):
        """Record a tool call result for deduplication."""
        key = self._hash(tool_name, args)
        self._history[key] = (args, output, time.time())
    
    def _hash(self, tool_name: str, args: dict) -> str:
        # Normalize args (sort dict keys, handle lists)
        normalized = json.dumps(args, sort_keys=True, default=str)
        return hashlib.sha256(f"{tool_name}:{normalized}".encode()).hexdigest()[:32]
    
    def reset(self):
        """Clear history at the start of each turn."""
        self._history.clear()

---

tool_deduplication:
  enabled: true
  # How many identical calls before we force a cache hit
  exact_duplicate_threshold: 1  # Always dedup exact duplicates
  # How many similar calls (same tool, different args, same output) before warning
  no_progress_threshold: 3
  # Whether to append a system note when returning cached results
  append_system_note: true

RAW_BUFFERClick to expand / collapse

PR Proposal: Agent-Level Tool Call Deduplication / Loop Prevention

Problem Statement

Why Existing PRs Don't Fully Solve This

PR #16641 (tool-call loop guardrails): Detects repeated failing or non-progressing tool calls within a single turn and injects warnings. Our bug was repeated successful calls across multiple turns that returned the same information.
PR #3006 / #8126 (tool result caching): Caches results for identical calls. This would help but doesn't address the root cause — the model shouldn't generate the redundant calls in the first place.

Proposed Solution

Add an agent-level tool call deduplication and progress tracking layer that sits between the model's output and actual tool execution.

Core Mechanism

Per-turn tool call registry: Hash (tool_name + normalized_args) → output
Before executing any tool: Check if this exact call was already made this turn
If duplicate: Return cached result with a duplicate: true flag and append a system note: "Note: This tool call was already executed in this turn with identical arguments. Result was: ..."
If similar but not exact (same tool, slightly different args, same output): Flag as "no new information" after N repeats
Cross-turn tracking (optional): Maintain a short LRU cache of recent tool calls to catch loops that span multiple turns

Implementation Sketch

# agent/tool_dedup.py

class ToolCallRegistry:
    """Tracks tool calls within a turn to prevent redundant execution."""
    
    def __init__(self, max_history=100):
        self._history = {}  # hash -> (args, output, timestamp)
        self._max_history = max_history
    
    def check(self, tool_name: str, args: dict) -> tuple[bool, Any]:
        """Returns (is_duplicate, cached_output) if this exact call was already made."""
        key = self._hash(tool_name, args)
        if key in self._history:
            return True, self._history[key][1]
        return False, None
    
    def record(self, tool_name: str, args: dict, output: Any):
        """Record a tool call result for deduplication."""
        key = self._hash(tool_name, args)
        self._history[key] = (args, output, time.time())
    
    def _hash(self, tool_name: str, args: dict) -> str:
        # Normalize args (sort dict keys, handle lists)
        normalized = json.dumps(args, sort_keys=True, default=str)
        return hashlib.sha256(f"{tool_name}:{normalized}".encode()).hexdigest()[:32]
    
    def reset(self):
        """Clear history at the start of each turn."""
        self._history.clear()

Integration Points

run_agent.py: Instantiate ToolCallRegistry at turn start, check before each tool execution
cli.py: Add config option tool_deduplication.enabled (default: true)
Config: Add to hermes_cli/config.py and cli-config.yaml.example

Config Defaults

tool_deduplication:
  enabled: true
  # How many identical calls before we force a cache hit
  exact_duplicate_threshold: 1  # Always dedup exact duplicates
  # How many similar calls (same tool, different args, same output) before warning
  no_progress_threshold: 3
  # Whether to append a system note when returning cached results
  append_system_note: true

Test Plan

Unit tests: tests/agent/test_tool_dedup.py
- Exact duplicate detection
- Args normalization (dict order, list order)
- Registry reset behavior
- Hash collision safety
Integration tests: tests/run_agent/test_tool_dedup_runtime.py
- Agent makes same read_file call twice in one turn → second returns cached
- Agent runs git log with different grep patterns → all tracked separately
- Turn reset: new turn can re-run same tool with fresh result
Regression tests:
- Ensure normal multi-tool workflows still work
- Ensure deliberate re-checks (e.g., polling process(action="poll")) aren't broken

Scope

Files to modify: ~5 files
- agent/tool_dedup.py (new)
- run_agent.py (wire into tool execution loop)
- hermes_cli/config.py (add defaults)
- cli-config.yaml.example (document)
- tests/agent/test_tool_dedup.py (new)
- tests/run_agent/test_tool_dedup_runtime.py (new)
Risk: Low. This is a pure optimization — it only prevents redundant execution, never blocks new calls.

Alternative: Simpler Approach

Instead of full deduplication, we could add a "no new information" detector:

After any tool call, hash the output
If the same tool was called recently with the same output, append a note: "This produced the same result as the previous call. Consider if you already have the answer."
This is lighter weight but less aggressive about preventing the redundant execution itself.

Recommendation

Implement the full deduplication approach. The cost is low (one dict lookup per tool call), the benefit is high (prevents the exact loop we hit), and it generalizes to any idempotent tool.

Related PRs

#16641: Tool-call loop guardrails (warning-first, single-turn)
#3006: RAM-backed tool result cache
#8126: Opt-in result memoization for idempotent tools

This PR would complement those by addressing the execution layer, not just the result caching or warning layer.

extent analysis

TL;DR

Implement the proposed agent-level tool call deduplication and progress tracking layer to prevent redundant tool executions.

Guidance

Review the ToolCallRegistry class implementation to ensure it correctly handles tool call deduplication and progress tracking.
Integrate the ToolCallRegistry into the run_agent.py file to check for duplicate tool calls before execution.
Configure the tool_deduplication settings in hermes_cli/config.py and cli-config.yaml.example to enable deduplication and set thresholds for exact and similar tool calls.
Develop comprehensive unit tests and integration tests to verify the correctness of the deduplication mechanism.

Example

# Example usage of ToolCallRegistry
registry = ToolCallRegistry()
tool_name = "git log"
args = {"--all": True, "--oneline": True}
output = "commit hash"
is_duplicate, cached_output = registry.check(tool_name, args)
if is_duplicate:
    print("Duplicate tool call detected")
else:
    # Execute the tool call and record the result
    registry.record(tool_name, args, output)

Notes

The proposed solution assumes that the tool calls are idempotent, meaning that repeated calls with the same arguments will produce the same output. If this assumption is not valid, the deduplication mechanism may not work correctly.

Recommendation

Implement the full deduplication approach as proposed, as it provides a more comprehensive solution to preventing redundant tool executions. The cost is low, and the benefit is high in preventing the exact loop that was encountered.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #optimization #chain error #conversation history

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

hermes - ✅(Solved) Fix feat(agent): tool call deduplication to prevent redundant execution loops [1 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fixed

PR fix notes

PR #18075: feat(agent): add tool call deduplication to prevent redundant execution loops

Description (problem / solution / changelog)

PR Proposal: Agent-Level Tool Call Deduplication / Loop Prevention

Problem Statement

Why Existing PRs Don't Fully Solve This

Proposed Solution

Core Mechanism

Implementation Sketch

Integration Points

Config Defaults

Test Plan

Scope

Alternative: Simpler Approach

Recommendation

Related PRs

Changed files

Code Example

PR Proposal: Agent-Level Tool Call Deduplication / Loop Prevention

Problem Statement

Why Existing PRs Don't Fully Solve This

Proposed Solution

Core Mechanism

Implementation Sketch

Integration Points

Config Defaults

Test Plan

Scope

Alternative: Simpler Approach

Recommendation

Related PRs

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING