hermes - 💡(How to fix) Fix feat: limit CPU usage of Hermes-spawned scripts to prevent server hangs

StepCodex · 2026-05-08T17:43:41Z

[hermes] Problem Scripts and background tasks spawned by Hermes e.g. eval benchmarks, long-running terminal calls, execute code blocks can consume 100% CPU and… ## Problem Scripts and background tasks spawned by Hermes (e.g. eval benchmarks, long-running `terminal()` calls, `execute_code` blocks) can consume 100% CPU and hang the entire server — making the Hermes WebUI unresponsive and blocking all other Hermes operations. **Observed behaviour:** A benchmark script looping on HTTP requests pegged the CPU, caused the server to become unresponsive, and ultimately required a full reboot. ## Proposed Solutions ### 1. Auto-`nice` all spawned subprocesses (quick win) Wrap all `terminal()` and `execute_code` subprocess invocations with `nice -n 19` so the OS preempts them freely when Hermes or the WebUI needs CPU. ```python # In the subprocess launch code import os os.nice(19) ``` Or prefix shell commands automatically: ```python cmd = f"nice -n 19 {cmd}" ``` ### 2. `systemd` CPU quota on the Hermes process group Add a `CPUQuota=50%` constraint to the `hermes-gateway.service` unit so the entire Hermes process tree — including all spawned children — is capped at 50% CPU by the kernel. ```ini [Service] CPUQuota=50% ``` ### 3. `resource.setrlimit` in `execute_code` Before running user-submitted Python, set a CPU time limit: ```python import resource # Limit CPU time to 60s per execute_code call resource.setrlimit(resource.RLIMIT_CPU, (60, 60)) ``` ### 4. Configurable `max_cpu_percent` in `config.yaml` Allow operators to configure CPU limits per deployment: ```yaml execution: max_cpu_percent: 50 # applies to all spawned processes nice_level: 19 # default niceness for background tasks cpu_time_limit: 120 # seconds, for execute_code ``` ### 5. Separate cgroup for background tasks Run all Hermes-initiated background processes in a dedicated cgroup with CPU limits, isolating them from the gateway and WebUI processes. ## Priority - **Short-term:** Apply `nice -n 19` to all subprocess spawns — one-line fix, immediate protection - **Medium-term:** `systemd` `CPUQuota` on the service unit — kernel-enforced, no code changes needed - **Long-term:** Configurable limits via `config.yaml` for self-hosted deployments ## Environment - Ubuntu 22.04, systemd - Hermes WebUI on port 8787 (Tailscale) - 2-core VM — any runaway process immediately affects the UI

hermes2026-05-08 17:43:41

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Code Example

# In the subprocess launch code
import os
os.nice(19)

---

cmd = f"nice -n 19 {cmd}"

---

[Service]
CPUQuota=50%

---

import resource
# Limit CPU time to 60s per execute_code call
resource.setrlimit(resource.RLIMIT_CPU, (60, 60))

---

execution:
  max_cpu_percent: 50      # applies to all spawned processes
  nice_level: 19           # default niceness for background tasks
  cpu_time_limit: 120      # seconds, for execute_code

RAW_BUFFERClick to expand / collapse

Problem

Scripts and background tasks spawned by Hermes (e.g. eval benchmarks, long-running terminal() calls, execute_code blocks) can consume 100% CPU and hang the entire server — making the Hermes WebUI unresponsive and blocking all other Hermes operations.

Observed behaviour: A benchmark script looping on HTTP requests pegged the CPU, caused the server to become unresponsive, and ultimately required a full reboot.

Proposed Solutions

1. Auto-`nice` all spawned subprocesses (quick win)

Wrap all terminal() and execute_code subprocess invocations with nice -n 19 so the OS preempts them freely when Hermes or the WebUI needs CPU.

# In the subprocess launch code
import os
os.nice(19)

Or prefix shell commands automatically:

cmd = f"nice -n 19 {cmd}"

2. `systemd` CPU quota on the Hermes process group

Add a CPUQuota=50% constraint to the hermes-gateway.service unit so the entire Hermes process tree — including all spawned children — is capped at 50% CPU by the kernel.

[Service]
CPUQuota=50%

3. `resource.setrlimit` in `execute_code`

Before running user-submitted Python, set a CPU time limit:

import resource
# Limit CPU time to 60s per execute_code call
resource.setrlimit(resource.RLIMIT_CPU, (60, 60))

4. Configurable `max_cpu_percent` in `config.yaml`

Allow operators to configure CPU limits per deployment:

execution:
  max_cpu_percent: 50      # applies to all spawned processes
  nice_level: 19           # default niceness for background tasks
  cpu_time_limit: 120      # seconds, for execute_code

5. Separate cgroup for background tasks

Run all Hermes-initiated background processes in a dedicated cgroup with CPU limits, isolating them from the gateway and WebUI processes.

Priority

Short-term: Apply nice -n 19 to all subprocess spawns — one-line fix, immediate protection
Medium-term: systemd CPUQuota on the service unit — kernel-enforced, no code changes needed
Long-term: Configurable limits via config.yaml for self-hosted deployments

Environment

Ubuntu 22.04, systemd
Hermes WebUI on port 8787 (Tailscale)
2-core VM — any runaway process immediately affects the UI

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#chain error #conversation history #tool integration #LLM response #prompt template

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix feat: limit CPU usage of Hermes-spawned scripts to prevent server hangs

Recommended Tools

GitHub issue graph ai analysis

Code Example

Problem

Proposed Solutions

1. Auto-`nice` all spawned subprocesses (quick win)

2. `systemd` CPU quota on the Hermes process group

3. `resource.setrlimit` in `execute_code`

4. Configurable `max_cpu_percent` in `config.yaml`

5. Separate cgroup for background tasks

Priority

Environment

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix feat: limit CPU usage of Hermes-spawned scripts to prevent server hangs

Recommended Tools

GitHub issue graph ai analysis

Code Example

Problem

Proposed Solutions

1. Auto-nice all spawned subprocesses (quick win)

2. systemd CPU quota on the Hermes process group

3. resource.setrlimit in execute_code

4. Configurable max_cpu_percent in config.yaml

5. Separate cgroup for background tasks

Priority

Environment

Still need to ship something?

RELATED_DISCOVERY

TRENDING

1. Auto-`nice` all spawned subprocesses (quick win)

2. `systemd` CPU quota on the Hermes process group

3. `resource.setrlimit` in `execute_code`

4. Configurable `max_cpu_percent` in `config.yaml`