openclaw - 💡(How to fix) Fix Feature Request: Add child spans for detailed OTEL traces [1 participants]

shaohq · 2026-04-08T05:55:05Z

[openclaw] Feature Request: Add child spans for detailed OTEL traces Problem Currently, OpenClaw's OTEL instrumentation only provides coarse-grained spans. For… ## Feature Request: Add child spans for detailed OTEL traces ### Problem Currently, OpenClaw's OTEL instrumentation only provides coarse-grained spans. For example, `openclaw.message.processed` is a single span that encompasses the entire message processing time (~228 seconds), but there is no breakdown of where that time is spent. ### Current Spans Observed - `openclaw.message.processed` - entire message processing (no child spans) - `openclaw.model.usage` - model API call with attributes but no sub-steps - `openclaw.session.stuck` - session stuck detection ### Desired Behavior Add child spans under `openclaw.message.processed` to break down: 1. **Tool calls** - time spent in each tool execution 2. **Model API latency** - time for API request/response round-trip 3. **Tokenization** - time spent calculating/counting tokens 4. **Response building** - time spent constructing the final response 5. **Other sub-operations** - any significant internal steps ### Example Use Case When debugging slow responses, developers need to understand where time is spent: - Is it waiting on an LLM API? - Is it running tool executions? - Is it processing tokens? ### Proposed Implementation Wrap significant internal operations with child spans: ```typescript const parentSpan = tracer.startSpan("openclaw.message.processed"); // ... const toolSpan = tracer.startSpan("openclaw.tool.execution", { parent: parentSpan }); // tool work toolSpan.end(); // ... parentSpan.end(); ``` ### Environment - OpenClaw Version: 2026.4.5 - OTLP Backend: Jaeger - Protocol: http/protobuf

openclaw2026-04-08 05:55:05

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#62964•Fetched 2026-04-09 08:00:04

View on GitHub

Comments

Participants

Timeline

Reactions

Author

shaohq

Participants

shaohq

Code Example

const parentSpan = tracer.startSpan("openclaw.message.processed");
// ...
const toolSpan = tracer.startSpan("openclaw.tool.execution", { parent: parentSpan });
// tool work
toolSpan.end();
// ...
parentSpan.end();

RAW_BUFFERClick to expand / collapse

Feature Request: Add child spans for detailed OTEL traces

Problem

Currently, OpenClaw's OTEL instrumentation only provides coarse-grained spans. For example, openclaw.message.processed is a single span that encompasses the entire message processing time (~228 seconds), but there is no breakdown of where that time is spent.

Current Spans Observed

openclaw.message.processed - entire message processing (no child spans)
openclaw.model.usage - model API call with attributes but no sub-steps
openclaw.session.stuck - session stuck detection

Desired Behavior

Add child spans under openclaw.message.processed to break down:

Tool calls - time spent in each tool execution
Model API latency - time for API request/response round-trip
Tokenization - time spent calculating/counting tokens
Response building - time spent constructing the final response
Other sub-operations - any significant internal steps

Example Use Case

When debugging slow responses, developers need to understand where time is spent:

Is it waiting on an LLM API?
Is it running tool executions?
Is it processing tokens?

Proposed Implementation

Wrap significant internal operations with child spans:

const parentSpan = tracer.startSpan("openclaw.message.processed");
// ...
const toolSpan = tracer.startSpan("openclaw.tool.execution", { parent: parentSpan });
// tool work
toolSpan.end();
// ...
parentSpan.end();

Environment

OpenClaw Version: 2026.4.5
OTLP Backend: Jaeger
Protocol: http/protobuf

extent analysis

TL;DR

Implementing child spans under the openclaw.message.processed span can provide a detailed breakdown of where time is spent during message processing.

Guidance

Identify significant internal operations such as tool calls, model API latency, tokenization, response building, and other sub-operations that can be wrapped with child spans.
Use the proposed implementation approach of starting a child span with a specific name (e.g., openclaw.tool.execution) and setting the parent span to openclaw.message.processed.
Ensure that each child span is properly ended after its corresponding operation is completed to accurately measure the time spent.
Verify the effectiveness of the child spans by checking the OTLP backend (Jaeger) for the presence and accuracy of the detailed traces.

Example

const parentSpan = tracer.startSpan("openclaw.message.processed");
const toolSpan = tracer.startSpan("openclaw.tool.execution", { parent: parentSpan });
// tool execution code here
toolSpan.end();
const modelApiSpan = tracer.startSpan("openclaw.model.api.latency", { parent: parentSpan });
// model API call code here
modelApiSpan.end();
parentSpan.end();

Notes

The implementation of child spans should be tailored to the specific requirements of the OpenClaw application and may need to be adjusted based on the actual performance characteristics and debugging needs.

Recommendation

Apply the proposed workaround of implementing child spans to gain more detailed insights into the message processing time. This approach allows for a more fine-grained understanding of where time is spent without requiring significant changes to the existing instrumentation.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #docker error #permission error #memory optimization #batch processing

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix Feature Request: Add child spans for detailed OTEL traces [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Code Example

Feature Request: Add child spans for detailed OTEL traces

Problem

Current Spans Observed

Desired Behavior

Example Use Case

Proposed Implementation

Environment

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

openclaw - 💡(How to fix) Fix Feature Request: Add child spans for detailed OTEL traces [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Code Example

Feature Request: Add child spans for detailed OTEL traces

Problem

Current Spans Observed

Desired Behavior

Example Use Case

Proposed Implementation

Environment

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING