Observability: Logs, Traces, And Soul Timelines

Section 1

Why agents need their own observability

Compare the options

Layer	Question it answers	Where it lives
Logs	What happened in this heartbeat?	stdout / file / log drain (Loki, Datadog)
Traces	How long did each step take, and which step was the bottleneck?	OTLP endpoint (Jaeger, Honeycomb, Vercel Observability)
Soul timeline	Is this soul still healthy as a long-running thing?	Mission Control UI / Grafana dashboard
Audit log	Did the soul actually do what we authorized?	Append-only file in /var/openclaw/audit (lesson 1)

One line of OpenClaw heartbeat log. Grep-friendly, Loki-friendly, eyeball-friendly.

json

{
  "ts": "2026-04-27T08:00:00.123Z",
  "level": "info",
  "event": "heartbeat.complete",
  "soul": "inbox-triage",
  "heartbeat_id": "hb_2k4n9",
  "interval_s": 900,
  "actual_duration_s": 12.4,
  "model": "qwen3.5:8b",
  "tokens_in": 4218,
  "tokens_out": 612,
  "skills_called": ["gmail.list", "gmail.label"],
  "approvals_pending": 0,
  "outcome": "success"
}

Compare the options

Alert	Condition	Why it matters
Heartbeat missed	No heartbeat.complete event in 2x interval window	Soul is dead, hung, or the host is down — and you wouldn't notice otherwise
Tick > interval	actual_duration_s > interval_s for 3 consecutive heartbeats	Soul is overlapping itself; ticks are queuing; cost will runaway
Token spend spike	Daily tokens_in for a soul > 2x rolling 7-day median	Model swap, prompt regression, infinite tool loop, or context bloat
Pending approvals piling up	approvals_pending > 5 for over an hour	Soul is stuck waiting for a human; needs attention or the gate needs tuning
Repeated skill error	Same skill returning error in 5 consecutive heartbeats	Skill is broken, credentials expired, or the upstream API changed

Key terms in this lesson

Observability: Logs, Traces, And Soul Timelines

Why agents need their own observability

Three layers, three questions

What to surface in logs

Traces: where the time actually went

The soul timeline

Sketch your dashboard before you build it

Alerting on heartbeat anomalies

Apply: instrument one soul this week

Curious about “Observability: Logs, Traces, And Soul Timelines”?

Keep going

Observability: Logs, Traces, And Soul Timelines

Why agents need their own observability

Three layers, three questions

What to surface in logs

Traces: where the time actually went

The soul timeline

Sketch your dashboard before you build it

Alerting on heartbeat anomalies

Apply: instrument one soul this week

Curious about “Observability: Logs, Traces, And Soul Timelines”?

Keep going