Observability

BitVault adopts OpenTelemetry (ADR-0013) from the first commit. All three signal types — traces, metrics, and structured logs — are instrumented in every component using a vendor-neutral SDK. Backends are a deployment choice, not a code change.

:::note Observability is not optional. The async/eventual-consistency design makes distributed tracing the only reliable way to answer “where did this file indexing go?” When a NodeChanged event travels from the commit transaction through NATS JetStream to the Search Indexer, no single log line tells the whole story — only a connected trace does. :::

Contents

Page What it covers
OpenTelemetry Setup SDK setup, context propagation, collector, health endpoints
Metrics RED/USE/domain gauges, dashboards, alert thresholds
Distributed Tracing Trace anatomy, key flows, sampling, debugging async events
SLOs & Error Budgets SLO targets, error budgets, burn-rate alerts, canary gating

Design principles