ODOCK.AI
ObservabilityTraffic Analytics

Latency

How slow it is, what shape the slowness has, and when the tail fires.

Latency

The Latency tab is the why is it slow view.

Latency tab — trend, distribution, heatmap

Visualizations

ChartAnswers
Latency TrendIs it getting worse? Daily average latency with gateway overhead overlaid on a second axis.
Latency DistributionWhere is the mass? Bar buckets from <250ms to >10s.
Latency HeatmapWhen does the tail fire? Day-by-hour grid split by latency band.

The two lines on Latency Trend are intentionally on separate axes. A jump on average latency with a flat gateway overhead is upstream. A jump on gateway overhead with a flat average usually points at a plugin or safety module — see Plugins.

Workflow

Open this tab when the Latency KPI card on the header shows p95 climbing.

Read Latency Trend first — average vs gateway overhead tells you which side of the gateway is slowing down.

Read Latency Distribution — is the whole distribution shifting right, or is a new tail bucket growing?

Use Latency Heatmap to check whether the regression is constant or hourly.

Switch to the Traces tab, sort by latency, and open the worst offender for a per-request inspection.

Tips

Filter the page to one Model before looking at the distribution. Mixing models with very different generation lengths makes the distribution chart hard to read.

On this page