Live end-to-end performance of the Lucy 2.1 realtime endpoint, measured from a fleet of real browser clients across US Google Cloud regions. Each chart shows one metric over time, per region โ p50 (solid) and p95 (dashed). Click a legend entry to isolate it; double-click to hide it.
This dashboard reports the real user experience of the Lucy 2.1 realtime video model โ measured end-to-end from a browser, not from server-side timers. Everything you see is what an actual client integration would observe.
A fleet of probes runs continuously across several US Google Cloud
regions. Each probe is a headless Chromium browser (driven by Playwright) running the
production @decartai/sdk โ the same SDK customers use. It connects to
lucy-2.1 at wss://api.decart.ai over WebRTC, streams a looped
synthetic camera feed, and decodes the model's returned video.
The probes execute as Google Cloud Run Jobs โ one job per region, each sized at
cpu=8 / 16 GiB. A central scheduler fires one region per tick
on a rotation, so the load placed on production Lucy stays flat no matter how many regions are
monitored. Every run opens several short back-to-back realtime sessions and aggregates them.
Results are bucketed over time and plotted per region (p50 = typical, p95 = tail).
connect() until the realtime session is ready โ WebRTC signaling, SDP negotiation, and ICE connectivity.getStats().getStats()) โ higher jitter means less even delivery.