Skip to content

Latency

[/ˈleɪtənsi/]

nounTechnology#performance#networking#speed#infrastructure
0 views1 definitions

Definitions

1
+870

The time delay between initiating an action and receiving the first response. In networking, latency is the round-trip time for a data packet; in AI, it often refers to time-to-first-token or end-to-end inference time. Lower latency means faster, more responsive user experiences.

The new model has lower latency but slightly less accuracy — a classic speed/quality trade-off.
by @devbuilder1/1/1970

Related Terms

Related terms are generated only from public tags, classes, translations, and explicit relationships. No unavailable semantic relationships are fabricated.