跳转到内容

#speed

1 approved public terms with this tag.

机器辅助翻译草稿 (Chinese) for "Latency": The time delay between initiating an action and receiving the first response. In networking, latency is the round-trip time for a data packet; in AI, it often refers to time-to-first-token or end-to-end inference time. Lower latency means faster, more responsive user experiences.

示例草稿: The new model has lower latency but slightly less accuracy — a classic speed/quality trade-off.