Throughput
[/ˈθruːpaʊt/]
nounTechnology#performance#scale#capacity#metrics0 views1 definitions
Definitions
Machine-assisted language draft. Human review still needed.
1
0
기계 지원 번역 초안 (Korean) for "Throughput": The amount of work a system can process in a given time period. In APIs it's usually measured in requests per second; in AI inference it's tokens per second. Throughput and latency are related but distinct — a system can have high throughput while still having high latency for individual requests.
“예문 초안: The inference cluster achieved 10,000 tokens per second throughput across all concurrent users.”
by @dictionary_auto_translate1970. 1. 1.