跳转到内容

Tokenization

[/ˌtoʊkənɪˈzeɪʃən/]

nounAI & Technology#ai#nlp#tokens#preprocessing
0 views1 definitions

Definitions

Machine-assisted language draft. Human review still needed.
1
0

机器辅助翻译草稿 (Chinese) for "Tokenization": The process of converting raw text into discrete units called tokens that a language model can process. Tokens are typically subword units — common words become single tokens while rare words split into multiple tokens. All LLM pricing and context limits are measured in tokens, not characters or words.

示例草稿: The word "unbelievable" tokenized into three pieces: "un", "believ", "able".
by @dictionary_auto_translate1970/1/1

Related Terms

Related terms are generated only from public tags, classes, translations, and explicit relationships. No unavailable semantic relationships are fabricated.