Skip to content

Transformer

[/trænsˈfɔːrmər/]

nounAI & Technology#ai#architecture#deep-learning#attention
0 views1 definitions

Definitions

1
+2016

A neural network architecture introduced in 2017 ("Attention Is All You Need") that underlies virtually all modern language models. Transformers use self-attention mechanisms to process entire sequences in parallel, capturing long-range dependencies that earlier recurrent architectures struggled with.

Every major LLM from GPT to Claude is built on the transformer architecture.
by @mlresearcher1/1/1970

Related Terms

Related terms are generated only from public tags, classes, translations, and explicit relationships. No unavailable semantic relationships are fabricated.