Transformer
[/trænsˈfɔːrmər/]
nounAI & Technology#ai#architecture#deep-learning#attention0 views1 definitions
Definitions
1
+2016
A neural network architecture introduced in 2017 ("Attention Is All You Need") that underlies virtually all modern language models. Transformers use self-attention mechanisms to process entire sequences in parallel, capturing long-range dependencies that earlier recurrent architectures struggled with.
“Every major LLM from GPT to Claude is built on the transformer architecture.”
by @mlresearcher1/1/1970