Transformer

A transformer is a type of neural network architecture that excels at processing sequential data, such as text. It uses self-attention mechanisms to capture long-range dependencies and is the foundation of many large language models.

» Glossary