Encoder-Only Model

Large Language Models

A transformer with only encoder layers and bidirectional attention, suited for understanding tasks (BERT family).

This concept is essential for understanding large language models and forms a key part of modern AI systems.

BERT
Bidirectional Attention
Transformer

Related Terms

BERT

Bidirectional Encoder Representations from Transformers - a model that understands context by looking at text from both directions.

Bidirectional Attention

Allowing tokens to attend to both past and future context, used in encoder models like BERT.

Transformer

A neural network architecture introduced in 'Attention is All You Need' (2017) that relies entirely on self-attention mechanisms, becoming the foundation for modern LLMs.

← Back to All Terms