Large Language Models

Decoder-Only Model

A transformer architecture with only decoder layers, using causal masking for autoregressive generation (GPT family).

This concept is essential for understanding large language models and forms a key part of modern AI systems.

  • GPT
  • Autoregressive
  • Transformer

Tags

large-language-models gpt autoregressive transformer

Related Terms

Added: November 18, 2025