Query-Key-Value

Large Language Models

The three learned projections in attention mechanisms used to compute attention weights and outputs.

This concept is essential for understanding large language models and forms a key part of modern AI systems.

Attention Mechanism
Self-Attention
Transformer

Related Terms

Attention Mechanism

A technique that allows neural networks to focus on relevant parts of the input when producing each output, assigning different weights to different input elements.

Self-Attention

A mechanism where each token attends to all other tokens in the sequence to understand contextual relationships.

Transformer

A neural network architecture introduced in 'Attention is All You Need' (2017) that relies entirely on self-attention mechanisms, becoming the foundation for modern LLMs.

← Back to All Terms

Query-Key-Value

Related Concepts

Tags

Related Terms

Attention Mechanism

Self-Attention

Transformer