Large Language Models
BOS Token
Beginning Of Sequence token - marks the start of a sequence in language models.
This concept is essential for understanding large language models and forms a key part of modern AI systems.
Related Concepts
- Special Token
- EOS Token
- Tokenization
Tags
large-language-models special-token eos-token tokenization
Related Terms
EOS Token
End Of Sequence token - signals when the model has finished generating a complete output.
Special Token
Reserved tokens with special meanings like [CLS], [SEP], [MASK], [PAD] used in various model architectures.
Tokenization
The process of breaking text into smaller units (tokens) that language models can process, using algorithms like BPE or WordPiece.