Architecture
Decoder
Definition
“
A neural network component that generates output sequences from encoded representations. In transformer-based language models, the decoder processes tokens sequentially and predicts the next token using self-attention and cross-attention mechanisms. GPT models are decoder-only architectures, while models like T5 use encoder-decoder designs.
”
Related Terms
No related terms linked yet.
Explore all terms →