Test Your Knowledge - Large Language Models

Architecture

Easy

Question 1 of 10

large language models

In the Transformer architecture, which mechanism allows tokens to weigh other tokens' influence when producing contextual representations?

Convolutional filters

Self-attention

Max pooling

Recurrent gating