Architecture
Easy
Question 1 of 10

large language models

In the Transformer architecture, which mechanism allows tokens to weigh other tokens' influence when producing contextual representations?