What is the advantage of positional encoding over using additional features?

Asked Mar 12 '24 at 21:20

Active Mar 12 '24 at 21:20

Viewed 16 times

Popular models such as the transformer model use positional encoding on existing feature dimensions. Why is this preferred over adding more features to the feature dimension of the tensor which can hold the positional information?

asked Mar 12 '24 at 21:20

kot

1

Does this answer your question? In a Transformer model, why does one sum positional encoding to the embedding rather than concatenate it? – noe Mar 12 '24 at 22:15

What is the advantage of positional encoding over using additional features?

0 Answers0