Position Information Emerges in Causal Transformers Without Positional Encodings via Similarity of Nearby Embeddings
- Chunsheng Zuo
- , Pavel Guerzhoy
- , Michael Guerzhoy
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
2
Link opens in a new tab
Scopus
citations