What Algorithms Can Transformers Learn? A Study in Length Generalization

Publication
International Conference on Learning Representations (ICLR), 2024