Incremental Learning of Sparse Attention Patterns in Transformers

2025-10-22

PriGM@EurIPS 2025Workshop on Principles of Generative Modeling·Training Dynamics

Incremental Learning of Sparse Attention Patterns in Transformers

October 2025