SEA: Sparse Linear Attention with Estimated Attention Mask Paper • 2310.01777 • Published Oct 3, 2023 • 1
Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased Full Set Gradient Approximation Paper • 2208.12401 • Published Aug 26, 2022 • 1
Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction Paper • 2505.11254 • Published May 16 • 49