
(Linear) Attention as Test-Time Regression
A unifying framework for linear attention mechanisms as test-time regression and how to parallelize training and inference.
A unifying framework for linear attention mechanisms as test-time regression and how to parallelize training and inference.