Launch Week is here! We just announced Nexus - Follow along

Abstract

We consider the problem of estimating the Attention mechanism in small space, and prove the existence of coresets for it of nearly optimal size. Specifically, we show that for any set of unit-norm keys and values in , there exists a subset of size at most such that

simultaneously for all queries whose norm is bounded by . This outperforms the best known results for this problem. We also offer an improved lower bound showing that -coresets must have size .

Share:

Start building knowledgeable AI today

Create your first index for free, then pay as you go when you're ready to scale.