Don't be Dense, SLiCE the Cost! Structured Linear Controlled Differential Equations (SLiCEs) are a new class of sequence models that combine the maximal expressivity (i.e., universality) of dense, ...
Based is an efficient architecture inspired by recovering attention-like capabilities (i.e., recall). We do so by combining 2 simple ideas: Short sliding window attention (e.g., window size 64), to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results