linear-attention topics

RWKV-LM

12.4k

Stars

845

Forks

Watchers

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...

BlinkDL

attention-mechanism

deep-learning

gpt

gpt-2

Multi-Attention-Network

60

Stars

7

Forks

Watchers

The semantic segmentation of remote sensing images

lironui

attention-mechanism

linear-attention

remote-sensing

segmentation

MAResU-Net

41

Stars

2

Forks

Watchers

The semantic segmentation of remote sensing images

lironui

attention

attention-mechanism

linear-attention

remote-sensing

autoregressive-linear-attention-cuda

44

Stars

3

Forks

Watchers

CUDA implementation of autoregressive linear attention, with all the latest research findings

lucidrains

artificial-intelligence

attention-mechanisms

cuda

deep-learning

taylor-series-linear-attention

81

Stars

2

Forks

Watchers

Explorations into the recently proposed Taylor Series Linear Attention

lucidrains

artificial-intelligence

attention-mechanisms

deep-learning

linear-attention

agent-attention-pytorch

75

Stars

1

Forks

Watchers

Implementation of Agent Attention in Pytorch

lucidrains

artificial-intelligence

attention-mechanisms

deep-learning

linear-attention

heinsen_attention

24

Stars

1

Forks

24

Watchers

Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)

glassroom

attention

attention-mechanism

attention-model

heinsen-attention

CARE-Transformer

212

Stars

16

Forks

212

Watchers

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction

zhouyuan888888

care-transformer

cvpr2025

linear-attention

linear-models