Mamba

Mamba

[論文導讀] Mamba : 挑戰 Transformer 的新星 EP 1 Structured State Space Sequence Model (S 4) - YouTube
I have read this paper 4 months ago from X, and I knew this paper would go big. But I just skimmed the paper.

Pasted image 20240623182208.png

Why not A???

Mamba is a new class of models known as State Space Models (SSMs) that promise similar performance to Transformers with the ability to handle long sequence lengths efficiently.