Towards AGI - State Space Models

State Space Models

S4 paper: Efficiently Modeling Long Sequences with Structured State Spaces
HiPPO: Recurrent Memory with Optimal Polynomial Projections
A new family of SSMs (a fusion of CNNs, RNNs, and classical SSMs like Kalman filter): Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers
Follow-up works have focused on understanding S4 models, as well as refining them and augmenting their capabilities: [1, 2, 3, 4, 5]
Few recent methods optimize SSMs by integrating them with Transformers [1, 2, 3]
SSMs for time series
1. Effectively Modeling Time Series with Simple Discrete State Spaces
SSMs for RL
Mamba: Linear-Time Sequence Modeling with Selective State Spaces & follow-ups [1, 2, 3]

Page updated

Google Sites

Report abuse