Add S₀ Tuning (PEFT for hybrid recurrent-attention models) by JackYoung27 · Pull Request #14 · xmindflow/Awesome_Mamba

JackYoung27 · 2026-04-08T15:14:28Z

S₀ tuning optimizes one state matrix per recurrent layer while freezing all model weights. On Qwen3.5-4B: +23.6 pp on HumanEval (p < 0.001, 10 seeds), +10.8 pp over LoRA, zero inference overhead. Tested on FalconH1-7B (Mamba-2).

Paper: https://arxiv.org/abs/2604.01168
Code: https://github.com/JackYoung27/s0-tuning

Add S₀ Tuning (PEFT for hybrid recurrent-attention models)

8c32202

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add S₀ Tuning (PEFT for hybrid recurrent-attention models)#14

Add S₀ Tuning (PEFT for hybrid recurrent-attention models)#14
JackYoung27 wants to merge 1 commit intoxmindflow:mainfrom
S0-Tuning:add-s0-tuning

JackYoung27 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JackYoung27 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant