This is a blog about my research ideas. I try to keep things simple and visual.

Recent Posts

Nonlinear Advantage

Recently, with the democratization of big open source fundation models such as Meta’s llama 2 or stable diffusion, many people gained interest in model pruni...

How ResNets fix the Loss Surface

In the last post of this series, we came to understand that harmonic distortions created by repeatedly applying ReLU layers are responsible for the rough los...