Blog

Notes on learning theory, optimization, neural networks, and related theory.

2026

“A Simple Proof for the Gap Between the Muon and RMNP Descent Directions on Linear Regression.”

English 中文

May 17, 2026.

“Row Normalization and Orthogonalization Are Asymptotically Equivalent on Neural Networks.”

English 中文

May 8, 2026.