Blog Notes on learning theory, optimization, neural networks, and related theory. 2026 “A Simple Proof for the Gap Between the Muon and RMNP Descent Directions on Linear Regression.” English 中文 May 17, 2026. “Row Normalization and Orthogonalization Are Asymptotically Equivalent on Neural Networks.” English 中文 Related Note: Matrix factorization Deep linear network 2-layer ReLU May 8, 2026.
“A Simple Proof for the Gap Between the Muon and RMNP Descent Directions on Linear Regression.” English 中文 May 17, 2026.
“Row Normalization and Orthogonalization Are Asymptotically Equivalent on Neural Networks.” English 中文 Related Note: Matrix factorization Deep linear network 2-layer ReLU May 8, 2026.