Neural networks are effectively gauge invariant, and you have a huge space of va...

pizza · 2026-01-06T03:05:34 1767668734

Yes to this. Furthermore:

- you can solve neural networks in analytic form with a hodge star approach* [0]

- if you use a picture to set your initial weights for your nn, you can see visually how close or far your choice of optimizer is actually moving the weights - eg non-dualized optimizers look like they barely change things whereas dualized Muon changes the weights much more to the point you cannot recognize the originals [1]

*unfortunately, this is exponential in memory

[0] M. Pilanci — From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford's Geometric Algebra and Convexity https://arxiv.org/abs/2309.16512

[1] https://docs.modula.systems/examples/weight-erasure/

eru · 2026-01-06T04:36:10 1767674170

Thanks for the explanations and the great links!

srean · 2026-01-06T10:06:16 1767693976

Wouldn't such local invariance tie in with flatness or shallowness of the minima ?

This would tie in with the observation that flat/shallow minimas are easier to find with stochastic gradient descent and such weights generalise better.