There's something I'm fundamentally missing here--if the standard basis and the ...

pavpanchekha · 2025-04-23T04:23:35 1745382215

The minimization is regularized, meaning you add a penalty term for large coefficients. The coefficients will be different for the two bases, meaning the regularization will work differently.

bcoates · 2025-04-23T05:20:52 1745385652

Ok, yeah, doing a little googling that makes sense. I kind of feel that the article author was burying the lede by saying this was about ML optimization where apparently regularization is the norm(so to speak lol) and basis selection is the whole ball game indirectly through the way it influences convex optimization