How could this help us understand the difference between the learned parameters ...

		bionhoward on May 4, 2024 \| parent \| context \| favorite \| on: Kolmogorov-Arnold Networks How could this help us understand the difference between the learned parameters and their gradients? Can the gradients become one with the parameters a la exponential function?