More

mdda · 2025-07-10T17:35:08 1752168908

You've got to be careful with PDFs : We can't see how they are rendered internally for the LLM, so there may be differences in how it's treating the margin/gutters/bleeds that we should account for (and cannot).

mdda · 2025-05-16T13:16:09 1747401369

"Make this better in a loop" is less powerful than using evolution on a population. While it may seem like evolution is just single steps in a loop, something qualitatively different occurs due to the population dynamics - since you get the opportunity for multiple restarts / interpolation (according to an LLM) between examples / and 'novelty' not being instantly rejected.

mdda · 2025-04-03T16:40:30 1743698430

and (of course) the company will record the data so that the robots will be able to learn via imitation learning ASAP

mdda · on Jan 21, 2025

I think the "Aha" is that the RL caused it to use an anthropomorphic tone.

One difference from the initial step is that the second time around includes the initial step and the aha comment in the context : It is, after all, just doing LLM token-wise prediction.

OTOH, the RL process means that it has potentially learned the impact of statements that it makes on the success of future generation. This self-direction makes it go somewhat beyond vanilla-LLM pattern mimicry IMHO.

mdda · on Sept 5, 2023

"Track your potential while staying current" would be more apt

mdda · on July 28, 2023

So you're the one in charge of the unix epoch rollover?

mdda · on May 3, 2023

Did you see the tennis players from Nvidia? https://research.nvidia.com/labs/toronto-ai/vid2player3d/

mdda · on Dec 3, 2022

Google Colab gives you $free GPU (usually a 16Gb T4) preloaded with frameworks, ready to run. Later, you might be tempted by the Pro(+) version, but there's plenty of scope to move up the learning curve before spending any money.

anigbrowl · on Dec 3, 2022

I should check that out. Jetbrains just integrated remote management for code and notebooks into their IDEs and this seems like the perfect way to test. Thanks for the tip!

mdda · on Dec 1, 2022

Could you point to any resources online about how to do this? e.g. is this using 8-bit quantisation?

mdda · on July 29, 2022

Isn't that for only one quarter of the A100?

artemisart · on July 31, 2022

Yes... a full A100 (+ reasonable CPU etc) looks closer to 2€.