More

liuliu · 2025-11-25T23:42:03 1764114123

There is no reason to believe Gemini Image is not diffusion model. In fact, generated result suggests it at least have VAE and very likely is a diffusion model variant. (Most likely a transfusion model).

liuliu · 2025-11-25T17:54:22 1764093262

As a startup, they pivoted and focused on image models (they are model providers, and image models often have more use cases than video models, not to mention they continue to have bigger image dataset moat, not video).

echelon · 2025-11-26T02:24:10 1764123850

> bigger image dataset moat

If they have so much data, then why do Flux model outputs look so God-awful bad?

They have plastic skin, weird chins, and have that "AI" aura. Not the good AI aura, mind you. The cheap automated YouTube video kind that you immediately skip.

Flux 2 seems to suffer from the exact same problems.

Midjourney is ancient. Their CEO is off trying to build a 3D volume and dating companion or some nonsense and leaving the product without guidance and much change. It almost feels abandoned. But even so, Midjourney has 10,000x better aesthetics despite having terrible prompt adherence and control. Midjourney images are dripping with magazine spread or Pulitzer aesthetics. It's why Zuckerberg went to them to license their model instead of quasi "open source" BFL.

Even SDXL looks better, and that's a literal dinosaur.

Most of the amazing things you see on social media either come from Midjourney or SDXL. To this day.

SV_BubbleTime · 2025-11-26T04:14:37 1764130477

>Even SDXL looks better, and that's a literal dinosaur.

I’m not saying you are wrong in effect, but for reference just slightly over 2 years ago was SDZL released, and it took about a year to have great fine tunes.

liuliu · 2025-11-23T18:38:34 1763923114

Windows 7 is the last good one. And that is only... Oh, almost 20 years ago. Never mind.

markus_zhang · 2025-11-23T20:05:55 1763928355

10 is OK if you remove the ads and stop random update. I’ll never use 11 and beyond. I always switched to Linux for my dev box and now that I play less and less game (haven’t played for weeks) I’ll switch to Linux for my personal box too, once the current one broke down.

layer8 · 2025-11-23T19:24:02 1763925842

Windows 7 was supported until 10 years ago.

1718627440 · 2025-11-24T10:30:01 1763980201

Yeah, the version history as perceived by the vendor and as perceived by the commoner are somewhat out of touch. To me Windows 10 is basically new and already considered to be out-of-date.

spacechild1 · 2025-11-23T20:39:57 1763930397

Windows 10 Pro is actually a pretty decent OS. It brought quite a few major improvements over Windows 7 and I can't really think of any notable downsides.

liuliu · 2025-11-07T20:26:10 1762547170

LuaTorch is eager-execution. The problem with LuaTorch is the GC. You cannot rely on traditional GC for good work, since each tensor is megabytes (at the time), now gigabytes large, you need to collect them aggressively rather than at intervals (Python's reference-counting system solves this issue, and of course, by "collecting", I don't mean free the memory (PyTorch has a simple slab allocator to manage CUDA memory)).

HarHarVeryFunny · 2025-11-07T22:16:05 1762553765

With Lua Torch the model execution was eager, but you still had to construct the model graph beforehand - it wasn't "define by run" like PyTorch.

Back in the day, having completed Andrew Ng's ML coursew, I then built my own C++ NN framework copying this graph-mode Lua Torch API. One of the nice things about explicitly building a graph was that my framework supported having the model generate a GraphViz DOT representation of itself so I could visualize it.

liuliu · 2025-11-07T22:36:27 1762554987

Ah, I get what you mean now. I am mixing up the nn module and the tensor execution bits. (to be fair, the PyTorch nn module carries over many these quirks!).

liuliu · 2025-11-07T20:21:40 1762546900

That's wrong. Llama.cpp / Candle doesn't offer anything on the table that PyTorch cannot do (design wise). What they offer is smaller deployment footprint.

What's modern about LLM is the training infrastructure and single coordinator pattern, which PyTorch just started and inferior to many internal implementations: https://pytorch.org/blog/integration-idea-monarch/

liuliu · 2025-11-02T02:33:36 1762050816

Note that busy_timeout is not applicable to SQLite in this case (the SQLITE_BUSY issued immediately, no wait in this case).

Also this is because WAL mode (and I believe only for WAL mode, since there is really no concurrent reads in the other mode).

The reason is because pages in WAL mode appended to a single log file. Hence, if you read something inside a BEGIN transaction, later wants to mutate something else, there could be another page already appended and potentially interfere with the strict serializable guarantee for WAL mode. Hence, SQLite has to fail at the point of lock upgrade.

Immediate mode solves this problem because at BEGIN time (or more correctly, at the time of first read in that transaction), a write lock is acquired hence no page can be appended between read -> write, unlike in the deferred mode.

liuliu · 2025-10-30T19:08:05 1761851285

Developers can still choose to enable sandbox for apps delivered outside of App Store. Some of them simply choose to not do so: https://developer.apple.com/documentation/security/hardened-...

microtonal · 2025-10-30T19:38:59 1761853139

s/some/most/

Sadly.

For those that don't know, an easy way to check is to right-click a column in Activity Monitor and enable the Sandbox column.

liuliu · 2025-10-29T16:59:24 1761757164

How to do site-to-site traffic over Tailscale / WG encryption? From preliminary testing, it seems have difficulty to saturate a 10Gbps connection while plain HTTP (nginx) traffic does that fine. Of course it should vary from CPU to CPU, but any tips how to improve that? Ideally I would love to go over with encrypted traffic, although everything is public, just one less thing need to be careful (in case future need to transport some non-public data over).

mh- · 2025-10-29T17:44:36 1761759876

You'd need to analyze what's bottlenecking when you're doing that.

Even a cursory look at htop on both ends while you're trying to saturate that link would be informative.

liuliu · 2025-10-26T16:04:46 1761494686

Yeah, luckily, you can unit tests these and fix them. They are not concurrency bugs (again, luckily).

BTW, numeric differentiation can only be tested very limitedly (due to algorithmic complexity when you doing big matrix). It is much easier / effective to test against multiple implementations.

antoine-levitt · 2025-10-27T08:02:12 1761552132

You can easily test a gradient using only the forward pass by doing f(x+h) ~ f(x) + dot(g, h) for a random h

liuliu · 2025-10-24T17:27:24 1761326844

And it is always felt to me that has lineage from neural Turing machine line of work as prior. The transformative part was 1. find a good task (machine translation) and a reasonable way to stack (encoder-decoder architecture); 2. run the experiment; 3. ditch the external KV store idea and just use self-projected KV.

Related thread:https://threadreaderapp.com/thread/1864023344435380613.html