Yup, they win. My biggest SQLite database is 1.7TB with, as of just now 23148511...

k_bx · 2025-02-17T10:12:00 1739787120

I think FS-level compression would be a perfect match. Has anyone tried it successfully on large SQLite DBs? (I tried but btrfs failed to do so, and I didn't get to the bottom of why).

piterrro · 2025-02-17T10:27:12 1739788032

I did a small benchmark for compression with VFS and column compression a while ago: https://logdy.dev/blog/post/part-3-log-file-compression-with... https://logdy.dev/blog/post/part-4-log-file-compression-with... It all depends on the use cases and read/write patterns. Imo if well designed could yield added value

zimpenfish · 2025-02-17T10:20:06 1739787606

> I think FS-level compression would be a perfect match. Has anyone tried it successfully on large SQLite DBs?

I've had decent success with `sqlite-zstd`[0] which is row-level compression but only on small (~10GB) databases. No reason why it couldn't work for bigger DBs though.

[0] https://github.com/phiresky/sqlite-zstd

Traubenfuchs · 2025-02-17T10:28:54 1739788134

> My biggest SQLite database is 1.7TB with

What do you run this on? Just some aws vpc with a huge disk attached?

immibis · 2025-02-17T16:00:18 1739808018

I can see that you're a user of AWS. Check some prices on dedicated servers one day. They're an order of magnitude cheaper than similar AWS instances, and more powerful because all compute and storage resources are local and unshared.

They do have a higher price floor, though. There are no $5/month dedicated servers anywhere - the cheapest is more like $40. There are $5/month virtual servers outside of AWS which are cheaper and more powerful than $5/month AWS instances.

antithesis-nl · 2025-02-17T10:42:40 1739788960

A Windows Server VM on a self-hosted Hyper-V box, which has a whole bunch of 8TB NVMe drives; this VM has a 4TB virtual volume on one of those (plus a much smaller OS volume on another).

cm2187 · 2025-02-17T10:35:29 1739788529

How do you backup a file like that?

antithesis-nl · 2025-02-17T10:47:29 1739789249

Using the SQLite backup API, which pretty much corresponds to the .backup CLI command. It doesn't block any reads or writes, so the performance impact is minimal, even if you do it directly to slow-ish storage.

hu3 · 2025-02-17T11:11:44 1739790704

> It doesn't block any reads or writes.

That's neat! I bet it keeps growing a WAL file while the backup is ongoing right?

rco8786 · 2025-02-17T12:55:01 1739796901

Hard to imagine doing it any other way, which is probably fine up until you hit some larger files sizes.

ncruces · 2025-02-17T12:38:12 1739795892

That copies the entire file each time (not just deltas).

You may find sqlite_rsync better.

homebrewer · 2025-02-17T11:51:21 1739793081

I use zfs snapshots, they work in diffs so they're very cheap to store, create, and replicate.

leosanchez · 2025-02-17T10:56:02 1739789762

sqlite_rsync is new tool created by sqlite team. It might be useful.

blitzar · 2025-02-17T11:54:27 1739793267

Ctrl-c, Ctrl-v

MyOutfitIsVague · 2025-02-17T14:58:51 1739804331

That's not great advice for a large database (and I wouldn't recommend it for small databases either). That incidentally works when the db is small enough that the copy is nearly atomic, but with a big copy, you can end up with a corrupt database. SQLite is designed such that a crashed system at any time does not corrupt a database. It's not designed such that a database can be copied linearly from beginning to end while it's being written to without corruption. Simply copying the database is only good enough if you can ensure that there are no write transactions opened during the entire copy.

A reliable backup will need to use the .backup command, the .dump command, the backup API[0], or filesystem or volume snapshots to get an atomic snapshot.

[0]: https://www.sqlite.org/backup.html

pinoy420 · 2025-02-17T10:15:07 1739787307

You should use mongodb. It’s web scale

teddyh · 2025-02-17T10:25:58 1739787958

(References <https://www.youtube.com/watch?v=b2F-DItXtZs>)

pinoy420 · 2025-02-17T10:33:55 1739788435

This hits so much harder 15 years on.

Additionally Xtranormal missed out on the generative video curve

bborud · 2025-02-17T10:29:53 1739788193

Someone might take that as advice.

threeseed · 2025-02-17T11:38:53 1739792333

MongoDB earns $1.7b a year in revenue.

A whole lot of people have already taken that advice.

gonzo41 · 2025-02-17T12:23:15 1739794995

Captive customers my friend.